post

Day 5 – Bubble Charts

Rack Top Temp Bubble Chart

Bubble charts visualization takes accuracy to the next level because they can show three variables at one time.  In this case the parameters are rack, top temperature, and time. Each vertical line is a specific rack.  The graph above is displaying the top temperatures.  The bubbles on that vertical line represent the temperature of the rack during the time period (the temperatures are rounded to the closest 2°F increment).  The size of the bubble indicates how long the rack was at that temperature.  For example: the top temperatures at rack 6 were roughly 68°F, 70°F, 72°F, 74°F, and 76°F.  The larger bubbles at 72°F and 74°F indicates that rack is operating at those temperatures the majority of the time.

Takeaways

  • It is instantly obvious that racks 12, 17, and 20 have persistent hot spots.
  • There appears to be multiple racks that share the same temperature profile.  Racks 1 – 4 are all operating in the same temperature range.  Racks 5 – 10 are also similar.
  • Rack 15 seems to be an outlier operating cooler than its surrounding sensors.  Possibly an airflow issue?
  • Racks 31 – 34 are the coldest of the bunch.  There may be an excessive amount of floor tiles in this region.

Issues

  • Bubble charts can only show one measured parameter at a time. In this case, it represents rack top temperatures .Bubble charts are not good for comparing rack top and rack bottom temperatures
  • Also, bubble charts should be reviewed over a specific time duration.   The chart above displays data over the course of a week.  You could certainly look at data over the course of a month or longer, but it becomes more  difficult if you would like to compare two time periods such as one week versus another week.

Previous Posts
About the Data
Day 1 – From 10,000 Feet
Day 2 – Rack by Rack
Day 3 – Learn to Love Graphs
Day 4 – Wordle
Today – Bubble Charts
Tomorrow – Bubble Charts Delta

post

Day 2 – Rack by Rack

Rack by Rack Temperatures (truncated at 30 sensors for space)

This view shows the min, max, and average values for the top and bottom of each rack.  The actual table has 149 rows, but I cropped it for space purposes.  You’ll quickly realize that while this more detailed information is much more useful than yesterday’s table, it is difficult to extract any actionable information if we can’t sort this data.  Here is the same data table sorted to show the 20 hottest rack and 20 coldest based on their average.

Hottest Racks

Top Coldest

Takeaways

  • 13 racks average above the upper thermal limit of 80.6°F and 1 rack averages below the lower limit of 64.4°F.  The hot and cold spots have been identified.
  • On Day 1 we saw that the maximum rack top and rack bottom temperature were both 88.7°F.  Our question was if they came from the same location.  We now see that they don’t – rack 115 has a max top temp of 88.7°F and rack 20 has a max bottom temp of 88.7°F.

Issues

  • While this information is valuable, presenting it in a table is still overwhelming and difficult to quickly extract the most useful pieces of information.
  • Tables are useful for presenting static information but not very good at showing time dependent patterns.  Min, max, and average provide a good summary for the entire time period, however they cannot reveal any time dependencies.

Previous Posts
About the Data
Day 1 – From 10,000 Feet
Today – Rack by Rack
Tomorrow – Learn to Love Graphs

post

Day 1 – From 10,000 Feet

Data Summary

Let’s begin by taking a very high level view.  The table above calculates the minimum, maximum, and average temperature for the entire data set and display it in a simple table.

Takeaways

  • Maximum temperature exceeds the ASHRAE recommended upper thermal limit of 80.6°F and the minimum temperature exceeds the lower limit of 64.4°F.  It is evident that there are hot spots…and cold spots
  • Average temperatures for rack tops and bottoms are within ASHRAE recommended ranges
  • Top temperatures are hotter than bottom temperatures, as expected, for min value and average value, but not for max value.  How come?

Issues

  • It doesn’t make sense that the top and bottom temperatures would be the same for the maximum value.  Are they coming from the same location?
  • You can’t determine which specific racks are outside of thermal threshold so there is no way of locating the hot spots. Although this information is at a glance simple, useful and high level, a data center operator needs more granular visibility into the statistics for each rack

Are there any other important takeaways you can identify in the data?  Leave your thoughts in the comment section.

Posts in Series
Yesterday – About the Data
Today – From 10,000 Feet
Tomorrow – Rack by Rack

post

Data Confessional

“If you torture the data long enough, it will confess.”

                                                   -Ronald Coase

I love that quote because it gives data a humanistic capability…the capability to obtain useful information. Inside every set of data, no matter how simple or mundane, there is information to be obtained. Data not only contains information that is obvious, but also information that we don’t yet know exists. When looking through raw numbers, charts, and graphs from every perceivable angle and combination, there comes a moment that these small but important pieces of information will emerge. With persistence, an open mind, and a will to engage in a bit of targeted torture, you can make a little bit of data go a long way. Over the next seven days, I will take a set of seemingly simple temperature data and massage it, analyze it, and torture it until it confesses what it knows. Each day I will present the data using a new visual. I will show you how, by analyzing and displaying data in different ways, you gain entirely new actionable information and surprising insights.

About The Data

The data that I will be analyzing is one week of rack inlet air temperatures obtained from an intelligent energy management system in a data center. Temperatures were measured in the cold aisle at the top and bottom of the rack. The data was collected with 149 sensors resulting in 298 monitored points in the data set (149 top temps and 149 bottom temps). The data is comprised of hourly averages for each point for the duration of the week. Below is a sample of the raw data.

Raw Data (Top of Rack Temps)

Raw Data (Bottom of Rack Temps)

Tomorrow will be the first day of analyzing this information in the attempt to gain insightful and actionable information.  Stay tuned.

Tomorrows Post: From 10,000 Feet