📊 What data visualisation is (and isn't)

📊 What data visualisation is (and isn't)

Hello everyone! Today, I'd like to share some thoughts on something I find particularly important: Data visualisation. Far from being just a collection of eye-catching charts, it's about turning raw data into relatable stories, making complex numbers understandable and actionable for everyone.

Now, I know the term can feel a bit overused, but i feel there is often room to improve. Data visualization is more than a buzzword; it's a powerful tool that transforms raw, often confusing data into a digestible and visually appealing format. This transformation is not just about making data look good; it's about making it understandable and actionable. I think this alluvial plot from the NY Times is a good example of how it can be done:


The plot effectively portrays the evolving performance of countries across Olympic events through colorful flows. Clear labels and focused call-outs guide the viewer's attention, making the complex data easily understandable. Grouping countries together minimizes clutter, further aiding comprehension. Overall, the plot exemplifies how well-executed data visualization can serve as a powerful tool for communication, decision-making, and collaboration.

What is Data Visualization?

Data visualization is a storytelling medium. It's not merely a process of laying out numbers in graphical form; it's about converting those numbers into a narrative. That narrative can illuminate trends, patterns, and insights that might otherwise remain buried in spreadsheets. The storytelling aspect is crucial for making the data relatable and understandable, especially to those who aren't data experts.

One of the strengths of data visualization is that it can serve as a universal language that bridges the gap between Experts and Non-Experts. A well-crafted chart or graph can often say more than a lengthy report, making it a useful tool for quick and effective communication.

Now, it's not just about making data understandable; it's about making it Engaging and Memorable. Visual elements like colors and shapes not only make the data aesthetically pleasing but also evoke emotional responses. When used effectively, these elements can reinforce the core message of the data, making it memorable and impactful.

By distilling complex data into easily digestible visual formats, data visualization accelerates the decision-making process. Stakeholders can quickly identify trends, correlations, and outliers that would be less apparent in raw data. This clarity enhances the quality of decisions while also saving time and resources.

Moreover, data visualization promotes Collaboration and Communication. Its shareable nature makes it an excellent tool for team discussions, presentations, and reports. Visuals offer a shared point of reference, which can make these interactions more focused and productive.

What is Data Visualization Not?

However, let's not forget what Data Visualization Is Not: While a visually appealing chart can capture attention, the primary goal of data visualization is not aesthetic. Overemphasizing the visual aspect can distract from the core message and even lead to misinterpretations. Similarly, data visualization is Not One-Size-Fits-All. Different datasets and audiences might require different types of visual representation.

Using the same types of charts or dashboards for various kinds of data can result in either an information overload or a dearth of necessary information. And speaking of dashboards, while they are fantastic tools for tracking Key Performance Indicators (KPIs), they're not the be-all and end-all. It's crucial to understand what a dashboard can and cannot measure, ensuring that it complements, rather than replaces, thorough analysis.

Finally, while visualizations can aid in decision-making, they should Not Substitute for critical thinking. They offer a "snapshot" of the data, but they're not a replacement for thorough analysis or discussion. They should be the starting point, prompting further investigation and analytical thinking.

Here is an example which i took from this blog post:

Four Principles for Well-Designed Data Visualization

This may seem like a lot to consider when creating a data visualization, but the good news is that there are established frameworks to guide us. They help to implement these ideas effectively, ensuring that our visualizations are both functional and insightful. One such framework are has been outlined by Edward Tufte, a pioneer in the field. These four principles offer a robust foundation for anyone looking to master the art and science of data visualization:

  1. Firstly, the Maximize Data-Ink Ratio principle emphasizes that the ink used should primarily represent and communicate the data. This approach isn't about adding unnecessary visual elements like gridlines or heavy backgrounds, which can distract from the data's core message. By focusing on the data-ink ratio, the visualization remains a powerful tool for delivering clear, direct insights, aiding both storytelling and decision-making.
  2. Next, Use Simple Labels. Labels should be straightforward and aim to enhance, not distract from, the data presentation. Overly complex or flashy labels can make a visualization hard to understand, defeating its primary purpose. Using simple labels makes the visualization more accessible and facilitates a seamless connection between the data and the audience.
  3. The third principle, Avoid Chartjunk, advises against the use of unnecessary decorations like 3D effects or excessive color. Remember, data visualization is not an art project. Its focus should be on clarity and function, not aesthetics. By eliminating these distractions, the visualization becomes an effective tool for quick understanding and informed decision-making.
  4. Last but not least, Focus on Data. The data should be the central element, and all other components like lines or labels should only serve to make the data more comprehensible. By keeping the focus squarely on the data, the visualization aligns perfectly with its core purpose: to serve as a catalyst for collaboration and identify areas that might need improvement or further investigation.

Understanding and applying these principles can significantly enhance the effectiveness of your data visualization efforts, making them not just visually appealing but also highly functional and insightful.

Exploring the Dynamics of Electric Vehicle Charging Infrastructure

To illustrate the significance of advancements in electric vehicle (EV) charging infrastructure, let's examine some real-world data, particularly focusing on Germany. This is a timely issue, as the availability of public charging stations is a crucial factor for the widespread adoption of EVs, especially in urban settings. As of the end of 2022, there were 2.7 million public charging points globally—a 55% increase from the previous year. The ratio of electric light-duty vehicles (LDVs) to each charging point was around ten, according to global statistics provided by the International Energy Agency (IEA). Both the European Union and the United States have committed substantial funding to advance this sector further.

In the analyzed data from Germany, the average ratio of EVs to each charging station across various regions was 27.82. The ratios varied widely, with the highest being an astonishing 142.78. The data, sourced from an API provided by the German government, may not cover all the stations but nonetheless offers valuable information about regional variations.

To make sense of this data, I first plotted a graph that included the IEA's recommended ratio of 10 EVs per station as a target line.

The design of the graph is minimalist, focusing solely on essential elements to facilitate understanding. A white background and the absence of gridlines ensure that the viewer's attention is centered on the critical information: the ratio of EVs to charging stations. Next, data points representing the performance of different German regions were added to the graph.

Incorporating these data points adds real-world context but also introduces challenges due to the dispersion of points and differing population sizes. To address this, two key adjustments were made:

  • Resizing Data Points According to Population Size: This adds a third dimension to the plot, allowing viewers to gauge how the number of cars and stations relate to the population size.
  • Logarithmic Axis Scales: This technique spreads the data points more evenly, enabling easier interpretation.

Both changes enhance the graph's clarity and utility, aligning well with data visualization best practices.

After implementing these adjustments to our graph, it's evident that most geographical areas lag behind the goal of having 10 electric vehicles (EVs) per charging station. However, small to medium-sized regions are an exception to this rule. An interesting observation is that the data for the three largest cities in Germany cluster closely together. This indicates that these cities, despite having different population sizes, have a similar ratio of EVs to charging stations. The implication here is that in larger cities, the availability of charging stations is becoming increasingly aligned with the number of EVs, rather than the size of the population.

To build on this preliminary observation, the logical next step is to employ statistical methods. By doing so, we can estimate the number of charging stations needed for a specific number of EVs, while accounting for variables like regional differences within the Bundesländer, types of regions, and varying sizes in both area and population. Utilizing a statistical approach will enhance our understanding of the relationship between EVs and charging stations, making our initial observations from data visualization more robust.

Further analysis allows us to estimate the number of charging stations required per number of EVs, keeping other factors constant. The results indicate that most regions have an estimated 25 to 30 EVs per station, with smaller regions performing slightly better. At this point, there are two avenues we can explore further:

  1. We can examine how different variables influence our estimations and assess the reliability of these findings.
  2. We can delve into regional differences within the Bundesländer.

Interactive Plot

To give an idea of the second point, I've coloured the regions in the graph by federal state. While this provides some initial insights - for example, Bavaria seems to be doing well while North Rhine-Westphalia is lagging behind - the data can get a bit cluttered.

click for interactive plot

Future analysis could use techniques such as clustering or faceting to make these patterns clearer.

---

The data and subsequent analysis offer promising avenues for understanding the evolving landscape of EVs and charging stations. While there's still work to be done, these findings lay a strong foundation for future studies and improvements in the sector.

Keep up and stay tuned. 🐜

Henry




To view or add a comment, sign in

More articles by Henry Zehe

Insights from the community

Others also viewed

Explore topics