-from Wikipedia
EDS 240: Lecture 1.2
Data visualization: an intro
Week 1 | January 6th, 2024
What is data visualization?
“…the practice of designing and creating easy-to-communicate and easy-to-understand graphic or visual representations of a large amount of complex quantitative and qualitative data and information with the help of static, dynamic or interactive visual items.”
-from Wikipedia
What is data visualization?
“any graphical representation of information and data”
“part art and part science”
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
History of data visualization
Note: this is by no means a comprehensive history of the visual representation of information!
Images from BBC News, Wikipedia (a, b, c, d), Smithsonian, The Marginalian, Scientific American & Twitter
Why do we visualize data?
04:00
. . . to answer questions / derive insights
Fig Caption: Unusual climate anomalies in 2023 (the red line, which appears bold in print). Sea ice extent (a, b), temperatures (c–e), and area burned in Canada (f) are presently far outside their historical ranges. These anomalies may be due to both climate change and other factors. Sources and additional details about each variable are provided in supplemental file S1. Each line corresponds to a different year, with darker gray representing later years.
A nice Twitter thread on key takeaways from the above paper
. . . to explore & generate new questions
“Exploratory data analysis (EDA) is not a formal process with a strict set of rules. More than anything, EDS is a state of mind…you should feel free to investigate every idea that occurs to you. Some of these ideas will pan out, and some will be dead ends. As your exploration continues, you will hone in on a few particularly productive insights that you’ll eventually write-up and communicate to others.”
-Hadley Wickham, author of R for Data Science
. . . to identify trends
Same summary statistics, different distributions!
Anscombe’s Quartet
(Anscombe 1973)
Datasaurus Dozen
(Matejka & Fitzmaurice 2017, expanding upon Albert Cairo’s Datasaurus)
. . . to prompt discussion
. . . to prompt discussion
gif created from Antti Lipponen’s Temperature Anomolies.
. . . to prompt discussion
gif created from Mark SubbaRao’s Climate Spiral. For a similar visualization with accompanying {ggplot2}
code, see Nicola Rennie’s TidyTuesday contribution!
. . . to create art / tell a story
Patchwork Kingdoms, by Nadieh Bremer portraying the “digital divide” in schools across the world
. . . to create art / tell a story
To enlarge, Right click > Open Image in New Tab
Find the data on the TidyTuesday repo
Vertices of Visualization
Vertices of Communication by Albert Cairo, as presented by Cédric Scherer during his 2022 Graphic Design with ggplot2 workshop (slides)
Why R for data viz?
{ggplot2}
extensions)
Take a Break
05:00