Skip to main content

Same Stats, Different Graphs

The datasaurus dataset, transformed in 12 different ways
The datasaurus dataset, transformed in 12 different ways. Throughout the animation, the mean and sd do not change.

Justin Matejka and George Fitzmaurice show a convincing example of the value of data visualization. In this short paper, the authors transform the "Datasaurus" dataset into 12 different datasets, all with the same mean and standard deviation. 
This makes a very convincing argument to present more than only summary statistics, and it is an entertaining read as well!

...make both calculations and graphs. Both sorts of output should be studied; each will contribute to understanding.

F. J. Anscombe, 1973 
(and echoed in nearly all talks about data visualization...)

See https://www.autodeskresearch.com/publications/samestats for the full article.

How can we help you?