How to Find and Work with Data

A collection of resources on finding, accessing, evaluating, and working with data responsibly, critically, and ethically.

Data Viz Books

Why Visualize Data?

A series of four charts which share a linear regression but whose individual distributions are vastly different.

From Wikipedia: This graphic represents the four datasets defined by Francis Anscombe for which some of the usual statistical properties (mean, variance, correlation and regression line) are the same, even though the datasets are different. Reference: Anscombe, Francis J. (1973) Graphs in statistical analysis. American Statistician, 27, 17–21.

Data Visualization: Best Practices

Data Visualization: Worst Practices

Data Visualization Tools

A reminder about free and public tools: please be careful and cautious with the data you share with them. Dig into their privacy and data policies and err on the side of caution.