External Sources
Here are some good outside sources for finding datasets suitable for classes:
- Data is Plural, a weekly newsletter of interesting public datasets.
- SCORE Sports Data Repository, a collection of sports datasets for classroom use
- Stanford Large Network Dataset Collection
- Royal Society Open Science, an open-access general science journal whose papers have Data Availability statements and often have public data
- Scientific Data, a journal publishing descriptions of open datasets (often large and with unusual structure)
- Data in Brief, another journal publishing descriptions of open datasets
- Open Case Studies, a collection of data analysis case studies showing analyses from conception to conclusion
- UCI Machine Learning Repository, containing hundreds of datasets suitable for machine learning tasks
- TidyTuesday, a project collecting datasets for weekly data wrangling and visualization events