Sources of big data
- http://archive.ics.uci.edu/ml/index.php
- Kaggle
- Statista
- Data World
- Data Hub
- AWS open data
- Google public data
Crime
- https://www.fbi.gov/services/cjis/ucr
- https://www.pcr.uu.se/research/UCDP/
- https://www.drugabuse.gov/drug-topics/trends-statistics
Internet
- https://wiki.dbpedia.org/
- https://trends.google.com/trends/explore
- Reddit datasets
Government
- https://www.ukdataservice.ac.uk/
- https://data.gov.uk/
- https://data.london.gov.uk/
- https://www.data.gov/
- https://opengovernmentdata.org/data/
- https://www.cia.gov/library/publications/the-world-factbook/
- https://data.gov.au/
- https://opendata.cityofnewyork.us/
- https://open.canada.ca/en/open-data
House prices
- UK house prices
Mathematics
Health
- https://healthdata.gov/
- https://digital.nhs.uk/data-and-information/data-collections-and-data-sets
- https://www.who.int/data/collections
- https://www.who.int/gho/maternal_health/reproductive_health/en/
- http://portals.broadinstitute.org/cgi-bin/cancer/datasets.cgi
- https://www.cdc.gov/datastatistics/
- https://www.fda.gov/drugs/drug-approvals-and-databases/drugsfda-data-files
- https://github.com/publichealthengland/coronavirus-dashboard
- 1000 Genomes project
- USDA food composition
Business
- https://www.glassdoor.com/research/type/data-sets/
- https://opencorporates.com/
Transport
- https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page
Climate
- Africa climate
- https://openaq.org/
- Historic weather
- NOAA tides and currents data
- Global temperatures
- Nasa Earth data
- Reeep
Nature
- NYC squirrel census - see also squirrel attacks
Travel
- https://www.ustravel.org/research
Current affairs
- Five Thirty Eight
Entertainment
- BFI industry data insights
- Spotify developer API
Sport and gambling
- Historical Football Results and Betting Odds Data
- Facebook API
- Instagram API
- YouTube API
Finance
- Google Finance
- Cryptocompare API
- XE API (free trial)
- https://datahub.io/collections/stock-market-data
- https://data.imf.org/?sk=388dfa60-1d26-4ade-b505-a05a558d9a42
- https://atlas.cid.harvard.edu/
- World Bank Open Data
- Financial Times markets data
To sort
- https://www.gapminder.org/data/
- http://storage.googleapis.com/books/ngrams/books/datasetsv2.html
- https://github.com/rfordatascience/tidytuesday
- https://oxylabs.io
- https://scrapinghub.com
- https://import.io
- https://webscraper.io
References
- https://www.kdnuggets.com/2017/12/big-data-free-sources.html
- https://www.springboard.com/blog/free-public-data-sets-data-science-project/
- https://www.dataquest.io/blog/free-datasets-for-projects/
- https://www.forbes.com/sites/bernardmarr/2016/02/12/big-data-35-brilliant-and-free-data-sources-for-2016/#4166564fb54d
- https://www.tableau.com/en-gb/learn/articles/free-public-data-sets
- https://piktochart.com/blog/100-data-sets/
- https://github.com/awesomedata/awesome-public-datasets
- https://mode.com/analytics-dispatch/interesting-data-sets/
- https://www.dataquest.io/blog/free-datasets-for-projects/
- https://github.com/BuzzFeedNews/nics-firearm-background-checks
- https://www.springboard.com/blog/free-public-data-sets-data-science-project/
- https://www.kdnuggets.com/2017/12/big-data-free-sources.html
- https://github.com/awesomedata/awesome-public-datasets
- https://github.com/sindresorhus/awesome