Forums/Community Questions & Answers

Working with public data sets...

Alex Villamil
posted this on June 04, 2012 15:39

There are many data sets out there that Datameer can work with. Here's a list of publicly available data sets. You can download the data and upload them to the "Upload files" section of Datameer to work with them. In the case of API's, you will need to call the API to create the data file first before uploading. 


Regional:

https://data.sfgov.org/

http://nycopendata.socrata.com/
http://www.ordnancesurvey.co.uk/oswebsite/products/os-opendata.html
http://www.dados.gov.pt/pt/catalogodados/catalogodados.aspx
http://data.gov.uk/

Aggregators:
http://infochimps.com/
http://buzzdata.com/
http://aws.amazon.com/datasets?_encoding=UTF8&jiveRedirect=1
http://www.google.com/publicdata/directory
http://www.data.gov/
http://Junar.com
http://www.archives.gov/research/alic/tools/online-databases.html
http://www.delicious.com/jbaldwinconnect/DataSets
http://www.guardian.co.uk/news/datablog

Financial:
http://open.bloomberg.com/
http://data.worldbank.org/

Machine Learning

http://archive.ics.uci.edu/ml/

Wireless Data
http://crawdad.org/

Wikipedia:
http://en.wikipedia.org/wiki/Wikipedia:Database_download

Census:
http://factfinder2.census.gov/faces/nav/jsf/pages/index.xhtml

Google Ngrams
http://googleresearch.blogspot.com/2006/08/all-our-n-gram-are-belon...

NASA:
https://wist.echo.nasa.gov/wist-bin/api/ims.cgi?mode=MAINSRCH&JS=1

Stanford Collections
http://snap.stanford.edu/data/index.html

Sports
http://developer.espn.com/

Uncategorized:
http://ftp.ncbi.nih.gov/
http://gettingpastgo.socrata.com
http://platform.newscred.com
http://data.cityofchicago.org
http://data.govloop.com
http://data.gov.uk/
http://data.medicare.gov
http://data.seattle.gov
http://data.sunlightlabs.com
http://developer.yahoo.com/geo/geoplanet/data/
http://econ.worldbank.org/datasets
http://www.kasabi.com
http://linkeddata.org/
http://medihal.archives-ouvertes.fr
http://ngrams.googlelabs.com/
http://public.resource.org/
http://rechercheisidore.fr
http://reddit.com/r/datasets
https://datamarket.azure.com/
http://www.bls.gov/
http://www.crunchbase.com/
http://www.dartmouthatlas.org/
http://www.datakc.org
http://www.factual.com/
http://www.freebase.com/
http://www.kaggle.com/
http://www.ordnancesurvey.co.uk/oswebsite/products/os-opendata.html
http://build.kiva.org/
http://www.imdb.com/interfaces
http://timetric.com/public-data/
http://www2.jpl.nasa.gov/srtm