Our Datasets
chinese venues
List of venues categorized with price and ratings
- type: venues
- cities: Shanghai
- source: www.dianping.com
- capture-type: mixed (scraping + API)
- spatial-precision: GPS coordinate
- temporal-precision: yearly capture
- dataset-precision: Complete
- status: done
- count: 350k
foursquare venues
List of venues categorized with ratings
- type: venues
- cities: Shanghai, Paris
- source: www.foursquare.com
- capture-type: API
- spatial-precision: GPS coordinate
- temporal-precision: continuous capture
- dataset-precision: few at a time
- status: paused
- count: pending
yelp venues
List of venues categorized with ratings
- type: venues
- cities: Paris
- source: www.yelp.com
- capture-type: API
- spatial-precision: GPS coordinate
- temporal-precision: continuous capture
- dataset-precision: few at a time
- status: paused
- count: pending
in-City weibo messages
Sina Weibo microblog message send from within Shanghai
- type: social network
- cities: Shanghai
- source: www.weibo.com
- capture-type: API
- spatial-precision: GPS coordinate
- temporal-precision: continuous capture
- dataset-precision: few at a time
- status: pending
- count: 2 millions
flick photos
photos taken inside cities with tags but not the picture itself
- type: social network
- cities: Shanghai, Paris, Beijing, Tokyo, NYC, London
- source: www.flickr.com
- capture-type: API
- spatial-precision: GPS coordinate
- temporal-precision: continuous capture
- dataset-precision: few at a time
- status: paused
- count: 400k
Air Quality Index
Air Quality index with pm25, PM10 and other pollution indicators
- type: weather
- cities: Shanghai, Paris, Beijing, NYC
- source: www.aqicn.org
- capture-type: scraping
- spatial-precision: city wide
- temporal-precision: continuous capture
- dataset-precision: few at a time
- status: paused
- count: pending
weather information
List of venues categorized with ratings
- type: weather
- cities: Shanghai, Paris
- source: www.yahooweather.com
- capture-type: API
- spatial-precision: city wide
- temporal-precision: continuous capture
- dataset-precision: few at a time
- status: paused
- count: pending
new building creation
List of remarkable buildings published on archdaily from architecture firms
- type: architecture
- cities: world
- source: www.archdaily.com
- capture-type: scraping
- spatial-precision: city wide
- temporal-precision: daily capture
- dataset-precision: complete
- status: paused
- count: pending
github
List of github users locations
- type: social network
- cities: world
- source: www.github.com
- capture-type: API
- spatial-precision: city wide
- temporal-precision: continuous capture
- dataset-precision: complete
- status: paused
- count: 3 millions
roads
roads in graph format with geographic polyline
- type: transportation
- cities: Shanghai, France
- source: www.openstreetmap.com
- capture-type: dataset
- spatial-precision: GPS coordinate
- temporal-precision: monthly
- dataset-precision: complete
- status: running
- count: millions
bus lines
bus lines in a graph format with geographic polyline
- type: transportation
- cities: Shanghai, Paris
- source: www.baidu.com, RATP app
- capture-type: scraping, API, dataset
- spatial-precision: GPS coordinate
- temporal-precision: yearly
- dataset-precision: complete
- status: complete
- count: 20k
subway lines
subway lines in a graph format with geographic polyline
- type: transportation
- cities: Shanghai, Paris
- source: www.openstreetmap.com, RATP app, Shanghai Metro App
- capture-type: datasets
- spatial-precision: GPS coordinate
- temporal-precision: yearly
- dataset-precision: complete
- status: running
- count: 2k
real estate
prices per square meter for locations
- type: real estate
- cities: Shanghai, Paris
- source: www.anjuke.com, meilleursagents.com
- capture-type: scraping
- spatial-precision: GPS coordinate
- temporal-precision: continuous capture
- dataset-precision: few at a time
- status: running
- count: 125k
cities in books
list of cities mentionned in english books
- type: culture
- cities: World
- source: www.gutenberg.org
- capture-type: dataset
- spatial-precision: city wide
- temporal-precision: yearly
- dataset-precision: complete
- status: running
- count: pending