You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
e02c4f1542 | 3 years ago | |
---|---|---|
.. | ||
README.md | 3 years ago | |
list_of_countries_sorted_gini.txt | 4 years ago | |
nycTaxiFares.gz | 4 years ago | |
nycTaxiRides.gz | 4 years ago | |
nyctaxi.csv | 3 years ago | |
posts_sample.xml | 4 years ago | |
programming-languages.csv | 4 years ago | |
stations.csv | 4 years ago | |
trips.csv | 4 years ago | |
warandsociety.txt | 4 years ago |
README.md
SF Bay Area Bike Share
https://www.kaggle.com/benhamner/sf-bay-area-bike-share
stations.csv
trips.csv
Stack Overflow Data Dump
https://archive.org/details/stackexchange
posts_sample.xml
sc.textFile("posts.xml").mapPartitions(_.take(1000)).repartition(1).saveAsTextFile("posts_sample.xml")
New York City Taxi Data(2010-2013)
https://databank.illinois.edu/datasets/IDB-9610843 или https://uofi.app.box.com/v/NYCtaxidata
nyctaxi.csv
схема: https://uofi.app.box.com/v/NYCtaxidata/file/33670345557
nycTaxiFares.gz
nycTaxiRides.gz
схема: https://github.com/apache/flink-training/blob/master/README.md#schema-of-taxi-ride-events
List of programming languages
https://en.wikipedia.org/wiki/List_of_programming_languages
programming-languages.csv