|
|
@ -11,6 +11,10 @@ https://archive.org/details/stackexchange
|
|
|
|
|
|
|
|
|
|
|
|
posts_sample.xml
|
|
|
|
posts_sample.xml
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
sc.textFile("posts.xml").mapPartitions(_.take(1000)).repartition(1).saveAsTextFile("posts_sample.xml")
|
|
|
|
|
|
|
|
```
|
|
|
|
|
|
|
|
|
|
|
|
### New York City Taxi Data(2010-2013)
|
|
|
|
### New York City Taxi Data(2010-2013)
|
|
|
|
|
|
|
|
|
|
|
|
https://databank.illinois.edu/datasets/IDB-9610843 или https://uofi.app.box.com/v/NYCtaxidata
|
|
|
|
https://databank.illinois.edu/datasets/IDB-9610843 или https://uofi.app.box.com/v/NYCtaxidata
|
|
|
|