From 6d68d5207ca1aca57c583d13c83fb3c3ca53bf8e Mon Sep 17 00:00:00 2001 From: Vladimir Protsenko Date: Thu, 26 Nov 2020 16:29:10 +0000 Subject: [PATCH] Update README.md --- data/README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/data/README.md b/data/README.md index 4b57c6c..299cf07 100644 --- a/data/README.md +++ b/data/README.md @@ -11,6 +11,10 @@ https://archive.org/details/stackexchange posts_sample.xml +``` +sc.textFile("posts.xml").mapPartitions(_.take(1000)).repartition(1).saveAsTextFile("posts_sample.xml") +``` + ### New York City Taxi Data(2010-2013) https://databank.illinois.edu/datasets/IDB-9610843 или https://uofi.app.box.com/v/NYCtaxidata