Apache Spark User List - how to split RDD by key and save to different path
1. be careful, HDFS are better for large files, not bunches of small files.2. if that's really what you want, roll it your own.
Read full article from Apache Spark User List - how to split RDD by key and save to different path
No comments:
Post a Comment