http://java.dzone.com/articles/apache-spark-next-big-data
http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pdf
http://ampcamp.berkeley.edu/big-data-mini-course/
Spark’s primary abstraction is a distributed collection of items called a Resilient Distributed Dataset (RDD).
No comments:
Post a Comment