Saturday, August 23, 2014

Apache Spark

http://spark.apache.org/docs/latest/quick-start.html
http://java.dzone.com/articles/apache-spark-next-big-data
http://www.cs.berkeley.edu/~matei/papers/2012/nsdi_spark.pdf
http://ampcamp.berkeley.edu/big-data-mini-course/

Spark’s primary abstraction is a distributed collection of items called a Resilient Distributed Dataset (RDD).


No comments:

Related Posts Plugin for WordPress, Blogger...