Monday, May 05, 2014

Cloudera QuickStart VM

http://blog.cloudera.com/blog/2013/06/quickstart-vm-now-with-real-time-big-data/
http://www.cloudera.com/content/support/en/downloads/download-components/download-products.html?productID=F6mO278Rvo
https://github.com/kite-sdk/kite-examples

  • flume : Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data.
  • hbase
  • hdfs
  • hive
  • hue
  • impala
  • ks_indexer
  • mapreduce
  • oozie: Oozie is a workflow scheduler system to manage Apache Hadoop jobs.
  • solr : SolrTM is the popular, blazing fast open source enterprise search platform from the Apache LuceneTM project. 
  • sqoop : Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.
  • yarn : a resource-management platform responsible for managing compute resources in clusters and using them for scheduling of users' applications.
  • zookeeper




No comments:

Related Posts Plugin for WordPress, Blogger...