One of the announcements that caught my eye at the August San Francisco VMworld was the Hadoop Starter Kit released by EMC's Open Innovation Lab. Customers already using Isilon and vSphere downloaded over 500 copies of the HSK 1.0 guide due to the fact that the HSK is (a) free and (b) easy to use. It represented a low barrier way to get started with Hadoop in an Isilon/vSphere environment.
Here are some of the use cases that the kit facilitates:
- Hadoop projects or sandbox environments for experimentation
- Exploiting (with Hadoop) large volumes of unstructured data already living in Isilon
- Bringing Hadoop processing in-house from a public provider
For the Barcelona VMworld Conference this week the kit has been upgraded to include the following:
- New support for major Hadoop distributions: Apache Hadoop, Cloudera, Hortonworks, and PivotalHD
- Quick deployment via the Big Data Extensions toolkit
- GUI for simplified management (integrated with vSphere Web Client)
- Elastic scaling: Elasticity adjusts the number of active compute virtual machines based on specified configuration settings.
For more information you can view the HSK 2.0 page, stop by Booth D207 in Barcelona, and/or view the Pivotal or Apache videos.
Steve
EMC Fellow
Comments
You can follow this conversation by subscribing to the comment feed for this post.