If you’re new to Big Data and want to get started using a Hadoop-based platform, check out Explore Hadoop and BigInsights, a new collection of hands-on exercises.

The first lab explains how you can get started using a free VMware image provided by IBM.  The image contains many popular open source projects in the Hadoop ecosystem (including HBase, Hive, Pig, Zookeeper, Oozie, Avro, Flume, and more) as well as some additional IBM technologies that can make it easier for you to manage your environment, explore and analyze your data, and so on.

The second lab focuses on the Hadoop Distributed File System (HDFS) and MapReduce.¬† You’ll learn a few basic commands, launch a sample job (application), inspect the output, and explore some details about your job’s execution.

Both labs are pretty short — you should be able to complete each in 15 minutes or so.¬† Indeed, the idea behind this new lab series is to help beginners get started quickly.¬† Future installments will include brief exercises on additional technologies using this same VMware image.¬† So stay tuned for labs that introduce you to a Web console for exploring and administering your cluster, a spreadsheet-style tool for analyzing Big Data without writing any code, and Big SQL for querying data stored on your cluster.

If you’re not familiar with Hadoop at all, visit the “What is?” video collection on Hadoop Dev to learn the basics quickly.¬† In particular, start with these videos:


Join The Discussion

Your email address will not be published. Required fields are marked *