This video shows a community notebook which introduces basic Spark concepts and helps you to start using Spark for R. In this notebook, you’ll use the publicly available mtcars data set from Motor Trend magazine to learn some basic R. You’ll learn how to load data, create a Spark DataFrame, aggregate data, run mathematical formulas, and run SQL queries against the data. To do so, from within the IBM Watson Studio, click the Notebooks section in the Watson Community, and search for Spark R.

Free SparkR course

Looking to master Apache Spark with SparkR to perform large scale data analysis? SparkR provides a distributed data frame API that enables structured data processing with a syntax familiar to R users. This course will help you:

  • Learn why R is a popular statistical programming language with a number of extensions that support data processing and machine learning tasks.
  • Learn how SparkR, an R package that provides a light-weight frontend, uses Apache Spark from R.

Enroll in this free Big Data University course: Analyzing Big Data in R using Apache Spark

2 comments on"Community Notebook: Use Spark R to Load and Analyze Data"

  1. “This video shows a community notebook which introduces basic Spark concepts and helps you to start using Spark for R”
    The example in the video is about Python though

Join The Discussion

Your email address will not be published. Required fields are marked *