This video shows a community notebook which introduces basic Spark concepts and helps you to start using Spark for R. In this notebook, you’ll use the publicly available mtcars data set from Motor Trend magazine to learn some basic R. You’ll learn how to load data, create a Spark DataFrame, aggregate data, run mathematical formulas, and run SQL queries against the data. To do so, from within the IBM Data Science Experience, click the Notebooks section in the Data Science Experience Community, and search for Spark R.
Free SparkR course
Looking to master Apache Spark with SparkR to perform large scale data analysis? SparkR provides a distributed data frame API that enables structured data processing with a syntax familiar to R users. This course will help you:
- Learn why R is a popular statistical programming language with a number of extensions that support data processing and machine learning tasks.
- Learn how SparkR, an R package that provides a light-weight frontend, uses Apache Spark from R.