Learn more >
Get the code
by Alok Singh | Updated March 28, 2019 - Published August 6, 2018
AnalyticsArtificial intelligenceData science
Archived date: 2019-06-04
This developer code pattern use R4ML, a scalable R package, running on IBM Watson Studio to perform various machine-learning exercises. Developers new to Watson Studio and scalable machine learning who are interested in big data for data exploration and data preparation tasks will learn how to use R4ML, which augments the capabilities of the Apache Spark R framework.
In this code pattern, we will use R4ML, a scalable R package running on IBM Watson™ Studio to perform various machine-learning exercises. For users who are unfamiliar with Watson Studio, it is an interactive, collaborative cloud-based environment where data scientists, developers, and others interested in data science can use tools (e.g., RStudio, Jupyter Notebooks, Spark, etc.) to collaborate, share, and gather insight from their data.
We live in the age of big data. Tons of data are generated every day, and it is important for analysts and data scientists to analyze the data for business results. However, traditional data science tools like R and Python-based scikit-learn will not scale to big data, which is why frameworks like Apache Spark and Apache Hadoop were created. R4ML is one approach toward that goal.
R4ML provides various out-of-the-box tools and a pre-processing utility for doing the feature engineering. It also provides utilities to sample data and for exploratory analysis. This pattern provides an end-to-end example to demonstrate the ease and power of R4ML in implementing data pre-processing and data exploration.
When you have completed this code pattern, you will understand how to:
Ready to put this code pattern to use? Complete details on how to get started running and using this application are in the README.
See how a fictional health care company uses cloud technology to access data stored on z/OS systems.
This Learning path is designed for developers interested in quickly coming up to speed on what Db2 Event Store offers…
Data is fueling today's digital transformation, but only 15% of organizations get what they need from their data. And 87%…
Back to top