Data Refinery is part of IBM Watson, and comes with both the IBM Watson Studio and IBM Watson Knowledge Catalog applications. It’s a self-service data preparation client for data scientists, data engineers, and business analysts. With it, you can quickly transform large amounts of raw data into consumable, quality information that’s ready for analytics. Data Refinery makes it easy to explore, prepare, and deliver data that people across your organization can trust.
- Ability to access data wherever it resides: in the cloud, on-premises, or on your desktop
- Powerful shaping operations to clean, organize, fix, and validate data
- Scripting support for RStudio’s dplyr for the efficient and flexible manipulation of data sets
- Support for single- and multi-column operations and the creation of complex new columns from existing columns
- Ability to undo, redo, and delete steps in a data flow
- Monitoring of data preparation flows
- Interactive data validation and automatic detection of anomalies such as missing values, outliers, and duplicates
- Visualizations that provide insight into large amounts of data
Watch this short video to see how IBM Watson Studio can help guide your company’s success and assist every worker using data to ask questions, extract meaningful insights and accelerate business decisions.
Ready to get started?
Earn a Data Refinery Essentials Badge
If you’re using Data Refinery, consider earning the Data Refinery Essentials IBM Open Badge to share verified proof of your achievement.