Data Refinery is part of IBM Watson, and comes with both the IBM Watson Studio and IBM Watson Knowledge Catalog applications. It’s a self-service data preparation client for data scientists, data engineers, and business analysts. With it, you can quickly transform large amounts of raw data into consumable, quality information that’s ready for analytics. Data Refinery makes it easy to explore, prepare, and deliver data that people across your organization can trust.

  • Ability to access data wherever it resides: in the cloud, on-premises, or on your desktop
  • Powerful shaping operations to clean, organize, fix, and validate data
  • Scripting support for RStudio’s dplyr for the efficient and flexible manipulation of data sets
  • Support for single- and multi-column operations and the creation of complex new columns from existing columns
  • Ability to undo, redo, and delete steps in a data flow
  • Monitoring of data preparation flows
  • Interactive data validation and automatic detection of anomalies such as missing values, outliers, and duplicates
  • Visualizations that provide insight into large amounts of data

Watch this short video to see how IBM Watson Studio can help guide your company’s success and assist every worker using data to ask questions, extract meaningful insights and accelerate business decisions.

Ready to get started?

knowledge badge

Earn a Data Refinery Essentials Badge

If you’re using Data Refinery, consider earning the Data Refinery Essentials IBM Open Badge to share verified proof of your achievement.

Additional Watson Studio badges are available for Watson Studio Essentials, Streams Flows, Visual Recognition, and Dashboards.

Find articles, tutorials, notebooks, and more in the IBM Watson Studio Gallery.

Learn more:

Join The Discussion

Your email address will not be published. Required fields are marked *