IBM
Developer
Explore
Articles
Blogs
Courses
Learning paths
Open projects
Series
Tutorials
Products
IBM Bob
Granite models
Open Liberty
watsonx.ai
watsonx.data
Docling
Languages
IBM Semeru Runtimes
Java
Python
Node.js
JavaScript
COBOL
Technologies
Artificial intelligence
Data Science
Messaging
Machine Learning
Observability
Security
Events
All Events
IBM Hackathons
IBM Community Events
TechXchange Conference
Resources
IBM Documentation
IBM Support
IBM Developer Videos
IBM Technology Videos
Open Source @ IBM
TechXchange
Home
Explore
Articles
Blogs
Courses
Learning paths
Open projects
Series
Tutorials
Products
IBM Bob
Granite models
Open Liberty
watsonx.ai
watsonx.data
Docling
Languages
IBM Semeru Runtimes
Java
Python
Node.js
JavaScript
COBOL
Technologies
Artificial intelligence
Data Science
Messaging
Machine Learning
Observability
Security
Events
All Events
IBM Hackathons
IBM Community Events
TechXchange Conference
Resources
IBM Documentation
IBM Support
IBM Developer Videos
IBM Technology Videos
Open Source @ IBM
TechXchange
Subscribe
Options
loading
Loading page...
Learning Path
Get started with Data Prep Kit (DPK)
Overview
Introducing Data Prep Kit (DPK)
Applying DPK for LLM applications
Preparing data for fine-tuning LLMs
Preparing data for a RAG pipeline
Preparing data for an agentic workflow
Deep dive into DPK modules
Exploring pre-built DPK transforms
Building custom data prep modules
Scaling data prep workflows
Summary
Tutorial
Scaling data preparation workflows in Data Prep Kit (DPK)
Implementing a Ray runtime and adding Kubeflow Pipelines extensions to existing transforms
By
Maroun Touma
,
Shahrokh Daijavad
,
Revital Eres
,
Alexey Roytman
,
Mohammad Nassar
Save
Save
Previous
Building custom data prep modules
Next
Summary