This article is part of the Learning path: Get started with Watson Machine Learning Accelerator series.
|An introduction to Watson Machine Learning Accelerator||Article|
|How to use Watson Machine Learning Accelerator||Article + notebook|
|Elastic Distributed Training in Watson Machine Learning Accelerator||Article + notebook|
The adoption of artificial intelligence (AI) has been increasing across all business sectors as more industry leaders understand the value that data and machine learning models can bring to their business. Faster times to create accurate models are essential to driving value for time to market.
IBM Cloud Pak® for Data provides a unified platform that integrates data and AI services to help you build, run and manage AI. IBM Watson® Machine Learning Accelerator is a capability designed to accelerate deep learning with end-to-end transparency and visibility that enables businesses or organizations to bring AI applications into production while making deep learning and machine learning more accessible.
What is Watson Machine Learning Accelerator?
Watson Machine Learning Accelerator is an enterprise AI infrastructure to make deep learning and machine learning more accessible and brings the benefits of AI to your business. It combines popular open source deep learning frameworks with efficient AI development tools.
Data scientists can accelerate their AI journey by scaling out their workload such as tuning their hyperparameters, while sharing GPU resources in an elastic manner with a growing number of data scientists based on fair share allocation or priority scheduling without interrupting jobs. To quickly learn more, watch the following video.
Watson Machine Learning Accelerator is now available on IBM Cloud Pak for Data, running on Red Hat® OpenShift® Container Platform and Intel® servers. Watson Machine Learning Accelerator continues to be available on premise for both accelerated IBM Power Systems™ servers and Intel servers.
Deploying Watson Machine Learning Accelerator
Watson Machine Learning Accelerator can be deployed as a service from IBM Cloud Pak for Data or installed as a stand-alone offering on-premises. To learn about deploying Watson Machine Learning Accelerator as a service, look at IBM Cloud Pak for Data and the Watson Machine Learning service. Or, if you’re installing and configuring Watson Machine Learning Accelerator on-premises, see Planning on installing Watson Machine Learning Accelerator 1.2.3.
Watson Machine Learning Accelerator capabilities
The following is an overview of the key capabilities of Watson Machine Learning Accelerator.
Accelerated deep learning utilizing GPUs
Watson Machine Learning Accelerator runs its deep learning jobs on GPU hardware. GPUs are specialized hardware that businesses do not want to sit idle. These specialized hardware allow for faster deep learning results and higher throughput. View this video for a notebook example that showcases the speed differences between running training workloads on GPU and CPU. You see that your deep learning training is up to 10 times faster on GPU.
Elastic Distributed Training
The Elastic Distributed Training capability in Watson Machine Learning Accelerator enables GPU sharing and reallocation across multiple running jobs using the resource policy defined. Resource policies can be defined between lines of business, projects, or users to ensure fair allocation and priority access to GPU resources. As data scientists submit deep learning training jobs, jobs are automatically allocated across shared resources, simplifying the distribution of training workloads.
See how Elastic Distributed Training balances deep learning jobs with this video.
Automated hyperparameter optimization
Automated hyperparameter optimization helps data scientists optimize the speed of training by automating hyperparameter searches in parallel. See how by using the Watson Machine Learning Accelerator API, you can automate hyperparameter optimization training.
Elastic distributed inference
Publish inference models as services. To learn more about downloading and configuring inference in Watson Machine Learning Accelerator for Cloud Pak for Data, see Downloading the command line tool. If using Watson Machine Learning Accelerator on-premises, configure the command line tool.
This article provided an overview of Watson Machine Learning Accelerator and its capabilities. To learn more about installing and configuring Watson Machine Learning Accelerator or how to use it, see How to use Watson Machine Learning Accelerator.