By Vinodh Mohan, Rich Hagarty, Ying Chen | Published November 5, 2018 - Updated November 5, 2018
AnalyticsArtificial IntelligenceData ScienceMachine LearningPython
The focus of this code pattern is to provide a start-to-finish workflow that demonstrates the features and capabilities available in the new release of IBM Watson Studio Local.
In this blog post we’ll:
In this code pattern, we attempt to use the chemical properties of wines to classify them into one of 3 categories. The wine properties are provided by a distributed data set from kaggle.
To extract the relevant properties required to classify a wine, Principal component analysis (PCA) is applied to the data set. PCA is a popular dimensionality reduction technique which is used to reduce N number of numerical variables into few principal components that are used as features in the machine learning model. These principal components capture a major percentage of the combined variance effect of all the variables.
For the classification model, Logistic regression (a popular machine learning model) is applied to the extracted components to predict the wine categories.
IBM Watson Studio Local is an out-of-the-box on-premises solution for data scientists and data engineers. It addresses the entire Data Science life cycle and provides an environment where data scientists can work with a variety of tools such as Spark, R, Python, and Anaconda – all integrated to work together in a productive collaborative experience. Either due to GDPR or other data privacy-related issues, Watson Studio Local is perfect for users wanting to perform complex data science related work in the security of their private network.
Aside from running notebooks, Watson Studio also provides projects for multi-tenancy and collaboration, identity hooks for LDAP, an admin console for management, a community tab for finding sample content, and integration with GitHub and GitHub Enterprise. In addition, it’s also deployable to IBM’s popular IBM Cloud Private.
Try the code pattern out. Check it out by going directly to our GitHub repo. The code pattern will walk the user through creating Watson Studio Local assets, running the notebook, and lastly interpreting the results.
Want to see the notebook results directly? Use NBViewer to view one of our code pattern notebooks, for example this one that performs feature engineering on our wine data set.
Keep an eye on IBM Code for more Watson Studio related patterns!
April 4, 2019
Get the Code »
November 15, 2018
Back to top