Data Science Lunch and Learn: Learn clustering algorithms using Python and scikit-learn


November 23, 2020 12:30 pm GMT

Interested in Machine Learning? Join us at this workshop where we compare different clustering algorithms with a detailed walk-through of each of them with a hands-on experience.

Clustering is the assignment of a set of observations into subsets (called clusters) so that observations in the same cluster are similar in some sense. It is a method of unsupervised learning and a common technique for statistical data analysis used in many fields. Categories of clustering algorithms we will be discussing:
– Centroid-based clustering
– Density-based clustering
– Hierarchical clustering

In this workshop, scikit-learn provides data sets that help to illustrate the clustering algorithm differences. 

We’ll use these where needed, but we also use our customer data set to help you visualize clustering with realistic data instead of obvious shapes.

We will run a Jupyter notebook in Watson Studio. If you want to try this out for yourself, please go here to sign up for a free account and follow these instructions to get up and running.

Presenter: Fawaz Siddiqi