You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
ML-For-Beginners/5-Clustering
Maik Fruhner 890cf47657
Fix 5-2 Seaborn Lineplot call
1 month ago
..
1-Visualize Fix 5-1 sns.FacetGrid call to match sns 0.13 API 1 month ago
2-K-Means Fix 5-2 Seaborn Lineplot call 1 month ago
data re-numbering lesson groups 3 years ago
images image paths 3 years ago
translations Update 2 years ago
README.md Update 2 years ago

README.md

Clustering models for machine learning

Clustering is a machine learning task where it looks to find objects that resemble one another and group these into groups called clusters. What differs clustering from other approaches in machine learning, is that things happen automatically, in fact, it's fair to say it's the opposite of supervised learning.

Regional topic: clustering models for a Nigerian audience's musical taste 🎧

Nigeria's diverse audience has diverse musical tastes. Using data scraped from Spotify (inspired by this article, let's look at some music popular in Nigeria. This dataset includes data about various songs' 'danceability' score, 'acousticness', loudness, 'speechiness', popularity and energy. It will be interesting to discover patterns in this data!

A turntable

Photo by Marcela Laskoski on Unsplash

In this series of lessons, you will discover new ways to analyze data using clustering techniques. Clustering is particularly useful when your dataset lacks labels. If it does have labels, then classification techniques such as those you learned in previous lessons might be more useful. But in cases where you are looking to group unlabelled data, clustering is a great way to discover patterns.

There are useful low-code tools that can help you learn about working with clustering models. Try Azure ML for this task

Lessons

  1. Introduction to clustering
  2. K-Means clustering

Credits

These lessons were written with 🎶 by Jen Looper with helpful reviews by Rishit Dagli and Muhammad Sakib Khan Inan.

The Nigerian Songs dataset was sourced from Kaggle as scraped from Spotify.

Useful K-Means examples that aided in creating this lesson include this iris exploration, this introductory notebook, and this hypothetical NGO example.