Week 4

Searching for the optimal number of clusters with the elbow method

Inertia is the sum of squared distance from each data point to the center of its cluster. Smaller inertia means that clusters are more tightly organized.

Idea: try different numbers of clusters, check how inertia is changing.

Project 2

Pandas

Series

DataFrame

loc and iloc

Boolean indexing

Sorting

Aggregations

Exercises