🧠 AI with Python – 🧩 Customer Segmentation using KMeans

Posted on: January 27, 2026

Description:

Understanding customers is at the heart of every successful business.

Instead of treating all customers the same, organizations segment them into groups based on behavior, spending patterns, and engagement levels.

In this project, we apply unsupervised machine learning to perform customer segmentation using KMeans clustering, a widely used technique in marketing, retail, and product analytics.

Understanding the Problem

Customer data often lacks explicit labels.

We don’t know in advance which customer belongs to which group — instead, we want the algorithm to discover patterns on its own.

This makes customer segmentation an unsupervised learning problem, where the objective is to group customers such that:

customers within a group are similar
customers across groups are different

1. Loading Customer Data

We begin with a dataset containing customer income and spending behavior.

import pandas as pd

df = pd.read_csv("customers.csv")
df.head()

Each row represents a customer with attributes like income, spending score, order value, and purchase frequency.

2. Inspecting Feature Ranges

Before clustering, it’s important to understand feature scales.

print(df.describe())

Features such as income and spending score exist on very different numeric ranges, which affects distance-based algorithms.

3. Feature Scaling

KMeans relies on distance calculations, so feature scaling is mandatory.

from sklearn.preprocessing import StandardScaler

X = df.drop("customer_id", axis=1)
scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

Scaling ensures that no single feature dominates the clustering process.

4. Applying KMeans Clustering

We apply KMeans to identify natural customer segments.

from sklearn.cluster import KMeans

kmeans = KMeans(n_clusters=4, random_state=42)
df["segment"] = kmeans.fit_predict(X_scaled)

Each customer is now assigned to a segment based on similarity.

5. Visualizing Customer Segments

Visualization helps translate clusters into business insight.

import matplotlib.pyplot as plt

plt.scatter(
    df["annual_income"],
    df["spending_score"],
    c=df["segment"]
)
plt.xlabel("Annual Income")
plt.ylabel("Spending Score")
plt.title("Customer Segmentation using KMeans")
plt.show()

This plot reveals distinct customer groups with different income and spending behaviors.

Interpreting the Segments

Typical customer segments might include:

low-income, low-spending customers
moderate-income, regular customers
high-income, selective buyers
high-value, frequent purchasers

These insights help tailor marketing campaigns, pricing strategies, and customer experiences.

Key Takeaways

Customer segmentation is a core unsupervised ML use case.
KMeans groups customers based on similarity, not predefined labels.
Feature scaling is essential for distance-based clustering algorithms.
Visualizing clusters makes results interpretable and actionable.
Segmentation enables personalization and data-driven business decisions.

Conclusion

Customer segmentation using KMeans demonstrates how machine learning uncovers hidden structure in customer data.

By grouping customers based on behavioral patterns, businesses can move beyond generic strategies and deliver targeted, personalized experiences.

This project showcases a practical end-to-end unsupervised learning workflow, making it a strong addition to the AI with Python – Real-World Mini Projects (Advanced) series.

Code Snippet:

import pandas as pd

df = pd.read_csv("customers.csv")
df.head()


print(df.info())
print(df.describe())


from sklearn.preprocessing import StandardScaler

scaler = StandardScaler()
X_scaled = scaler.fit_transform(df)


from sklearn.cluster import KMeans

kmeans = KMeans(
    n_clusters=4,
    random_state=42
)


df["segment"] = kmeans.fit_predict(X_scaled)


import matplotlib.pyplot as plt

plt.scatter(
    df.iloc[:, 0],
    df.iloc[:, 1],
    c=df["segment"]
)
plt.xlabel("Feature 1")
plt.ylabel("Feature 2")
plt.title("Customer Segmentation using KMeans")
plt.show()

← →	move
↑	rotate
↓	soft drop
Space	hard drop
P	pause / resume

🧠 AI with Python – 🧩 Customer Segmentation using KMeans

Description:

Understanding the Problem

1. Loading Customer Data

2. Inspecting Feature Ranges

3. Feature Scaling

4. Applying KMeans Clustering

5. Visualizing Customer Segments

Interpreting the Segments

Key Takeaways

Conclusion

Code Snippet:

Comments

Add Your Comment

🧠 AI with Python – 🧩 Customer Segmentation using KMeans

Description:

Understanding the Problem

1. Loading Customer Data

2. Inspecting Feature Ranges

3. Feature Scaling

4. Applying KMeans Clustering

5. Visualizing Customer Segments

Interpreting the Segments

Key Takeaways

Conclusion

Code Snippet:

Comments Show Comments

Add Your Comment

Related Posts

🧠 AI with Python – 🍷 Wine Quality Prediction (RandomForest + SHAP)

🧠 AI with Python – 📦 Online Sales Demand Forecasting

🧠 AI with Python – ☀️ Solar Energy Output Prediction

Comments