Skip to main content

Clustering

đŸŽ¯ KMeans​

Class: KMeans

HyperparameterValues
KNumber of clusters
MaxIterationsConvergence limit
ToleranceConvergence threshold
SeedReproducibility
InitMethodRandom, PlusPlus

Exposes after fit: Centroids, Inertia, Iterations


🔍 DBSCAN​

Class: DBSCAN

HyperparameterValues
EpsilonNeighbourhood radius
MinPointsMinimum density

Discovers K automatically. Noise points labeled -1. Exposes NoiseCount.


đŸŒŗ Agglomerative Clustering​

Class: AgglomerativeClustering

HyperparameterValues
KNumber of clusters
LinkageSingle, Complete, Average, Ward

Bottom-up hierarchical merging. Exposes Dendrogram.


📐 Clustering Evaluators​

All evaluators implement IClusteringEvaluator where higher score = better.

EvaluatorClassMetricNotes
SilhouetteSilhouetteEvaluators∈[−1,1]s \in [-1, 1]Higher = better
Inertia (Elbow)InertiaEvaluator−W-W (negated)Use RawInertia() for elbow curve
Davies-BouldinDaviesBouldinEvaluator−DB-DB (negated)Lower DB = better separation
Calinski-HarabaszCalinskiHarabaszEvaluatorCHCHHigher = better, fast