`sklearn.metrics`.silhouette_score¶

sklearn.metrics.silhouette_score(X, labels, metric='euclidean', sample_size=None, random_state=None, **kwds)[source]¶

Compute the mean Silhouette Coefficient of all samples.

The Silhouette Coefficient is calculated using the mean intra-cluster distance (a) and the mean nearest-cluster distance (b) for each sample. The Silhouette Coefficient for a sample is (b - a) / max(a, b). To clarify, b is the distance between a sample and the nearest cluster that the sample is not a part of. Note that Silhouette Coefficent is only defined if number of labels is 2 <= n_labels <= n_samples - 1.

This function returns the mean Silhouette Coefficient over all samples. To obtain the values for each sample, use silhouette_samples.

The best value is 1 and the worst value is -1. Values near 0 indicate overlapping clusters. Negative values generally indicate that a sample has been assigned to the wrong cluster, as a different cluster is more similar.

Examples using `sklearn.metrics.silhouette_score`¶

../../_images/plot_affinity_propagation1.png

Demo of affinity propagation clustering algorithm

Demo of DBSCAN clustering algorithm

A demo of K-Means clustering on the handwritten digits data

../../_images/plot_kmeans_silhouette_analysis1.png

Selecting the number of clusters with silhouette analysis on KMeans clustering

Clustering text documents using k-means

sklearn.metrics.silhouette_score¶

Examples using sklearn.metrics.silhouette_score¶

`sklearn.metrics`.silhouette_score¶

Examples using `sklearn.metrics.silhouette_score`¶