assign each point to the cluster with the closest centroid python

Solutions on MaxInterview for assign each point to the cluster with the closest centroid python by the best coders in the world

showing results for - "assign each point to the cluster with the closest centroid python"

18 Nov 2016

1def kmeans(X, k, maxiter, seed = None):
2    """
3    specify the number of clusters k and
4    the maximum iteration to run the algorithm
5    """
6    n_row, n_col = X.shape
7
8    # randomly choose k data points as initial centroids
9    if seed is not None:
10        np.random.seed(seed)
11    
12    rand_indices = np.random.choice(n_row, size = k)
13    centroids = X[rand_indices]
14
15    for itr in range(maxiter):
16        # compute distances between each data point and the set of centroids
17        # and assign each data point to the closest centroid
18        distances_to_centroids = pairwise_distances(X, centroids, metric = 'euclidean')
19        cluster_assignment = np.argmin(distances_to_centroids, axis = 1)
20
21        # select all data points that belong to cluster i and compute
22        # the mean of these data points (each feature individually)
23        # this will be our new cluster centroids
24        new_centroids = np.array([X[cluster_assignment == i].mean(axis = 0) for i in range(k)])
25        
26        # if the updated centroid is still the same,
27        # then the algorithm converged
28        if np.all(centroids == new_centroids):
29            break
30        
31        centroids = new_centroids
32    
33    return centroids, cluster_assignment
34

source

similar questions

how to round a decimal to the nearest whole number in python find the closest smaller value in an array python python find closest value in list to zero euclidean distance python 3 variables python find index of closest value in list python numpy find nearest value how to get index of closest value in list python fnd closest element in array numpy k means clustering in python e2 80 93 3 clusters how to round to the nearest 25 python python round to nearest 10 euclidean distance python python find closest date how to find closest distance for given points numpy roundup to nearest 5 python distance of coordinates closest point python python get nearest value in list nearest neighbors imputation select closest number in array python how to find the neighbors of an element in matrix python how to find the closest value in column python find closest color python

queries leading to this page

k means clusters to id in your main algorithm iteration loop 2c for each data point 2c calculate the squared distance between each point and the centroid to which it belongs 2c and sum all of these squared distances print out the cluster that each data point is assigned to kmeans get all instances of a cluster assign each point to the cluster with the closest centroid python how to find the closest centroid k means clustering python square distance in kmeans python python k means clustering calculate coordinates of data kmeans input list assign each point to the cluster with the closest centroid python