Industrial Training




ML - Clustering Mean Shift Algorithm


Introduction to Mean-Shift Algorithm


As discussed earlier, it is another powerful clustering algorithm used in unsupervised learning. Unlike K-means clustering, it does not make any assumptions; hence it is a non-parametric algorithm.

Mean-shift algorithm basically assigns the datapoints to the clusters iteratively by shifting points towards the highest density of datapoints i.e. cluster centroid.

The difference between K-Means algorithm and Mean-Shift is that later one does not need to specify the number of clusters in advance because the number of clusters will be determined by the algorithm w.r.t data.


Working of Mean-Shift Algorithm


We can understand the working of Mean-Shift clustering algorithm with the help of following steps −

  • Step 1 − First, start with the data points assigned to a cluster of their own.
  • Step 2 − Next, this algorithm will compute the centroids.
  • Step 3 − In this step, location of new centroids will be updated.
  • Step 4 − Now, the process will be iterated and moved to the higher density region.
  • Step 5 − At last, it will be stopped once the centroids reach at position from where it cannot move further.

Advantages and Disadvantages


Advantages

The following are some advantages of Mean-Shift clustering algorithm −


  • It does not need to make any model assumption as like in K-means or Gaussian mixture.
  • It can also model the complex clusters which have nonconvex shape.
  • It only needs one parameter named bandwidth which automatically determines the number of clusters.
  • There is no issue of local minima as like in K-means.
  • No problem generated from outliers.

Disadvantages

The following are some disadvantages of Mean-Shift clustering algorithm −


  • Mean-shift algorithm does not work well in case of high dimension, where number of clusters changes abruptly.
  • We do not have any direct control on the number of clusters but in some applications, we need a specific number of clusters.
  • It cannot differentiate between meaningful and meaningless modes.


Hi I am Pluto.