In Kmeans clustering we can define number of cluster. But is it possible to define that cluster_1 will contain 20% data, cluster_2 will have 30% and cluster_3 will have rest of the data points?
I try to do it by python but couldn't.
In Kmeans clustering we can define number of cluster. But is it possible to define that cluster_1 will contain 20% data, cluster_2 will have 30% and cluster_3 will have rest of the data points?
I try to do it by python but couldn't.
Copyright © 2021 Jogjafile Inc.
Here is a discussion on how to modify KMeans so that the clusters all have the same size. You could modify it further to make the clusters have your desired respective sizes.