Kmeans without knowing the number of clusters? [duplicate]
One approach is cross-validation. In essence, you pick a subset of your data and cluster it into k clusters, and you ask how well it clusters, compared with the rest of the data: Are you assigning data points to the same cluster memberships, or are they falling into different clusters? If the memberships are roughly … Read more