Clustering
Gist
Who is closer to who?
Types
- k-means clustering
Criticism
Any use of "data-driven clustering" should be met with some skepticism
- Frank Harrell seems to be against it.
References
- https://darrendahly.github.io/post/cluster/ (Interesting blog post about how if you have two correlated variables, clustering can produce a lot of groups. It's WEIRD if it doesn't produce groups)