Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The kmeans metric is exactly the metric you would want to optimize the performance of an algorithm like [bolt](https://arxiv.org/abs/1706.10283). In that and other discretization routines, the value of k is a parameter related to compression ratio, efficiency, and other metrics more predictable than some ethereal notion of how many clusters the data "naturally" has.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: