Nice work! I am testing the performance on a subset of imagenet (https://tiny-imagenet.herokuapp.com/) using InfoMin_800.pth. I changed the number of clusters to 200. I only tested accuracy on training set. I run the minibatch kmeans for several epochs and so far the accuracy seems to be around 10%. Tho the first couple dozen clusters can get very high accuracy (80~90%). Does this seem reasonable? I would imaging if it produced good results on the whole Imagenet with 1000 classes it should produce similar results on a subset. The only caveat I can think of is that I have smaller number of images (100k) so maybe if will affect the quality of k-means clustering.