Information Maximization Clustering via Multi-View Self-Labelling

Ntelemis, Foivos; Jin, Yaochu; Thomas, Spencer A.

doi:10.1016/j.knosys.2022.109042

Computer Science > Computer Vision and Pattern Recognition

arXiv:2103.07368 (cs)

[Submitted on 12 Mar 2021 (v1), last revised 18 Oct 2021 (this version, v2)]

Title:Information Maximization Clustering via Multi-View Self-Labelling

Authors:Foivos Ntelemis, Yaochu Jin, Spencer A. Thomas

View PDF

Abstract:Image clustering is a particularly challenging computer vision task, which aims to generate annotations without human supervision. Recent advances focus on the use of self-supervised learning strategies in image clustering, by first learning valuable semantics and then clustering the image representations. These multiple-phase algorithms, however, increase the computational time and their final performance is reliant on the first stage. By extending the self-supervised approach, we propose a novel single-phase clustering method that simultaneously learns meaningful representations and assigns the corresponding annotations. This is achieved by integrating a discrete representation into the self-supervised paradigm through a classifier net. Specifically, the proposed clustering objective employs mutual information, and maximizes the dependency between the integrated discrete representation and a discrete probability distribution. The discrete probability distribution is derived though the self-supervised process by comparing the learnt latent representation with a set of trainable prototypes. To enhance the learning performance of the classifier, we jointly apply the mutual information across multi-crop views. Our empirical results show that the proposed framework outperforms state-of-the-art techniques with the average accuracy of 89.1% and 49.0%, respectively, on CIFAR-10 and CIFAR-100/20 datasets. Finally, the proposed method also demonstrates attractive robustness to parameter settings, making it ready to be applicable to other datasets.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2103.07368 [cs.CV]
	(or arXiv:2103.07368v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2103.07368
Related DOI:	https://doi.org/10.1016/j.knosys.2022.109042

Submission history

From: Foivos Ntelemis [view email]
[v1] Fri, 12 Mar 2021 16:04:41 UTC (991 KB)
[v2] Mon, 18 Oct 2021 21:14:16 UTC (1,025 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Information Maximization Clustering via Multi-View Self-Labelling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Information Maximization Clustering via Multi-View Self-Labelling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators