Semi-supervised clustering with deep metric learning and graph embedding

Xiaocui Li, Hongzhi Yin, Ke Zhou*, Xiaofang Zhou

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

As a common technology in social network, clustering has attracted lots of research interest due to its high performance, and many clustering methods have been presented. The most of existing clustering methods are based on unsupervised learning. In fact, we usually can obtain some/few labeled samples in real applications. Recently, several semi-supervised clustering methods have been proposed, while there is still much space for improvement. In this paper, we aim to tackle two research questions in the process of semi-supervised clustering: (i) How to learn more discriminative feature representations to boost the process of the clustering; (ii) How to effectively make use of both the labeled and unlabeled data to enhance the performance of clustering. To address these two issues, we propose a novel semi-supervised clustering approach based on deep metric learning (SCDML) which leverages deep metric learning and semi-supervised learning effectively in a novel way. To make the extracted features of the contribution of data more representative and the label propagation network more suitable for real applications, we further improve our approach by adopting triplet loss in deep metric learning network and combining bedding with label propagation strategy to dynamically update the unlabeled to labeled data, which is named as semi-supervised clustering with deep metric learning and graph embedding (SCDMLGE). SCDMLGE enhances the robustness of metric learning network and promotes the accuracy of clustering. Substantial experimental results on Mnist, CIFAR-10, YaleB, and 20-Newsgroups benchmarks demonstrate the high effectiveness of our proposed approaches.

Original languageEnglish
Pages (from-to)781-798
Number of pages18
JournalWorld Wide Web
Volume23
Issue number2
DOIs
Publication statusPublished - 1 Mar 2020
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2019, Springer Science+Business Media, LLC, part of Springer Nature.

Keywords

  • Clustering
  • Deep metric learning
  • Graph embedding
  • Semi-supervised learning
  • k-means

Fingerprint

Dive into the research topics of 'Semi-supervised clustering with deep metric learning and graph embedding'. Together they form a unique fingerprint.

Cite this