ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval

Hao Ren, Ziqiang Zheng, Yang Wu, Hong Lu*, Yang Yang, Ying Shan, Sai Kit Yeung

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

The huge domain gap between sketches and photos poses huge challenges for Sketch-Based Image Retrieval (SBIR). The Zero-Shot Sketch-Based Image Retrieval (ZS-SBIR) is more generic and practical but brings an even greater challenge: the additional knowledge gap between the seen and unseen categories. In order to simultaneously mitigate both gaps, we propose an Approaching-and-Centralizing Network (termed 'ACNet') to jointly optimize sketch-to-photo synthesis and image retrieval. The retrieval module guides the synthesis module to generate large amounts of diverse photo-like images that help the sketch domain gradually approach the photo domain to eliminate the domain gap, and thus better serves retrieval. Meanwhile, the retrieval module itself centralizes the embeddings of training samples for learning a similarity measurement to eliminate the knowledge gap. Our approach is simple yet effective, which achieves state-of-the-art performance on two widely used ZS-SBIR datasets and surpasses previous methods by a large margin (eg, 8.2% improvement in terms of mAP@all on TU-Berlin Extended dataset).

Original languageEnglish
Pages (from-to)5022-5035
Number of pages14
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume33
Issue number9
DOIs
Publication statusPublished - 1 Sept 2023

Bibliographical note

Publisher Copyright:
© 1991-2012 IEEE.

Keywords

  • Sketch-based image retrieval
  • deep learning
  • metric learning
  • zero-shot learning

Fingerprint

Dive into the research topics of 'ACNet: Approaching-and-Centralizing Network for Zero-Shot Sketch-Based Image Retrieval'. Together they form a unique fingerprint.

Cite this