Deep Learning for Weakly-Supervised Object Detection and Localization: A Survey

Feifei Shao, Long Chen*, Jian Shao, Wei Ji, Shaoning Xiao, Lu Ye, Yueting Zhuang, Jun Xiao

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

66 Citations (Scopus)

Abstract

Weakly-Supervised Object Detection (WSOD) and Localization (WSOL), i.e., detecting multiple and single instances with bounding boxes in an image using image-level labels, are long-standing and challenging tasks in object detection. Hundreds of WSOD and WSOL methods and numerous techniques have been proposed in the deep learning era. To this end, in this paper, we consider WSOL as a sub-task of WSOD and provide a comprehensive survey of the recent achievements of WSOD. Specifically, we firstly describe the formulation and setting of the WSOD, including the background, challenges, basic framework. Meanwhile, we summarize and analyze all advanced techniques and training and test tricks for improving detection performance. Then, we introduce the widely-used datasets and evaluation metrics of WSOD. Lastly, we discuss the future directions of WSOD. We believe that these summaries can help pave a way for future research on WSOD and WSOL.

Original languageEnglish
Pages (from-to)192-207
Number of pages16
JournalNeurocomputing
Volume496
DOIs
Publication statusPublished - 28 Jul 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022 Elsevier B.V.

Keywords

  • Basic framework
  • Future directions
  • Object detection and localization
  • Techniques
  • Weakly-supervised learning

Fingerprint

Dive into the research topics of 'Deep Learning for Weakly-Supervised Object Detection and Localization: A Survey'. Together they form a unique fingerprint.

Cite this