Hybrid Cross-Feature Interaction Attention Module for Object Detection in Intelligent Mobile Scenes

Di Tian, Yi Han*, Yongtao Liu, Jiabo Li, Ping Zhang, Ming Liu

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

4 Citations (Scopus)

Abstract

Object detection is one of the fundamental tasks in computer vision, holding immense significance in the realm of intelligent mobile scenes. This paper proposes a hybrid cross-feature interaction (HCFI) attention module for object detection in intelligent mobile scenes. Firstly, the paper introduces multiple kernel (MK) spatial pyramid pooling (SPP) based on SPP and improves the channel attention using its structure. This results in a hybrid cross-channel interaction (HCCI) attention module with better cross-channel interaction performance. Additionally, we bolster spatial attention by incorporating dilated convolutions, leading to the creation of the cross-spatial interaction (CSI) attention module with superior cross-spatial interaction performance. By seamlessly combining the above two modules, we achieve an improved HCFI attention module without resorting to computationally expensive operations. Through a series of experiments involving various detectors and datasets, our proposed method consistently demonstrates superior performance. This results in a performance improvement of 1.53% for YOLOX on COCO and a performance boost of 2.05% for YOLOv5 on BDD100K. Furthermore, we propose a solution that combines HCCI and HCFI to address the challenge of extremely small output feature layers in detectors, such as SSD. The experimental results indicate that the proposed method significantly improves the attention capability of object detection in intelligent mobile scenes.

Original languageEnglish
Article number4991
JournalRemote Sensing
Volume15
Issue number20
DOIs
Publication statusPublished - Oct 2023

Bibliographical note

Publisher Copyright:
© 2023 by the authors.

Keywords

  • attention mechanism
  • computer vision
  • deep learning
  • intelligent mobile scenes
  • object detection

Fingerprint

Dive into the research topics of 'Hybrid Cross-Feature Interaction Attention Module for Object Detection in Intelligent Mobile Scenes'. Together they form a unique fingerprint.

Cite this