Exploiting detected visual objects for frame-level video filtering

Xingzhong Du, Hongzhi Yin*, Zi Huang, Yi Yang, Xiaofang Zhou

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

Abstract

Videos are generated at an unprecedented speed on the web. To improve the efficiency of access, developing new ways to filter the videos becomes a popular research topic. One on-going direction is using visual objects to perform frame-level video filtering. Under this direction, existing works create the unique object table and the occurrence table to maintain the connections between videos and objects. However, the creation process is not scalable and dynamic because it heavily depends on human labeling. To improve this, we propose to use detected visual objects to create these two tables for frame-level video filtering. Our study begins with investigating the existing object detection techniques. After that, we find object detection lacks the identification and connection abilities to accomplish the creation process alone. To supply these abilities, we further investigate three candidates, namely, recognizing-based, matching-based and tracking-based methods, to work with the object detection. Through analyzing the mechanism and evaluating the accuracy, we find that they are imperfect for identifying or connecting the visual objects. Accordingly, we propose a novel hybrid method that combines the matching-based and tracking-based methods to overcome the limitations. Our experiments show that the proposed method achieves higher accuracy and efficiency than the candidate methods. The subsequent analysis shows that the proposed method can efficiently support the frame-level video filtering using visual objects.

Original languageEnglish
Pages (from-to)1259-1284
Number of pages26
JournalWorld Wide Web
Volume21
Issue number5
DOIs
Publication statusPublished - 1 Sept 2018
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2017, Springer Science+Business Media, LLC.

Keywords

  • Accuracy and effciency evaluation
  • Frame-level video filtering
  • Visual object

Fingerprint

Dive into the research topics of 'Exploiting detected visual objects for frame-level video filtering'. Together they form a unique fingerprint.

Cite this