End-to-end active object tracking via reinforcement learning

Wenhan Luo, Peng Sun, Fangwei Zhong, Wei Liu, Tong Zhang, Yizhou Wang

Research output: Chapter in Book/Conference Proceeding/ReportConference Paper published in a bookpeer-review

27 Citations (Scopus)

Abstract

We study active object tracking, where a tracker takes as input the visual observation (i.e., frame sequence) and produces the camera control signal (e.g., move forward, turn left, etc.). Conventional methods tackle the tracking and the camera control separately, which is challenging to tune jointly. It also incurs many human efforts for labeling and many expensive trial-and-errors in real- world. To address these issuer we propose, in this paper, an end-to-end solution via deep reinforcement learning, where a ConvNet-LSTM function approximater is adopted for the direct frame-to- action prediction. We further propose an environment augmentation technique and a customized reward function, which are crucial for a successful training. The tracker trained in simulators (ViZ- Doom, Unreal Engine) shows good generalization in the case of unseen object moving path, unseen object appearance, unseen background, and distracting object. It can restore tracking when occasionally losing the target. With the experiments over the VOT dataset, we also find that die tracking ability, obtained solely from simulators, can potentially transfer to real-world scenarios.

Original languageEnglish
Title of host publication35th International Conference on Machine Learning, ICML 2018
EditorsJennifer Dy, Andreas Krause
PublisherInternational Machine Learning Society (IMLS)
Pages5191-5200
Number of pages10
ISBN (Electronic)9781510867963
Publication statusPublished - 2018
Externally publishedYes
Event35th International Conference on Machine Learning, ICML 2018 - Stockholm, Sweden
Duration: 10 Jul 201815 Jul 2018

Publication series

Name35th International Conference on Machine Learning, ICML 2018
Volume7

Conference

Conference35th International Conference on Machine Learning, ICML 2018
Country/TerritorySweden
CityStockholm
Period10/07/1815/07/18

Bibliographical note

Publisher Copyright:
© Copyright 2018 by the author(s).

Fingerprint

Dive into the research topics of 'End-to-end active object tracking via reinforcement learning'. Together they form a unique fingerprint.

Cite this