TY - JOUR
T1 - CDTD
T2 - A Large-Scale Cross-Domain Benchmark for Instance-Level Image-to-Image Translation and Domain Adaptive Object Detection
AU - Shen, Zhiqiang
AU - Huang, Mingyang
AU - Shi, Jianping
AU - Liu, Zechun
AU - Maheshwari, Harsh
AU - Zheng, Yutong
AU - Xue, Xiangyang
AU - Savvides, Marios
AU - Huang, Thomas S.
N1 - Publisher Copyright:
© 2020, Springer Science+Business Media, LLC, part of Springer Nature.
PY - 2021/3
Y1 - 2021/3
N2 - Cross-domain visual problems, such as image-to-image translation and domain adaptive object detection, have attracted increasing attentions in the last few years, and also become new rising and challenging directions for the computer vision community. Recently, despite enormous efforts of the field in data collection, there are still few datasets covering the instance-level image-to-image translation and domain adaptive object detection tasks simultaneously. In this work, we introduce a large-scale cross-domain benchmark CDTD (contains 155,529 high-resolution natural images across four different modalities with object bounding box annotations. A summary of the entire dataset is provided in the following sections. Dataset is available at: http://zhiqiangshen.com/projects/INIT/index.html.) for the new instance-level translation and object detection tasks. We provide comprehensive baseline results of the benchmark on both of these two tasks. Moreover, we proposed a novel instance-level image-to-image translation approach called INIT and a gradient detach method for the domain adaptive object detection to harvest and exert dataset’s function of the instance level annotations across different domains.
AB - Cross-domain visual problems, such as image-to-image translation and domain adaptive object detection, have attracted increasing attentions in the last few years, and also become new rising and challenging directions for the computer vision community. Recently, despite enormous efforts of the field in data collection, there are still few datasets covering the instance-level image-to-image translation and domain adaptive object detection tasks simultaneously. In this work, we introduce a large-scale cross-domain benchmark CDTD (contains 155,529 high-resolution natural images across four different modalities with object bounding box annotations. A summary of the entire dataset is provided in the following sections. Dataset is available at: http://zhiqiangshen.com/projects/INIT/index.html.) for the new instance-level translation and object detection tasks. We provide comprehensive baseline results of the benchmark on both of these two tasks. Moreover, we proposed a novel instance-level image-to-image translation approach called INIT and a gradient detach method for the domain adaptive object detection to harvest and exert dataset’s function of the instance level annotations across different domains.
KW - Cross-domain benchmark
KW - Domain adaptive object detection
KW - Instance level image-to-image translation
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000592014000002
UR - https://openalex.org/W3106734733
UR - https://www.scopus.com/pages/publications/85096439050
U2 - 10.1007/s11263-020-01394-z
DO - 10.1007/s11263-020-01394-z
M3 - Journal Article
SN - 0920-5691
VL - 129
SP - 761
EP - 780
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
IS - 3
ER -