TY - JOUR
T1 - Spectral multimodal hashing and its application to multimedia retrieval
AU - Zhen, Yi
AU - Gao, Yue
AU - Yeung, Dit Yan
AU - Zha, Hongyuan
AU - Li, Xuelong
N1 - Publisher Copyright:
© 2015 IEEE.
PY - 2016/1
Y1 - 2016/1
N2 - In recent years, multimedia retrieval has sparked much research interest in the multimedia, pattern recognition, and data mining communities. Although some attempts have been made along this direction, performing fast multimodal search at very large scale still remains a major challenge in the area. While hashing-based methods have recently achieved promising successes in speeding-up large-scale similarity search, most existing methods are only designed for uni-modal data, making them unsuitable for multimodal multimedia retrieval. In this paper, we propose a new hashing-based method for fast multimodal multimedia retrieval. The method is based on spectral analysis of the correlation matrix of different modalities. We also develop an efficient algorithm that learns some parameters from the data distribution for obtaining the binary codes. We empirically compare our method with some state-of-the-art methods on two real-world multimedia data sets.
AB - In recent years, multimedia retrieval has sparked much research interest in the multimedia, pattern recognition, and data mining communities. Although some attempts have been made along this direction, performing fast multimodal search at very large scale still remains a major challenge in the area. While hashing-based methods have recently achieved promising successes in speeding-up large-scale similarity search, most existing methods are only designed for uni-modal data, making them unsuitable for multimodal multimedia retrieval. In this paper, we propose a new hashing-based method for fast multimodal multimedia retrieval. The method is based on spectral analysis of the correlation matrix of different modalities. We also develop an efficient algorithm that learns some parameters from the data distribution for obtaining the binary codes. We empirically compare our method with some state-of-the-art methods on two real-world multimedia data sets.
KW - Hash function learning
KW - Multimedia retrieval
KW - Spectral multimodal hashing (SMH)
UR - https://www.webofscience.com/wos/woscc/full-record/WOS:000367144300004
UR - https://openalex.org/W1920291194
UR - https://www.scopus.com/pages/publications/84961083586
U2 - 10.1109/TCYB.2015.2392052
DO - 10.1109/TCYB.2015.2392052
M3 - Journal Article
SN - 2168-2267
VL - 46
SP - 27
EP - 38
JO - IEEE Transactions on Cybernetics
JF - IEEE Transactions on Cybernetics
IS - 1
M1 - 7163570
ER -