Unsupervised Video Hashing via Deep Neural Network

被引:0
作者
Chao Ma
Yun Gu
Chen Gong
Jie Yang
Deying Feng
机构
[1] Shanghai Jiao Tong University,Institute of Image Processing and Pattern Recognition
[2] Nanjing University of Science and Technology,School of Computer Science and Engineering
[3] Liaocheng University,undefined
来源
Neural Processing Letters | 2018年 / 47卷
关键词
Video hashing; Unsupervised method; Deep neural network; Spatio-temporal feature;
D O I
暂无
中图分类号
学科分类号
摘要
Hashing is a common solution for content-based multimedia retrieval by encoding high-dimensional feature vectors into short binary codes. Previous works mainly focus on image hashing problem. However, these methods can not be directly used for video hashing, as videos contain not only spatial structure within each frame, but also temporal correlation between successive frames. Several researchers proposed to handle this by encoding the extracted key frames, but these frame-based methods are time-consuming in real applications. Other researchers proposed to characterize the video by averaging the spatial features of frames and then the existing hashing methods can be adopted. Unfortunately, the sort of “video” features does not take the correlation between frames into consideration and may lead to the loss of the temporal information. Therefore, in this paper, we propose a novel unsupervised video hashing framework via deep neural network, which performs video hashing by incorporating the temporal structure as well as the conventional spatial structure. Specially, the spatial features of videos are obtained by utilizing convolutional neural network, and the temporal features are established via long-short term memory. After that, the time series pooling strategy is employed to obtain the single feature vector for each video. The obtained spatio-temporal feature can be applied to many existing unsupervised hashing methods. Experimental results on two real datasets indicate that by employing the spatio-temporal features, our hashing method significantly improves the performance of existing methods which only deploy the spatial features, and meanwhile obtains higher mean average precision compared with the state-of-the-art video hashing methods.
引用
收藏
页码:877 / 890
页数:13
相关论文
共 50 条
[11]   Target Code Guided Binary Hashing Representations with Deep Neural Network [J].
Wang, Yunbo ;
Cao, Dong ;
Sun, Zhenan .
PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, :530-535
[12]   A Classification Retrieval Method for Encrypted Speech Based on Deep Neural Network and Deep Hashing [J].
Zhang, Qiuyu ;
Zhao, Xuejiao ;
Hu, Yingjie .
IEEE ACCESS, 2020, 8 :202469-202482
[13]   HITS : Binarizing physiological time series with deep hashing neural network [J].
Fu, Zhaoji ;
Wang, Can ;
Wei, Guodong ;
Zhang, Wenrui ;
Du, Shaofu ;
Hong, Shenda .
PATTERN RECOGNITION LETTERS, 2022, 156 :23-28
[14]   Structure Guided Deep Neural Network for Unsupervised Active Learning [J].
Li, Changsheng ;
Ma, Handong ;
Yuan, Ye ;
Wang, Guoren ;
Xu, Dong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 :2767-2781
[15]   Unsupervised Speech Denoising Method based on Deep Neural Network [J].
Chen, Xiaohan .
2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, :254-258
[16]   DeepCoder: A Deep Neural Network Based Video Compression [J].
Chen, Tong ;
Liu, Haojie ;
Shen, Qiu ;
Yue, Tao ;
Cao, Xun ;
Ma, Zhan .
2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
[17]   Feature Extraction of Video Using Deep Neural Network [J].
Hayakawa, Yoshihiro ;
Oonuma, Takanori ;
Kobayashi, Hideyuki ;
Takahashi, Akiko ;
Chiba, Shinji ;
Fujiki, Nahomi M. .
2016 IEEE 15TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2016, :465-470
[18]   Pedestrian Detection Based on Deep Neural Network in Video Surveillance [J].
Zhang, Bo ;
Guo, Ke ;
Yang, Yunxiang ;
Guo, Jing ;
Zhang, Xueying ;
Hu, Xiaocheng ;
Jiang, Yinan ;
Zhang, Xinhai .
COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 :113-120
[19]   Unsupervised Adaptation for Deep Neural Network using Linear Least Square Method [J].
Hsiao, Roger ;
Ng, Tim ;
Tsakalidis, Stavros ;
Nguyen, Long ;
Schwartz, Richard .
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, :2887-2891
[20]   Deep associative neural network for associative memory based on unsupervised representation learning [J].
Liu, Jia ;
Gong, Maoguo ;
He, Haibo .
NEURAL NETWORKS, 2019, 113 :41-53