SUPERVISED DEEP HASHING FOR EFFICIENT AUDIO EVENT RETRIEVAL

被引:0
作者
Jati, Arindam [1 ]
Emmanouilidou, Dimitra [2 ]
机构
[1] Univ Southern Calif, Los Angeles, CA 90007 USA
[2] Microsoft Res, Audio & Acoust Res Grp, Redmond, WA USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
Audio events; efficient retrieval; hashing; quantization; deep neural network; IMAGE RETRIEVAL; CLASSIFICATION; QUANTIZATION;
D O I
10.1109/icassp40776.2020.9053766
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Efficient retrieval of audio events can facilitate real-time implementation of numerous query and search-based systems. This work investigates the potency of different hashing techniques for efficient audio event retrieval. Multiple state-of-the-art weak audio embeddings are employed for this purpose. The performance of four classical unsupervised hashing algorithms is explored as part of off-the-shelf analysis. Then, we propose a partially supervised deep hashing framework that transforms the weak embeddings into a low-dimensional space while optimizing for efficient hash codes. The model uses only a fraction of the available labels and is shown here to significantly improve the retrieval accuracy on two widely employed audio event datasets. The extensive analysis and comparison between supervised and unsupervised hashing methods presented here, give insights on the quantizability of audio embeddings. This work provides a first look in efficient audio event retrieval systems and hopes to set baselines for future research.
引用
收藏
页码:4497 / 4501
页数:5
相关论文
共 27 条
[1]  
[Anonymous], 2008, P ADV NEURAL INFORM
[2]  
[Anonymous], P IEEE INT C AC SPEE
[3]   The Inverted Multi-Index [J].
Babenko, Artem ;
Lempitsky, Victor .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (06) :1247-1260
[4]  
Cai D., 2016, ARXIV161207545
[5]  
Cao Y, 2016, AAAI CONF ARTIF INTE, P3457
[6]  
Charikar M.S., 2002, P 34 ANN ACM S THEOR, P380, DOI DOI 10.1145/509907.509965
[7]  
Fonseca E., 2018, P WORKSH DET REC WIL, P69
[8]  
Gemmeke J. F., P IEEE INT C AC SPEE
[9]   Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval [J].
Gong, Yunchao ;
Lazebnik, Svetlana ;
Gordo, Albert ;
Perronnin, Florent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2916-2929
[10]   Quantization [J].
Gray, RM ;
Neuhoff, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) :2325-2383