Automated recovery of damaged audio files using deep neural networks

被引:7
作者
Heo, Hee-Soo [1 ]
So, Byung-Min [2 ]
Yang, IL-Ho [1 ]
Yoon, Sung-Hyun [1 ]
Yu, Ha-Jin [1 ]
机构
[1] Univ Seoul, Sch Comp Sci, Coll Engn, 163 Siripdae Ro, Seoul 02504, South Korea
[2] Supreme Prosecutors Off, 157 Banpo Daero, Seoul 06590, South Korea
关键词
Audio files; Automated recovery; Deep neural networks; File carving; Long short-term memory;
D O I
10.1016/j.diin.2019.07.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose two methods to recover damaged audio files using deep neural networks. The presented audio file recovery methods differ from the conventional file carving-based recovery method because the former restore lost data, which are difficult to recover with the latter method. This research suggests that recovery tasks, which are essential yet very difficult or very time consuming, can be automated with the proposed recovery methods using deep neural networks. We apply feed-forward and Long Short Term Memory neural networks for the tasks. The experimental results show that deep neural networks can distinguish speech signals from non-speech signals, and can also identify the encoding methods of the audio files at the level of bits. This leads to successful recovery of the damaged audio files, which are otherwise difficult to recover using the conventional file-carving-based methods. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:117 / 126
页数:10
相关论文
共 14 条
[1]  
Abadi M., 2015, P 12 USENIX S OPERAT
[2]  
Chollet F., 2015, KERAS OTHERS
[3]  
Fant G., 1971, Acoustic Theory of Speech Production: With Calculations Based on X-ray Studies of Russian Articulations
[4]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[5]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[6]  
Hewlett D, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1535
[7]  
Hinton G., 2012, COURSERA VIDEO LECT
[8]   Deep Neural Networks for Acoustic Modeling in Speech Recognition [J].
Hinton, Geoffrey ;
Deng, Li ;
Yu, Dong ;
Dahl, George E. ;
Mohamed, Abdel-rahman ;
Jaitly, Navdeep ;
Senior, Andrew ;
Vanhoucke, Vincent ;
Patrick Nguyen ;
Sainath, Tara N. ;
Kingsbury, Brian .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :82-97
[9]  
Ioffe S, 2015, 32 INT C MACH LEARN
[10]  
Kingma DP, 2014, ARXIV