A rule-based obfuscating focused crawler in the audio retrieval domain

被引:1
作者
Montanaro, Marco [1 ]
Rinaldi, Antonio Maria [1 ]
Russo, Cristiano [1 ]
Tommasino, Cristian [1 ]
机构
[1] Univ Naples Federico II, Dept Elect Engn & Informat Technol, Via Claudio 21, I-80125 Naples, Italy
关键词
Audio retrieval; Information retrieval; Deep neural networks; Web crawler; Focused crawler; Indexing; Pattern recognition; Copyright infringement; MULTIMEDIA; SEARCH; CLASSIFICATION;
D O I
10.1007/s11042-023-16155-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The detection of violations of intellectual properties on multimedia files is a critical problem for the current infrastructure of the Internet, especially within very large document collections. To contrast such a problem, either proactive or reactive methods are used. The first category prevents the upload of infringing files themselves by comparing illegal files with a reference collection, while the second one responds to reports made by third parties or artificial intelligence systems in order to delete files deemed illegal. In this article we propose an approach that is both reactive and proactive at the same time, with the aim of preventing the deletion of legal uploads of files (or modifications of such files, such as remixes, parodies and other edits) due to the presence of illegal uploads on a platform. We developed a rule-based obfuscating focused crawler able to work with audio files in the Audio Information Retrieval (AIR) domain, but its use can be easily extended to other multimedia file types, such as videos or textual documents. Our proposed model automatically scans multimedia files uploaded to the public collection only when a user query is submitted to it. We will also show experimental results obtained during tests on a known musical collection. Several combinations of specific Neural Network-Similarity Scorer solutions are shown, and we will discuss the strength and efficiency of each combination.
引用
收藏
页码:25231 / 25260
页数:30
相关论文
共 53 条
[1]   A State-of-the-Art Survey on Deep Learning Theory and Architectures [J].
Alom, Md Zahangir ;
Taha, Tarek M. ;
Yakopcic, Chris ;
Westberg, Stefan ;
Sidike, Paheding ;
Nasrin, Mst Shamima ;
Hasan, Mahmudul ;
Van Essen, Brian C. ;
Awwal, Abdul A. S. ;
Asari, Vijayan K. .
ELECTRONICS, 2019, 8 (03)
[2]  
[Anonymous], 1997, Neural-network models of cognition, DOI DOI 10.1016/S0166-4115(97)80111-2
[3]  
Baeza-Yates R, 1999, Modern information retrieval, V463
[4]  
Bartlett P, 2012, 26 ANN C NEURAL INFO
[5]  
Beckert S, 2018, Columbia Stud Hist U, P1
[6]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[7]  
Bogert B.P., 1963, Proceedings of the Symposium on Time Series Analysis, V15, P209
[8]  
Bokhari M.U., 2013, INT J COMPUTER APPL, V74
[9]   Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment [J].
Bosse, Sebastian ;
Maniry, Dominique ;
Mueller, Klaus-Robert ;
Wiegand, Thomas ;
Samek, Wojciech .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (01) :206-219
[10]   In Google we trust? [J].
Burguet, Roberto ;
Caminal, Ramon ;
Ellman, Matthew .
INTERNATIONAL JOURNAL OF INDUSTRIAL ORGANIZATION, 2015, 39 :44-55