WIKITAG: WIKIPEDIA-BASED KNOWLEDGE EMBEDDINGS TOWARDS IMPROVED ACOUSTIC EVENT CLASSIFICATION

被引:3
|
作者
Zhang, Qin [1 ]
Tang, Qingming [1 ]
Kao, Chieh-Chi [1 ]
Sun, Ming [1 ]
Liu, Yang [1 ]
Wang, Chao [1 ]
机构
[1] Amazon Inc, Seattle, WA 98109 USA
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Audio classification; Wikipedia; semantic embedding; multi-view learning; AudioSet;
D O I
10.1109/ICASSP43922.2022.9747648
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Acoustic event classification (AEC) is the task of determining whether certain events occur in an audio clip. Inspired by previous research [1, 2, 3] that embeddings from event labels can be leveraged to facilitate the learning of new detectors with no or limited audio samples, we introduce Wikipedia-based text embeddings as auxiliary information to improve AEC. We describe how to extract label embeddings from multiple Wikipedia texts, and formulate the multi-view aligned AEC problem based on VGGish model. We show that our "wikiTAG" embeddings encode rich semantic information and are more informative than label embeddings for AEC tasks. Compared to a supervised baseline on AudioSet, the multiview model with "wikiTAG" embeddings achieves 7.3% and 1.3% relative improvement in mean average precision (mAP) using 10% and full AudioSet for training, respectively. To the author's knowledge, this is the first work in the AEC domain on building large-scale label representations by leveraging Wikipedia data in a systematic fashion.
引用
收藏
页码:136 / 140
页数:5
相关论文
共 7 条
  • [1] Towards perfect text classification with Wikipedia-based semantic Naive Bayes learning
    Kim, Han-joon
    Kim, Jiyun
    Kim, Jinseog
    Lim, Pureum
    NEUROCOMPUTING, 2018, 315 : 128 - 134
  • [2] Biomedical literature classification using encyclopedic knowledge: a Wikipedia-based bag-of-concepts approach
    Mourino Garcia, Marcos Antonio
    Perez Rodriguez, Roberto
    Anido Rifon, Luis E.
    PEERJ, 2015, 3
  • [3] Open domain question answering using Wikipedia-based knowledge model
    Ryu, Pum-Mo
    Jang, Myung-Gil
    Kim, Hyun-Ki
    INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (05) : 683 - 692
  • [4] A wikipedia-based semantic relatedness framework for effective dimensions classification in online reputation management
    Qureshi, M. Atif
    Younus, Arjumand
    O'Riordan, Colm
    Pasi, Gabriella
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2018, 9 (05) : 1403 - 1413
  • [5] A wikipedia-based semantic relatedness framework for effective dimensions classification in online reputation management
    M. Atif Qureshi
    Arjumand Younus
    Colm O’Riordan
    Gabriella Pasi
    Journal of Ambient Intelligence and Humanized Computing, 2018, 9 : 1403 - 1413
  • [6] SoundSemantics: Exploiting Semantic Knowledge in Text for Embedded Acoustic Event Classification
    Islam, Md Tamzeed
    Nirjon, Shahriar
    IPSN '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2019, : 217 - 228
  • [7] Event Detection in Wikipedia Edit History Improved by Documents Web Based Automatic Assessment
    Fisichella, Marco
    Ceroni, Andrea
    BIG DATA AND COGNITIVE COMPUTING, 2021, 5 (03)