A vector-based approach to broadcast audio database indexing and retrieval

被引:0
|
作者
Wang, Lei [1 ]
Li, Haizhou [1 ]
Chng, Eng Siong [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel framework to index and retrieve audio content from broadcast database that contains both speech and music. In this framework, we model the acoustic events using hidden Markov models, which are then used to decode the audio content. The decoding results in the form of acoustic token sequence and acoustic lattice are used to generate features for indexing and retrieval with the vector space model. Experiments were carried out on the TRECVID database and the results showed that the proposed framework is effective in audio information retrieval. The results also showed that the features generated from the acoustic lattice provide more accurate information than token sequence.
引用
收藏
页码:512 / 515
页数:4
相关论文
共 50 条
  • [1] Efficient color image indexing and retrieval using a vector-based scheme
    Androutsos, D
    Venetsanopoulos, AN
    Plataniotis, KN
    1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 15 - 20
  • [2] Indexing and retrieval of broadcast news
    Renals, S
    Abberley, D
    Kirby, D
    Robinson, T
    SPEECH COMMUNICATION, 2000, 32 (1-2) : 5 - 20
  • [3] Audio indexing of Arabic broadcast news
    Billa, J
    Noamany, M
    Srivastava, A
    Liu, D
    Stone, R
    Xu, J
    Makhoul, J
    Kubala, F
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 5 - 8
  • [4] Indexing and Retrieval of Audio: A Survey
    Goujun Lu
    Multimedia Tools and Applications, 2001, 15 : 269 - 290
  • [5] Indexing and retrieval of audio: A survey
    Lu, GJ
    MULTIMEDIA TOOLS AND APPLICATIONS, 2001, 15 (03) : 269 - 290
  • [6] An Audio Indexing and Retrieval Approach using a Video Surveillance Ontology
    Kazi Tani, Mohammed Yassine
    Ghomari, Abdelghani
    Dali Youcef, Lamia
    Lablack, Adel
    Bilasco, Ioan Marius
    2017 COMPUTING CONFERENCE, 2017, : 258 - 261
  • [7] A generic audio classification and segmentation approach for multimedia indexing and retrieval
    Kiranyaz, S
    Qureshi, AF
    Gabbouj, M
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 1062 - 1081
  • [8] A novel vector-based approach to color image retrieval using a vector angular-based distance measure
    Androutsos, D
    Plataniotis, KN
    Venetsanopoulos, AN
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1999, 75 (1-2) : 46 - 58
  • [9] Novel vector-based approach to color image retrieval using a vector angular-based distance measure
    Androutsos, D.
    Plataniotis, K.N.
    Venetsanopoulos, A.N.
    Computer Vision and Image Understanding, 1999, 75 (01): : 46 - 58
  • [10] Image indexing and retrieval based on vector quantization
    Teng, Shyh Wei
    Lu, Guojun
    PATTERN RECOGNITION, 2007, 40 (11) : 3299 - 3316