Video handling with music and speech detection

被引:46
|
作者
Minami, K [1 ]
Akutsu, A [1 ]
Hamada, H [1 ]
Tonomura, Y [1 ]
机构
[1] Nippon Telegraph & Tel Corp, Human Interface Labs, Kanagawa 2390847, Japan
关键词
D O I
10.1109/93.713301
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The audio-based approach to video indexing described here detects music and speech independently even when they occur simultaneously. The indexed video segments, when presented on the Video Sound Browser, let users randomly access the video. The Video in Time system provides different video condensation levels based on video structuring that can link the video segments and the director's intentions.
引用
收藏
页码:17 / 25
页数:9
相关论文
共 50 条
  • [41] Music video hybrids
    Moltenbrey, K
    COMPUTER GRAPHICS WORLD, 2004, 27 (09) : 2 - 2
  • [42] Music training, music aptitude, and speech perception
    Schellenberg, E. Glenn
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (08) : 2783 - 2784
  • [43] MUSIC MODELS FOR MUSIC-SPEECH SEPARATION
    Hughes, Thad
    Kristjansson, Trausti
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4917 - 4920
  • [44] MUSIC MODELS FOR MUSIC-SPEECH SEPARATION
    Hughes, Thad
    Kristjansson, Trausti
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4917 - 4920
  • [46] Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface
    Futoshi Asano
    Kiyoshi Yamamoto
    Isao Hara
    Jun Ogata
    Takashi Yoshimura
    Yoichi Motomura
    Naoyuki Ichimura
    Hideki Asoh
    EURASIP Journal on Advances in Signal Processing, 2004
  • [47] Detection and separation of speech event using audio and video information fusion and its application to robust speech interface
    Asano, F
    Yamamoto, K
    Hara, I
    Ogata, J
    Yoshimura, T
    Motomura, Y
    Ichimura, N
    Asoh, H
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (11) : 1727 - 1738
  • [48] Music Video as postmodern format: The rupture of audiovisual codes through music video
    Rodriguez-Lopez, Jennifer
    DOXA COMUNICACION, 2016, (22): : 13 - 30
  • [49] The Music Video Value Chain: Music Video into Cultural Industries' Commercial Circuits
    Rodriguez-Lopez, Jennifer
    Aguaded-Gomez, Ignacio
    ADCOMUNICA-REVISTA CIENTIFICA DE ESTRATEGIAS TENDENCIAS E INNOVACION EN COMMUNICACION, 2015, (09): : 119 - 132
  • [50] MUSICAL CINEMA, MUSIC VIDEO, MUSIC TELEVISION
    ALLAN, B
    FILM QUARTERLY, 1990, 43 (03) : 2 - 14