A multi-modal dialogue system for information navigation and retrieval across spoken document archives with topic hierarchies

被引:0
|
作者
Pan, YC [1 ]
Wang, CC [1 ]
Hsieh, YC [1 ]
Lee, TH [1 ]
Lee, YS [1 ]
Fu, YS [1 ]
Huang, YT [1 ]
Lee, LS [1 ]
机构
[1] Natl Taiwan Univ, Grad Inst Comp Sci & Informat Engn, Taipei, Taiwan
来源
2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2005年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike the written documents, the spoken documents are difficult to be shown on the screen and browsed by the user during retrieval. In this paper, we propose to use multi-modal dialogues to help the user to "navigate" across the spoken document archives and retrieve the desired documents based on a topic hierarchy constructed by the key terms extracted from the retrieved spoken documents. An initial prototype system for such functions has been developed, in which the broadcast news in Mandarin Chinese was taken as the example spoken documents, and the Named Entities (NEs) are taken as the key terms to construct the topic hierarchy.
引用
收藏
页码:375 / 380
页数:6
相关论文
共 50 条
  • [1] A Spoken Dialogue System for Document Information Retrieval Utilizing Topic Knowledge
    Kiriyama, S., 1600, John Wiley and Sons Inc. (35):
  • [2] Multi-modal Information Integration for Document Retrieval
    Hassan, Ehtesham
    Chaudhury, Santanu
    Gopal, M.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 1200 - 1204
  • [3] AUTOMATIC TOPIC DETECTION STRATEGY FOR INFORMATION RETRIEVAL IN SPOKEN DOCUMENT
    Jin, Shan
    Misra, Hemant
    Sikora, Thomas
    Jose, Joemon
    2009 10TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES, 2009, : 300 - +
  • [4] Architecture of multi-modal dialogue system
    Fuchs, M
    Hejda, P
    Slavík, P
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 433 - 438
  • [5] Clip Retrieval using Multi-modal Biometrics in Meeting Archives
    Vajaria, Himanshu
    Sarkar, Sudeep
    Kasturi, Rangachar
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 58 - 61
  • [6] Multi-modal information retrieval using FINT
    van Zaanen, M
    de Croon, G
    MULTILINGUAL INFORMATION ACCESS FOR TEXT, SPEECH AND IMAGES, 2005, 3491 : 728 - +
  • [7] Multi-modal information retrieval with a semantic view mechanism
    Li, Q
    Yang, J
    Zhuang, YT
    19TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS: AINA 2005, 2005, : 133 - 138
  • [8] SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval
    Wu, Siwei
    Li, Yizhi
    Zhu, Kang
    Zhang, Ge
    Liang, Yiming
    Ma, Kaijing
    Xiao, Chenghao
    Zhang, Haoran
    Yang, Bohao
    Chen, Wenhu
    Huang, Wenhao
    Al Moubayed, Noura
    Fu, Jie
    Lin, Chenghua
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 12560 - 12574
  • [9] Multi-modal Dialogue System with Sign Language Capabilities
    Hruz, M.
    Campr, P.
    Krnoul, Z.
    Zelezny, M.
    Aran, Oya
    Santemiz, Pinar
    ASSETS 11: PROCEEDINGS OF THE 13TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2011, : 265 - 266
  • [10] Utilizing Multi-modal Emotion Information in Dialogue Strategy Classification
    Jang, Jin Yea
    Kim, Jieun
    Jung, Minyoung
    Jung, Hyedong
    Shin, Saim
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 943 - 945