Automatic Keyphrase Extraction and Segmentation of Video Lectures

被引:0
作者
Balagopalan, Arun [1 ]
Balasubramanian, Lalitha Lakshmi [1 ]
Balasubramanian, Vidhya [1 ]
Chandrasekharan, Nithin [1 ]
Damodar, Aswin [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Engn, Dept Comp Sci & Engn, Coimbatore, Tamil Nadu, India
来源
2012 IEEE INTERNATIONAL CONFERENCE ON TECHNOLOGY ENHANCED EDUCATION (ICTEE 2012) | 2012年
关键词
Automatic keyphrase extraction; meta-data extraction; lecture browser; segmentation; video lectures;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Keyphrases are essential meta-data that summarize the contents of an instructional video. In this paper, we present a domain independent, statistical approach for automatic keyphrase extraction from audio transcripts of video lectures. We identify new features in audio transcripts, that capture key patterns characterizing keyphrases in lecture videos. A system for keyphrase extraction is designed that uses a supervised machine learning algorithm, based on a Naive-Bayes classifier to extract relevant keyphrases. Our extensive experimental studies show that our system extracts more relevant keywords than existing approaches. The paper also evaluates the performance of the proposed keyphrase extraction method for different categories of lectures. The extracted keyphrases are used further as features for automatic topic based segmentation of the video lectures. This process of automatic keyphrase extraction and segmentation results in a section-wise annotated video lecture which can be effectively viewed in a lecture browser.
引用
收藏
页数:10
相关论文
共 21 条
  • [1] [Anonymous], 1989, LANGUAGE CONTEXT TEX
  • [2] Statistical models for text segmentation
    Beeferman, D
    Berger, A
    Lafferty, J
    [J]. MACHINE LEARNING, 1999, 34 (1-3) : 177 - 210
  • [3] Christopher D Manning PR., 2009, INTRO INFORM RETRIEV
  • [4] Frantzi K. T., 1996, COLING
  • [5] Glass J.R., 2005, P HLTEMNLP 2005 DEMO, P28
  • [6] Gurevych I., 2004, COLING 04
  • [7] Haubold A., 2004, Proceedings. IEEE Sixth International Symposium on Multimedia Software, P570
  • [8] Hearst MA, 1997, COMPUT LINGUIST, V23, P33
  • [9] Hulth A, 2003, PROCEEDINGS OF THE 2003 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, P216
  • [10] Kim S.N., 2009, P WORKSH MULT EXPR I, P9