Video Transcript Indexing and Retrieval Procedure

被引:1
作者
Turcu, Gabi [1 ]
Mihaescu, Marian Cristian [1 ]
Heras, Stella [2 ]
Palanca, Javier [2 ]
Julian, Vicente [2 ]
机构
[1] Univ Craiova, Fac Automat Comp & Elect, Craiova, Romania
[2] Univ Politecn Valencia, Sistemas Informat & Computac, Valencia, Spain
来源
2019 27TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM) | 2019年
关键词
LDA; Spanish video transcripts; retrieval;
D O I
10.23919/softcom.2019.8903790
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Finding appropriate e-Learning resources within a repository of videos represents a critical aspect for students. Given that transcripts are available for the entire set of videos, the problem reduces to obtaining a ranked list of video transcripts for a particular query. The paper presents a custom approach for searching the 16.012 available video transcripts from Media Website at Universitat Politecnica de Valencia. The proposed solution builds a bag-of-words, computes TF-IDF scores, clusters the transcripts and builds a Latent Dirichlet Allocation (LDA) model for each cluster. An inherent difficulty of the problem comes from the fact that transcripts are in the Spanish language. The experimental results are satisfactory, especially in terms of comparing with currently existing search mechanism.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 18 条
  • [1] [Anonymous], INT WORKSH CONT BAS
  • [2] Baeza-Yates R.A., 2011, Modern Information Retrieval: The Concepts and Technology Behind Search
  • [3] Bakar ZA, 2017, J TELECOMMUNICATION, V9, P43
  • [4] Basu Subhasree, 2016, MultiMedia Modeling. 22nd International Conference, MMM 2016. Proceedings, P238, DOI 10.1007/978-3-319-27671-7_20
  • [5] Research-paper recommender systems: a literature survey
    Beel, Joeran
    Gipp, Bela
    Langer, Stefan
    Breitinger, Corinna
    [J]. INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2016, 17 (04) : 305 - 338
  • [6] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [7] Multi-modal Language Models for Lecture Video Retrieval
    Chen, Huizhong
    Cooper, Matthew
    Joshi, Dhiraj
    Girod, Bernd
    [J]. PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1081 - 1084
  • [8] Dand Pelleg, 2000, INT C MACH LEARN, DOI DOI 10.1007/3-540-44491-2_3
  • [9] A generic framework for semantic video indexing based on visual concepts/contexts detection
    Elleuch, Nizar
    Ben Ammar, Anis
    Alimi, Adel M.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (04) : 1397 - 1421
  • [10] Fahmy Yousef Ahmed Mohamed., 2014, Video-based learning: A critical analysis of the research published in 2003-2013 and future visions