Web metadata extraction and semantic indexing for learning objects extraction

被引:11
作者
Atkinson, John [1 ]
Gonzalez, Andrea [1 ]
Munoz, Mauricio [1 ]
Astudillo, Hernan [2 ]
机构
[1] Univ Concepcion, Dept Comp Sci, Concepcion, Chile
[2] Univ Tecn Federico Santa Maria, Dept Informat, Valparaiso, Chile
关键词
Metadata extraction; Text mining; Semantic analysis; Machine learning; Learning objects; MANAGEMENT; SEARCH;
D O I
10.1007/s10489-014-0557-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Secondary-school teachers are in constant need of finding relevant digital resources to support specific didactic goals. Unfortunately, generic search engines do not allow them to identify learning objects among semi-structured candidate educational resources, much less retrieve them by teaching goals. This article describes a multi-strategy approach for semantically guided extraction, indexing and search of educational metadata; it combines machine learning, concept analysis, and corpus-based natural language processing techniques. The overall model was validated by comparing extracted metadata against standard search methods and heuristic-based techniques for Classification Accuracy and Metadata Quality (as evaluated by actual teachers), yielding promising results and showing that this semantically guided metadata extraction can effectively enhance access and use of educational digital material.
引用
收藏
页码:649 / 664
页数:16
相关论文
共 39 条
  • [1] Combining text and link analysis for focused crawling - An application for vertical search engines
    Almpanidis, G.
    Kotropoulos, C.
    Pitas, I.
    [J]. INFORMATION SYSTEMS, 2007, 32 (06) : 886 - 908
  • [2] [Anonymous], 2004, J INTERNET CATALOGIN
  • [3] [Anonymous], 2004, Introduction to Machine Learning
  • [4] [Anonymous], 2 INT C COMP ENG APP
  • [5] [Anonymous], P SAWM04 WORKSH ECML
  • [6] [Anonymous], METADATA GENERATION
  • [7] [Anonymous], 2009, P 39 IEEE FRONT ED C, DOI DOI 10.1109/FIE.2009.5350828
  • [8] [Anonymous], 10 INT C DOC AN REC
  • [9] [Anonymous], LACLO 2010 5 LAT AM
  • [10] [Anonymous], P WORLD C ED MULT HY