Highlighter: Automatic Highlighting of Electronic Learning Documents

被引:7
作者
Baralis, Elena [1 ]
Cagliero, Luca [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, Cso Duca Abruzzi 24, I-10129 Turin, Italy
关键词
E-learning; text mining; classification;
D O I
10.1109/TETC.2017.2681655
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Electronic textual documents are among the most popular teaching content accessible through e-learning platforms. Teachers or learners with different levels of knowledge can access the platform and highlight portions of textual content which are deemed as particularly relevant. The highlighted documents can be shared with the learning community in support of oral lessons or individual learning. However, highlights are often incomplete or unsuitable for learners with different levels of knowledge. This paper addresses the problem of predicting new highlights of partly highlighted electronic learning documents. With the goal of enriching teaching content with additional features, text classification techniques are exploited to automatically analyze portions of documents enriched with manual highlights made by users with different levels of knowledge and to generate ad hoc prediction models. Then, the generated models are applied to the remaining content to suggest highlights. To improve the quality of the learning experience, learners may explore highlights generated by models tailored to different levels of knowledge. We tested the prediction system on real and benchmark documents highlighted by domain experts and we compared the performance of various classifiers in generating highlights. The achieved results demonstrated the high accuracy of the predictions and the applicability of the proposed approach to real teaching documents.
引用
收藏
页码:7 / 19
页数:13
相关论文
共 42 条
  • [1] Aggarwal Charu C, 2012, Mining text data, P163, DOI [DOI 10.1007/978-1-4614-3223-46, DOI 10.1007/978-1-4614-3223-4, 10.1007/978-1-4614-3223-4]
  • [2] Seeing beyond reading: a survey on visual text analytics
    Alencar, Aretha B.
    de Oliveira, Maria Cristina F.
    Paulovich, Fernando V.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (06) : 476 - 492
  • [3] AUTOMATED LEARNING OF DECISION RULES FOR TEXT CATEGORIZATION
    APTE, C
    DAMERAU, F
    WEISS, SM
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1994, 12 (03) : 233 - 251
  • [4] A lazy approach to associative classification
    Baralis, Elena
    Chiusano, Silvia
    Garza, Paolo
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (02) : 156 - 171
  • [5] MWI-Sum: A Multilingual Summarizer Based on Frequent Weighted Itemsets
    Baralis, Elena
    Cagliero, Luca
    Fiori, Alessandro
    Garza, Paolo
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2015, 34 (01)
  • [6] GRAPHSUM: Discovering correlations among multiple terms for graph-based summarization
    Baralis, Elena
    Cagliero, Luca
    Mahoto, Naeem
    Fiori, Alessandro
    [J]. INFORMATION SCIENCES, 2013, 249 : 96 - 109
  • [7] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [8] Cohen W. W., 1995, Machine Learning. Proceedings of the Twelfth International Conference on Machine Learning, P115
  • [9] Copeck T., 2005, P DOC UND C DUC, P1
  • [10] NEAREST NEIGHBOR PATTERN CLASSIFICATION
    COVER, TM
    HART, PE
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) : 21 - +