A case study of duplications detection for educational domain thorough ad hoc search and identification NLP-based method

被引：1

作者：

Mikhaylov, S. N. ^{[1
]}

Chuikova, V. V. ^{[1
]}

Sokolova, Marina V. ^{[1
]}

Potapenko, A. M. ^{[1
]}

机构：

[1] Southwest State Univ, Kursk 305040, Russia

来源：

EXPERT SYSTEMS | 2017年 / 34卷 / 04期

关键词：

evaluation; information resource; information retrieval; natural language processing; visual interface for knowledge representation;

D O I：

10.1111/exsy.12200

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

During the organization and planning of lecture courses for a discipline, its content may be overlapped and partially delivered in more than one course. Sometimes this action causes time loss through unnecessary repeating. This paper introduces an automated tool for duplications detections adapting methods of natural language processing used for Web search. The experiment for unstructured electronic document repositories clustering for thematic duplicate identification in different documents in the case of educational domain is presented. A prototype of this Web service-based software search engine is being designed and discussed. The experiment aimed to identify thematic duplicates of various courses within one of the teaching disciplines is also presented.

引用

页数：11

共 2 条

[1] A NLP-based semi-automatic identification system for delays in follow-up examinations: an Italian case study on clinical referrals
Torri, Vittorio
Ercolanoni, Michele
Bortolan, Francesco
Leoni, Olivia
Ieva, Francesca
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
[2] An identification method for thin shale gas reservoirs based on the high-frequency recovery technology in frequency domain: A case study from deep shale gas in the Luzhou area of the Sichuan Basin
Kang K.
Yang W.
Li W.
Li H.
Wang M.
Lyu K.
Natural Gas Industry, 2022, 42 (10) : 54 - 62

← 1 →