Reference-Corpus Formation for Estimating the Closeness of Topical Texts to the Semantic Standard

被引:0
作者
D. V. Mikhaylov
G. M. Emelyanov
机构
[1] Yaroslav-the-Wise Novgorod State University,
来源
Pattern Recognition and Image Analysis | 2022年 / 32卷
关键词
pattern recognition; intelligent text analysis; information theory; text complexity; lossless-in-sense text compression;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:755 / 762
页数:7
相关论文
共 11 条
[1]  
Jones K. S.(2004)A statistical interpretation of term specificity and its application in retrieval J. Doc. 60 493-502
[2]  
Kozlova N. V.(2013)Linguistic corpus: Typology and terms Vestn. Novosibirskogo Gos. Univ. Ser.: Lingvist. Mezhkul’tur. Kommun. 11 79-89
[3]  
Marcus M. P.(1993)Building a large annotated corpus of English: The Penn treebank Comput. Linguist. 19 313-330
[4]  
Santorini B.(2021)Analysis of the mutual relevance of topical corpus documents in the problem of assessing the proximity of text to the semantic standard Pattern Recognit. Image Anal. 31 588-594
[5]  
Marcinkiewicz M. A.(2020)Hierarchization of topical texts based on the estimate of proximity to the semantic pattern without paraphrasing Pattern Recognit. Image Anal. 30 440-449
[6]  
Mikhaylov D. V.(2012)Automatic keyphrase extraction for vocabulary reduction in probabilistic topic models Estestv. Tekh. Nauki 6 456-464
[7]  
Emelyanov G. M.(2014)Additive regularization for topic models of text collections Dokl. Math. 456 268-271
[8]  
Mikhaylov D. V.(undefined)undefined undefined undefined undefined-undefined
[9]  
Emelyanov G. M.(undefined)undefined undefined undefined undefined-undefined
[10]  
Tsar’kov S. V.(undefined)undefined undefined undefined undefined-undefined