Detection of Stopwords in Classical Chinese Poetry

被引:0
|
作者
Peng, Lei [1 ]
Ma, Xiaodong [2 ,3 ]
Teng, Zheng [4 ]
机构
[1] Chongqing Three Gorges Med Coll, Lib & Informat Sci Ctr, Chongqing, Peoples R China
[2] INTI Int Univ, Fac Data Sci & Informat Technol, Nilai, N Sembilan, Malaysia
[3] HuangHe Sci & Technol Univ, Sch Int, Zhengzhou, Henan, Peoples R China
[4] Chongqing Three Gorges Med Coll, Sch Med Technol, Chongqing, Peoples R China
关键词
TF-IDF; stopwords; Chinese; poetry; frequency;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this research, we address the problem of stopword detection in Classical Chinese Poetry, an area that has not been explored previously. Stopword detection is crucial in text mining tasks, as identifying and removing stopwords is essential for improving the performance of various natural language processing models. Inspired by the TF-IDF method, we propose a novel approach that utilizes external knowledge to reconstruct the Term Weight matrix. Our key finding is that incorporating external knowledge significantly refines the granularity of the term weight, thereby improving the effectiveness of stopword detection. Based on these findings, we conclude that external knowledge can enhance the ability of text representation, especially for the short texts in Classical Chinese Poetry.
引用
收藏
页码:255 / 261
页数:7
相关论文
共 50 条
  • [41] Visual search and reading comprehension in Chinese children: the mediation of word detection skill
    Liu, Duo
    Chen, Xi
    READING AND WRITING, 2020, 33 (05) : 1163 - 1182
  • [42] Visual search and reading comprehension in Chinese children: the mediation of word detection skill
    Duo Liu
    Xi Chen
    Reading and Writing, 2020, 33 : 1163 - 1182
  • [43] Annotations and consistency detection for Chinese dual-mode emotional speech database
    Jing, Shaoling
    Mao, Xia
    Chen, Lijiang
    Zhang, Nana
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2015, 41 (10): : 1925 - 1934
  • [44] Automation of Classical QEEG Trending Methods for Early Detection of Delayed Cerebral Ischemia: More Work to Do
    Wickering, Ellis
    Gaspard, Nicolas
    Zafar, Sahar
    Moura, Valdery J.
    Biswal, Siddharth
    Bechek, Sophia
    O'Connor, Kathryn
    Rosenthal, Eric S.
    Westover, M. Brandon
    JOURNAL OF CLINICAL NEUROPHYSIOLOGY, 2016, 33 (03) : 227 - 234
  • [45] Detection by denaturing gradient gel  electrophoresis of an Arg1689Cysmutation in a Chinese patient
    阮长耿
    顾建明
    中华医学杂志(英文版), 1997, (02) : 96 - 99
  • [46] Detection of KRAS Mutations and Their Associations with Clinicopathological Features and Survival in Chinese Colorectal Cancer Patients
    Li, Z.
    Chen, Y.
    Wang, D.
    Wang, G.
    He, L.
    Suo, J.
    JOURNAL OF INTERNATIONAL MEDICAL RESEARCH, 2012, 40 (04) : 1589 - 1598
  • [47] Diagnostic potential of vitreoretinal lymphoma by detection of gene mutations with NGS in 25 Chinese patients
    Chen, Kun
    Qin, Huanhuan
    Li, Xiangyu
    Zhou, Xian
    Ma, Jingjing
    Guan, Ming
    CLINICA CHIMICA ACTA, 2024, 561
  • [48] Diabetes-Related Topic Detection in Chinese Health Websites Using Deep Learning
    Chen, Xinhuan
    Zhang, Yong
    Xing, Chunxiao
    Liu, Xiao
    Chen, Hsinchun
    SMART HEALTH, ICSH 2014, 2014, 8549 : 13 - +
  • [49] Clinical characteristics and prognostic factors in Chinese patients with classical Hodgkin's lymphoma involving extranodal sites: a retrospective single-center
    Zou, Dong-Mei
    Zhou, Dao-Bin
    Zhang, Yan
    Wang, Wei
    Zhang, Wei
    HEMATOLOGY, 2019, 24 (01) : 661 - 668
  • [50] Community detection of Chinese micro-blogging using multi-dimensional weighted network
    Zhou, Xiaoping, 1600, Bentham Science Publishers B.V., P.O. Box 294, Bussum, 1400 AG, Netherlands (08): : 1188 - 1197