Data-Driven Regular Expressions Evolution for Medical Text Classification Using Genetic Programming

被引:0
|
作者
Liu, Jiandong [1 ]
Bai, Ruibin [1 ]
Lu, Zheng [1 ]
Ge, Peiming [2 ]
Aickelin, Uwe [3 ]
Liu, Daoyun [2 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
[2] Ping An Hlth Cloud Co Ltd China, Techonol Dept, Shanghai, Peoples R China
[3] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
来源
2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2020年
关键词
text classification; genetic programming; co-occurrence matrix; EXPERT-SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In medical fields, text classification is one of the most important tasks that can significantly reduce human workload through structured information digitization and intelligent decision support. Despite the popularity of learning-based text classification techniques, it is hard for human to understand or manually fine-tune the classification for better precision and recall, due to the black box nature of learning. This study proposes a novel regular expression-based text classification method making use of genetic programming (GP) approaches to evolve regular expressions that can classify a given medical text inquiry with satisfaction. Given a seed population of regular expressions (randomly initialized or manually constructed by experts), our method evolves a population of regular expressions, using a novel regular expression syntax and a series of carefully chosen reproduction operators. Our method is evaluated with real-life medical text inquiries from an online healthcare provider and shows promising performance. More importantly, our method generates classifiers that can be fully understood, checked and updated by medical doctors, which are fundamentally crucial for medical related practices.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Incorporating Adaptive Discretization into Genetic Programming for Data Classification
    Dufourq, Emmanuel
    Pillay, Nelishia
    2013 THIRD WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2013, : 127 - 133
  • [32] A Comparison of Genetic Programming Representations for Binary Data Classification
    Dufourq, Emmanuel
    Pillay, Nelishia
    2013 THIRD WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES (WICT), 2013, : 134 - 140
  • [33] Classification of seafloor habitats using genetic programming
    Silva, Sara
    Tseng, Yao-Ting
    APPLICATIONS OF EVOLUTIONARY COMPUTING, PROCEEDINGS, 2008, 4974 : 315 - +
  • [34] Multiple Imputation and Genetic Programming for Classification with Incomplete Data
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    PROCEEDINGS OF THE 2017 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'17), 2017, : 521 - 528
  • [35] Genetic Programming Based Data Projections for Classification Tasks
    Estebanez, Cesar
    Aler, Ricardo
    Valls, Jose M.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 7, 2005, 7 : 56 - 61
  • [36] Genetic programming based data projections for classification tasks
    Estébanez, C
    Aler, R
    Valls, JM
    ENFORMATIKA, VOL 7: IEC 2005 PROCEEDINGS, 2005, : 56 - 61
  • [37] Unbalanced breast cancer data classification using novel fitness functions in genetic programming
    Devarriya, Divyaansh
    Gulati, Cairo
    Mansharamani, Vidhi
    Sakalle, Aditi
    Bhardwaj, Arpit
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 140
  • [38] Multiclass Classification on High Dimension and Low Sample Size Data Using Genetic Programming
    Wei, Tingyang
    Liu, Wei-Li
    Zhong, Jinghui
    Gong, Yue-Jiao
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2022, 10 (02) : 704 - 718
  • [39] Evolution of Fuzzy Classifiers Using Genetic Programming
    Muni, Durga Prasad
    Pal, Nikhil R.
    FUZZY INFORMATION AND ENGINEERING, 2012, 4 (01) : 29 - 49
  • [40] Data Augmentation for Genetic Programming-Driven Late Merging of HOG and Uniform LBP Features for Texture Classification
    Hazgui, Mohamed
    Ghazouani, Haythem
    Barhoumi, Walid
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2024, 11 (02) : 211 - 239