Data-Driven Regular Expressions Evolution for Medical Text Classification Using Genetic Programming

被引:0
|
作者
Liu, Jiandong [1 ]
Bai, Ruibin [1 ]
Lu, Zheng [1 ]
Ge, Peiming [2 ]
Aickelin, Uwe [3 ]
Liu, Daoyun [2 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
[2] Ping An Hlth Cloud Co Ltd China, Techonol Dept, Shanghai, Peoples R China
[3] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
来源
2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2020年
关键词
text classification; genetic programming; co-occurrence matrix; EXPERT-SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In medical fields, text classification is one of the most important tasks that can significantly reduce human workload through structured information digitization and intelligent decision support. Despite the popularity of learning-based text classification techniques, it is hard for human to understand or manually fine-tune the classification for better precision and recall, due to the black box nature of learning. This study proposes a novel regular expression-based text classification method making use of genetic programming (GP) approaches to evolve regular expressions that can classify a given medical text inquiry with satisfaction. Given a seed population of regular expressions (randomly initialized or manually constructed by experts), our method evolves a population of regular expressions, using a novel regular expression syntax and a series of carefully chosen reproduction operators. Our method is evaluated with real-life medical text inquiries from an online healthcare provider and shows promising performance. More importantly, our method generates classifiers that can be fully understood, checked and updated by medical doctors, which are fundamentally crucial for medical related practices.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Genetic Programming for Image Classification with Unbalanced Data
    Bhowan, Urvesh
    Zhang, Mengjie
    Johnston, Mark
    2009 24TH INTERNATIONAL CONFERENCE IMAGE AND VISION COMPUTING NEW ZEALAND (IVCNZ 2009), 2009, : 316 - +
  • [22] Genetic programming for medical classification: a program simplification approach
    Mengjie Zhang
    Phillip Wong
    Genetic Programming and Evolvable Machines, 2008, 9 : 229 - 255
  • [23] Classification of gene expression data with genetic programming
    Driscoll, JA
    Worzel, B
    MacLean, D
    GENETIC PROGRAMMING THEORY AND PRACTICE, 2003, 6 : 25 - 42
  • [24] Genetic programming for medical classification: a program simplification approach
    Zhang, Mengjie
    Wong, Phillip
    GENETIC PROGRAMMING AND EVOLVABLE MACHINES, 2008, 9 (03) : 229 - 255
  • [25] Fault classification using genetic programming
    Zhang, Liang
    Nandi, Asoke K.
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2007, 21 (03) : 1273 - 1284
  • [26] GPSO: A FRAMEWORK FOR OPTIMIZATION OF GENETIC PROGRAMMING CLASSIFIER EXPRESSIONS FOR BINARY CLASSIFICATION USING PARTICLE SWARM OPTIMIZATION
    Jabeen, Hajira
    Baig, Abdul Rauf
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (1A): : 233 - 242
  • [27] Data Mining Classification: The Potential of Genetic Programming
    El Kadhi, Nabil H.
    Habib, Fatima A.
    SIXTH INTERNATIONAL MULTI-CONFERENCE ON COMPUTING IN THE GLOBAL INFORMATION TECHNOLOGY (ICCGI 2011), 2011, : 1 - 7
  • [28] Term-weighting learning via genetic programming for text classification
    Jair Escalante, Hugo
    Garcia-Limon, Mauricio A.
    Morales-Reyes, Alicia
    Graff, Mario
    Montes-y-Gomez, Manuel
    Morales, Eduardo F.
    Martinez-Carranza, Jose
    KNOWLEDGE-BASED SYSTEMS, 2015, 83 : 176 - 189
  • [29] Turkish Medical Text Classification Using BERT
    Celikten, Azer
    Bulut, Hasan
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [30] A Genetic Programming-Driven Data Fitting Method
    Chen, Hao
    Guo, Zi Yuan
    Duan, Hong Bai
    Ban, Duo
    IEEE ACCESS, 2020, 8 (08): : 111448 - 111459