Data-Driven Regular Expressions Evolution for Medical Text Classification Using Genetic Programming

被引:0
|
作者
Liu, Jiandong [1 ]
Bai, Ruibin [1 ]
Lu, Zheng [1 ]
Ge, Peiming [2 ]
Aickelin, Uwe [3 ]
Liu, Daoyun [2 ]
机构
[1] Univ Nottingham Ningbo China, Sch Comp Sci, Ningbo, Peoples R China
[2] Ping An Hlth Cloud Co Ltd China, Techonol Dept, Shanghai, Peoples R China
[3] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
来源
2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC) | 2020年
关键词
text classification; genetic programming; co-occurrence matrix; EXPERT-SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In medical fields, text classification is one of the most important tasks that can significantly reduce human workload through structured information digitization and intelligent decision support. Despite the popularity of learning-based text classification techniques, it is hard for human to understand or manually fine-tune the classification for better precision and recall, due to the black box nature of learning. This study proposes a novel regular expression-based text classification method making use of genetic programming (GP) approaches to evolve regular expressions that can classify a given medical text inquiry with satisfaction. Given a seed population of regular expressions (randomly initialized or manually constructed by experts), our method evolves a population of regular expressions, using a novel regular expression syntax and a series of carefully chosen reproduction operators. Our method is evaluated with real-life medical text inquiries from an online healthcare provider and shows promising performance. More importantly, our method generates classifiers that can be fully understood, checked and updated by medical doctors, which are fundamentally crucial for medical related practices.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Feature Selection For Text Classification Using Genetic Algorithms
    Bidi, Noria
    Elberrichi, Zakaria
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 806 - 810
  • [42] Auto Machine Learning Based on Genetic Programming for Medical Image Classification
    Herrera-Sanchez, David
    Acosta-Mesa, Hector-Gabriel
    Mezura-Montes, Efren
    ADVANCES IN COMPUTATIONAL INTELLIGENCE. MICAI 2023 INTERNATIONAL WORKSHOPS, 2024, 14502 : 349 - 359
  • [43] A new fitness function in genetic programming for classification of imbalanced data
    Kumar, Arvind
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024, 36 (07) : 1021 - 1033
  • [44] Reusing Genetic Programming for Ensemble Selection in Classification of Unbalanced Data
    Bhowan, Urvesh
    Johnston, Mark
    Zhang, Mengjie
    Yao, Xin
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2014, 18 (06) : 893 - 908
  • [45] Genetic Programming Based ECOC for Multiclass Microarray Data Classification
    Wang JiaJun
    Liu KunHong
    Sun MengXin
    Hong QingQi
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL. 1, 2017, : 280 - 283
  • [46] Predicting Problem Difficulty for Genetic Programming Applied to Data Classification
    Trujillo, Leonardo
    Martinez, Yuliana
    Galvan-Lopez, Edgar
    Legrand, Pierrick
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 1355 - 1362
  • [47] Grinding Burn and Chatter Classification Using Genetic Programming
    Chen, Xun
    Griffin, James
    ADVANCES IN ABRASIVE TECHNOLOGY XI, 2009, 389-390 : 90 - +
  • [48] Optimizing Classification Techniques Using Genetic Programming Approach
    Saraee, Mohammad Hussein
    Sadjady, Razieh Sadat
    INMIC: 2008 INTERNATIONAL MULTITOPIC CONFERENCE, 2008, : 345 - +
  • [49] Cooperative Coevolutionary Multiobjective Genetic Programming for Microarray Data Classification
    Qing, Yang
    Ma, Chi
    Zhou, Yu
    Zhang, Xiao
    Xia, Haowen
    PROCEEDINGS OF THE 2021 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'21), 2021, : 804 - 811
  • [50] Directly Constructing Multiple Features for Classification with Missing Data using Genetic Programming with Interval Functions
    Cao Truong Tran
    Zhang, Mengjie
    Andreae, Peter
    Xue, Bing
    PROCEEDINGS OF THE 2016 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'16 COMPANION), 2016, : 69 - 70