Down Syndrome Prediction Using a Cascaded Machine Learning Framework Designed for Imbalanced and Feature-correlated Data

被引:6
|
作者
Li, Ling [1 ]
Liu, Wanying [1 ]
Zhang, Hongguo [2 ]
Jiang, Yuting [2 ]
Hu, Xiaonan [2 ]
Liu, Ruizhi [2 ]
机构
[1] Jilin Univ, Sch Commun Engn, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Hosp 1, Ctr Prenatal Diag, Ctr Reprod Med, Changchun 130021, Jilin, Peoples R China
关键词
Bioinformatics; down syndrome prediction; imbalanced learning; cascaded framework; ensemble learning; noninvasive prenatal diagnosis; DISEASE; DIAGNOSIS; TRISOMY-21; VALUES;
D O I
10.1109/ACCESS.2019.2929681
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Down syndrome (DS) caused by the presence of part or all of a third copy of chromosome 21 is the most common form of aneuploidy. The prenatal screening for DS is a key component of antenatal care and is recommended to be universally offered to women irrespective of age or background. The objective of this paper is to introduce a noninvasive and accurate diagnosis procedure for DS and to minimize social and financial cost of prenatal diagnosis. Recently, machine learning has received considerable attention in predictive analytics for medical problems. However, there is few its applications on DS prediction reported due to the difficulty of dealing with highly imbalanced and feature-correlated screening data. In this paper, we propose a cascaded machine learning framework designed for DS prediction based on three complementary stages: 1) pre-judgment with isolation forest technique, 2) model ensemble by voting strategy, and 3) final judgment using logistic regression approach. The experimental results show that the performance of this framework on maternal serum screening data set, when evaluated with different evaluation parameters, is superior to those of some machine learning methods. The best suggested combination of input features for DS screening is the group of alpha-fetoprotein, human chorionic gonadotropin, unconjugated estriol, and maternal age. In addition, our method has the potential to generate further accurate prediction for imbalanced and feature-correlated data, thereby providing a novel and effective approach for certain diseases analysis.
引用
收藏
页码:97582 / 97593
页数:12
相关论文
共 50 条
  • [1] Classification with machine learning algorithms after hybrid feature selection in imbalanced data sets
    Pulat, Meryem
    Kocakoc, Ipek Deveci
    OPERATIONS RESEARCH AND DECISIONS, 2024, 34 (04) : 157 - 183
  • [2] Efficient Bioinspired Feature Selection and Machine Learning Based Framework Using Omics Data and Biological Knowledge Data Bases in Cancer Clinical Endpoint Prediction
    Zenbout, Imene
    Bouramoul, Abdelkrim
    Meshoul, Souham
    Amrane, Mounira
    IEEE ACCESS, 2023, 11 : 2674 - 2699
  • [3] Ensemble Feature Selection Framework for Paddy Yield Prediction in Cauvery Basin using Machine Learning Classifiers
    Sathya, P.
    Gnanasekaran, P.
    COGENT ENGINEERING, 2023, 10 (02):
  • [4] Feature reduction for hepatocellular carcinoma prediction using machine learning algorithms
    Mostafa, Ghada
    Mahmoud, Hamdi
    Abd El-Hafeez, Tarek
    Elaraby, Mohamed E.
    JOURNAL OF BIG DATA, 2024, 11 (01)
  • [5] Classifying imbalanced data using BalanceCascade-based kernelized extreme learning machine
    Bhagat Singh Raghuwanshi
    Sanyam Shukla
    Pattern Analysis and Applications, 2020, 23 : 1157 - 1182
  • [6] Classifying imbalanced data using BalanceCascade-based kernelized extreme learning machine
    Raghuwanshi, Bhagat Singh
    Shukla, Sanyam
    PATTERN ANALYSIS AND APPLICATIONS, 2020, 23 (03) : 1157 - 1182
  • [7] Improved metabolomic data-based prediction of depressive symptoms using nonlinear machine learning with feature selection
    Takahashi, Yuta
    Ueki, Masao
    Yamada, Makoto
    Tamiya, Gen
    Motoike, Ikuko N.
    Saigusa, Daisuke
    Sakurai, Miyuki
    Nagami, Fuji
    Ogishima, Soichi
    Koshiba, Seizo
    Kinoshita, Kengo
    Yamamoto, Masayuki
    Tomita, Hiroaki
    TRANSLATIONAL PSYCHIATRY, 2020, 10 (01)
  • [8] Machine Learning Models for Classification of Cushing's Syndrome Using Retrospective Data
    Isci, Senol
    Kalender, Derya Sema Yaman
    Bayraktar, Firat
    Yaman, Alper
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (08) : 3153 - 3162
  • [9] Caller-Agent Pairing in Call Centers Using Machine Learning Techniques with Imbalanced Data
    Mehrbod, Negin
    Grilo, Antonio
    Zutshi, Aneesh
    2018 IEEE INTERNATIONAL CONFERENCE ON ENGINEERING, TECHNOLOGY AND INNOVATION (ICE/ITMC), 2018,
  • [10] An anomaly prediction framework for financial IT systems using hybrid machine learning methods
    Wang, Jingwen
    Liu, Jingxin
    Pu, Juntao
    Yang, Qinghong
    Miao, Zhongchen
    Gao, Jian
    Song, You
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2019, 14 (11) : 15277 - 15286