Incremental model of relevance in features with healthcare data analytics

被引:0
作者
Sujitha, R. [1 ]
Poornima, I. Gethzi Ahila [2 ]
Jambulingam, Umamageswaran [3 ]
机构
[1] PSNA Coll Engn & Technol, Dept Comp Sci & Engn, Dindigul, Tamilnadu, India
[2] Ramco Inst Technol, Dept Comp Sci & Engn, Rajapalayam, Tamilnadu, India
[3] Amrita Vishwa Vidyapeetham Chennai, Dept Comp & Engn, Amrita Sch Comp, Chennai, Tamilnadu, India
关键词
Map reduce; Redundancy; Relevancy; Apache Spark; Machine Learning; DEEP; CLASSIFICATION; NETWORK;
D O I
10.1016/j.bspc.2025.107504
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
With the rapid assembly of big data health frameworks for early lung cancer detection, this research provides solution for early and optimum diagnosis of lung cancer with high dimensional dataset by using, the extraction of features carried out with map-reduce framework followed by feature selection using correlation based model and relevance model. The model, in addition develops a Symmetric uncertainty and the Minimum Redundancy Maximum Relevancy (MRMR) model in combination with the weight computation model and incremental selection model. We advocate the use of the Map Reduce framework for processing large datasets that is to be used for classification. The results show the importance of our model with respect to the accuracy of the proposed model which has been increased and became stable at 91%, 92.04%, and 91.6% respectively for all high dimensional unstructured dataset Sputum images, Lung cancer, and thoracic datasets respectively. The error rate decreased to 8% for the lung cancer dataset, 9.5% for the sputum dataset, and 6.2% for the thoracic surgery dataset. Redundancy has been reduced as well as relevancy among features has been increased, 0.852 for sputum, 0.87 for lung cancer, and 0.90 for thoracic surgery datasets. Since it was able to eliminate redundancy and relevancy among several features, we also observed a reduced error rate in most of the datasets.
引用
收藏
页数:12
相关论文
共 33 条
[31]   A Fast Feature Selection Algorithm Based on Swarm Intelligence in Acoustic Defect Detection [J].
Zhang, Tao ;
Ding, Biyun ;
Zhao, Xin ;
Yue, Qianyu .
IEEE ACCESS, 2018, 6 :28848-28858
[32]   Diabetes-related avoidable hospitalisations and its relationship with primary healthcare resourcing in China: A cross-sectional study from Sichuan Province [J].
Zhao, Xiaoshuang ;
Zhang, Yumeng ;
Yang, Yili ;
Pan, Jay .
HEALTH & SOCIAL CARE IN THE COMMUNITY, 2022, 30 (04) :E1143-E1156
[33]   Maximum Relevance and Minimum Redundancy Feature Selection Methods for a Marketing Machine Learning Platform [J].
Zhao, Zhenyu ;
Anand, Radhika ;
Wang, Mallory .
2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, :442-452