An Intellectual Methodology for Secure Health Record Mining and Risk Forecasting Using Clustering and Graph-Based Classification

被引:6
|
作者
Irene, D. Shiny [1 ]
Surya, V [2 ]
Kavitha, D. [2 ]
Shankar, R. [3 ]
Thangaraj, S. John Justin [1 ]
机构
[1] Saveetha Inst Med & Tech Sci, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[2] SRM Inst Sci & Technol, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[3] Teegala Krishna Reddy Engn Coll, Elect & Commun Engn, Hyderabad, India
关键词
Feature extraction; feature selection; clustering; classification; privacy preservation; entropy; BIG DATA; CLOUD; INTERNET; IOT; ALGORITHM; CARE; THINGS; MODEL; FRAMEWORK; PROTOCOL;
D O I
10.1142/S0218126621501358
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of the research work is to analyze and validate health records and securing the personal information of patients is a challenging issue in health records mining. The risk prediction task was formulated with the label Cause of Death (COD) as a multi-class classification issue, which views health-related death as the "biggest risk." This unlabeled data particularly describes the health conditions of the participants during the health examinations. It can differ tremendously between healthy and highly ill. Besides, the problems of distributed secure data management over privacy-preserving are considered. The proposed health record mining is in the following stages. In the initial stage, effective features such as fisher score, Pearson correlation, and information gain is calculated from the health records of the patient. Then, the average values are calculated for the extracted features. In the second stage, feature selection is performed from the average features by applying the Euclidean distance measure. The chosen features are clustered in the third stage using distance adaptive fuzzy c-means clustering algorithm (DAFCM). In the fourth stage, an entropy-based graph is constructed for the classification of data and it categorizes the patient's record. At the last stage, for security, privacy preservation is applied to the personal information of the patient. This performance is matched against the existing methods and it gives better performance than the existing ones.
引用
收藏
页数:19
相关论文
共 43 条
  • [31] Spectral clustering and query expansion using embeddings on the graph-based extension of the set-based information retrieval model
    Kalogeropoulos, Nikitas-Rigas
    Kontogiannis, George
    Makris, Christos
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 263
  • [32] Semisupervised graph-based hyperspectral images classification using low-rank representation graph with considering the local structure of data
    Ahmadi, Seyyed Ali
    Mehrshad, Nasser
    Razavi, Seyyed Mohammad
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (06)
  • [33] A Bottom-up Method Using Texture Features and a Graph-based Representation for Lettrine Recognition and Classification
    Mehri, Maroua
    Gomez-Kramer, Petra
    Heroux, Pierre
    Coustaty, Mickael
    Lerouge, Julien
    Mullo, Remy
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 226 - 230
  • [34] Semi-supervised multi-label classification using an extended graph-based manifold regularization
    Li, Ding
    Dick, Scott
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (02) : 1561 - 1577
  • [35] Credit risk evaluation using clustering based fuzzy classification method
    Baser, Furkan
    Koc, Oguz
    Selcuk-Kestel, Sevtap
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 223
  • [36] Graph-Based Hyperspectral Image Classification Using Outliers Detection based on Spatial Information and Estimating of the Number of GMM Mixtures
    Lak, Mahboubeh
    Keshavarz, Ahmad
    Pourghassem, Hossein
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 196 - 200
  • [37] Mosque Typo-Morphological Classification for Pattern Recognition Using Shape Grammar Theory and Graph-Based Techniques
    Ali, Lana Abubakr
    Mustafa, Faris Ali
    BUILDINGS, 2023, 13 (03)
  • [38] Using Graph-based Metrics with Empirical Risk Minimization to Speed Up Active Learning on Networked Data
    Macskassy, Sofus A.
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 597 - 605
  • [39] Multi-class parkinsonian disorders classification with quantitative MR markers and graph-based features using support vector machines
    Morisi, Rita
    Manners, David Neil
    Gnecco, Giorgio
    Lanconelli, Nico
    Testa, Claudia
    Evangelisti, Stefania
    Talozzi, Lia
    Gramegna, Laura Ludovica
    Bianchini, Claudio
    Calandra-Buonaura, Giovanna
    Sambati, Luisa
    Giannini, Giulia
    Cortelli, Pietri
    Tonon, Caterina
    Lodi, Raffaele
    PARKINSONISM & RELATED DISORDERS, 2018, 47 : 64 - 70
  • [40] An Information Entropy-based Risk (IER) Index of Mining Safety Using Clustering and Statistical Methods
    Eshwar, Dharmasai
    Chatterjee, Snehamoy
    Kaunda, Rennie
    Miller, Hugh
    Majdara, Aref
    MINING METALLURGY & EXPLORATION, 2024, : 1693 - 1708