Learning And Predicting Diabetes Data Sets Using Semi-Supervised Learning

被引:0
|
作者
Tayal, Radhika [1 ]
Shankar, Achyut [2 ]
机构
[1] Noida Int Univ Uttar Pradesh, Dept Comp Sci, Noida, Uttar Pradesh, India
[2] Amity Univ Uttar Pradesh, ASET, Dept Comp Sci & Engn, Noida, Uttar Pradesh, India
关键词
D O I
10.1109/confluence47617.2020.9058276
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Now these days, many tools have been developed by the researchers to analyze the impact of diabetes disease on common people within a definite period. However, all these tools have predicted the results based on the labeled dataset or smaller dataset. But in a recent environment, we have collected a large amount of data using both online and offline media. Consequently, data are generated from heterogeneous sources, are in unstructured form and voluminous, etc. As a result, it is not possible to use huge data by using traditional prediction algorithms because they work only on the structured dataset. In this paper, we have used the semi-supervised learning approach that works on a partially labeled dataset for predicting diabetes disease. The partial dataset is the combination of a labeled and unlabelled dataset. For prediction, we have considered 80% unlabelled datasets and 20% labeled datasets. We developed a user based interface for the user to build their prediction model using labeled and unlabeled datasets and analyze the data according to their requirements and interest. Our main objective is to develop a diabetes prediction system that can be used by the researcher and the common people using with minimal labelled datasets.
引用
收藏
页码:385 / 389
页数:5
相关论文
共 50 条
  • [31] Semi-supervised Learning Using Siamese Networks
    Sahito, Attaullah
    Frank, Eibe
    Pfahringer, Bernhard
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 586 - 597
  • [32] Predicting Unlabeled Traffic For Intrusion Detection Using Semi-Supervised Machine Learning
    Murthy, Chidananda P.
    Manjunatha, A. S.
    Jaiswal, Anku
    Madhu, B. R.
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2016, : 218 - 222
  • [33] Sentiment analysis using semi-supervised learning with few labeled data
    Pan, Yuhao
    Chen, Zhiqun
    Suzuki, Yoshimi
    Fukumoto, Fumiyo
    Nishizaki, Hiromitsu
    2020 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW 2020), 2020, : 231 - 234
  • [34] Effective semi-supervised learning for structured data using Embedding GANs
    Deng, Xiaoheng
    Jiang, Ping
    Zhao, Dezheng
    Huang, Rong
    Shen, Hailan
    Pattern Recognition Letters, 2021, 151 : 127 - 134
  • [35] SEMI-SUPERVISED HIERARCHY LEARNING USING MULTIPLE-LABELED DATA
    Javadi, Ailar
    Gray, Alexander
    Anderson, David
    Berisha, Visar
    2011 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2011,
  • [36] Semi-supervised learning using multiple clusterings with limited labeled data
    Forestier, Germain
    Wemmert, Cedric
    INFORMATION SCIENCES, 2016, 361 : 48 - 65
  • [37] Semi-Supervised Machine Learning Method for Predicting Observed Individual Risk Preference Using Gallup Data
    Ahmed, Faroque
    Shamsuddin, Mrittika
    Sultana, Tanzila
    Shamsuddin, Rittika
    MATHEMATICAL AND COMPUTATIONAL APPLICATIONS, 2024, 29 (02)
  • [38] Behavioral Learning for Data Adjacent Graph Construction in Semi-supervised Learning
    Liu, Zhen
    Yang, Jun-an
    Liu, Hui
    Wang, Wei
    2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATIONS (CSA), 2015, : 125 - 129
  • [39] Semi-supervised learning by disagreement
    Zhou, Zhi-Hua
    Li, Ming
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 415 - 439
  • [40] A survey on semi-supervised learning
    Jesper E. van Engelen
    Holger H. Hoos
    Machine Learning, 2020, 109 : 373 - 440