Automated machine learning tool: The first stop for data science and statistical model building

被引:0
|
作者
Gopagoni D. [1 ]
Lakshmi P.V. [1 ]
机构
[1] Department of Computer Science and Engineering, GIT GITAM (Deemed to be University), Vishakhapatnam, Andhra Pradesh
来源
International Journal of Advanced Computer Science and Applications | 2020年 / 02期
关键词
Artificial neural networks; Automated machine learning; Drug design; K-means clustering; Market analysis; Naive bayes classification; QSAR; QSPR; R program; Regression models; Shiny web app; Supervised learning; Support vector machines;
D O I
10.14569/ijacsa.2020.0110253
中图分类号
学科分类号
摘要
Machine learning techniques are designed to derive knowledge out of existing data. Increased computational power, use of natural language processing, image processing methods made easy creation of rich data. Good domain knowledge is required to build useful models. Uncertainty remains around choosing the right sample data, variables reduction and selection of statistical algorithm. A suitable statistical method coupled with explaining variables is critical for model building and analysis. There are multiple choices around each parameter. An automated system which could help the scientists to select an appropriate data set coupled with learning algorithm will be very useful. A freely available web-based platform, named automated machine learning tool (AMLT), is developed in this study. AMLT will automate the entire model building process. AMLT is equipped with all most commonly used variable selection methods, statistical methods both for supervised and unsupervised learning. AMLT can also do the clustering. AMLT uses statistical principles like R2 to rank the models and automatic test set validation. Tool is validated for connectivity and capability by reproducing two published works. © Science and Information Organization.
引用
收藏
页码:410 / 418
页数:8
相关论文
共 50 条
  • [21] A Comparison of Automated Machine Learning Tools for Predicting Energy Building Consumption in Smart Cities
    Soares, Daniela
    Pereira, Pedro Jose
    Cortez, Paulo
    Goncalves, Carlos
    PROGRESS IN ARTIFICIAL INTELLIGENCE, EPIA 2023, PT I, 2023, 14115 : 311 - 322
  • [22] Explainable Remaining Tool Life Prediction for Individualized Production Using Automated Machine Learning
    Krupp, Lukas
    Wiede, Christian
    Friedhoff, Joachim
    Grabmaier, Anton
    SENSORS, 2023, 23 (20)
  • [23] Energy Model Machine (EMM) Instant Building Energy Prediction using Machine Learning
    Asl, Mohammad Rahmani
    Das, Subhajit
    Tsai, Barry
    Molloy, Ian
    Hauck, Anthony
    ECAADE 2017: SHARING OF COMPUTABLE KNOWLEDGE! (SHOCK!), VOL 2, 2017, : 277 - 286
  • [24] Building machine learning models without sharing patient data: A simulation-based analysis of distributed learning by ensembling
    Tuladhar, Anup
    Gill, Sascha
    Ismail, Zahinoor
    Forkert, Nils D.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 106 (106)
  • [25] Comparison of merging strategies for building machine learning models on multiple independent gene expression data sets
    Krepel, Jessica
    Kircher, Magdalena
    Kohls, Moritz
    Jung, Klaus
    STATISTICAL ANALYSIS AND DATA MINING, 2022, 15 (01) : 112 - 124
  • [26] Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods
    Mumuni, Alhassan
    Mumuni, Fuseini
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, : 4035 - 4085
  • [27] Machine learning model for delay risk assessment in tall building projects
    Sanni-Anibire, Muizz O.
    Zin, Rosli Mohamad
    Olatunji, Sunday Olusanya
    INTERNATIONAL JOURNAL OF CONSTRUCTION MANAGEMENT, 2022, 22 (11) : 2134 - 2143
  • [28] Risky Driver Recognition with Class Imbalance Data and Automated Machine Learning Framework
    Wang, Ke
    Xue, Qingwen
    Lu, Jian John
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (14)
  • [29] Application of Statistical Machine Learning Algorithms for Classification of Bridge Deformation Data Sets
    Avendano, Juan C.
    Otero, Luis Daniel
    Otero, Carlos
    2021 15TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON 2021), 2021,
  • [30] A Supervised Machine Learning Model for Tool Condition Monitoring in Smart Manufacturing
    Ganeshkumar, S.
    Deepika, T.
    Haldorai, Anandakumar
    DEFENCE SCIENCE JOURNAL, 2022, 72 (05) : 712 - 720