Automated machine learning tool: The first stop for data science and statistical model building

被引:0
|
作者
Gopagoni D. [1 ]
Lakshmi P.V. [1 ]
机构
[1] Department of Computer Science and Engineering, GIT GITAM (Deemed to be University), Vishakhapatnam, Andhra Pradesh
来源
International Journal of Advanced Computer Science and Applications | 2020年 / 02期
关键词
Artificial neural networks; Automated machine learning; Drug design; K-means clustering; Market analysis; Naive bayes classification; QSAR; QSPR; R program; Regression models; Shiny web app; Supervised learning; Support vector machines;
D O I
10.14569/ijacsa.2020.0110253
中图分类号
学科分类号
摘要
Machine learning techniques are designed to derive knowledge out of existing data. Increased computational power, use of natural language processing, image processing methods made easy creation of rich data. Good domain knowledge is required to build useful models. Uncertainty remains around choosing the right sample data, variables reduction and selection of statistical algorithm. A suitable statistical method coupled with explaining variables is critical for model building and analysis. There are multiple choices around each parameter. An automated system which could help the scientists to select an appropriate data set coupled with learning algorithm will be very useful. A freely available web-based platform, named automated machine learning tool (AMLT), is developed in this study. AMLT will automate the entire model building process. AMLT is equipped with all most commonly used variable selection methods, statistical methods both for supervised and unsupervised learning. AMLT can also do the clustering. AMLT uses statistical principles like R2 to rank the models and automatic test set validation. Tool is validated for connectivity and capability by reproducing two published works. © Science and Information Organization.
引用
收藏
页码:410 / 418
页数:8
相关论文
共 50 条
  • [41] Machine learning as a tool to predict potassium concentration in soybean leaf using hyperspectral data
    Furlanetto, Renato Herrig
    Crusiol, Luis Guilherme Teixeira
    Goncalves, Joao Vitor Ferreira
    Nanni, Marcos Rafael
    de Oliveira Junior, Adilson
    de Oliveira, Fabio Alvares
    Sibaldelli, Rubson Natal Ribeiro
    PRECISION AGRICULTURE, 2023, 24 (06) : 2264 - 2292
  • [42] Automated Color Model-Based Concrete Detection in Construction-Site Images by Using Machine Learning Algorithms
    Son, Hyojoo
    Kim, Changmin
    Kim, Changwan
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2012, 26 (03) : 421 - 433
  • [43] Machine learning as a tool to predict potassium concentration in soybean leaf using hyperspectral data
    Renato Herrig Furlanetto
    Luís Guilherme Teixeira Crusiol
    João Vitor Ferreira Gonçalves
    Marcos Rafael Nanni
    Adilson de Oliveira Junior
    Fábio Alvares de Oliveira
    Rubson Natal Ribeiro Sibaldelli
    Precision Agriculture, 2023, 24 : 2264 - 2292
  • [44] Comparison of Machine Learning Algorithms and Fruit Classification using Orange Data Mining Tool
    Vaishnav, Devashree
    Rao, B. Rama
    PROCEEDINGS OF THE 2018 3RD INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2018), 2018, : 603 - 607
  • [45] A Connection Between Pattern Classification by Machine Learning and Statistical Inference With the General Linear Model
    Gorriz, J. M.
    Jimenez-Mesa, C.
    Segovia, F.
    Ramirez, J.
    Suckling, J.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (11) : 5332 - 5343
  • [46] Automated machine learning-based framework of heating and cooling load prediction for quick residential building design
    Lu, Chujie
    Li, Sihui
    Penaka, Santhan Reddy
    Olofsson, Thomas
    ENERGY, 2023, 274
  • [47] Machine learning, artificial intelligence, and data science breaking into drug design and neglected diseases
    Pena-Guerrero, Jose
    Nguewa, Paul A.
    Garcia-Sosa, Alfonso T.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2021, 11 (05)
  • [48] Automated Framework for Developing Predictive Machine Learning Models for Data-Driven Drug Discovery
    Neves, Bruno J.
    Moreira-Filho, Jose T.
    Silva, Arthur C.
    Borba, Joyce V. V. B.
    Mottin, Melina
    Alves, Vinicius M.
    Braga, Rodolpho C.
    Muratov, Eugene N.
    Andrade, Carolina H.
    JOURNAL OF THE BRAZILIAN CHEMICAL SOCIETY, 2021, 32 (01) : 110 - 122
  • [49] Accuracy analyses and model comparison of machine learning adopted in building energy consumption prediction
    Liu, Zhijian
    Wu, Di
    Liu, Yuanwei
    Han, Zhonghe
    Lun, Liyong
    Gao, Jun
    Jin, Guangya
    Cao, Guoqing
    ENERGY EXPLORATION & EXPLOITATION, 2019, 37 (04) : 1426 - 1451
  • [50] Leveraging Automated Machine Learning for the Analysis of Global Public Health Data: A Case Study in Malaria
    Manduchi, Elisabetta
    Moore, Jason H.
    INTERNATIONAL JOURNAL OF PUBLIC HEALTH, 2021, 66 : 614296