Feasibility of Active Machine Learning for Multiclass Compound Classification

被引:30
|
作者
Lang, Tobias [1 ,2 ]
Flachsenberg, Florian [1 ]
von Luxburg, Ulrike [3 ]
Rarey, Matthias [1 ]
机构
[1] Univ Hamburg, Ctr Bioinformat, D-20146 Hamburg, Germany
[2] Univ Hamburg, Dept Comp Sci, Schluterstr 70, D-20146 Hamburg, Germany
[3] Univ Tubingen, Dept Comp Sci, D-72076 Tubingen, Germany
关键词
DISCOVERY; TOOL;
D O I
10.1021/acs.jcim.5b00332
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
A common task in the hit-to-lead process is classifying sets of compounds into multiple, usually structural classes, which build the groundwork for subsequent SAR studies. Machine learning techniques can be used to automate this process by learning classification models from training compounds of each class. Gathering class information for compounds can be cost-intensive as the required data needs to be provided by human experts or experiments. This paper studies whether active machine learning can be used to reduce the required number of training compounds. Active learning is a machine learning method which processes class label data in an iterative fashion. It has gained much attention in a broad range of application areas. In this paper, an active learning method for multiclass compound classification is proposed. This method selects informative training compounds so as to optimally support the learning progress. The combination with human feedback leads to a semiautomated interactive multiclass classification procedure. This method was investigated empirically on 15 compound classification tasks containing 86-2870 compounds in 3-38 classes. The empirical results show that active learning can solve these classification tasks using 10-80% of the data which would be necessary for standard learning techniques.
引用
收藏
页码:12 / 20
页数:9
相关论文
共 50 条
  • [21] Toward the explainability, transparency, and universality of machine learning for behavioral classification in neuroscience
    Goodwin, Nastacia L.
    Nilsson, Simon R. O.
    Choong, Jia Jie
    Golden, Sam A.
    CURRENT OPINION IN NEUROBIOLOGY, 2022, 73
  • [22] Psychometric and Machine Learning Approaches for Diagnostic Assessment and Tests of Individual Classification
    Gonzalez, Oscar
    PSYCHOLOGICAL METHODS, 2021, 26 (02) : 236 - 254
  • [23] A Review on Automatic Classification of Honey Botanical Origins using Machine Learning
    Al-Awadhi, Mokhtar A.
    Deshmukh, Ratnadeep R.
    2021 INTERNATIONAL CONFERENCE OF MODERN TRENDS IN INFORMATION AND COMMUNICATION TECHNOLOGY INDUSTRY (MTICTI 2021), 2021, : 25 - 29
  • [24] Gearbox faults feature selection and severity classification using machine learning
    Zuber, Ninoslav
    Bajric, Rusmir
    EKSPLOATACJA I NIEZAWODNOSC-MAINTENANCE AND RELIABILITY, 2020, 22 (04): : 748 - 756
  • [25] Beef Cut Classification Using Multispectral Imaging and Machine Learning Method
    Li, Ang
    Li, Chenxi
    Gao, Moyang
    Yang, Si
    Liu, Rong
    Chen, Wenliang
    Xu, Kexin
    FRONTIERS IN NUTRITION, 2021, 8
  • [26] Laser-based classification of olive oils assisted by machine learning
    Gazeli, Odhisea
    Bellou, Elli
    Stefas, Dimitrios
    Couris, Stelios
    FOOD CHEMISTRY, 2020, 302
  • [27] A systematic review of the application of machine learning in the detection and classification of transposable elements
    Orozco-Arias, Simon
    Isaza, Gustavo
    Guyot, Romain
    Tabares-Soto, Reinel
    PEERJ, 2019, 7
  • [28] Classification of Active and Weakly Active ST Inhibitors of HIV-1 Integrase Using a Support Vector Machine
    Yan, Aixia
    Xuan, Shouyi
    Hu, Xiaoying
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2012, 15 (10) : 792 - 805
  • [29] Active Machine Learning for Chemical Engineers: A Bright Future Lies Ahead!
    Ureel, Yannick
    Dobbelaere, Maarten R.
    Ouyang, Yi
    De Ras, Kevin
    Sabbe, Maarten K.
    Marin, Guy B.
    Van Geem, Kevin M.
    ENGINEERING, 2023, 27 : 23 - 30
  • [30] Research on Classification Method of Eggplant Seeds Based on Machine Learning and Multispectral Imaging Classification Eggplant Seeds
    Sun, Lei
    Fan, Xiaofei
    Huang, Sheng
    Luo, Shuangxia
    Zhao, Lili
    Chen, Xueping
    He, Yi
    Suo, Xuesong
    JOURNAL OF SENSORS, 2021, 2021