Classifying Force Spectroscopy of DNA Pulling Measurements Using Supervised and Unsupervised Machine Learning Methods

被引:6
|
作者
Karatay, Durmus U. [1 ]
Zhang, Jie [1 ]
Harrison, Jeffrey S. [1 ]
Ginger, David S. [1 ]
机构
[1] Univ Washington, Dept Chem, Seattle, WA 98195 USA
关键词
RANDOM FOREST; VALIDATION;
D O I
10.1021/acs.jcim.5b00722
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Dynamic force spectroscopy (DFS) measurements on biomolecules typically require classifying thousands of repeated force spectra prior to data analysis. Here, we study classification of atomic force microscope-based DFS measurements using machine-learning algorithms in order to automate selection of successful force curves. Notably, we collect a data set that has a testable positive signal using photoswitch-modified DNA before and after illumination with UV (365 nm) light. We generate a feature set consisting of six properties of force distance curves to train supervised models and use principal component analysis (PCA) for an unsupervised model. For supervised classification, we train random forest models for binary and multiclass classification of force distance curves. Random forest models predict successful pulls with an accuracy of 94% and classify them into five classes with an accuracy of 90%. The unsupervised method using Gaussian mixture models (GMM) reaches an accuracy of approximately 80% for binary classification.
引用
收藏
页码:621 / 629
页数:9
相关论文
共 50 条
  • [21] Classifying Cancer Patients Based on DNA Sequences Using Machine Learning
    Hussain, Fahad
    Saeed, Umair
    Muhammad, Ghulam
    Islam, Noman
    Sheikh, Ghazala Shafi
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (03) : 436 - 443
  • [22] Classifying types of victims in a traffic accident using machine learning methods
    Chang, Xuning
    Cai, Jiahui
    Fu, Hongxin
    Zhang, Zuoyu
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VIRTUAL REALITY, AND VISUALIZATION (AIVRV 2021), 2021, 12153
  • [23] Supervised machine learning models for classifying common causes of dizziness
    Formeister, Eric J.
    Baum, Rachel T.
    Sharon, Jeffrey D.
    AMERICAN JOURNAL OF OTOLARYNGOLOGY, 2022, 43 (03)
  • [24] Classifying Circumnutation in Pea Plants via Supervised Machine Learning
    Wang, Qiuran
    Barbariol, Tommaso
    Susto, Gian Antonio
    Bonato, Bianca
    Guerra, Silvia
    Castiello, Umberto
    PLANTS-BASEL, 2023, 12 (04):
  • [25] Synergy of unsupervised and supervised machine learning methods for the segmentation of the graphite particles in the microstructure of ductile iron
    Alrfou, Khaled
    Kordijazi, Amir
    Rohatgi, Pradeep
    Zhao, Tian
    MATERIALS TODAY COMMUNICATIONS, 2022, 30
  • [26] New Learning Methods for Supervised and Unsupervised Preference Aggregation
    Volkovs, Maksims N.
    Zemel, Richard S.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2014, 15 : 1135 - 1176
  • [27] Supervised and Unsupervised Learning Methods for the Automation of Glomerular Scoring
    Bukowy, John
    Evans, Louise
    Broadway, Elizabeth
    Dayton, Alex
    Cowley, Allen
    FASEB JOURNAL, 2015, 29
  • [28] A Novel Classifier Combining Supervised and Unsupervised Learning Methods
    Chmielnicki, Wieslaw
    2016 THIRD EUROPEAN NETWORK INTELLIGENCE CONFERENCE (ENIC 2016), 2016, : 232 - 238
  • [29] Unsupervised and Supervised Machine Learning in User Modeling for Intelligent Learning Environments
    Amershi, Saleema
    Conati, Cristina
    2007 INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2007, : 72 - 81
  • [30] Improved landslide susceptibility mapping using unsupervised and supervised collaborative machine learning models
    Su, Chenxu
    Wang, Bijiao
    Lv, Yunhong
    Zhang, Mingpeng
    Peng, Dalei
    Bate, Bate
    Zhang, Shuai
    GEORISK-ASSESSMENT AND MANAGEMENT OF RISK FOR ENGINEERED SYSTEMS AND GEOHAZARDS, 2023, 17 (02) : 387 - 405