Automatic clustering of single-molecule break junction data through task-oriented representation learning

被引:0
作者
Zhao, Yi-Heng [1 ]
Pang, Shen-Wen [1 ]
Huang, Heng-Zhi [1 ]
Wu, Shao-Wen [1 ]
Sun, Shao-Hua [1 ,2 ]
Liu, Zhen-Bing [1 ,2 ]
Pan, Zhi-Chao [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Sch Artificial Intelligence, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Coll & Univ Key Lab AI Algorithm Engn, Guilin 541004, Peoples R China
关键词
Single-molecule conductance; Break junction; Deep clustering; Representation learning; Neural architecture search; CONDUCTANCE; ELECTRONICS; ROBUST; NOISE;
D O I
10.1007/s12598-024-03089-7
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Clustering is a pivotal data analysis method for deciphering the charge transport properties of single molecules in break junction experiments. However, given the high dimensionality and variability of the data, feature extraction remains a bottleneck in the development of efficient clustering methods. In this regard, extensive research over the past two decades has focused on feature engineering and dimensionality reduction in break junction conductance. However, extracting highly relevant features without expert knowledge remains an unresolved challenge. To address this issue, we propose a deep clustering method driven by task-oriented representation learning (CTRL) in which the clustering module serves as a guide for the representation learning (RepL) module. First, we determine an optimal autoencoder (AE) structure through a neural architecture search (NAS) to ensure efficient RepL; second, the RepL process is guided by a joint training strategy that combines AE reconstruction loss with the clustering objective. The results demonstrate that CTRL achieves excellent performance on both the generated and experimental data. Further inspection of the RepL step reveals that joint training robustly learns more compact features than the unconstrained AE or traditional dimensionality reduction methods, significantly reducing misclustering possibilities. Our method provides a general end-to-end automatic clustering solution for analyzing single-molecule break junction data. (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic) (CTRL), (sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic); (sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)k-means(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic), CTRL(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), CTRL(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).
引用
收藏
页码:3244 / 3257
页数:14
相关论文
共 70 条
[31]   Classification of conductance traces with recurrent neural networks [J].
Lauritzen, Kasper P. ;
Magyarkuti, Andras ;
Balogh, Zoltan ;
Halbritter, Andras ;
Solomon, Gemma C. .
JOURNAL OF CHEMICAL PHYSICS, 2018, 148 (08)
[32]   Unsupervised vector-based classification of single-molecule charge transport data [J].
Lemmer, Mario ;
Inkpen, Michael S. ;
Kornysheva, Katja ;
Long, Nicholas J. ;
Albrecht, Tim .
NATURE COMMUNICATIONS, 2016, 7
[33]   In-situ label-free single-molecule dynamic detection of thermal-reversible reactions [J].
Li, Jin ;
Long, Xia ;
Cao, Juexian ;
Hu, Yong .
CHEMICAL ENGINEERING JOURNAL, 2023, 451
[34]   Gate controlling of quantum interference and direct observation of anti-resonances in single molecule charge transport [J].
Li, Yueqi ;
Buerkle, Marius ;
Li, Guangfeng ;
Rostamian, Ali ;
Wang, Hui ;
Wang, Zixiao ;
Bowler, David R. ;
Miyazaki, Tsuyoshi ;
Xiang, Limin ;
Asai, Yoshihiro ;
Zhou, Gang ;
Tao, Nongjian .
NATURE MATERIALS, 2019, 18 (04) :357-+
[35]   Using Weakly Supervised Deep Learning to Classify and Segment Single-Molecule Break-Junction Conductance Traces [J].
Lin, Dongying ;
Zhao, Zhihao ;
Pan, Haoyang ;
Li, Shi ;
Wang, Yongfeng ;
Wang, Dong ;
Sanvito, Stefano ;
Hou, Shimin .
CHEMPHYSCHEM, 2021, 22 (20) :2107-2114
[36]   Spectral Clustering to Analyze the Hidden Events in Single-Molecule Break Junctions [J].
Lin, Luchun ;
Tang, Chun ;
Dong, Gang ;
Chen, Zhixin ;
Pan, Zhichao ;
Liu, Junyang ;
Yang, Yang ;
Shi, Jia ;
Ji, Rongrong ;
Hong, Wenjing .
JOURNAL OF PHYSICAL CHEMISTRY C, 2021, 125 (06) :3623-3630
[37]   Dissecting Time-Evolved Conductance Behavior of Single Molecule Junctions by Nonparametric Machine Learning [J].
Liu, Bo ;
Murayama, Sanae ;
Komoto, Yuki ;
Tsutsui, Makusu ;
Taniguchi, Masateru .
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2020, 11 (16) :6567-6572
[38]   Transition from Tunneling Leakage Current to Molecular Tunneling in Single-Molecule Junctions [J].
Liu, Junyang ;
Zhao, Xiaotao ;
Zheng, Jueting ;
Huang, Xiaoyan ;
Tang, Yongxiang ;
Wang, Fei ;
Li, Ruihao ;
Pi, Jiuchan ;
Huang, Cancan ;
Wang, Lin ;
Yang, Yang ;
Shi, Jia ;
Mao, Bing-Wei ;
Tian, Zhong-Qun ;
Bryce, Martin R. ;
Hong, Wenjing .
CHEM, 2019, 5 (02) :390-401
[39]   Statistical approach to investigating transport through single molecules [J].
Loertscher, Emanuel ;
Weber, Heiko B. ;
Riel, Heike .
PHYSICAL REVIEW LETTERS, 2007, 98 (17)
[40]   Unsupervised feature recognition in single-molecule break junction data [J].
Magyarkuti, Andras ;
Balogh, Nora ;
Balogh, Zoltan ;
Venkataraman, Latha ;
Halbritter, Andras .
NANOSCALE, 2020, 12 (15) :8355-8363