Automatic clustering of single-molecule break junction data through task-oriented representation learning

被引:0
作者
Zhao, Yi-Heng [1 ]
Pang, Shen-Wen [1 ]
Huang, Heng-Zhi [1 ]
Wu, Shao-Wen [1 ]
Sun, Shao-Hua [1 ,2 ]
Liu, Zhen-Bing [1 ,2 ]
Pan, Zhi-Chao [1 ,2 ]
机构
[1] Guilin Univ Elect Technol, Sch Artificial Intelligence, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Guangxi Coll & Univ Key Lab AI Algorithm Engn, Guilin 541004, Peoples R China
关键词
Single-molecule conductance; Break junction; Deep clustering; Representation learning; Neural architecture search; CONDUCTANCE; ELECTRONICS; ROBUST; NOISE;
D O I
10.1007/s12598-024-03089-7
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Clustering is a pivotal data analysis method for deciphering the charge transport properties of single molecules in break junction experiments. However, given the high dimensionality and variability of the data, feature extraction remains a bottleneck in the development of efficient clustering methods. In this regard, extensive research over the past two decades has focused on feature engineering and dimensionality reduction in break junction conductance. However, extracting highly relevant features without expert knowledge remains an unresolved challenge. To address this issue, we propose a deep clustering method driven by task-oriented representation learning (CTRL) in which the clustering module serves as a guide for the representation learning (RepL) module. First, we determine an optimal autoencoder (AE) structure through a neural architecture search (NAS) to ensure efficient RepL; second, the RepL process is guided by a joint training strategy that combines AE reconstruction loss with the clustering objective. The results demonstrate that CTRL achieves excellent performance on both the generated and experimental data. Further inspection of the RepL step reveals that joint training robustly learns more compact features than the unconstrained AE or traditional dimensionality reduction methods, significantly reducing misclustering possibilities. Our method provides a general end-to-end automatic clustering solution for analyzing single-molecule break junction data. (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic) (CTRL), (sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic); (sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)k-means(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic), CTRL(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), CTRL(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic), (sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic)(sic).
引用
收藏
页码:3244 / 3257
页数:14
相关论文
共 70 条
[1]   Flicker Noise as a Probe of Electronic Interaction at Metal-Single Molecule Interfaces [J].
Adak, Olgun ;
Rosenthal, Ethan ;
Meisner, Jeffer ;
Andrade, Erick F. ;
Pasupathy, Abhay N. ;
Nuckolls, Colin ;
Hybertsen, Mark S. ;
Venkataraman, Latha .
NANO LETTERS, 2015, 15 (06) :4143-4149
[2]   Deep learning for single-molecule science [J].
Albrecht, Tim ;
Slabaugh, Gregory ;
Alonso, Eduardo ;
Al-Arif, S. M. Masudur R. .
NANOTECHNOLOGY, 2017, 28 (42)
[3]   Configurational Behavior and Conductance of Alkanedithiol Molecular Wires from Accelerated Dynamics Simulations [J].
Alexis Paz, S. ;
Zoloff Michoff, Martin E. ;
Negre, Christian F. A. ;
Olmos-Asar, Jimena A. ;
Mariscal, Marcelo M. ;
Sanchez, Cristian G. ;
Leiva, Ezequiel P. M. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2012, 8 (11) :4539-4545
[4]   Single-molecule junctions beyond electronic transport [J].
Aradhya, Sriharsha V. ;
Venkataraman, Latha .
NATURE NANOTECHNOLOGY, 2013, 8 (06) :399-410
[5]   Configuration-Specific Insight into Single-Molecule Conductance and Noise Data Revealed by the Principal Component Projection Method [J].
Balogh, Z. ;
Mezei, G. ;
Tenk, N. ;
Magyarkuti, A. ;
Halbritter, A. .
JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2023, 14 (22) :5109-5118
[6]   Unsupervised Segmentation-Based Machine Learning as an Advanced Analysis Tool for Single Molecule Break Junction Data [J].
Bamberger, Nathan D. ;
Ivie, Jeffrey A. ;
Parida, Keshaba N. ;
McGrath, Dominic, V ;
Monti, Oliver L. A. .
JOURNAL OF PHYSICAL CHEMISTRY C, 2020, 124 (33) :18302-18315
[7]  
Bergstra J., 2011, 24 INT C NEUR INF PR, V24
[8]   Trusting our machines: validating machine learning models for single-molecule transport experiments [J].
Bro-Jorgensen, William ;
Hamill, Joseph M. ;
Bro, Rasmus ;
Solomon, Gemma C. .
CHEMICAL SOCIETY REVIEWS, 2022, 51 (16) :6875-6892
[9]   A reference-free clustering method for the analysis of molecular break-junction measurements [J].
Cabosart, Damien ;
El Abbassi, Maria ;
Stefani, Davide ;
Frisenda, Riccardo ;
Calame, Michel ;
van der Zant, Herre S. J. ;
Perrin, Mickael L. .
APPLIED PHYSICS LETTERS, 2019, 114 (14)
[10]   From molecular to supramolecular electronics [J].
Chen, Hongliang ;
Fraser Stoddart, J. .
NATURE REVIEWS MATERIALS, 2021, 6 (09) :804-828