Topological data analysis with digital microscope leather images for animal species classification

被引:0
作者
Ehiro, Takuya [1 ]
Onji, Takeshi [1 ]
机构
[1] Osaka Res Inst Ind Sci & Technol, Res Div Polymer Funct Mat, Izumi, Osaka, Japan
关键词
Animal species classification; Machine learning; Topological data analysis; Persistent homology; Lifetime; MAXIMIZATION ALGORITHM; IDENTIFICATION;
D O I
10.1186/s42825-024-00187-1
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
This study presents a method for classifying cow and horse leather using a small number of digital microscope images and topological data analysis. In this method, hair pore coordinates in the images are used as essential information for classification. First, the coordinates were semiautomatically extracted using conventional image processing methods and persistent homology (PH) computation. Binary images with white pixels corresponding to the coordinates were generated, and their PHs were computed using filtration based on the Manhattan distance. In addition to the pairwise distance between the two pores, zeroth- and first-order lifetimes were used as explanatory variables to construct the classifier. Among the three explanatory variables, the zeroth-order lifetime resulted in the highest classification accuracy (86%) for the test data. Furthermore, we constructed logistic regression (LR) and random forest (RF) models using the zeroth-order lifetime computed from all images and conducted model interpretation. In both LR and RF, information on a zeroth-order lifetime of less than 10 was used as an important explanatory variable. Additionally, the inverse analysis of birth-death pairs suggested that the zeroth-order lifetime contains topological information distinct from the conventional pairwise distance. Our proposed method is designed to be robust in data-limited situations because it only uses hair pore coordinates as explanatory variables and does not require other information, such as hair pore density or pore size. This study demonstrates that accurate classifiers can be obtained using topological features related to hair pore arrangement.
引用
收藏
页数:15
相关论文
共 42 条
[1]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[2]  
Bergstra J., 2011, Proceedings of the 24th International Conference on Neural Information Processing Systems, NIPS'11, page, P2546
[3]  
Bradski G, 2000, DR DOBBS J, V25, P120
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   An Introduction to Topological Data Analysis: Fundamental and Practical Aspects for Data Scientists [J].
Chazal, Frederic ;
Michel, Bertrand .
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2021, 4
[6]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[7]   The shape of cancer relapse: Topological data analysis predicts recurrence in paediatric acute lymphoblastic leukaemia [J].
Chulian, Salvador ;
Stolz, Bernadette ;
Martinez-Rubio, Alvaro ;
Goni, Cristina Blazquez ;
Gutierrez, Juan Rodriguez ;
Velazquez, Teresa Caballero ;
Quintana, Agueda Molinos ;
Orellana, Manuel Ramirez ;
Robleda, Ana Castillo ;
Soler, Jose Luis Fuster M. ;
Puras, Alfredo Minguela ;
Sanchez, Maria V. Martinez D. ;
Rosa, Maria D. ;
Perez-Garcia, Victor M. D. ;
Byrne, Helen D. .
PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (08)
[8]  
Edelsbrunner H, 2008, CONTEMP MATH, V453, P257
[9]   Descriptor generation from Morgan fingerprint using persistent homology [J].
Ehiro, T. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2024, 35 (01) :31-51
[10]   Topological Data Analysis for Multivariate Time Series Data [J].
El-Yaagoubi, Anass B. ;
Chung, Moo K. ;
Ombao, Hernando .
ENTROPY, 2023, 25 (11)