Hierarchical Classification of Animal Images Including Visually Similar Species: A Case Study on Parrot Images

被引:0
作者
Lee, Me Young [1 ]
Seong, Hyeon Ah [1 ]
Seong, Si won [1 ]
Lee, Eui Chul [2 ]
机构
[1] Sangmyung Univ, Grad Sch, Dept Artificial Intelligence & Informat, Seoul 03016, South Korea
[2] Sangmyung Univ, Dept Human Ctr Artificial Intelligence, Seoul 03016, South Korea
关键词
convolutional neural network; deep learning; hierarchical structure; image classification; parrot classification;
D O I
10.3837/tiis.2025.02.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, convolutional neural network (CNN)-based methods have gained significant popularity for image classification tasks. However, when faced with many classes, especially those with comparable features, CNN-based algorithms encounter difficulties in achieving effective differentiation. This limitation becomes particularly challenging when attempting to accurately classify animals and plants belonging to internationally endangered species, which often exhibit similarities with closely related species. To address this issue, this study proposes a method for classifying images of 11 parrot species using a hierarchical structure based on a CNN image classification model. The classification criteria in the first layer are defined by dividing the data based on the genus Cacatua and employing a random division approach. Experimental results comparing the proposed method with existing image classification techniques reveal that the precision improved from 0.917 to 0.952, the recall improved from 0.914 to 0.950, and the F1 score increased by 0.036, from 0.914 to 0.950. Additionally, when the images were randomly sorted in the first layer, the precision improved from 0.917 to 0.944, the recall improved from 0.914 to 0.943, and the F1 score increased by 0.029, from 0.914 to 0.943. These results indicate that the first-stage classification based on the biological taxonomic system led to an improvement of approximately 20% in terms of the F1 score compared to when the first-stage classification was done randomly in groups. Consequently, the proposed method demonstrates better performance in animal image classification problems when initially pre-classifying visually similar classes. On the other hand, if the external appearances of all classes are entirely different, the proposed method may not be suitable for application.
引用
收藏
页码:513 / 532
页数:20
相关论文
共 20 条
[1]   Deep learning: Applications, architectures, models, tools, and frameworks: A comprehensive survey [J].
Gheisari, Mehdi ;
Ebrahimzadeh, Fereshteh ;
Rahimi, Mohamadtaghi ;
Moazzamigodarzi, Mahdieh ;
Liu, Yang ;
Pramanik, Pijush Kanti Dutta ;
Heravi, Mohammad Ali ;
Mehbodniya, Abolfazl ;
Ghaderzadeh, Mustafa ;
Feylizadeh, Mohammad Reza ;
Kosari, Saeed .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (03) :581-606
[2]   Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1026-1034
[3]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[4]   Densely Connected Convolutional Networks [J].
Huang, Gao ;
Liu, Zhuang ;
van der Maaten, Laurens ;
Weinberger, Kilian Q. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2261-2269
[5]   Multi-Class Parrot Image Classification Including Subspecies with Similar Appearance [J].
Jang, Woohyuk ;
Lee, Eui Chul .
BIOLOGY-BASEL, 2021, 10 (11)
[6]   Image Identification of Multiple Parrot Species Belonging to CITES Using Deep Neural Networks [J].
Jang, Woohyuk ;
Seong, Si Won ;
Kim, Chang Bae ;
Lee, Eui Chul .
INTELLIGENT HUMAN COMPUTER INTERACTION, IHCI 2020, PT II, 2021, 12616 :248-253
[7]   Animal Sounds Classification Scheme Based on Multi-Feature Network with Mixed Datasets [J].
Kim, Chung-Il ;
Cho, Yongjang ;
Jung, Seungwon ;
Rew, Jehyeok ;
Hwang, Eenjun .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (08) :3384-3398
[8]  
Kingma DP., 2014, P 2 INT C LEARN REPR
[9]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[10]  
LeCun Y, 1999, The mnist database of handwritten digits