Multiple attentional pyramid networks for Chinese herbal recognition

被引:31
作者
Xu, Yingxue [1 ]
Wen, Guihua [1 ]
Hu, Yang [1 ]
Luo, Mingnan [1 ]
Dai, Dan [1 ]
Zhuang, Yishan [1 ]
Hall, Wendy [2 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Guangdong, Peoples R China
[2] Univ Southampton, Web Sci Inst, Southampton, Hants, England
基金
美国国家科学基金会;
关键词
Pyramid networks; Attention mechanism; Multi-scale features; Chinese herbal recognition; Chinese herbs image datasets;
D O I
10.1016/j.patcog.2020.107558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese herbs play a critical role in Traditional Chinese Medicine. Due to different recognition granular-ity, they can be recognized accurately only by professionals with much experience. It is expected that they can be recognized automatically using new techniques like machine learning. However, there is no Chinese herbal image dataset available. Simultaneously, there is no machine learning method which can deal with Chinese herbal image recognition well. Therefore, this paper begins with building a new standard Chinese-Herbs dataset. Subsequently, a new Attentional Pyramid Networks (APN) for Chinese herbal recognition is proposed, where both novel competitive attention and spatial collaborative attention are proposed and then applied. APN can adaptively model Chinese herbal images with different feature scales. Finally, a new framework for Chinese herbal recognition is proposed as a new application of APN. Experiments are conducted on our constructed dataset and validate the effectiveness of our methods. (c) 2020 Published by Elsevier Ltd.
引用
收藏
页数:14
相关论文
共 49 条
[31]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241
[32]  
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[33]  
Toth B.P., 2016, CLEF WORKING NOTES, V1609, P569
[34]  
van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[35]   Residual Attention Network for Image Classification [J].
Wang, Fei ;
Jiang, Mengqing ;
Qian, Chen ;
Yang, Shuo ;
Li, Cheng ;
Zhang, Honggang ;
Wang, Xiaogang ;
Tang, Xiaoou .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6450-6458
[36]   Deep Parametric Continuous Convolutional Neural Networks [J].
Wang, Shenlong ;
Suo, Simon ;
Ma, Wei-Chiu ;
Pokrovsky, Andrei ;
Urtasun, Raquel .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2589-2597
[37]  
WOO S, 2018, P EUR C COMP VIS ECC, DOI DOI 10.1007/978-3-030-01234-2_1
[38]   Aggregated Residual Transformations for Deep Neural Networks [J].
Xie, Saining ;
Girshick, Ross ;
Dollar, Piotr ;
Tu, Zhuowen ;
He, Kaiming .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5987-5995
[39]   Learning Feature Pyramids for Human Pose Estimation [J].
Yang, Wei ;
Li, Shuang ;
Ouyang, Wanli ;
Li, Hongsheng ;
Wang, Xiaogang .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :1290-1299
[40]   Deep Layer Aggregation [J].
Yu, Fisher ;
Wang, Dequan ;
Shelhamer, Evan ;
Darrell, Trevor .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2403-2412