Multiple attentional pyramid networks for Chinese herbal recognition

被引:31
作者
Xu, Yingxue [1 ]
Wen, Guihua [1 ]
Hu, Yang [1 ]
Luo, Mingnan [1 ]
Dai, Dan [1 ]
Zhuang, Yishan [1 ]
Hall, Wendy [2 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Guangdong, Peoples R China
[2] Univ Southampton, Web Sci Inst, Southampton, Hants, England
基金
美国国家科学基金会;
关键词
Pyramid networks; Attention mechanism; Multi-scale features; Chinese herbal recognition; Chinese herbs image datasets;
D O I
10.1016/j.patcog.2020.107558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese herbs play a critical role in Traditional Chinese Medicine. Due to different recognition granular-ity, they can be recognized accurately only by professionals with much experience. It is expected that they can be recognized automatically using new techniques like machine learning. However, there is no Chinese herbal image dataset available. Simultaneously, there is no machine learning method which can deal with Chinese herbal image recognition well. Therefore, this paper begins with building a new standard Chinese-Herbs dataset. Subsequently, a new Attentional Pyramid Networks (APN) for Chinese herbal recognition is proposed, where both novel competitive attention and spatial collaborative attention are proposed and then applied. APN can adaptively model Chinese herbal images with different feature scales. Finally, a new framework for Chinese herbal recognition is proposed as a new application of APN. Experiments are conducted on our constructed dataset and validate the effectiveness of our methods. (c) 2020 Published by Elsevier Ltd.
引用
收藏
页数:14
相关论文
共 49 条
[1]  
[Anonymous], 2010, Implementation and benchmarking of perceptual image hash functions
[2]   Attentive Collaborative Filtering: Multimedia Recommendation with Item- and Component-Level Attention [J].
Chen, Jingyuan ;
Zhang, Hanwang ;
He, Xiangnan ;
Nie, Liqiang ;
Liu, Wei ;
Chua, Tat-Seng .
SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, :335-344
[3]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[4]   SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J].
Chen, Long ;
Zhang, Hanwang ;
Xiao, Jun ;
Nie, Liqiang ;
Shao, Jian ;
Liu, Wei ;
Chua, Tat-Seng .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6298-6306
[5]   Cascaded Pyramid Network for Multi-Person Pose Estimation [J].
Chen, Yilun ;
Wang, Zhicheng ;
Peng, Yuxiang ;
Zhang, Zhiqiang ;
Yu, Gang ;
Sun, Jian .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7103-7112
[6]   GRAM: Graph-based Attention Model for Healthcare Representation Learning [J].
Choi, Edward ;
Bahadori, Mohammad Taha ;
Song, Le ;
Stewart, Walter F. ;
Sun, Jimeng .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :787-795
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   Interaction-Aware Spatio-Temporal Pyramid Attention Networks for Action Classification [J].
Du, Yang ;
Yuan, Chunfeng ;
Li, Bing ;
Zhao, Lili ;
Li, Yangxi ;
Hu, Weiming .
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 :388-404
[9]  
Gogul I., 2017, 2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN), P1
[10]  
He K, 2016, IEEE C COMP VIS PATT, DOI [10.1109/CVPR.2016.90, DOI 10.1109/CVPR.2016.90, 10.48550/arXiv.1512.03385, DOI 10.48550/ARXIV.1512.03385]