Gca-pvt-net: group convolutional attention and PVT dual-branch network for oracle bone drill chisel segmentation

被引:4
作者
Liu, Guoqi [1 ,2 ,3 ]
Yang, Yiping [1 ,3 ]
Li, Xueshan [3 ,4 ]
Liu, Dong [1 ,2 ]
Ru, Linyuan [1 ,3 ]
Han, Yanbiao [3 ,4 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
[2] Henan Normal Univ, Big Data Engn Lab Teaching Resource & accessment E, Xinxiang 453007, Henan, Peoples R China
[3] Oracle Bone Intelligent Comp Lab, Xinxiang 453007, Henan, Peoples R China
[4] Henan Normal Univ, Coll Hist & Culture, Xinxiang 453007, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Oracle bone drill chisel segmentation; Transformer; Convolutional neural networks; Pyramid vision transformer;
D O I
10.1186/s40494-024-01378-z
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Oracle bones (Obs) are a significant carrier of the shang dynasty civilization, primarily consisting of tortoise shells and animal bones, through the study of which we can gain a deeper understanding of the political, economic, religious, and cultural aspects of the shang dynasty. The oracle bone drill chisel (Obdc) is considered an essential non-textual material. The segmentation of Obdc assists archaeologists determine the approximate age of the Obs, which possesses considerable research value. However, the breakage of thousands of years of underground buried Obs, the blurring of the edges of the area burned by the Obdc, the different shapes, and the inconsistent number have brought challenges to the accurate segmentation of the Obdc. In this article, we propose a group convolutional attention and pvt dual-branch network (GCA-PVT-Net) for Obdc segmentation. To our knowledge, this paper is the first to research the automatic segmentation of Obdc. It is a hybrid Convolutional neural network (CNN) and Transformer framework. The work offers the following contributions: (1) The Obdc images are labeled based on the delineation criteria of different drill chisel (DC) shapes to create the Obdc dataset. (2) A convolutional attention module (CAM) is proposed as both an encoder and decoder. The feature extraction process, which effectively integrates global and local information, ensures better modeling of long-term correlations in images while preserving details. (3) A channel feature aggregation module (CFAM) is designed to enhance the effective integration of channel features, enabling feature fusion across various branches and at different levels. (4) The edge deep supervision strategy is applied to smooth the jagged edge of the predicted images at the decoder's end. Extensive experiments on the Obdc dataset show that GCA-PVT-Net outperforms other state-of-the-art (SOTA) methods. The comparative experimental results show that the edge accuracy and segmentation accuracy of the model reach the top 1.
引用
收藏
页数:18
相关论文
共 38 条
[1]   Performance measure characterization for evaluating neuroimage segmentation algorithms [J].
Chang, Herng-Hua ;
Zhuang, Audrey H. ;
Valentino, Daniel J. ;
Chu, Woei-Chyn .
NEUROIMAGE, 2009, 47 (01) :122-135
[2]  
Chen J., 2021, arXiv
[3]   A Classification Method of Oracle Materials Based on Local Convolutional Neural Network Framework [J].
Chen, Shanxiong ;
Han Xu ;
Gao Weizhe ;
Liu Xuxin ;
Mo Bofeng .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2020, 40 (03) :32-44
[4]  
Dong B, 2024, Arxiv, DOI [arXiv:2108.06932, 10.48550/arXiv.2108.06932, DOI 10.48550/ARXIV.2108.06932]
[5]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[6]   OBM-CNN: a new double-stream convolutional neural network for shield pattern segmentation in ancient oracle bones [J].
Gao, Weize ;
Chen, Shanxiong ;
Zhang, Chongsheng ;
Mo, Bofeng ;
Liu, Xuxing .
APPLIED INTELLIGENCE, 2022, 52 (11) :12241-12257
[7]  
Guo R., 1951, Yin qi shi duo
[8]  
Han Y, 2018, Collation and study of morphological data of oracle bone drilling and divination signs in the shang dynasty
[9]   HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation [J].
Heidari, Moein ;
Kazerouni, Amirhossein ;
Soltany, Milad ;
Azad, Reza ;
Aghdam, Ehsan Khodapanah ;
Cohen-Adad, Julien ;
Merhof, Dorit .
2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, :6191-6201
[10]   Squeeze-and-Excitation Networks [J].
Hu, Jie ;
Shen, Li ;
Sun, Gang .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7132-7141