A Multi-Scale Hybrid Attention Network for Sentence Segmentation Line Detection in Dongba Scripture

被引:1
作者
Xing, Junyao [1 ]
Bi, Xiaojun [2 ,3 ]
Weng, Yu [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin 150009, Peoples R China
[2] Minzu Univ China, Sch Informat & Engineerin, Beijing 100081, Peoples R China
[3] Key Lab Ethn Language Intelligent Anal & Secur Gov, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
hybrid attention mechanism; multi-scale depthwise convolution; multi-scale features; sentence segmentation line detection; Dongba scripture sentence segmentation line detection dataset;
D O I
10.3390/math11153392
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Dongba scripture sentence segmentation is an important and basic work in the digitization and machine translation of Dongba scripture. Dongba scripture sentence segmentation line detection (DS-SSLD) as a core technology of Dongba scripture sentence segmentation is a challenging task due to its own distinctiveness, such as high inherent noise interference and nonstandard sentence segmentation lines. Recently, projection-based methods have been adopted. However, these methods are difficult when dealing with the following two problems. The first is the noisy problem, where a large number of noise in the Dongba scripture image interference detection results. The second is the Dongba scripture inherent characteristics, where many vertical lines in Dongba hieroglyphs are easily confused with the vertical sentence segmentation lines. Therefore, this paper aims to propose a module based on the convolutional neural network (CNN) to improve the accuracy of DS-SSLD. To achieve this, we first construct a tagged dataset for training and testing DS-SSLD, including 2504 real images collected from Dongba scripture books and sentence segmentation targets. Then, we propose a multi-scale hybrid attention network (Multi-HAN) based on YOLOv5s, where a multiple hybrid attention unit (MHAU) is used to enhance the distinction between important features and redundant features, and the multi-scale cross-stage partial unit (Multi-CSPU) is used to realize multi-scale and richer feature representation. The experiment is carried out on the Dongba scripture sentence segmentation dataset we built. The experimental results show that the proposed method exhibits excellent detection performance and outperforms several state-of-the-art methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Hyperspectral image classification based on multi-scale hybrid convolutional network
    Yang, Yun
    Zhou, Yao
    Chen, Jia-ning
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (03) : 368 - 377
  • [22] Parathyroid Gland Detection Based on Multi-Scale Weighted Fusion Attention Mechanism
    Liu, Wanling
    Lu, Wenhuan
    Li, Yijian
    Chen, Fei
    Jiang, Fan
    Wei, Jianguo
    Wang, Bo
    Zhao, Wenxin
    ELECTRONICS, 2025, 14 (06):
  • [23] MSDMAT-BTS: Multi-scale diffusion model and attention mechanism for brain tumor segmentation
    Gao, Tao
    Hu, Weijie
    Chen, Mingzhi
    Chen, Lingna
    Jiang, Hui
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 104
  • [24] Construction Vehicle Detection Method Based on Multi-Scale Residual Network
    Liu, Liangshuai
    Chen, Ze
    She, Kai
    Ji, Yanpeng
    Feng, Haiyan
    Ni, Yong
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1399 - 1405
  • [25] Multi-scale Prototypical Network for Few-shot Anomaly Detection
    Wu, Jingkai
    Jiang, Weijie
    Huang, Zhiyong
    Lin, Qifeng
    Zheng, Qinghai
    Liang, Yi
    Yu, Yuanlong
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 1067 - 1076
  • [26] LMANet: A Lightweight Asymmetric Semantic Segmentation Network Based on Multi-Scale Feature Extraction
    Chen, Hui
    Xiao, Zhexuan
    Ge, Bin
    Li, Xuedi
    ELECTRONICS, 2024, 13 (17)
  • [27] MTC-Net: Multi-scale feature fusion network for medical image segmentation
    Ren S.
    Wang Y.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04) : 8729 - 8740
  • [28] Multi-Scale Feature Channel Attention Generative Adversarial Network for Face Sketch Synthesis
    Zheng, Jieying
    Wu, Yahong
    Song, Wanru
    Xu, Ran
    Liu, Feng
    IEEE ACCESS, 2020, 8 : 146754 - 146769
  • [29] Multi-scale attention guided network for end-to-end face alignment and recognition
    Shakeel, M. Saad
    Zhang, Yuxuan
    Wang, Xin
    Kang, Wenxiong
    Mahmood, Arif
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 88
  • [30] Multi-level and multi-scale deep saliency network for salient object detection
    Zhang, Qing
    Lin, Jiajun
    Zhuge, Jingling
    Yuan, Wenhao
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 415 - 424