A Multi-Scale Hybrid Attention Network for Sentence Segmentation Line Detection in Dongba Scripture

被引:1
|
作者
Xing, Junyao [1 ]
Bi, Xiaojun [2 ,3 ]
Weng, Yu [2 ,3 ]
机构
[1] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin 150009, Peoples R China
[2] Minzu Univ China, Sch Informat & Engineerin, Beijing 100081, Peoples R China
[3] Key Lab Ethn Language Intelligent Anal & Secur Gov, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
hybrid attention mechanism; multi-scale depthwise convolution; multi-scale features; sentence segmentation line detection; Dongba scripture sentence segmentation line detection dataset;
D O I
10.3390/math11153392
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Dongba scripture sentence segmentation is an important and basic work in the digitization and machine translation of Dongba scripture. Dongba scripture sentence segmentation line detection (DS-SSLD) as a core technology of Dongba scripture sentence segmentation is a challenging task due to its own distinctiveness, such as high inherent noise interference and nonstandard sentence segmentation lines. Recently, projection-based methods have been adopted. However, these methods are difficult when dealing with the following two problems. The first is the noisy problem, where a large number of noise in the Dongba scripture image interference detection results. The second is the Dongba scripture inherent characteristics, where many vertical lines in Dongba hieroglyphs are easily confused with the vertical sentence segmentation lines. Therefore, this paper aims to propose a module based on the convolutional neural network (CNN) to improve the accuracy of DS-SSLD. To achieve this, we first construct a tagged dataset for training and testing DS-SSLD, including 2504 real images collected from Dongba scripture books and sentence segmentation targets. Then, we propose a multi-scale hybrid attention network (Multi-HAN) based on YOLOv5s, where a multiple hybrid attention unit (MHAU) is used to enhance the distinction between important features and redundant features, and the multi-scale cross-stage partial unit (Multi-CSPU) is used to realize multi-scale and richer feature representation. The experiment is carried out on the Dongba scripture sentence segmentation dataset we built. The experimental results show that the proposed method exhibits excellent detection performance and outperforms several state-of-the-art methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] GLIMS: Attention-guided lightweight multi-scale hybrid network for volumetric semantic segmentation
    Yazici, Ziya Ata
    Oksuz, Ilkay
    Ekenel, Hazim Kemal
    IMAGE AND VISION COMPUTING, 2024, 146
  • [2] MULTI-SCALE DEEP SUPERVISED ATTENTION NETWORK FOR RED LESION SEGMENTATION
    Dey, Shramana
    Dutta, Pallabi
    Mitra, Sushmita
    Shankar, B. Uma
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [3] Multi-Scale Hybrid Attention Convolutional Neural Network for Automatic Segmentation of Lumbar Vertebrae From MRI
    Liu, Jing
    Zhou, Yuee
    Cui, Xinxin
    Jin, Fengqing
    Suo, Guodong
    Xu, Hao
    Yang, Jianlan
    IEEE ACCESS, 2024, 12 : 77999 - 78013
  • [4] A Multi-Scale Liver Tumor Segmentation Method Based on Residual and Hybrid Attention Enhanced Network with Contextual Integration
    Sun, Liyan
    Jiang, Linqing
    Wang, Mingcong
    Wang, Zhenyan
    Xin, Yi
    SENSORS, 2024, 24 (17)
  • [5] Pyramid attention object detection network with multi-scale feature fusion
    Chen, Xiu
    Li, Yujie
    Nakatoh, Yoshihisa
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 104
  • [6] MS-FANet: Multi-scale feature attention network for liver tumor segmentation
    Chen, Ying
    Zheng, Cheng
    Zhang, Wei
    Lin, Hongping
    Chen, Wang
    Zhang, Guimei
    Xu, Guohui
    Wu, Fang
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 163
  • [7] SMANet: Superpixel-guided multi-scale attention network for medical image segmentation
    Shen, Yiwei
    Guo, Junchen
    Liu, Yan
    Xu, Chang
    Li, Qingwu
    Qi, Fei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 100
  • [8] MSIANet: Multi-scale Interactive Attention Crowd Counting Network
    Zhang, Shihui
    Zhao, Weibo
    Wang, Lei
    Wang, Wei
    Li, Qunpeng
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 2236 - 2245
  • [9] MCANet: Medical Image Segmentation with Multi-scale Cross-axis Attention
    Hao Shao
    Quansheng Zeng
    Qibin Hou
    Jufeng Yang
    Machine Intelligence Research, 2025, 22 (3) : 437 - 451
  • [10] Multi-scale Dynamic Network for Temporal Action Detection
    Ren, Yifan
    Xu, Xing
    Shen, Fumin
    Wang, Zheng
    Yang, Yang
    Shen, Heng Tao
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 267 - 275