UFNet: A Multi-scale Fusion Feature based Text Detection Method

被引:0
|
作者
Chai, Zhengpeng [1 ]
Zhu, Rui [1 ]
Wang, Wei [2 ]
机构
[1] Wuhan Inst Technol, Sch Comp Sci & Engn, Wuhan, Peoples R China
[2] Wuhan 1 Hosp, Wuhan 430205, Peoples R China
来源
2023 THE 6TH INTERNATIONAL CONFERENCE ON ROBOT SYSTEMS AND APPLICATIONS, ICRSA 2023 | 2023年
关键词
feature fusion; binarization; segmentation network; text detection;
D O I
10.1145/3655532.3655558
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, the field of text detection has witnessed a growing trend, with more and more segmentation-based methods incorporating feature sampling. Segmentation methods possess a natural advantage in detecting text with both regular and irregular shapes due to their ability to effectively segment diverse targets and backgrounds that exhibit significant differences.The common sampling method in networks is typically the Feature Pyramid Network (FPN), which is used to match different dimensions for detecting the scale of images. However, due to the inherent limitations of scene text, such as variations in aspect ratio, dense text, and differences in width-to-height ratio, these general sampling methods (FPN) may not effectively address these issues. To ease this problem, we have proposed a novel network architecture called Unified Feature Fusion Network (UFNet), which integrates feature sampling. Compared to the DBU network, UFNet achieves significantly better performance in terms of accuracy and recall on English text detection datasets such as ICDAR2015 and the mixed English-Chinese dataset MSRA-TD500. Text detection results indicate that this algorithm solves the problem of poor performance in handling variations in aspect ratio and width-to-height ratio in images.
引用
收藏
页码:163 / 168
页数:6
相关论文
共 50 条
  • [1] Text Detection Algorithm Based on Multi-Scale Attention Feature Fusion
    She, Xiangyang
    Liu, Zhe
    Dong, Lihong
    Computer Engineering and Applications, 2024, 60 (01) : 198 - 206
  • [2] Hierarchical Feature Fusion With Text Attention For Multi-scale Text Detection
    Liu, Chao
    Zou, Yuexian
    Guan, Wenjie
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [3] A Multi-Scale Natural Scene Text Detection Method Based on Attention Feature Extraction and Cascade Feature Fusion
    Li, Nianfeng
    Wang, Zhenyan
    Huang, Yongyuan
    Tian, Jia
    Li, Xinyuan
    Xiao, Zhiguo
    SENSORS, 2024, 24 (12)
  • [4] Scene Text Detection Based on Multi-Scale Pooling and Bidirectional Feature Fusion
    Wei, Zheliang
    Li, Yueyang
    Luo, Haichi
    Computer Engineering and Applications, 2024, 60 (02) : 154 - 161
  • [5] Bridge crack detection method based on multi-scale feature fusion
    Wang, Yubian
    Zou, Chengzheng
    Song, Yajuan
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ALGORITHMS, SOFTWARE ENGINEERING, AND NETWORK SECURITY, ASENS 2024, 2024, : 559 - 562
  • [6] Drone Detection Based on Multi-scale Feature Fusion
    Zeng, Zhenni
    Wang, Zhenning
    Qin, Lang
    Li, Hui
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 194 - 198
  • [7] Vehicle detection method based on adaptive multi-scale feature fusion network
    Shen, Xuanjing
    Li, Hanyu
    Huang, Yongping
    Wang, Yu
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [8] Feature Adaptation and Multi-scale Attentional Fusion Based Anomaly Detection Method
    Zhang, Hanyuan
    Hao, Yingguang
    Wang, Hongyu
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1518 - 1523
  • [9] MTMFNet: multi-threshold and multi-scale feature fusion network for text detection
    Dai, Lei
    Gao, Wen
    Tang, Chengyu
    Wang, Min
    Chen, Zhihua
    VISUAL COMPUTER, 2025,
  • [10] UAV reaction detection based on multi-scale feature fusion
    He, Jianfeng
    Liu, Ming
    Yu, Chuanjiang
    2022 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, COMPUTER VISION AND MACHINE LEARNING (ICICML), 2022, : 640 - 643