PFCNet: Enhancing Rail Surface Defect Detection With Pixel-Aware Frequency Conversion Networks

被引:0
作者
Wu, Yue [1 ]
Qiang, Fangfang [1 ]
Zhou, Wujie [1 ]
Yan, Weiqing [2 ]
机构
[1] Zhejiang Univ Sci & Technol, Sch Informat & Elect Engn, Hangzhou 310023, Peoples R China
[2] Yantai Univ, Sch Comp & Technol, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolution; Feature extraction; Rails; Object detection; Frequency conversion; Surface treatment; Semantics; Inspection; Frequency-domain analysis; Accuracy; Rail defect detection; frequency feature aggregation; DCT transform;
D O I
10.1109/LSP.2025.3525855
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Applying computer vision techniques to rail surface defect detection (RSDD) is crucial for preventing catastrophic accidents. However, challenges such as complex backgrounds and irregular defect shapes persist. Previous methods have focused on extracting salient object information from a pixel perspective, thereby neglecting valuable high- and low-frequency image information, which can better capture global structural information. In this study, we design a pixel-aware frequency conversion network (PFCNet) to explore RSDD from a frequency domain perspective. We use different attention mechanisms and frequency enhancement for high-level and shallow features to explore local details and global structures comprehensively. In addition, we design a dual-control reorganization module to refine the features across levels. We conducted extensive experiments on an industrial RGB-D dataset (NEU RSDDS-AUG), and PFCNet achieved superior performance.
引用
收藏
页码:606 / 610
页数:5
相关论文
共 31 条
  • [1] CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection
    Cong, Runmin
    Lin, Qinwei
    Zhang, Chen
    Li, Chongyi
    Cao, Xiaochun
    Huang, Qingming
    Zhao, Yao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6800 - 6815
  • [2] ALOFT: A Lightweight MLP-like Architecture with Dynamic Low-frequency Transform for Domain Generalization
    Guo, Jintao
    Wang, Na
    Qi, Lei
    Shi, Yinghuan
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24132 - 24141
  • [3] Gait Recognition Under Various Viewing Angles Based on Correlated Motion Regression
    Kusakunniran, Worapan
    Wu, Qiang
    Zhang, Jian
    Li, Hongdong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (06) : 966 - 980
  • [4] Learning Selective Mutual Attention and Contrast for RGB-D Saliency Detection
    Liu, Nian
    Zhang, Ni
    Shao, Ling
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9026 - 9042
  • [5] Unsupervised Saliency Detection of Rail Surface Defects Using Stereoscopic Images
    Niu, Menghui
    Song, Kechen
    Huang, Liming
    Wang, Qi
    Yan, Yunhui
    Meng, Qinggang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (03) : 2271 - 2281
  • [6] Pan Zizheng, 2022, Advances in Neural Information Processing Systems
  • [7] CAVER: Cross-Modal View-Mixed Transformer for Bi-Modal Salient Object Detection
    Pang, Youwei
    Zhao, Xiaoqi
    Zhang, Lihe
    Lu, Huchuan
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 892 - 904
  • [8] MSEDNet: Multi-scale fusion and edge-supervised network for RGB-T salient object detection
    Peng, Daogang
    Zhou, Weiyi
    Pan, Junzhen
    Wang, Danhao
    [J]. NEURAL NETWORKS, 2024, 171 : 410 - 422
  • [9] Depth-induced Multi-scale Recurrent Attention Network for Saliency Detection
    Piao, Yongri
    Ji, Wei
    Li, Jingjing
    Zhang, Miao
    Lu, Huchuan
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 7253 - 7262
  • [10] Shunted Self-Attention via Multi-Scale Token Aggregation
    Ren, Sucheng
    Zhou, Daquan
    He, Shengfeng
    Feng, Jiashi
    Wang, Xinchao
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 10843 - 10852