Understanding Point Clouds of Electrical Substations Using a Multi-View Multi-Scale Fusion With Transformer

被引:0
作者
Wang, Xinghua [1 ]
Meng, Yanxi [2 ]
Dong, Hantuo [3 ]
Jia, Tao [2 ]
机构
[1] Guangdong Power Grid Co Ltd, Dept Infrastruct, Guangzhou 510050, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing Informat Engn, Wuhan 430072, Peoples R China
[3] Guangdong Power Grid Co Ltd, Planning Res Ctr, Guangzhou 510699, Peoples R China
关键词
Three-dimensional displays; Point cloud compression; Substations; Feature extraction; Solid modeling; Deep learning; Accuracy; Transformers; Convolution; Attention mechanisms; multi-view multi-scale fusion; transformer; 3D object classification; electrical substation point clouds;
D O I
10.1109/ACCESS.2025.3578226
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a deep learning-based network model, relying on multi-view multi-scale fusion with transformer, for understanding semantical information of electrical substation point clouds. Our method is inspired by human recognition of 3D objects using information from multiple perspectives and with multiple levels of details, and it is constructed by a dynamic integration of multi-view 3D morphological representation and multi-scale 3D geometric characterization. Additionally, we leverage the channel and spatial attention mechanism to capture the relationship between morphological representations at multiple views and utilize the self and cross attention mechanism to understand the relationship between geometric characterization at multiple scales. To verify our model, experiments were conducted based on our substation point clouds dataset and two benchmark datasets. Our model shows a better performance than the state-of-the-art methods, and it achieves an overall accuracy of 91.15% on our dataset, 93.30% on ModelNet40, and 92.80% on ModelNet40-C, indicating its effectiveness and robustness.
引用
收藏
页码:100492 / 100503
页数:12
相关论文
共 33 条
[1]   Automatic Object Extraction from Electrical Substation Point Clouds [J].
Arastounia, Mostafa ;
Lichti, Derek D. .
REMOTE SENSING, 2015, 7 (11) :15605-15629
[2]  
Cabrera JJ, 2024, Arxiv, DOI arXiv:2403.07593
[3]   4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks [J].
Choy, Christopher ;
Gwak, JunYoung ;
Savarese, Silvio .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3070-3079
[4]   PointVector: A Vector Representation In Point Cloud Analysis [J].
Deng, Xin ;
Zhang, Wenyu ;
Ding, Qing ;
Zhang, XinMing .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :9455-9465
[5]   Deep Learning for LiDAR Point Cloud Classification in Remote Sensing [J].
Diab, Ahmed ;
Kashef, Rasha ;
Shaker, Ahmed .
SENSORS, 2022, 22 (20)
[6]   Comprehensive Review of Deep Learning-Based 3D Point Cloud Completion Processing and Analysis [J].
Fei, Ben ;
Yang, Weidong ;
Chen, Wen-Ming ;
Li, Zhijun ;
Li, Yikang ;
Ma, Tao ;
Hu, Xing ;
Ma, Lipeng .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) :22862-22883
[7]   GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition [J].
Feng, Yifan ;
Zhang, Zizhao ;
Zhao, Xibin ;
Ji, Rongrong ;
Gao, Yue .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :264-272
[8]   Point-cloud based 3D object detection and classification methods for self-driving applications: A survey and taxonomy [J].
Fernandes, Duarte ;
Silva, Antonio ;
Nevoa, Rafael ;
Simoes, Claudia ;
Gonzalez, Dibet ;
Guevara, Miguel ;
Novais, Paulo ;
Monteiro, Joao ;
Melo-Pinto, Pedro .
INFORMATION FUSION, 2021, 68 :161-191
[9]  
Graham B, 2015, Arxiv, DOI arXiv:1505.02890
[10]   3D Semantic Segmentation with Submanifold Sparse Convolutional Networks [J].
Graham, Benjamin ;
Engelcke, Martin ;
van der Maaten, Laurens .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9224-9232