GPA-Net:No-Reference Point Cloud Quality Assessment With Multi-Task Graph Convolutional Network

被引:17
作者
Shan, Ziyu [1 ]
Yang, Qi [2 ]
Ye, Rui [1 ]
Zhang, Yujie [1 ]
Xu, Yiling [1 ]
Xu, Xiaozhong [2 ]
Liu, Shan [2 ]
机构
[1] Shanghai Jiao Tong Univ, Cooperat Media Innovat Ctr, Shanghai 200240, Peoples R China
[2] Tencent Media Lab, Shenzhen 518054, Peoples R China
基金
中国国家自然科学基金;
关键词
Point cloud compression; Measurement; Feature extraction; Distortion; Convolution; Task analysis; Multitasking; Graph convolutional network; multi-task learning; point cloud; quality assessment; MODEL;
D O I
10.1109/TVCG.2023.3282802
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
With the rapid development of 3D vision, point cloud has become an increasingly popular 3D visual media content. Due to the irregular structure, point cloud has posed novel challenges to the related research, such as compression, transmission, rendering and quality assessment. In these latest researches, point cloud quality assessment (PCQA) has attracted wide attention due to its significant role in guiding practical applications, especially in many cases where the reference point cloud is unavailable. However, current no-reference metrics which based on prevalent deep neural network have apparent disadvantages. For example, to adapt to the irregular structure of point cloud, they require preprocessing such as voxelization and projection that introduce extra distortions, and the applied grid-kernel networks, such as Convolutional Neural Networks, fail to extract effective distortion-related features. Besides, they rarely consider the various distortion patterns and the philosophy that PCQA should exhibit shift, scaling, and rotation invariance. In this paper, we propose a novel no-reference PCQA metric named the Graph convolutional PCQA network (GPA-Net). To extract effective features for PCQA, we propose a new graph convolution kernel, i.e., GPAConv, which attentively captures the perturbation of structure and texture. Then, we propose the multi-task framework consisting of one main task (quality regression) and two auxiliary tasks (distortion type and degree predictions). Finally, we propose a coordinate normalization module to stabilize the results of GPAConv under shift, scale and rotation transformations. Experimental results on two independent databases show that GPA-Net achieves the best performance compared to the state-of-the-art no-reference PCQA metrics, even better than some full-reference metrics in some cases.
引用
收藏
页码:4955 / 4967
页数:13
相关论文
共 46 条
  • [21] Multi-Task Y-Shaped Graph Neural Network for Point Cloud Learning in Autonomous Driving
    Zou, Xiaofeng
    Li, Kenli
    Li, Yangfan
    Wei, Wei
    Chen, Cen
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 9568 - 9579
  • [22] Attention based multi-task interpretable graph convolutional network for Alzheimer's disease analysis
    Jiang, Shunqin
    Feng, Qiyuan
    Li, Hengxin
    Deng, Zhenyun
    Jiang, Qinghong
    [J]. PATTERN RECOGNITION LETTERS, 2024, 180 : 1 - 8
  • [23] Exploring Contrast Multi-Attribute Representation With Deep Network for No-Reference Perceptual Quality Assessment
    Yang, Xiaodong
    Han, Zhenqi
    Wang, Yedong
    Liu, Lizhuang
    Zhao, Dan
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 902 - 906
  • [24] Robust multi-task learning network for complex LiDAR point cloud data preprocessing
    Zhao, Luda
    Hu, Yihua
    Yang, Xing
    Dou, Zhenglei
    Kang, Linshuang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [25] Learning structure of stereoscopic image for no-reference quality assessment with convolutional neural network
    Zhang, Wei
    Qu, Chenfei
    Ma, Lin
    Guan, Jingwei
    Huang, Rui
    [J]. PATTERN RECOGNITION, 2016, 59 : 176 - 187
  • [26] MULTI-TASK CENTER-OF-PRESSURE METRICS ESTIMATION FROM SKELETON USING GRAPH CONVOLUTIONAL NETWORK
    Du, Chen
    Graham, Sarah
    Jin, Shiwei
    Depp, Colin
    Truong Nguyen
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2313 - 2317
  • [27] Multi-task Temporal Convolutional Network for Predicting Water Quality Sensor Data
    Zhang, Yi-Fan
    Thorburn, Peter J.
    Fitch, Peter
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT IV, 2019, 1142 : 122 - 130
  • [28] Point Cloud Quality Assessment: Dataset Construction and Learning-based No-reference Metric
    Liu, Yipeng
    Yang, Qi
    Xu, Yiling
    Yang, Le
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [29] Improving cancer driver gene identification using multi-task learning on graph convolutional network
    Peng, Wei
    Tang, Qi
    Dai, Wei
    Chen, Tielin
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [30] MMMNet: An End-to-End Multi-Task Deep Convolution Neural Network With Multi-Scale and Multi-Hierarchy Fusion for Blind Image Quality Assessment
    Li, Fan
    Zhang, Yangfan
    Cosman, Pamela C.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (12) : 4798 - 4811