Unbiased 3D Semantic Scene Graph Prediction in Point Cloud Using Deep Learning

被引：4

作者：

Han, Chaolin ^{[1
]}

Li, Hongwei ^{[1
]}

Xu, Jian ^{[1
]}

Dong, Bing ^{[1
]}

Wang, Yalin ^{[1
]}

Zhou, Xiaowen ^{[1
]}

Zhao, Shan ^{[1
]}

机构：

[1] Zhengzhou Univ, Sch Geosci & Technol, Zhengzhou 450001, Peoples R China

来源：

APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 09期

基金：

中国国家自然科学基金;

关键词：

scene understanding; deep learning; 3D scene graph; prior knowledge; point cloud;

D O I：

10.3390/app13095657

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

As a core task of computer vision perception, 3D scene understanding has received widespread attention. However, the current research mainly focuses on the semantic understanding task at the level of entity objects and often neglects the semantic relationships between objects in the scene. This paper proposes a 3D scene graph prediction model based on deep learning methods for scanned point cloud data of indoor scenes to predict the semantic graph about the class of entity objects and their relationships. The model uses a multi-scale pyramidal feature extraction network, MP-DGCNN, to fuse features with the learned category-related unbiased meta-embedding vectors, and the relationship inference of the scene graph uses an ENA-GNN network incorporating node and edge cross-attention; in addition, considering the long-tail distribution effect, a category grouping re-weighting scheme is used in the embedded prior knowledge and loss function. For the 3D scene graph prediction task, experiments on the indoor point cloud 3DSSG dataset show that the model proposed in this paper performs well compared with the latest baseline model, and the prediction effectiveness and accuracy are substantially improved.

引用

页数：21

共 45 条

[1]

[Anonymous], 2004, Journal of Vision, DOI [10.1167/4.8.863, DOI 10.1167/4.8.863]

[2] 3D Scene Graph: A structure for unified semantics, 3D space, and camera [J].

Armeni, Iro ;

He, Zhi-Yang ;

Gwak, JunYoung ;

Zamir, Amir R. ;

Fischer, Martin ;

Malik, Jitendra ;

Savarese, Silvio .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :5663-5672

[3]

Ba J L., LAYER NORMALIZATION

[4]

Lipton ZC, 2015, Arxiv, DOI [arXiv:1506.00019, 10.48550/ARXIV.1506.00019]

[5] A Comprehensive Survey of Scene Graphs: Generation and Application [J].

Chang, Xiaojun ;

Ren, Pengzhen ;

Xu, Pengfei ;

Li, Zhihui ;

Chen, Xiaojiang ;

Hauptmann, Alex .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :1-26

[6] Knowledge-Embedded Routing Network for Scene Graph Generation [J].

Chen, Tianshui ;

Yu, Weihao ;

Chen, Riquan ;

Lin, Liang .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6156-6164

[7] Destruction and Construction Learning for Fine-grained Image Recognition [J].

Chen, Yue ;

Bai, Yalong ;

Zhang, Wei ;

Mei, Tao .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5152-5161

[8]

Cho KYHY, 2014, Arxiv, DOI [arXiv:1406.1078, DOI 10.3115/V1/D14-1179]

[9]

Dhamo H., 2021, P IEEECVF INT C COMP, P16352

[10] Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation [J].

Dong, Xingning ;

Gan, Tian ;

Song, Xuemeng ;

Wu, Jianlong ;

Cheng, Yuan ;

Nie, Liqiang .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :19405-19414

← 1 2 3 4 5 →