No-reference mesh visual quality assessment via ensemble of convolutional neural networks and compact multi-linear pooling

被引:44
作者
Abouelaziz, Ilyass [1 ]
Chetouani, Aladine [2 ]
El Hassouni, Mohammed [5 ]
Latecki, Longin Jan [3 ]
Cherifi, Hocine [4 ]
机构
[1] Mohammed V Univ Rabat, Rabat, Morocco
[2] Univ Orleans, PRISME Lab, Orleans, France
[3] Temple Univ, Dept Comp & Informat Sci, Philadelphia, PA 19122 USA
[4] Univ Burgundy, LE2I, UMR CNRS 6306, Dijon, France
[5] Mohammed V Univ Rabat, FLSH, Rabat, Morocco
关键词
Blind mesh quality assessment; Convolutional neural network; Fine-tuning; Compact multi-linear pooling; Visual saliency; ERROR; METRICS; COMPRESSION; ATTENTION; MODEL;
D O I
10.1016/j.patcog.2019.107174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blind or No reference quality evaluation is a challenging issue since it is done without access to the original content. In this work, we propose a method based on deep learning for the mesh visual quality assessment without reference. For a given 3D model, we first compute its mesh saliency. Then, we extract views from the 3D mesh and the corresponding mesh saliency. After that, the views are split into small patches that are filtered using a saliency threshold. Only the salient patches are selected and used as input data. After that, three pre-trained deep convolutional neural networks are employed for feature learning: VGG, AlexNet, and ResNet. Each network is fine-tuned and produces a feature vector. The Compact Multi-linear Pooling (CMP) is used afterward to fuse the retrieved vectors into a global feature representation. Finally, fully connected layers followed by a regression module are used to estimate the quality score. Extensive experiments are executed on four mesh quality datasets and comparisons with existing methods demonstrate the effectiveness of our method in terms of correlation with subjective scores. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 52 条
[1]  
Abouelaziz I., 2018, Electronic Imaging, V2018, P423
[2]  
Abouelaziz I, 2018, IEEE IMAGE PROC, P3533, DOI 10.1109/ICIP.2018.8451763
[3]   Blind 3D mesh visual quality assessment using support vector regression [J].
Abouelaziz, Ilyass ;
El Hassouni, Mohammed ;
Cherifi, Hocine .
MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) :24365-24386
[4]  
Abouelaziz I, 2017, IEEE IMAGE PROC, P755, DOI 10.1109/ICIP.2017.8296382
[5]   A Curvature based method for blind mesh visual quality assessment using a general regression neural network [J].
Abouelaziz, Ilyass ;
El Hassouni, Mohammed ;
Cherifi, Hocine .
2016 12TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS (SITIS), 2016, :793-797
[6]   No-Reference 3D Mesh Quality Assessment Based on Dihedral Angles Model and Support Vector Regression [J].
Abouelaziz, Ilyass ;
El Hassouni, Mohammed ;
Cherifi, Hocine .
IMAGE AND SIGNAL PROCESSING (ICISP 2016), 2016, 9680 :369-377
[7]   Multispectral Periocular Classification With Multimodal Compact Multi-Linear Pooling [J].
Algashaam, Faisal M. ;
Kien Nguyen ;
Alkanhal, Mohamed ;
Chandran, Vinod ;
Boles, Wageeh ;
Banks, Jasmine .
IEEE ACCESS, 2017, 5 :14572-14578
[8]   Recent advances in compression of 3D meshes [J].
Alliez, P ;
Gotsman, C .
ADVANCES IN MULTIRESOLUTION FOR GEOMETRIC MODELLING, 2005, :3-+
[9]  
[Anonymous], 2016, PROC C EMPIRICAL MET
[10]  
[Anonymous], PATTERN RECOGNITION