Leveraging Known Data for Missing Label Prediction in Cultural Heritage Context

被引:28
作者
Belhi, Abdelhak [1 ,2 ]
Bouras, Abdelaziz [1 ]
Foufou, Sebti [3 ]
机构
[1] Qatar Univ, CSE, POB 2713, Doha, Qatar
[2] Univ Lumiere Lyon 2, DISP Lab, F-69500 Lyon, France
[3] Univ Burgundy, Lab Le2i, F-21000 Dijon, France
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 10期
关键词
cultural heritage; convolutional neural networks; multimodal classification; digital heritage; digital preservation; ARTISTS;
D O I
10.3390/app8101768
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Cultural heritage represents a reliable medium for history and knowledge transfer. Cultural heritage assets are often exhibited in museums and heritage sites all over the world. However, many assets are poorly labeled, which decreases their historical value. If an asset's history is lost, its historical value is also lost. The classification and annotation of overlooked or incomplete cultural assets increase their historical value and allows the discovery of various types of historical links. In this paper, we tackle the challenge of automatically classifying and annotating cultural heritage assets using their visual features as well as the metadata available at hand. Traditional approaches mainly rely only on image data and machine-learning-based techniques to predict missing labels. Often, visual data are not the only information available at hand. In this paper, we present a novel multimodal classification approach for cultural heritage assets that relies on a multitask neural network where a convolutional neural network (CNN) is designed for visual feature learning and a regular neural network is used for textual feature learning. These networks are merged and trained using a shared loss. The combined networks rely on both image and textual features to achieve better asset classification. Initial tests related to painting assets showed that our approach performs better than traditional CNNs that only rely on images as input.
引用
收藏
页数:19
相关论文
共 47 条
[1]   Genre and Style based Painting Classification [J].
Agarwal, Siddharth ;
Karnick, Harish ;
Pant, Nirmal ;
Patel, Urvesh .
2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, :588-594
[2]  
[Anonymous], 2017, 31 AAAI C ART INT AA
[3]  
[Anonymous], P 15 INT WORKSH CONT
[4]  
[Anonymous], 2016, Adaptive deep pyramid matching for remote sensing scene classification
[5]  
[Anonymous], 2017, PAC RIM C MULT
[6]  
[Anonymous], PROC CVPR IEEE
[7]  
[Anonymous], 2016, INT C COMP VIS GRAPH
[8]  
[Anonymous], 2017, ARXIV170800684
[9]  
[Anonymous], 2017, ARXIV171103536
[10]  
[Anonymous], ARXIV17060509