Heri-Graphs: A Dataset Creation Framework for Multi-Modal Machine Learning on Graphs of Heritage Values and Attributes with Social Media

被引:13
作者
Bai, Nan [1 ]
Nourian, Pirouz [2 ]
Luo, Renqian [3 ]
Roders, Ana Pereira [1 ]
机构
[1] Delft Univ Technol, UNESCO Chair Heritage & Values Heritage & Reshapi, NL-2628 BL Delft, Netherlands
[2] Delft Univ Technol, Genesis Lab Generat Design & Generat Sci, NL-2628 BL Delft, Netherlands
[3] Microsoft Res, Beijing 100080, Peoples R China
关键词
World Heritage; Flickr; multi-modal dataset; graph construction; machine and deep learning; USER-GENERATED CONTENT; SPACE SYNTAX; TOURISM; ANALYTICS; NETWORKS; SYSTEM;
D O I
10.3390/ijgi11090469
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Values (why to conserve) and Attributes (what to conserve) are essential concepts of cultural heritage. Recent studies have been using social media to map values and attributes conveyed by the public to cultural heritage. However, it is rare to connect heterogeneous modalities of images, texts, geo-locations, timestamps, and social network structures to mine the semantic and structural characteristics therein. This study presents a methodological framework for constructing such multi-modal datasets using posts and images on Flickr for graph-based machine learning (ML) tasks concerning heritage values and attributes. After data pre-processing using pre-trained ML models, the multi-modal information of visual contents and textual semantics are modelled as node features and labels, while their social relationships and spatio-temporal contexts are modelled as links in Multi-Graphs. The framework is tested in three cities containing UNESCO World Heritage properties-Amsterdam, Suzhou, and Venice- which yielded datasets with high consistency for semi-supervised learning tasks. The entire process is formally described with mathematical notations, ready to be applied in provisional tasks both as ML problems with technical relevance and as urban/heritage study questions with societal interests. This study could also benefit the understanding and mapping of heritage values and attributes for future research in global cases, aiming at inclusive heritage management practices. Moreover, the proposed framework could be summarized as creating attributed graphs from unstructured social media data sources, ready to be applied in a wide range of use cases.
引用
收藏
页数:38
相关论文
共 125 条
[101]   Instagram, Flickr, or Twitter: Assessing the usability of social media data for visitor monitoring in protected areas [J].
Tenkanen, Henrikki ;
Di Minin, Enrico ;
Heikinheimo, Vuokko ;
Hausmann, Anna ;
Herbst, Marna ;
Kajala, Liisa ;
Toivonen, Tuuli .
SCIENTIFIC REPORTS, 2017, 7
[102]   COMPUTER MOVIE SIMULATING URBAN GROWTH IN DETROIT REGION [J].
TOBLER, WR .
ECONOMIC GEOGRAPHY, 1970, 46 (02) :234-240
[103]  
UNESCO, 2020, HER URB CONT IMP DEV
[104]  
UNESCO, 1972, CONVENTION PROTECTIO, DOI DOI 10.1111/J.1468-0033.1973.TB02056.X
[105]  
Urry John, 1990, The Tourist Gaze
[106]  
Valese M., 2020, INT ARCH PHOTOGRAMM, VXLIII-B4-2020, P81, DOI [10.5194/isprs-archives-XLIII-B4-2020-81-2020, DOI 10.5194/ISPRS-ARCHIVES-XLIII-B4-2020-81-2020]
[107]   Flickr and the culture of connectivity: Sharing views, experiences, memories [J].
van Dijck, Jose .
MEMORY STUDIES, 2011, 4 (04) :401-415
[108]  
Vaswani A., 2017, C WORKSHOP NEURAL IN, P6000
[109]  
Veldpaus L., 2015, Phd Thesis
[110]   LEARNING FROM A LEGACY Venice to Valletta [J].
Veldpaus, Loes ;
Roders, Ana Pereira .
CHANGE OVER TIME-AN INTERNATIONAL JOURNAL OF CONSERVATION AND THE BUILT ENVIRONMENT, 2014, 4 (02) :244-263