DeepDiffusion: Unsupervised Learning of Retrieval-Adapted Representations via Diffusion-Based Ranking on Latent Feature Manifold

被引:2
作者
Furuya, Takahiko [1 ]
Ohbuchi, Ryutarou [1 ]
机构
[1] Univ Yamanashi, Dept Comp Sci & Engn, Kofu, Yamanashi 4008511, Japan
基金
日本学术振兴会;
关键词
Manifolds; Multimedia communication; Deep learning; Measurement; Feature extraction; Unsupervised learning; Representation learning; Unsupervised representation learning; multimedia information retrieval; deep learning;
D O I
10.1109/ACCESS.2022.3218909
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised learning of feature representations is a challenging yet important problem for analyzing a large collection of multimedia data that do not have semantic labels. Recently proposed neural network-based unsupervised learning approaches have succeeded in obtaining features appropriate for classification of multimedia data. However, unsupervised learning of feature representations adapted to content-based matching, comparison, or retrieval of multimedia data has not been explored well. To obtain such retrieval-adapted features, we introduce the idea of combining diffusion distance on a feature manifold with neural network-based unsupervised feature learning. This idea is realized as a novel algorithm called DeepDiffusion (DD). DD simultaneously optimizes two components, a feature embedding by a deep neural network and a distance metric that leverages diffusion on a latent feature manifold, together. DD relies on its loss function but not encoder architecture. It can thus be applied to diverse multimedia data types with their respective encoder architectures. Experimental evaluation using 3D shapes and 2D images demonstrates versatility as well as high accuracy of the DD algorithm. Code is available at https://github.com/takahikof/DeepDiffusion
引用
收藏
页码:116287 / 116301
页数:15
相关论文
共 62 条
[1]  
Abadi M, 2016, Large-scale machine learning on heterogeneous systems
[2]  
Achlioptas P, 2018, PR MACH LEARN RES, V80
[3]  
[Anonymous], 2007, Recent advances in data mining of enterprise data: algorithms and applications, DOI DOI 10.1142/9789812779861_0015
[4]  
[Anonymous], 2017, PR MACH LEARN RES
[5]  
[Anonymous], 2015, P INT C LEARN REPR
[6]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[7]   Deep Metric Learning to Rank [J].
Cakir, Fatih ;
He, Kun ;
Xia, Xide ;
Kulis, Brian ;
Sclaroff, Stan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1861-1870
[8]  
Caron M, 2020, ADV NEUR IN, V33
[9]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[10]  
Chen T, 2020, PR MACH LEARN RES, V119