Diffusion Models as Data Mining Tools

被引：0

作者：

Siglidis, Ioannis ^{[1
]}

Holynski, Aleksander ^{[2
]}

Efros, Alexei A. ^{[2
]}

Aubry, Mathieu ^{[1
]}

Ginosar, Shiry ^{[2
]}

机构：

[1] Univ Gustave Eiffel, LIGM, CNRS, Ecole Ponts, Marne La Vallee, France

[2] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

COMPUTER VISION - ECCV 2024, PT LXI | 2025年 / 15119卷

基金：

欧洲研究理事会;

关键词：

Visual Data Mining; Diffusion Models;

D O I：

10.1007/978-3-031-73030-6_22

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper demonstrates how to use generative models trained for image synthesis as tools for visual data mining. Our insight is that since contemporary generative models learn an accurate representation of their training data, we can use them to summarize the data by mining for visual patterns. Concretely, we show that after finetuning conditional diffusion models to synthesize images from a specific dataset, we can use these models to define a typicality measure on that dataset. This measure assesses how typical visual elements are for different data labels, such as geographic location, time stamps, semantic labels, or even the presence of a disease. This analysis-by-synthesis approach to data mining has two key advantages. First, it scales much better than traditional correspondence-based approaches since it does not require explicitly comparing all pairs of visual elements. Second, while most previous works on visual data mining focus on a single dataset, our approach works on diverse datasets in terms of content and scale, including a historical car dataset, a historical face dataset, a large worldwide street-view dataset, and an even larger scene dataset. Furthermore, our approach allows for translating visual elements across class labels and analyzing consistent changes. Project page: https://diff-mining.github.io/.

引用

页码：393 / 409

页数：17

共 48 条

[1] Assessing the Trustworthiness of Saliency Maps for Localizing Abnormalities in Medical Imaging [J].

Arun, Nishanth ;

Gaw, Nathan ;

Singh, Praveer ;

Chang, Ken ;

Aggarwal, Mehak ;

Chen, Bryan ;

Hoebel, Katharina ;

Gupta, Sharut ;

Patel, Jay ;

Gidwani, Mishka ;

Adebayo, Julius ;

Li, Matthew D. ;

Kalpathy-Cramer, Jayashree .

RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2021, 3 (06)

[2]

Aubry M., 2019, Trans. Pattern Anal. Mach. Intell.

[3]

Azizi S., 2023, T MACHINE LEARNING R

[4] InstructPix2Pix: Learning to Follow Image Editing Instructions [J].

Brooks, Tim ;

Holynski, Aleksander ;

Efros, Alexei A. .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :18392-18402

[5] Ensembling with Deep Generative Views [J].

Chai, Lucy ;

Zhu, Jun-Yan ;

Shechtman, Eli ;

Isola, Phillip ;

Zhang, Richard .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14992-15002

[6] What's in a Decade? Transforming Faces Through Time [J].

Chen, Eric Ming ;

Sun, Jin ;

Khandelwal, Apoorv ;

Lischinski, Dani ;

Snavely, Noah ;

Averbuch-Elor, Hadar .

COMPUTER GRAPHICS FORUM, 2023, 42 (02) :281-291

[7]

Dhariwal P, 2021, ADV NEUR IN, V34

[8] What Makes Paris Look like Paris? [J].

Doersch, Carl ;

Singh, Saurabh ;

Gupta, Abhinav ;

Sivic, Josef ;

Efros, Alexei A. .

ACM TRANSACTIONS ON GRAPHICS, 2012, 31 (04)

[9]

geodose, ABOUT US

[10]

Geogebra, About us

← 1 2 3 4 5 →