Discovering Latent Topics by Gaussian Latent Dirichlet Allocation and Spectral Clustering

被引：4

作者：

Yuan, Bo ^{[1
]}

Gao, Xinbo ^{[1
]}

Niu, Zhenxing ^{[2
]}

Tian, Qi ^{[3
]}

机构：

[1] Xidian Univ, 2 Taibai South Rd, Xian 710071, Shaanxi, Peoples R China

[2] Alibaba Grp, 969 Wenyi West Rd, Hangzhou 311121, Zhejiang, Peoples R China

[3] Univ Texas San Antonio, San Antonio, TX 78249 USA

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2019年 / 15卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Latent Dirichlet allocation; Gaussian; spectral clustering; image retrieval; diversity;

D O I：

10.1145/3290047

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Today, diversifying the retrieval results of a certain query will improve customers' search efficiency. Showing the multiple aspects of information provides users an overview of the object, which helps them fast target their demands. To discover aspects, research focuses on generating image clusters from initially retrieved results. As an effective approach, latent Dirichlet allocation (LDA) has been proved to have good performance on discovering high-level topics. However, traditional LDA is designed to process textual words, and it needs the input as discrete data. When we apply this algorithm to process continuous visual images, a common solution is to quantize the continuous features into discrete form by a bag-of-visual-words algorithm. During this process, quantization error will lead to information that inevitably is lost. To construct a topic model with complete visual information, this work applies Gaussian latent Dirichlet allocation (GLDA) on the diversity issue of image retrieval. In this model, traditional multinomial distribution is substituted with Gaussian distribution to model continuous visual features. In addition, we propose a two-phase spectral clustering strategy, called dual spectral clustering, to generate clusters from region level to image level. The experiments on the challenging landmarks of the DIV400 database show that our proposal improves relevance and diversity by about 10% compared to traditional topic models.

引用

页数：18

共 36 条

[1] World Explorer: Visualizing Aggregate Data from Unstructured Text in Geo-Referenced Collections [J].

Ahern, Shane ;

Naaman, Mor ;

Nair, Rahul ;

Yang, Jeannie .

PROCEEDINGS OF THE 7TH ACM/IEE JOINT CONFERENCE ON DIGITAL LIBRARIES: BUILDING & SUSTAINING THE DIGITAL ENVIRONMENT, 2007, :1-10

[2]

[Anonymous], 2004, Proceedings of the 12th ACM International Conference on Multimedia

[3]

[Anonymous], P ACM MULT

[4] User Preferences Modeling and Learning for Pleasing Photo Collage Generation [J].

Bianco, Simone ;

Ciocca, Gianluigi .

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2015, 12 (01) :1-23

[5] Latent Dirichlet allocation [J].

Blei, DM ;

Ng, AY ;

Jordan, MI .

JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022

[6]

Carbonell J., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P335, DOI 10.1145/290941.291025

[7]

Clarke Charles L. A., 2008, SIGIR, P659, DOI DOI 10.1145/1390334.1390446

[8]

Das R, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P795

[9]

Deselaers Thomas., 2009, Proceedings of the ACM international conference on image and video retrieval, P1

[10] Multimodal Retrieval with Diversification and Relevance Feedback for Tourist Attraction Images [J].

Duc-Tien Dang-Nguyen ;

Piras, Luca ;

Giacinto, Giorgio ;

Boato, Giulia ;

De Natale, Francesco G. B. .

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2017, 13 (04)

← 1 2 3 4 →