A Tucker Deep Computation Model for Mobile Multimedia Feature Learning

被引：31

作者：

Zhang, Qingchen ^{[1
,2
]}

Yang, Laurence T. ^{[1
,2
]}

Liu, Xingang ^{[3
]}

Chen, Zhikui ^{[4
]}

Li, Peng ^{[4
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Sichuan, Peoples R China

[2] St Francis Xavier Univ, Dept Comp Sci, Antigonish, NS B2G 2W5, Canada

[3] Univ Elect Sci & Technol China, Sch Elect Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China

[4] Dalian Univ Technol, Sch Software Technol, 321 Tuqiang St, Dalian, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2017年 / 13卷 / 03期

关键词：

Deep learning; Tucker decomposition; deep computation; mobile multimedia; back-propagation; DATA-MANAGEMENT;

D O I：

10.1145/3063593

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, the deep computation model, as a tensor deep learning model, has achieved super performance for multimedia feature learning. However, the conventional deep computation model involves a large number of parameters. Typically, training a deep computation model with millions of parameters needs high-performance servers with large-scale memory and powerful computing units, limiting the growth of the model size for multimedia feature learning on common devices such as portable CPUs and conventional desktops. To tackle this problem, this article proposes a Tucker deep computation model by using the Tucker decomposition to compress the weight tensors in the full-connected layers for multimedia feature learning. Furthermore, a learning algorithm based on the back-propagation strategy is devised to train the parameters of the Tucker deep computation model. Finally, the performance of the Tucker deep computation model is evaluated by comparing with the conventional deep computation model on two representative multimedia datasets, that is, CUAVE and SNAE2, in terms of accuracy drop, parameter reduction, and speedup in the experiments. Results imply that the Tucker deep computation model can achieve a large-parameter reduction and speedup with a small accuracy drop for multimedia feature learning.

引用

页数：18

共 38 条

[1] Audio Masking Effect on Inter-Component Skews in Olfaction-Enhanced Multimedia Presentations
Ademoye, Oluwakemi A.
Murray, Niall
Muntean, Gabriel-Miro
Ghinea, Gheorghita
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2016, 12 (04)
[2] [Anonymous], ACM T KNOWLEDGE DISC
[3] [Anonymous], 2012, P INT C NEUR INF PRO
[4] [Anonymous], IEEE CLOUD COMPUTING
[5] [Anonymous], 2013, NIPS
[6] [Anonymous], 2015, INT C LEARNING REPRE
[7] [Anonymous], NATURE
[8] [Anonymous], 2009, ICML
[9] Representation Learning: A Review and New Perspectives
Bengio, Yoshua
Courville, Aaron
Vincent, Pascal
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
[10] Chen WL, 2015, PR MACH LEARN RES, V37, P2285

← 1 2 3 4 →