Adaptive Greedy Dictionary Selection for Web Media Summarization

被引:49
作者
Cong, Yang [1 ,2 ]
Liu, Ji [2 ]
Sun, Gan [1 ]
You, Quanzeng [2 ]
Li, Yuncheng [2 ]
Luo, Jiebo [2 ]
机构
[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Peoples R China
[2] Univ Rochester, Dept Comp Sci, Rochester, NY 14611 USA
关键词
Sparse representation; l(0) norm; dictionary learning; dictionary selection; forward-backward; greedy method; K-SVD; DANTZIG SELECTOR; SPARSE; ALGORITHM; FRAMEWORK; VIDEOS;
D O I
10.1109/TIP.2016.2619260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Initializing an effective dictionary is an indispensable step for sparse representation. In this paper, we focus on the dictionary selection problem with the objective to select a compact subset of basis from original training data instead of learning a new dictionary matrix as dictionary learning models do. We first design a new dictionary selection model via l(2,0) norm. For model optimization, we propose two methods: one is the standard forward-backward greedy algorithm, which is not suitable for large-scale problems; the other is based on the gradient cues at each forward iteration and speeds up the process dramatically. In comparison with the state-of-the-art dictionary selection models, our model is not only more effective and efficient, but also can control the sparsity. To evaluate the performance of our new model, we select two practical web media summarization problems: 1) we build a new data set consisting of around 500 users, 3000 albums, and 1 million images, and achieve effective assisted albuming based on our model and 2) by formulating the video summarization problem as a dictionary selection issue, we employ our model to extract keyframes from a video sequence in a more flexible way. Generally, our model outperforms the state-of-the-art methods in both these two tasks.
引用
收藏
页码:185 / 195
页数:11
相关论文
共 54 条
[11]   Greedy Dictionary Selection for Sparse Representation [J].
Cevher, Volkan ;
Krause, Andreas .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) :979-988
[12]  
Christensen MG, 2007, CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, P550
[13]   Speeded Up Low-Rank Online Metric Learning for Object Tracking [J].
Cong, Yang ;
Fan, Baojie ;
Liu, Ji ;
Luo, Jiebo ;
Yu, Haibin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (06) :922-934
[14]   Self-Supervised Online Metric Learning With Low Rank Constraint for Scene Categorization [J].
Cong, Yang ;
Liu, Ji ;
Yuan, Junsong ;
Luo, Jiebo .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (08) :3179-3191
[15]   Sparse Reconstruction Cost for Abnormal Event Detection [J].
Cong, Yang ;
Yuan, Junsong ;
Liu, Ji .
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :1807-+
[16]   Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection [J].
Cong, Yang ;
Yuan, Junsong ;
Luo, Jiebo .
IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) :66-75
[17]  
Cui JY, 2007, CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, VOLS 1 AND 2, P367
[18]   Subspace Pursuit for Compressive Sensing Signal Reconstruction [J].
Dai, Wei ;
Milenkovic, Olgica .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2009, 55 (05) :2230-2249
[19]   Sparse Solution of Underdetermined Systems of Linear Equations by Stagewise Orthogonal Matching Pursuit [J].
Donoho, David L. ;
Tsaig, Yaakov ;
Drori, Iddo ;
Starck, Jean-Luc .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (02) :1094-1121
[20]   Message-passing algorithms for compressed sensing [J].
Donoho, David L. ;
Maleki, Arian ;
Montanari, Andrea .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (45) :18914-18919