Parallel Lasso for Large-Scale Video Concept Detection

被引:16
作者
Geng, Bo [1 ]
Li, Yangxi [1 ]
Tao, Dacheng [2 ]
Wang, Meng [3 ]
Zha, Zheng-Jun [4 ]
Xu, Chao [1 ]
机构
[1] Peking Univ, Key Lab Machine Percept, Minist Educ, Beijing 100871, Peoples R China
[2] Univ Technol Sydney, Ctr Quantum Computat & Intelligent, Fac Engn & Informat Technol, Broadway, NSW 2007, Australia
[3] Hefei Univ Technol, Sch Comp & Informat, Hefei, Peoples R China
[4] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
关键词
Incomplete cholosky factorization; lasso; parallel learning; video concept detection; REGULARIZATION;
D O I
10.1109/TMM.2011.2174781
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing video concept detectors are generally built upon the kernel based machine learning techniques, e. g., support vector machines, regularized least squares, and logistic regression, just to name a few. However, in order to build robust detectors, the learning process suffers from the scalability issues including the high-dimensional multi-modality visual features and the large-scale keyframe examples. In this paper, we propose parallel lasso (Plasso) by introducing the parallel distributed computation to significantly improve the scalability of lasso (the regularized least squares). We apply the parallel incomplete Cholesky factorization to approximate the covariance statistics in the preprocess step, and the parallel primal-dual interior-point method with the Sherman-Morrison-Woodbury formula to optimize the model parameters. For a dataset with samples in a d-dimensional space, compared with lasso, Plasso significantly reduces complexities from the original O(d(3)) for computational time and O(d(2)) for storage space to O(h(2)d/m) and O(hd/m), respectively, if the system has processors and the reduced dimension is much smaller than the original dimension. Furthermore, we develop the kernel extension of the proposed linear algorithm with the sample reweighting schema, and we can achieve similar time and space complexity improvements [time complexity from O(n(3)) to O(h(2)n/m) and the space complexity from O(n(2)) to O(hn/m), for a dataset with training examples]. Experimental results on TRECVID video concept detection challenges suggest that the proposed method can obtain significant time and space savings for training effective detectors with limited communication overhead.
引用
收藏
页码:55 / 65
页数:11
相关论文
共 41 条
  • [1] [Anonymous], 2006, NIPS
  • [2] [Anonymous], 2009, P 17 ACM INT C MULTI
  • [3] Bi J., 2003, Journal of Machine Learning Research, V3, P1229, DOI 10.1162/153244303322753643
  • [4] Bo Geng, 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P2396, DOI 10.1109/CVPRW.2009.5206695
  • [5] Boyd S., 2004, CONVEX OPTIMIZATION, VFirst, DOI DOI 10.1017/CBO9780511804441
  • [6] Chang E. Y., 2007, ADV NEURAL INF PROCE, V20
  • [7] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893
  • [8] Least angle regression - Rejoinder
    Efron, B
    Hastie, T
    Johnstone, I
    Tibshirani, R
    [J]. ANNALS OF STATISTICS, 2004, 32 (02) : 494 - 499
  • [9] Efficient SVM training using low-rank kernel representations
    Fine, S
    Scheinberg, K
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2002, 2 (02) : 243 - 264
  • [10] Regularization Paths for Generalized Linear Models via Coordinate Descent
    Friedman, Jerome
    Hastie, Trevor
    Tibshirani, Rob
    [J]. JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01): : 1 - 22