SINGA: Putting Deep Learning in the Hands of Multimedia Users

被引:20
作者
Wang, Wei [1 ]
Chen, Gang [2 ]
Tien Tuan Anh Dinh [1 ]
Gao, Jinyang [1 ]
Ooi, Beng Chin [1 ]
Tan, Kian-Lee [1 ]
Wang, Sheng [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou, Zhejiang, Peoples R China
来源
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE | 2015年
关键词
Deep learning; Multimedia application; Distributed training;
D O I
10.1145/2733373.2806232
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, deep learning techniques have enjoyed success in various multimedia applications, such as image classification and multi-modal data analysis. Two key factors behind deep learning's remarkable achievement are the immense computing power and the availability of massive training datasets, which enable us to train large models to capture complex regularities of the data. There are two challenges to overcome before deep learning can be widely adopted in multimedia and other applications. One is usability, namely the implementation of different models and training algorithms must be done by non-experts without much effort. The other is scalability, that is the deep learning system must be able to provision for a huge demand of computing resources for training large models with massive datasets. To address these two challenges, in this paper, we design a distributed deep learning platform called SINGA which has an intuitive programming model and good scalability. Our experience with developing and training deep learning models for real-life multimedia applications in SINGA shows that the platform is both usable and scalable.
引用
收藏
页码:25 / 34
页数:10
相关论文
共 33 条
[21]  
Mikolov Tomas, 2013, P 26 INT C NEUR INF, V2, P3111, DOI DOI 10.5555/2999792.2999959
[22]  
Paine T., 2013, CoRR
[23]  
Recht Benjamin, 2011, Advances in neural information processing systems, V24, P693
[24]  
Szegedy C, 2015, PROC CVPR IEEE, P1, DOI [10.1109/cvpr.2015.7298594, DOI 10.1109/CVPR.2015.7298594]
[25]   Deep Learning for Content-Based Image Retrieval: A Comprehensive Study [J].
Wan, Ji ;
Wang, Dayong ;
Hoi, Steven C. H. ;
Wu, Pengcheng ;
Zhu, Jianke ;
Zhang, Yongdong ;
Li, Jintao .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :157-166
[26]   Effective Multi-Modal Retrieval based on Stacked Auto-Encoders [J].
Wang, Wei ;
Ooi, Beng Chin ;
Yang, Xiaoyan ;
Zhang, Dongxiang ;
Zhuang, Yueting .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (08) :649-660
[27]   Improving Content-based and Hybrid Music Recommendation using Deep Learning [J].
Wang, Xinxi ;
Wang, Ye .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :627-636
[28]  
Wu R., 2015, CoRR
[29]   Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification [J].
Wu, Zuxuan ;
Jiang, Yu-Gang ;
Wang, Jun ;
Pu, Jian ;
Xue, Xiangyang .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :167-176
[30]  
Yadan O., 2013, CoRR