Redundancy Avoidance for Big Data in Data Centers: A Conventional Neural Network Approach

被引:21
作者
Xu, Chenhan [1 ,2 ]
Wang, Kun [3 ]
Sun, Yanfei [1 ,2 ]
Guo, Song [3 ]
Zomaya, Albert Y. [4 ]
机构
[1] Nanjing Univ Posts & Telecommun, Jiangsu Engn Res Ctr Commun & Network Technol, Nanjing 210003, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Natl Engn Res Ctr Commun & Networking, Nanjing 210003, Peoples R China
[3] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Peoples R China
[4] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2020年 / 7卷 / 01期
基金
中国博士后科学基金;
关键词
Data centers; redundancy avoidance; multimedia; storage; big data; convolution neural network; SYSTEM;
D O I
10.1109/TNSE.2018.2843326
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
As the innovative data collection technologies are applying to every aspect of our society, the data volume is skyrocketing. Such phenomenon poses tremendous challenges to data centers with respect to enabling storage. In this paper, a hybrid-stream big data analytics model is proposed to perform multimedia big data analysis. This model contains four procedures, i.e., data preprocessing, data classification, data recognition and data load reduction. Specifically, an innovative multi-dimensional Convolution Neural Network (CNN) is proposed to assess the importance of each video frame. Thus, those unimportant frames can be dropped by a reliable decision-making algorithm. In order to ensure video quality, minimal correlation and minimal redundancy (MCMR) are combined to optimize the decision-making algorithm. Simulation results show that the amount of processed video is significantly reduced, and the quality of video is preserved due to the addition of MCMR. The simulation also proves that the proposed model performs steadily and is robust enough to scale up to accommodate the big data crush in data centers.
引用
收藏
页码:104 / 114
页数:11
相关论文
共 42 条
[1]  
[Anonymous], CORR
[2]  
[Anonymous], 2007, METH SUBJ ASS VID QU
[3]  
[Anonymous], [No title captured]
[4]   Discriminative Non-Linear Stationary Subspace Analysis for Video Classification [J].
Baktashmotlagh, Mahsa ;
Harandi, Mehrtash ;
Lovell, Brian C. ;
Salzmann, Mathieu .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) :2353-2366
[5]  
Bottou L, 2007, ADV NEURAL INFORM PR, P161
[7]   Efficient Multicast Delivery for Data Redundancy Minimization Over Wireless Data Centers [J].
Chuang, Ching-Chih ;
Yu, Ya-Ju ;
Pang, Ai-Chun ;
Tseng, Hsueh-Wen ;
Lin, Hsin-Peng .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2016, 4 (02) :225-241
[8]   An incremental clustering scheme for data de-duplication [J].
Costa, Gianni ;
Manco, Giuseppe ;
Ortale, Riccardo .
DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (01) :152-187
[9]   Energy Management in Cross-Domain Content Delivery Networks: A Theoretical Perspective [J].
Ge, Chang ;
Sun, Zhili ;
Wang, Ning ;
Xu, Ke ;
Wu, Jinsong .
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2014, 11 (03) :264-277
[10]   A General Communication Cost Optimization Framework for Big Data Stream Processing in Geo-Distributed Data Centers [J].
Gu, Lin ;
Zeng, Deze ;
Guo, Song ;
Xiang, Yong ;
Hu, Jiankun .
IEEE TRANSACTIONS ON COMPUTERS, 2016, 65 (01) :19-29