The Regression Learning of the Imbalanced and Big Data by the Online Mixture Model for the Mach Number Forecasting

被引:3
作者
Wang, Xiao-Jun [1 ]
Liu, Yan [2 ]
Yuan, Ping [2 ]
Zhou, Chang-Jun [3 ]
Zhang, Lin [4 ]
机构
[1] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116025, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110004, Liaoning, Peoples R China
[3] Zhejiang Normal Univ, Coll Math Phys & Informat Engn, Jinhua 321004, Peoples R China
[4] China Aerodynam Res & Dev Ctr, High Speed Aerodynam Inst, Mianyang 621000, Peoples R China
基金
中国国家自然科学基金;
关键词
Big data; imbalanced data; mixture model; ensemble; online; regression; wind tunnel; TRANSONIC WIND-TUNNEL; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2018.2886752
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extracting valuable information to enhance the performance of forecasting models from the imbalanced and big data requires the scalable implementation of advanced statistical learning methods. This paper proposes the online mixture model (OMM) and applies it to the Mach number forecasting. Treating the key variable (e.g., Mach number) forecasting under all working conditions as an entire task, and viewing that of each individual working condition as a subtask, the OMM separates the dense samples from the sparse ones on the basis of subtasks. The subtask models are independently learnt on the samples with reduced volume, and updated for the new working conditions without retaining samples from the old working conditions. Moreover, the tree-structure ensemble (TSE)-feature subsets ensembles (FSEs) algorithm is presented to fit the nonlinear function of a subtask model, where the FSE local models with low-dimensional input features are established on the non-overlapping sample subsets constructed by the TSE method. The TSE-FSEs not only reduce the volume of data but also perform distributed computing with parallel structure, and thus has the advantage of the learning of big data. Experiments carried out on the measurement data of wind tunnel indicate that the OMM with the TSE-FSEs outperforms other learning algorithms for the Mach number forecasting, and meets the precision and forecasting speed requirements in engineering.
引用
收藏
页码:7368 / 7380
页数:13
相关论文
共 38 条
[1]   A Novel Technique for In Situ Efficiency Estimation of Three-Phase IM Operating With Unbalanced Voltages [J].
Al-Badri, Maher ;
Pillay, Pragasen ;
Angers, Pierre .
IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2016, 52 (04) :2843-2855
[2]   Steady flow quality assessment of a modified transonic wind tunnel [J].
Amiri, K. ;
Soltani, M. R. ;
Haghiri, A. .
SCIENTIA IRANICA, 2013, 20 (03) :500-507
[3]  
[Anonymous], IEEE T CYBERN
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Generation of vertical gusts in a transonic wind tunnel [J].
Brion, V. ;
Lepage, A. ;
Amosse, Y. ;
Soulevant, D. ;
Senecat, P. ;
Abart, J. C. ;
Paillart, P. .
EXPERIMENTS IN FLUIDS, 2015, 56 (07)
[6]   Feature Interaction Augmented Sparse Learning for Fast Kinect Motion Detection [J].
Chang, Xiaojun ;
Ma, Zhigang ;
Lin, Ming ;
Yang, Yi ;
Hauptmann, Alexander G. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (08) :3911-3920
[7]   Compound Rank-k Projections for Bilinear Analysis [J].
Chang, Xiaojun ;
Nie, Feiping ;
Wang, Sen ;
Yang, Yi ;
Zhou, Xiaofang ;
Zhang, Chengqi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (07) :1502-1513
[8]   Big Data for Remote Sensing: Challenges and Opportunities [J].
Chi, Mingmin ;
Plaza, Antonio ;
Benediktsson, Jon Atli ;
Sun, Zhongyi ;
Shen, Jinsheng ;
Zhu, Yangyong .
PROCEEDINGS OF THE IEEE, 2016, 104 (11) :2207-2219
[9]   Optimized fixed-size kernel models for large data sets [J].
De Brabanter, K. ;
De Brabanter, J. ;
Suykens, J. A. K. ;
De Moor, B. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (06) :1484-1504
[10]  
de Brabanter K., 2011, TECH REP 0 146, P37