Advanced Machine Learning and Statistical Inference Approaches for Big Data Analytics and Information Fusion

被引:0
作者
Mehra, Raman K. [1 ]
Gandhe, Avinash [1 ]
Mansinghka, Vikash [2 ]
Shafto, Patrick [3 ]
Lovell, Dan [1 ]
Yu, Ssu-Hsin [1 ]
机构
[1] Sci Syst Co Inc, 500 West Cummings Pk Suite 3000, Woburn, MA 01801 USA
[2] MIT, Cambridge, MA 02139 USA
[3] Univ Louisville, Louisville, KY 40202 USA
来源
SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION XXII | 2013年 / 8745卷
关键词
Machine Learning; Big Data; CrossCat; XDATA; MCMC; Predictive Databases; Non-parametric Bayesian;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A revolution in Big Data Predictive Analytics has been created by the confluence of four major revolutionary technologies, viz. (i) availability of massive datasets, (ii) distributed cluster computing (iii) advances in non-parametric Bayesian Inference and (iv) Markov Chain Monte Carlo (MCMC) methods for fast probability calculations and stochastic searches in high dimensions. The paper presents a historical perspective on the seminal breakthrough developments related to probabilistic reasoning and Bayesian Inference leading up to the current state-of-the-art in data science. This is followed by a discussion of challenges in Big Data Analytics and presentation of a method for Automated Bayesian Machine Learning using a recently developed approach called CrossCat. This approach is based on non-parametric Bayesian Inference and efficient use of MCMC numerical algorithms. Under the DARPA XDATA program, SSCI, MIT and the University of Louisville are developing a Predictive Database System which will have an SQL type front-end and CrossCat backend to facilitate the use of sophisticated machine learning methods by non-experts.
引用
收藏
页数:3
相关论文
empty
未找到相关数据