Classification-oriented structure learning in Bayesian networks for multimodal event detection in videos

被引:0
作者
Guillaume Gravier
Claire-Hélène Demarty
Siwar Baghdadi
Patrick Gros
机构
[1] CNRS – IRISA,
[2] Technicolor,undefined
[3] INRIA,undefined
来源
Multimedia Tools and Applications | 2014年 / 70卷
关键词
Multimedia; Video analysis; Multimodal event detection; Bayesian networks; Structure learning;
D O I
暂无
中图分类号
学科分类号
摘要
We investigate the use of structure learning in Bayesian networks for a complex multimodal task of action detection in soccer videos. We illustrate that classical score-oriented structure learning algorithms, such as the K2 one whose usefulness has been demonstrated on simple tasks, fail in providing a good network structure for classification tasks where many correlated observed variables are necessary to make a decision. We then compare several structure learning objective functions, which aim at finding out the structure that yields the best classification results, extending existing solutions in the literature. Experimental results on a comprehensive data set of 7 videos show that a discriminative objective function based on conditional likelihood yields the best results, while augmented approaches offer a good compromise between learning speed and classification accuracy.
引用
收藏
页码:1421 / 1437
页数:16
相关论文
共 44 条
[1]  
Chow C(1968)Approximating discrete probability distributions with dependence trees IEEE Trans Inf Theory 11 462-467
[2]  
Liu C(1992)A Bayesian method for the induction of probabilistic networks from data Mach Learn 9 309-347
[3]  
Cooper GF(2008)Audiovisual integration with segment models for tennis video parsing Comput Vis Image Underst 111 142-154
[4]  
Herskovits E(1997)Bayesian network classifiers Mach Learn 29 131-163
[5]  
Delakis M(2000)Using Bayesian network to analyze expression data J Comput Biol 7 601-620
[6]  
Gravier G(2005)Structural extension to logistic regression: discriminative parameter learning of belief net classifiers Mach Learn J 59 297-322
[7]  
Gros P(2000)A semantic event-detection approach and its application to detecting hunts in wildlife video IEEE Trans Circuits Syst Video Technol 10 857-868
[8]  
Friedman N(2006)Semantic analysis of soccer video using dynamic bayesian network IEEE Trans Multimedia 15 1225-1233
[9]  
Geiger D(1990)Bayesian updating in recursive graphical models by local computations Computat Stat Q 4 269-282
[10]  
Goldszmid M(2006)Audiovisual integration for tennis broadcast structuring Multimed Tools Appl 30 289-311