Video affective content recognition based on genetic algorithm combined HMM

被引:0
作者
Sun, Kai [1 ]
Yu, Junqing [1 ]
机构
[1] Huazhong Univ Sci & Technol, Comp Coll Sci & Technol, Wuhan 430074, Peoples R China
来源
ENTERTAINMENT COMPUTING - ICEC 2007 | 2007年 / 4740卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video affective content analysis is a fascinating but seldom addressed field in entertainment computing research communities. To recognize affective content in video, a video affective content representation and recognition framework based on Video Affective Tree (VAT) and Hidden Markov Models (HMMs) was proposed. The proposed video affective content recognizer has good potential to recognize the basic emotional events of audience. However, due to Expectation-Maximization (EM) methods like the Baum-Welch algorithm tend to converge to the local optimum which is the closer to the starting values of the optimization procedure, the estimation of the recognizer parameters requires a more careful examination. A Genetic Algorithm combined HMM (GA-HMM) is presented here to address this problem. The idea is to combine a genetic algorithm to explore quickly the whole solution space with a Baum-Welch algorithm to find the exact parameter values of the optimum. The experimental results show that GA-HMM can achieve higher recognition rate with less computation compared with our previous works.
引用
收藏
页码:249 / +
页数:2
相关论文
共 9 条
[1]  
Coley D.A., 1999, An Introduction to Genetic Algorithms for Scientists and Engineers, DOI 10.1142/3904
[2]  
Goldstein E. B., 2021, Sensation and perception
[3]   Extracting moods from pictures and sounds [J].
Hanjalic, A .
IEEE SIGNAL PROCESSING MAGAZINE, 2006, 23 (02) :90-100
[4]   Affective video content representation and modeling [J].
Hanjalic, A ;
Xu, LQ .
IEEE TRANSACTIONS ON MULTIMEDIA, 2005, 7 (01) :143-154
[5]  
Kang Hang-Bong, 2003, ACM MM, P259
[6]  
McLachlan G., 1997, ALGORITHM EXTENSION
[7]   A TUTORIAL ON HIDDEN MARKOV-MODELS AND SELECTED APPLICATIONS IN SPEECH RECOGNITION [J].
RABINER, LR .
PROCEEDINGS OF THE IEEE, 1989, 77 (02) :257-286
[8]  
SUN K, 2007, IN PRESS VIDEO AFFEC
[9]  
2001, ISOIEC CD, P15938