On Parsing Visual Sequences with the Hidden Markov Model

被引:4
作者
Harte, Naomi [1 ]
Lennon, Daire [1 ]
Kokaram, Anil [1 ]
机构
[1] Trinity Coll Dublin, Sch Engn, Dublin 2, Ireland
基金
爱尔兰科学基金会;
关键词
RECOGNITION; SEGMENTATION; AUDIO; VIDEO; HMM;
D O I
10.1155/2009/924287
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Hidden Markov Models have been employed in many vision applications to model and identify events of interest. Their use is common in applications where HMMs are used to classify previously divided segments of video as one of a set of events being modelled. HMMs can also simultaneously segment and classify events within a continuous video, without the need for a separate first step to identify the start and end of the events. This is significantly less common. This paper is an exploration of the development of HMM frameworks for such complete event recognition. A review of how HMMs have been applied to both event classification and recognition is presented. The discussion evolves in parallel with an example of a real application in psychology for illustration. The complete videos depict sessions where candidates perform a number of different exercises under the instruction of a psychologist. The goal is to isolate portions of video containing just one of these exercises. The exercise involves rotating the head of a kneeling subject to the left, back to centre, to the right, to the centre, and repeating a number of times. By designing a HMM system to automatically isolate portions of video containing this exercise, issues such as the strategy of choice of event to be modelled, feature design and selection, as well as training and testing are reviewed. Thus this paper shows how HMMs can be more extensively applied in the domain of event recognition in video. Copyright (C) 2009 Naomi Harte et al.
引用
收藏
页数:13
相关论文
共 39 条
[1]  
[Anonymous], CUEDFINFENGTR38
[2]   Soccer highlights detection and recognition using HMMs [J].
Assfalg, J ;
Bertini, M ;
Del Bimbo, A ;
Nunziati, W ;
Pala, P .
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, :825-828
[3]   Object trajectory-based activity classification and recognition using hidden Markov models [J].
Bashir, Faisal I. ;
Khokhar, Ashfaq A. ;
Schonfeld, Dan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2007, 16 (07) :1912-1919
[4]  
Boreczky JS, 1998, INT CONF ACOUST SPEE, P3741, DOI 10.1109/ICASSP.1998.679697
[5]   Discovery and segmentation of activities in video [J].
Brand, M ;
Kettnaker, V .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (08) :844-851
[6]  
Chang P, 2002, IEEE IMAGE PROC, P609
[7]   Lipreading from color video [J].
Chiou, GI ;
Hwang, JN .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1997, 6 (08) :1192-1195
[8]   Activity modeling using event probability sequences [J].
Cuntoor, Naresh P. ;
Yegnanarayana, B. ;
Chellappa, Rama .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (04) :594-607
[9]  
DOYLE E, 2008, THESIS U DUBLIN DUBL
[10]   CEPSTRAL ANALYSIS TECHNIQUE FOR AUTOMATIC SPEAKER VERIFICATION [J].
FURUI, S .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (02) :254-272