Multimodal fusion for multimedia analysis: a survey

被引:766
作者
Atrey, Pradeep K. [1 ]
Hossain, M. Anwar [2 ]
El Saddik, Abdulmotaleb [2 ]
Kankanhalli, Mohan S. [3 ]
机构
[1] Univ Winnipeg, Dept Appl Comp Sci, Winnipeg, MB R3B 2E9, Canada
[2] Univ Ottawa, Multimedia Commun Res Lab, Ottawa, ON, Canada
[3] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
基金
加拿大自然科学与工程研究理事会;
关键词
Multimodal information fusion; Multimedia analysis; OPTIMAL SENSOR SELECTION; EVENT DETECTION; SEMANTIC ANNOTATION; MAXIMUM-ENTROPY; VIDEO; RECOGNITION; TRACKING; FEATURES; AUDIO; IDENTIFICATION;
D O I
10.1007/s00530-010-0182-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This survey aims at providing multimedia researchers with a state-of-the-art overview of fusion strategies, which are used for combining multiple modalities in order to accomplish various multimedia analysis tasks. The existing literature on multimodal fusion research is presented through several classifications based on the fusion methodology and the level of fusion (feature, decision, and hybrid). The fusion methods are described from the perspective of the basic concept, advantages, weaknesses, and their usage in various analysis tasks as reported in the literature. Moreover, several distinctive issues that influence a multimodal fusion process such as, the use of correlation and independence, confidence level, contextual information, synchronization between different modalities, and the optimal modality selection are also highlighted. Finally, we present the open issues for further research in the area of multimodal fusion.
引用
收藏
页码:345 / 379
页数:35
相关论文
共 158 条
[1]   Semantic indexing of multimedia content using visual, audio, and text cues [J].
Adams, WH ;
Iyengar, G ;
Lin, CY ;
Naphade, MR ;
Neti, C ;
Nock, HJ ;
Smith, JR .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (02) :170-185
[2]  
AGUILAR JF, 2003, INT C VID BAS BIOM P, P830
[3]   Audio-visual biometrics [J].
Aleksic, Petar S. ;
Katsaggelos, Aggelos K. .
PROCEEDINGS OF THE IEEE, 2006, 94 (11) :2025-2044
[4]   Particle methods for change detection, system identification, and control [J].
Andrieu, C ;
Doucet, A ;
Singh, SS ;
Tadic, VB .
PROCEEDINGS OF THE IEEE, 2004, 92 (03) :423-438
[5]  
[Anonymous], 2004, PROC ACM INT C MULTI
[6]  
[Anonymous], 2 INT WORKSH NETW SE
[7]  
[Anonymous], PETS PERFORMANCE EVA
[8]  
Argillander J, 2005, INT CONF ACOUST SPEE, P153
[9]  
Atrey PK, 2007, LECT NOTES COMPUT SC, V4352, P155
[10]   Goal-oriented optimal subset selection of correlated multimedia streams [J].
Atrey, Pradeep K. ;
Kankanhalli, Mohan S. ;
Oommen, John B. .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2007, 3 (01)