Scene recognition with audio-visual sensor fusion

被引:0
作者
Devicharan, D [1 ]
Mehrotra, KG [1 ]
Mohan, CK [1 ]
Varshney, PK [1 ]
Zuo, L [1 ]
机构
[1] Syracuse Univ, Dept EECS, Syracuse, NY 13244 USA
来源
Multisensor, Multisource Information Fusion: Architectures, Algorithms and Applications 2005 | 2005年 / 5813卷
关键词
multimodal sensor fusion; scene recognition; activity detection; audio and visual surveillance;
D O I
10.1117/12.605751
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Several surveillance applications are characterized by the ability to gather information about the scene from more than one sensor modality, and heterogeneous sensor data must then be fused by the decision-maker. hi this paper, we discuss the issues relevant to developing a model for fusion of information from audio and visual sensors, and present a framework to enhance decision-making capabilities. In particular, our methodology focuses on the issues of temporal reasoning, uncertainty representations, and coupling between features inferred from data streams coming from different sensors. We propose a conditional probability-based representation for uncertainty, along with fuzzy rules to assist decision-making, and a matrix representation of the coupling between sensor data streams. We also develop a fusion algorithm that utilizes these representations.
引用
收藏
页码:201 / 210
页数:10
相关论文
共 25 条
[1]  
ALBIOL A, 2004, SPIE MAGAZINE PHOTON
[2]  
Allegro S, 2001, AUTOMATIC SOUND CLAS
[3]   TIME AND TIME AGAIN - THE MANY WAYS TO REPRESENT TIME [J].
ALLEN, JF .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1991, 6 (04) :341-355
[4]  
American Nuclear Society, 2021, NUCL NEWS
[5]  
BARNARD M, 2003, P IEEE WORKSH NEUR N
[6]  
Beal M.J., 2002, P EUR C COMP VIS
[7]  
BOULAY B, 2003, P JOINT IEEE INT WOR
[8]  
BRAND M, 2000, IEEE T PATT AN MACH, V22
[9]  
BUECHLER M, 2002, THESIS SWISS FED I T
[10]  
COWLING M, 2004, THESIS GRIFFITH U