Attention to clapping - A direct method for detecting sound source from video and audio

被引:3
作者
Ikeda, T [1 ]
Ishiguro, IE [1 ]
Asada, M [1 ]
机构
[1] Osaka Univ, Grad Sch Engn, Dept Adapt Machine Syst, Suita, Osaka 5650871, Japan
来源
PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON MULTISENSOR FUSION AND INTEGRATION FOR INTELLIGENT SYSTEMS | 2003年
关键词
D O I
10.1109/MFI-2003.2003.1232668
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The research approaches utilizing ubiquitous sensors to support human activities have become of major interest lately. One of the required features of the ubiquitous sensor system is paying its attention to our signals, such as clapping hands and uttering keywords. To detect and localize these signs, it is useful to fuse visual and audio information. The sensor fusion in previous works is performed in the task-level layer through individual representations of the sensors. Therefore, it does not provide new information by fusing sensors. This paper proposes another method that fuses sensory signals based on mutual information maximization in the signal-level layer The fused signal provides us new information that cannot be obtained from individual sensors. As an example, this paper shows two experimental results of a sound source localization by audio-visual fusion.
引用
收藏
页码:264 / 268
页数:5
相关论文
共 50 条
[31]   Efficient method for detecting targets from remote sensing images based on global attention mechanism [J].
Gao, Zijun ;
Su, Jingwen ;
Li, Bo ;
Wang, Jue ;
Song, Zhankui .
IET IMAGE PROCESSING, 2025, 19 (01)
[32]   A method for estimating the orientation of a directional sound source from source directivity and multi-microphone recordings: Principles and application [J].
Guarato, Francesco ;
Jakobsen, Lasse ;
Vanderelst, Dieter ;
Surlykke, Annemarie ;
Hallam, John .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2011, 129 (02) :1046-1058
[33]   Method for measuring the low-frequency sound power from a complex sound source based on sound-field correction in a non-anechoic tank [J].
徐宏哲 ;
李琪 ;
唐锐 ;
尚大晶 .
Chinese Physics B, 2023, 32 (05) :578-593
[34]   Method for measuring the low-frequency sound power from a complex sound source based on sound-field correction in a non-anechoic tank [J].
Xu, Hongzhe ;
Li, Qi ;
Tang, Rui ;
Shang, Dajing .
CHINESE PHYSICS B, 2023, 32 (05)
[35]   A PROBABILISTIC EVALUATION METHOD FOR THE EFFECT OF DIRECT SOUND ON THE DIFFUSENESS OF REVERBERANT SOUND FIELD FROM THE VIEWPOINT OF AN N-DIMENSIONAL SIGNAL SPACE [J].
OHTA, M ;
MIYATA, S .
ACUSTICA, 1985, 58 (02) :75-82
[36]   Development of the numerical method for calculating sound radiation from a rotating dipole source in an opened thin duct [J].
Choi, Han-Lim ;
Lee, Duck Joo .
JOURNAL OF SOUND AND VIBRATION, 2006, 295 (3-5) :739-752
[37]   How to Annotate Freezing of Gait from Video: A Standardized Method Using Open-Source Software [J].
Gilat, Moran .
JOURNAL OF PARKINSONS DISEASE, 2019, 9 (04) :821-824
[38]   Direct Method for Reconstructing the Radiating Part of a Planar Source from Its Far-Fields [J].
Xiao, Gaobiao ;
Liu, Rui .
ELECTRONICS, 2022, 11 (23)
[39]   A Detecting and Compensation Method for the Errors from Broken Ground Control Points at the Application of Direct Geo-referencing [J].
Liu, Tong ;
Xu, Guochang ;
Yan, Wenlin ;
Xu, Tianhe .
2017 FORUM ON COOPERATIVE POSITIONING AND SERVICE (CPGPS), 2017, :174-178
[40]   On Explainable Closed-Set Source Device Identification Using Log-Mel Spectrograms From Video' Audio: A Grad-CAM Approach [J].
Korgialas, Christos ;
Tzolopoulos, Georgios ;
Kotropoulos, Constantine .
IEEE ACCESS, 2024, 12 :121822-121836