Joint correlation analysis of audio-visual dance figures

被引:0
作者
Ofli, F. [1 ]
Demir, Y. [1 ]
Erzin, E. [1 ]
Yemez, Y. [1 ]
Tekalp, A. M. [1 ]
机构
[1] Koc Univ, Goru Grafik Lab, TR-34450 Istanbul, Turkey
来源
2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3 | 2007年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a framework for analysis of dance figures from audio-visual data. Our audio-visual data is the multiview video of a dancing actor which is acquired using 8 synchronized cameras. The multi-camera motion capture technique of this framework is based on 3D tracking of the markers attached to the dancer's body, using stereo color information. The extracted 31) points are used to calculate the body motion features as 3D displacement vectors. On the other hand, MFC coefficients serve as the audio features. In the first stage of the two stage analysis task, we perform Hidden Markov Model (HMM) based unsupervised temporal segmentation of the audio and body motion features, separately, to extract the recurrent elementary audio and body motion patterns. In the second stage, the correlation of body motion patterns with audio patterns is investigated to create a correlation model that can be used during the synthesis of an audio-driven body animation.
引用
收藏
页码:604 / 607
页数:4
相关论文
共 50 条
[41]   AUDIO-VISUAL FOR THE PATIENT [J].
STUTTLE, FL .
JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 1959, 41 (07) :1362-1362
[42]   The Audio-Visual Reader [J].
不详 .
JOURNAL OF EDUCATIONAL RESEARCH, 1955, 48 (07) :552-553
[43]   Solos: A Dataset for Audio-Visual Music Analysis [J].
Montesinos, Juan F. ;
Slizovskaia, Olga ;
Haro, Gloria .
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
[44]   The Grounded Theory and the Analysis of Audio-Visual Texts [J].
Figueroa, Silvana K. .
INTERNATIONAL JOURNAL OF SOCIAL RESEARCH METHODOLOGY, 2008, 11 (01) :1-12
[45]   Audio-visual Technology for Conversation Scene Analysis [J].
Otsuka, Kazuhiro ;
Araki, Shoko .
NTT Technical Review, 2009, 7 (02)
[46]   Audio-Visual Automatic Group Affect Analysis [J].
Sharma, Garima ;
Dhall, Abhinav ;
Cai, Jianfei .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) :1056-1069
[47]   Analysis of temporal perception for audio-visual stimulation [J].
Yu, Mi ;
Lee, Sang-Min ;
Piao, Yong-Jun ;
Kwon, Tae-Kyu ;
Kim, Nam-Gyun .
WORLD CONGRESS ON MEDICAL PHYSICS AND BIOMEDICAL ENGINEERING 2006, VOL 14, PTS 1-6, 2007, 14 :591-+
[48]   Saliency Prediction in Uncategorized Videos Based on Audio-Visual Correlation [J].
Qamar, Maryam ;
Qamar, Suleman ;
Muneeb, Muhammad ;
Bae, Sung-Ho ;
Rahman, Anis .
IEEE ACCESS, 2023, 11 :15460-15470
[49]   An audio-visual speech recognition system for testing new audio-visual databases [J].
Pao, Tsang-Long ;
Liao, Wen-Yuan .
VISAPP 2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2006, :192-+
[50]   Perceptual thresholds of audio-visual spatial coherence for a variety of audio-visual objects [J].
Stenzel, Hanne ;
Jackson, Philip J. B. .
2018 AES INTERNATIONAL CONFERENCE ON AUDIO FOR VIRTUAL AND AUGMENTED REALITY, 2018,