ColEnViSon: Color Enhanced Visual Sonifier A Polyphonic Audio Texture and Salient Scene Analysis

被引:0
作者
Ancuti, Codruta [1 ]
Ancuti, Cosmin [1 ]
Bekaert, Philippe [1 ]
机构
[1] Hasselt Univ, UL IBBT, Expertise Ctr Digital Media, B-3590 Diepenbeek, Belgium
来源
VISAPP 2009: PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2 | 2009年
关键词
Blind Navigation; Visual Saliency; Color Transformation; IMAGE SEGMENTATION; MEAN SHIFT; SUBSTITUTION; QUANTIZATION; ATTENTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we introduce a color based image-audio system that enhances the perception of the visually impaired users. Traditional sound-vision substitution systems mainly translate gray scale images into corresponding audio frequencies. However, these algorithms deprive the user from the color information, an critical factor in object recognition and also for attracting visual attention. We propose an algorithm that translates the scene into sound based on some classical computer vision algorithms. The most salient visual regions are extracted by a hybrid approach that blends the computed salient map with the segmented image. The selected image region is simplified based on a reference color map dictionary. The centroid of the color space are translated into audio by different musical instruments. We chose to encode the audio file by polyphonic music composition reasoning that humans are capable to distinguish more than one instrument in the same time but also to reduce the playing duration. Testing the prototype demonstrate that non-proficient blindfold participants can easily interpret sequence of colored patterns and also to distinguish by example the quantity of a specific color contained by a given image.
引用
收藏
页码:566 / 572
页数:7
相关论文
共 34 条
[1]  
[Anonymous], BASIC COLOR TERMS TH
[2]   Auditory coding of visual patterns for the blind [J].
Arno, P ;
Capelle, C ;
Wanet-Defalque, MC ;
Catalan-Ahumada, M ;
Veraart, C .
PERCEPTION, 1999, 28 (08) :1013-1029
[3]  
Auvray Malika, 2005, Journal of Integrative Neuroscience, V4, P505, DOI 10.1142/S0219635205001002
[4]   VISION SUBSTITUTION BY TACTILE IMAGE PROJECTION [J].
BACH, P ;
COLLINS, CC ;
SAUNDERS, FA ;
WHITE, B ;
SCADDEN, L .
NATURE, 1969, 221 (5184) :963-&
[5]   Sensory substitution and the human-machine interface [J].
Bach-y-Rita, P ;
Kercel, SW .
TRENDS IN COGNITIVE SCIENCES, 2003, 7 (12) :541-546
[6]  
BELPAEME T, 2002, THESIS VRIJE U BRUSS
[7]  
BERETTA G, 1990, COLOR PALETTE SELECT
[8]  
Bregman A., 1990, Auditory Scene Analysis: The Perceptual Organization of Sound, DOI DOI 10.7551/MITPRESS/1486.001.0001
[9]   A real-time experimental prototype for enhancement of vision rehabilitation using auditory substitution [J].
Capelle, C ;
Trullemans, C ;
Arno, P ;
Veraart, C .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1998, 45 (10) :1279-1293
[10]   A fast and novel technique for color quantization using reduction of color space dimensionality [J].
Cheng, SC ;
Yang, CK .
PATTERN RECOGNITION LETTERS, 2001, 22 (08) :845-856