Spatial Speaker: 3D Java']Java Text-to-Speech Converter

被引:0
作者
Sodnik, Jaka [1 ]
Tomazic, Saso [1 ]
机构
[1] Univ Ljubljana, Fac Elect Engn, Ljubljana 61000, Slovenia
来源
WCECS 2009: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS I AND II | 2009年
关键词
HRTF; !text type='Java']Java[!/text; signal processing; spatial sound; TTS;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Text-to-speech (TTS) converters are the key components of various types of auditory displays. Such converters are extremely useful for visually impaired computer users who depend on synthesized speech read from the computer screen or directly from the web. In this paper we propose an enhancement of a Java FreeTTS speech synthesizer by adding the function of spatial positioning of both the speaker and the listener. With our module, an arbitrary text from a file or the web can be read to the user through the headphones from a fixed or changing position in space. In our solution, we combine the following modules: FreeTTS speech synthesizer, a custom made speech processing unit, MIT Media Lab HRTF library, JOAL positioning library and Creative X-Fi sound card. Although the main focus of the paper is on the design of the "Spatial Speaker", three different applications are proposed and some results of preliminary evaluation studies and user feedback are given. The entire system is developed as a single Java class which can be used in any auditory interface developed in Java.
引用
收藏
页码:1306 / 1310
页数:5
相关论文
共 13 条
[1]  
Arons B., 1992, J. Amer. Voice I/O Soc., V12, P35
[2]  
BORODIN Y, 2007, P 2007 INT CROSS DIS, V225, P128
[3]  
Cole R.A., 1996, SURVEY STATE ART HUM
[4]  
CRISPIEN K, 1996, P ICAD96
[5]  
Goose S., 2005, International Journal of Wireless and Mobile Computing, V1, P5, DOI 10.1504/IJWMC.2005.008049
[6]   A 3D audio only interactive web browser:: Using spatialization to convey hypermedia document structure [J].
Goose, S ;
Möller, C .
ACM MULTIMEDIA 99, PROCEEDINGS, 1999, :363-371
[7]  
Goose S., 2002, P INT C WORLD WID WE, P37
[8]  
Mahmud JalalU., 2007, P 16 INT C WORLD WID, P31, DOI DOI 10.1145/1242572.1242578
[9]  
MYNATT E, 1995, THESIS GEORGIA I TEC
[10]  
ROTH P, 2000, WORKSH FRIENDL EXCH, P57