From science fiction to science fact: a smart-house interface using speech technology and a photo-realistic avatar

被引:13
作者
Filho, G. L. [1 ]
Moir, Tom J. [2 ]
机构
[1] Guile 3D Studio, Curitiba, Parana, Brazil
[2] Massey Univ, Sch Engn & Adv Technol, Auckland, New Zealand
关键词
speech recognition; smart-house; 3D avatar; artificial intelligence;
D O I
10.1504/IJCAT.2010.034727
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper explores the problems of speech recognition in a ( sometimes) noisy environment. An adaptive acoustic beamformer is proposed based on the Griffiths-Jim method and a 'hot-spot' where speech can be received within a geometric-defined boundary and rejected outside of it will be shown to give a certain amount of noise immunity and improve the signal-tonoise ratio for the second stage, which is the speech recognition engine. The recognition engine used has a limited vocabulary which gives rise to an excellent hit-rate and less training than unlimited vocabulary. The technology here has improved vastly within the last decade and it will be shown that by using a head and shoulders avatar that is both photo-realistic and with appealing personality, the experience of a speech interface is vastly enhanced. The paper will explore these technologies and investigate the convergence of many of them in the current Massey smart-office.
引用
收藏
页码:32 / 39
页数:8
相关论文
共 19 条
[1]  
Agaiby H, 1997, DSP 97: 1997 13TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, P753
[2]  
Agostaro F., 2005, 9 C IT ASS ART INT S
[3]  
Campbell C., 1980, ELECT APPLIANCE CONT
[4]  
Diegal O., 2005, HOME ORIENTATED INFO, P13
[5]   Signal enhancement using beamforming and nonstationarity with applications to speech [J].
Gannot, S ;
Burshtein, D ;
Weinstein, E .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2001, 49 (08) :1614-1626
[6]   AN ALTERNATIVE APPROACH TO LINEARLY CONSTRAINED ADAPTIVE BEAMFORMING [J].
GRIFFITHS, LJ ;
JIM, CW .
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1982, 30 (01) :27-34
[7]  
Haykin S., 1986, ADAPTIVE FILTER THEO
[8]  
MAES P, 1994, COMMUN ACM, V37, P30
[9]  
Maj J-B., 2000, SIGN PROC S HILV NET
[10]   Near-field adaptive beamformer for robust speech recognition [J].
McCowan, IA ;
Moore, DC ;
Sridharan, S .
DIGITAL SIGNAL PROCESSING, 2002, 12 (01) :87-106