Speech-as-data technologies for personal information devices

被引:0
作者
Tucker, Roger C. F. [1 ]
Hickey, Marianne [1 ]
Haddock, Nick
机构
[1] Hewlett Packard Labs, Stoke Gifford, Bristol BS34 6QZ, Avon, England
关键词
Audio summarisation; Speech-as-data; Speech compression; Speech recognition; Wordspotting;
D O I
10.1007/s00779-002-0210-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For small, portable devices, speech input has the advantages of low-cost and small hardware, can be used on the move or whilst the eyes & hands are busy, and is natural and quick. Rather than rely on imperfect speech recognition we propose that information entered as speech is kept as speech and suitable tools are provided to allow quick and easy access to the speech-as-data records. This paper summarises our work on the technologies needed for these tools - for organising, browsing, searching and compressing the stored speech. These technologies go a long way towards giving stored speech the characteristics of text without the associated input problems.
引用
收藏
页码:22 / 29
页数:8
相关论文
共 33 条
[1]  
ADES S, 1986, P AVIOS 86
[2]  
Arons BM, 1994, THESIS MIT
[3]  
CHAZAN D, 2000, IEEE ICASSP, V3, P1299
[4]  
DIGALAKIS V, 1998, P ICASSP 98, V2, P989
[5]  
*ETSI ES, 2000, 201108 ETSI ES
[6]  
EULER S, 1994, P INT C AC SPEECH SI, V1, P621
[7]  
Garofolo J S., 2000, Proceedings of Content-Based Multimedia Information Access Conference, V1, P1
[8]  
HADDOCK N, 1996, P CHI 96
[9]  
JAMES DA, 1994, P ICASSP 94 AD
[10]  
JONES GJF, 1994, 335 U CAMBR COMP LAB