Speech-as-data technologies for personal information devices

被引：0

作者：

Tucker, Roger C. F. ^{[1
]}

Hickey, Marianne ^{[1
]}

Haddock, Nick

机构：

[1] Hewlett Packard Labs, Stoke Gifford, Bristol BS34 6QZ, Avon, England

来源：

PERSONAL AND UBIQUITOUS COMPUTING | 2003年 / 7卷 / 01期

关键词：

Audio summarisation; Speech-as-data; Speech compression; Speech recognition; Wordspotting;

D O I：

10.1007/s00779-002-0210-y

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For small, portable devices, speech input has the advantages of low-cost and small hardware, can be used on the move or whilst the eyes & hands are busy, and is natural and quick. Rather than rely on imperfect speech recognition we propose that information entered as speech is kept as speech and suitable tools are provided to allow quick and easy access to the speech-as-data records. This paper summarises our work on the technologies needed for these tools - for organising, browsing, searching and compressing the stored speech. These technologies go a long way towards giving stored speech the characteristics of text without the associated input problems.

引用

页码：22 / 29

页数：8