Speech and language processing for next-millennium communications services

被引:30
作者
Cox, RV [1 ]
Kamm, CA [1 ]
Rabiner, LR [1 ]
Schroeter, J [1 ]
Wilpon, JG [1 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
关键词
dialogue management; speaker recognition; speech coding; speech processing; speech recognition; speech synthesis; spoken language understanding;
D O I
10.1109/5.880086
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the future, the world of telecommunications will be vastly different than it is today. The driving force will be the seamless integration of real-time communications (e.g., voice, video, music, etc.) and data into a single network, with ubiquitous access to that network anywhere. anytime, and by a wide range of devices. The only currently available ubiquitous access device to the network is the telephone, and the only ubiquitous user access technology mode is spoken voice commands and natural language dialogues with machines. In the future, new access devices and modes will augment speech in this role, but are unlikely to supplant the telephone and access by speech anytime soon. Speech technologies have progressed to the point where they are now viable for a broad range of communications services, including compression of speech for use over wired and wireless networks: speech synthesis, recognition, and understanding for dialogue access to information, people and messaging; and speaker verification for secure access to information and services. This paper provides brief overviews of these technologies, discusses some of the unique properties of wireless, plain old telephone service, and Internet protocol networks that make voice communication and control problematic, and describes the types of voice services available in the past and today, and those that we foresee becoming available over the next several years.
引用
收藏
页码:1314 / 1337
页数:24
相关论文
共 80 条
  • [1] [Anonymous], P 1995 ARPA HUM LANG
  • [2] [Anonymous], P IEEE WORKSH SPEECH
  • [3] [Anonymous], P ICASSP
  • [4] [Anonymous], P EUR RHOD GREEC
  • [5] *ANSI, 1999, T15211999 ANSI
  • [6] BATES M, 1994, VOICE COMMUNICATION BETWEEN HUMANS AND MACHINES, P238
  • [7] BEUTNAGEL M, P JOINT M ASA EAA DE
  • [8] BEUTNAGEL M, 1999, J ACOUST SOC AM 2, V105, P1030
  • [9] Automation of Telecom Italia Directory Assistance Service: Field trial results
    Billi, R
    Canavesio, F
    Rullent, C
    [J]. 1998 IEEE 4TH WORKSHOP INTERACTIVE VOICE TECHNOLOGY FOR TELECOMMUNICATIONS APPLICATIONS - IVTTA '98, 1998, : 11 - 16
  • [10] Field trial evaluations of two different information inquiry systems
    Billi, R
    Castagneri, G
    Danieli, M
    [J]. THIRD IEEE WORKSHOP ON INTERACTIVE VOICE TECHNOLOGY FOR TELECOMMUNICATIONS APPLICATIONS - IVTTA-96, PROCEEDINGS, 1996, : 129 - 134