An active approach to spoken language processing

被引：9

作者：

AT and T Labs - Research ^{[1
]}

不详 ^{[2
]}

不详 ^{[3
]}

不详 ^{[4
]}

机构：

[1] ICSI, University of California - Berkeley, Berkeley

[2] Department of Information and Communication Technology, University of Trento

[3] SRI International, Speech Technology and Research Lab., Menlo Park

来源：

ACM Trans. Speech Lang. Process. | 2006年 / 3卷 / 1-31期

关键词：

Active evaluation; Active learning; Adaptive learning; Automatic speech recognition; Passive learning; Speech and language processing; Spoken dialog systems; Spoken language understanding; Unsupervised learning;

D O I：

10.1145/1177055.1177056

中图分类号：

学科分类号：

摘要：

State of the art data-driven speech and language processing systems require a large amount of human intervention ranging from data annotation to system prototyping. In the traditional supervised passive approach, the system is trained on a given number of annotated data samples and evaluated using a separate test set. Then more data is collected arbitrarily, annotated, and the whole cycle is repeated. In this article, we propose the active approach where the system itself selects its own training data, evaluates itself and re-trains when necessary. We first employ active learning which aims to automatically select the examples that are likely to be the most informative for a given task. We use active learning for both selecting the examples to label and the examples to re-label in order to correct labeling errors. Furthermore, the system automatically evaluates itself using active evaluation to keep track of the unexpected events and decides on-demand to label more examples. The active approach enables dynamic adaptation of spoken language processing systems to unseen or unexpected events for nonstationary input while reducing the manual annotation effort significantly. We have evaluated the active approach with the AT&T spoken dialog system used for customer care applications. In this article, we present our results for both automatic speech recognition and spoken language understanding. © 2006 ACM.

引用

页码：1 / 31

页数：30

共 56 条

[1]

Abney S., Schapire R., Singer Y., Boosting applied to tagging and PP attachment, Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing (EMNLP) and Very Large Corpora, (1999)

[2]

Anoluin D., Queries and concept learning, Mach. Learn., 2, pp. 319-342, (1988)

[3]

Argamon-Engelson S., Dagan I., Committee-based sample selection for probabilistic classifiers, J. Artif. Intel. Res., 11, pp. 335-360, (1999)

[4]

Bacchiani M., Roark B., Unsupervised language model adaptation, Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), (2003)

[5]

Baldridge J., Osborne M., Active learning for HPSG parse selection, Proceedings of the Conference on Computational Natural Language Learning (CoNLL), (2003)

[6]

Blum A., Mitchell T., Combining labeled and unlabeled data with co-training, Proceedings of the Workshop on Computational Learning Theory (COLT), (1998)

[7]

Cohn D., Atlas L., Ladner R., Improving generalization with active learning, Mach. Learn., 15, pp. 201-221, (1994)

[8]

Digalakis V., Rtischev D., Neumeyeh L.G., Speaker adaptation using constrained estimation of gaussian mixtures, IEEE Trans. Speech Audio Process., 3, 5, pp. 357-366, (1995)

[9]

Eskin E., Detecting errors within a corpus using anomaly detection, Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), (2000)

[10]

Federico M., Bayesian estimation methods for n-gram language model adaptation, Proceedings of the International Conference on Spoken Language Processing (ICSLP), (1996)

← 1 2 3 4 5 6 →