Efficient corpus creation method for NLU using interview with probing questions

被引:0
|
作者
Shima K. [1 ]
Homma T. [2 ]
Motohashi M. [1 ]
Ikeshita R. [2 ]
Kokubo H. [2 ]
Obuchi Y. [3 ]
She J. [3 ]
机构
[1] Clarion Co., Ltd., 7-2 Shintoshin, Chuo-ku, Saitama, Saitama
[2] Research and Development Group, Hitachi, Ltd, 1-280 Higashi-koigakubo, Kokubunji, 185-8601, Tokyo
[3] Tokyo University of Technology, 1404-1 Katakura, Hachioji, Tokyo
来源
Journal of Advanced Computational Intelligence and Intelligent Informatics | 2019年 / 23卷 / 05期
关键词
Corpus; Interview; Morpheme; Natural language understanding; Probing;
D O I
10.20965/jaciii.2019.p0947
中图分类号
学科分类号
摘要
This paper presents an efficient method to build a corpus to train natural language understanding (NLU) modules. Conventional corpus creation methods involve a common cycle: a subject is given a specific situation where the subject operates a device by voice, and then the subject speaks one utterance to execute the task. In these methods, many subjects are required in order to build a large-scale corpus, which causes a problem of increasing lead time and financial cost. To solve this problem, we propose to incorporate a “probing question” into the cycle. Specifically, after a subject speaks one utterance, the subject is asked to think of alternative utterances to execute the same task. In this way, we obtain many utterances from a small number of subjects. An evaluation of the proposed method applied to interview-based corpus creation shows that the proposed method reduces the number of subjects by 41% while maintaining morphological diversity in a corpus and morphological coverage for user utterances spoken to commercial devices. It also shows that the proposed method reduces the total time for interviewing subjects by 36% compared with the conventional method. We conclude that the proposed method can be used to build a useful corpus while reducing lead time and financial cost. © 2019 Fuji Technology Press. All rights reserved.
引用
收藏
页码:947 / 955
页数:8
相关论文
共 8 条
  • [1] Efficient Corpus Creation Method for NLU Using Interview with Probing Questions
    Shima, Kazuaki
    Homma, Takeshi
    Motohashi, Masataka
    Ikeshita, Rintaro
    Kokubo, Hiroaki
    Obuchi, Yasunari
    She, Jinhua
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2019, 23 (05) : 947 - 955
  • [2] Encouraging the production of narrative responses to past-behaviour interview questions: effects of probing and information
    Brosy, Julie
    Bangerter, Adrian
    Ribeiro, Sandrine
    EUROPEAN JOURNAL OF WORK AND ORGANIZATIONAL PSYCHOLOGY, 2020, 29 (03) : 330 - 343
  • [3] A Method for Complex Schema Matching Using Corpus
    Qian, Ying
    Li, Yu-Xiang
    Zhang, Shuai
    Cui, Li
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 1, 2011, : 445 - 448
  • [4] Using the Web as an Efficient Source of Building an Arabic Corpus: Presentation and Evaluation
    Bakari, Wided
    Bellot, Patrice
    Neji, Mahmoud
    INNOVATION MANAGEMENT AND EDUCATION EXCELLENCE VISION 2020: FROM REGIONAL DEVELOPMENT SUSTAINABILITY TO GLOBAL ECONOMIC GROWTH, VOLS I - VI, 2016, : 3399 - 3412
  • [5] Code Suggestion of Method Call Statements using a Source Code Corpus
    Yamamoto, Tetsuo
    2017 24TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2017), 2017, : 666 - 671
  • [6] A Fast Probing Detection Method using Hybrid Machine Learning Algorithms
    Youm, Sungkwan
    Kim, Eui-Jik
    PROCEEDINGS OF THE 2019 THE TWENTIETH ACM INTERNATIONAL SYMPOSIUM ON MOBILE AD HOC NETWORKING AND COMPUTING (MOBIHOC '19), 2019, : 375 - 376
  • [7] Understanding Chinese MA Students' Interpersonal Stance of Anticipatory "It" Patterns: Using Corpus Results to Guide Questionnaire and Discourse-Based Interview
    Chen, Jing
    Jiang, Yi
    SAGE OPEN, 2025, 15 (01):
  • [8] Protocol for an interview-based method for mapping mental models using causal-loop diagramming and realist interviewing
    Kenzie, Erin S.
    Wakeland, Wayne
    Jetter, Antonie
    Lich, Kristen Hassmiller
    Seater, Mellodie
    Gunn, Rose
    Davis, Melinda M.
    EVALUATION AND PROGRAM PLANNING, 2024, 103