Fast word lattice generation in morphological analysis

被引:0
|
作者
机构
[1] Institute of Industrial Science, University of Tokyo
[2] National Institute of Informatics, Institute of Industrial Science, University of Tokyo
来源
| 1600年 / Japanese Society for Artificial Intelligence卷 / 29期
关键词
Fast algorithm; Morphological analysis; Unknown words; Word lattice;
D O I
10.1527/tjsai.29.268
中图分类号
学科分类号
摘要
This paper proposes a fast word lattice generation algorithm for Japanese morphological analysis. We conducted experiments on three Japanese data sets to demonstrate that the previously proposed pruning-based algorithm is in fact not efficient enough, and that the pipeline algorithm, which is introduced in this paper, achieves considerable speed-up without loss of accuracy. Moreover, the compactness of the lattice generated by the pipeline algorithm was investigated from both theoretical and empirical perspectives.
引用
收藏
页码:268 / 276
页数:8
相关论文
共 50 条
  • [1] Fast Morphological Analysis of Czech
    Smerk, Pavel
    RASLAN 2009: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2009, : 13 - 16
  • [2] A Hybrid Approach to Robust Word Lattice Generation Via Acoustic-Based Word Detection
    Han, Icksang
    Park, Chiyoun
    Cho, Jeongmi
    Kim, Jeongsu
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 210 - 213
  • [3] Improving Word Alignment Through Morphological Analysis
    Vuong Van Bui
    Thanh Trung Tran
    Nhat Bich Thi Nguyen
    Tai Dinh Pham
    Anh Ngoc Le
    Cuong Anh Le
    INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, IUKM 2015, 2015, 9376 : 315 - 325
  • [4] Effective Integration of Automatic Word Spacing and Morphological Analysis in Korean
    Kim, Hongjin
    Kim, Harksoo
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 275 - 278
  • [5] UPDATING FIELD ASSOCIATION WORD DICTIONARY USING WORD ATTRIBUTES, MORPHOLOGICAL ANALYSIS, AND COMPOUND WORDS
    Atlam, El-Sayed
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (06): : 2097 - 2111
  • [6] MADAMIRA: A Fast, Comprehensive Tool for Morphological Analysis and Disambiguation of Arabic
    Pasha, Arfath
    Al-Badrashiny, Mohamed
    Diab, Mona
    El Kholy, Ahmed
    Eskander, Ramy
    Habash, Nizar
    Pooleery, Manoj
    Rambow, Owen
    Roth, Ryan M.
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1094 - 1101
  • [7] Tools for Fast Morphological Analysis Based on Finite State Automata
    Smerk, Pavel
    RASLAN 2014: RECENT ADVANCES IN SLAVONIC NATURAL LANGUAGE PROCESSING, 2014, : 147 - 150
  • [8] Morphological case and word order in Old English
    Pintzuk, S
    LANGUAGE SCIENCES, 2002, 24 (3-4) : 381 - 395
  • [9] Correcting Chinese Spelling Errors with Word Lattice Decoding
    Hsieh, Yu-Ming
    Bai, Ming-Hong
    Huang, Shu-Ling
    Chen, Keh-Jiann
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2015, 14 (04)
  • [10] Toward data-driven idea generation: Application of Wikipedia to morphological analysis
    Kwon, Heeyeul
    Park, Yongtae
    Geum, Youngjung
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2018, 132 : 56 - 80