Survey: Finite-state technology in natural language processing

被引:5
作者
Maletti, Andreas [1 ]
机构
[1] Univ Stuttgart, Inst Nat Language Proc, Pfaffenwaldring 5b, D-70569 Stuttgart, Germany
关键词
Finite-state automaton; Tree automaton; Context-free grammar; Natural language processing; Tokenization; Part-of-speech tagging; Parsing; Machine translation; MAXIMUM-LIKELIHOOD; PROBABILISTIC FUNCTIONS;
D O I
10.1016/j.tcs.2016.05.030
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this survey, we will discuss current uses of finite-state information in several statistical natural language processing tasks. To this end, we will review standard approaches in tokenization, part-of-speech tagging, and parsing, and illustrate the utility of finite-state information and technology in these areas. The particular problems were chosen to allow a natural progression from simple prediction to structured prediction. We aim for a sufficiently formal presentation suitable for readers with a background in automata theory that allows to appreciate the contribution of finite-state approaches, but we will not discuss practical issues outside the core ideas. We provide instructive examples and pointers into the relevant literature for all constructions. We close with an outlook on finite-state technology in statistical machine translation. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:2 / 17
页数:16
相关论文
共 50 条
  • [1] Finite-State Transducers with Multivalued Mappings for Processing of Rich Inflectional Languages
    Tukeyev, Ualsher
    Milosz, Marek
    Zhumanov, Zhandos
    NEW TRENDS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2015, 598 : 271 - 280
  • [2] Natural Language Processing for Dialects of a Language: A Survey
    Joshi, Aditya
    Dabre, Raj
    Kanojia, Diptesh
    Li, Zhuang
    Zhan, Haolan
    Haffari, Gholamreza
    Dippold, Doris
    ACM COMPUTING SURVEYS, 2025, 57 (06)
  • [3] DEEP LEARNING IN NATURAL LANGUAGE PROCESSING: A STATE-OF-THE-ART SURVEY
    Chai, Junyi
    Li, Anming
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), 2019, : 535 - 540
  • [4] Natural language processing in finance: A survey
    Du, Kelvin
    Zhao, Yazhi
    Mao, Rui
    Xing, Frank
    Cambria, Erik
    INFORMATION FUSION, 2025, 115
  • [5] Natural Language Parsing: Using Finite State Automata
    Rangra, Rachana
    Madhusudan
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 456 - 463
  • [6] The fusion of fuzzy theories and natural language processing: A state-of-the-art survey
    Liu, Ming
    Zhang, Hongjun
    Xu, Zeshui
    Ding, Kun
    APPLIED SOFT COMPUTING, 2024, 162
  • [7] Natural language processing in the patent domain: a survey
    Jiang, Lekang
    Goetz, Stephan M.
    Artificial Intelligence Review, 2025, 58 (07)
  • [8] Quantum Natural Language Processing: A Comprehensive Survey
    Varmantchaonala, Charles M.
    Fendji, Jean Louis K. E.
    Schoning, Julius
    Atemkeng, Marcellin
    IEEE ACCESS, 2024, 12 : 99578 - 99598
  • [9] A Review of Natural Language Processing for Financial Technology
    Gao, Ruizhuo
    Zhang, Zeqi
    Shi, Zhenning
    Xu, Dan
    Zhang, Weijuan
    Zhu, Dewei
    INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [10] A Review of Natural Language Processing Technology for Chinese Language and Literature
    Zeng, Ling-Bin
    Su, Jing-Wen
    Yang, Cheng
    Qian, Yue
    2022 INTERNATIONAL COMMUNICATION ENGINEERING AND CLOUD COMPUTING CONFERENCE, CECCC, 2022, : 1 - 6