Natural language processing: state of the art, current trends and challenges

被引:433
作者
Khurana, Diksha [1 ]
Koli, Aditya [1 ]
Khatter, Kiran [2 ]
Singh, Sukhdev
机构
[1] Manav Rachna Int Inst Res & Studies, Dept Comp Sci, Faridabad, India
[2] BML Munjal Univ, Dept Comp Sci, Gurgaon, India
关键词
Natural language processing; Natural language understanding; Natural language generation; NLP applications; NLP evaluation metrics; DOCUMENT; REPRESENTATION; KNOWLEDGE; EXTRACTION; MODELS; TEXT;
D O I
10.1007/s11042-022-13428-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Natural language processing (NLP) has recently gained much attention for representing and analyzing human language computationally. It has spread its applications in various fields such as machine translation, email spam detection, information extraction, summarization, medical, and question answering etc. In this paper, we first distinguish four phases by discussing different levels of NLP and components of Natural Language Generation followed by presenting the history and evolution of NLP. We then discuss in detail the state of the art presenting the various applications of NLP, current trends, and challenges. Finally, we present a discussion on some available datasets, models, and evaluation metrics in NLP.
引用
收藏
页码:3713 / 3744
页数:32
相关论文
共 159 条
  • [121] SCHERRER JR, 1994, METHOD INFORM MED, V33, P174
  • [122] Seal Dibyendu, 2020, Information and Communication Technology for Sustainable Development. Proceedings of ICT4SD 2018. Advances in Intelligent Systems and Computing (AISC 933), P423, DOI 10.1007/978-981-13-7166-0_42
  • [123] Sha F, 2003, HLT-NAACL 2003: HUMAN LANGUAGE TECHNOLOGY CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE MAIN CONFERENCE, P213
  • [124] Sharifirad S, 2019, ARXIV PREPRINT ARXIV
  • [125] Sharma Shashank, 2016, P LANGUAGE RESOURCES, P47
  • [126] Shemtov, 1997, AMBIGUITY MANAGEMENT
  • [127] Small SL., 1988, LEXICAL AMBIGUITY RE
  • [128] Pfam: multiple sequence alignments and HMM-profiles of protein domains
    Sonnhammer, ELL
    Eddy, SR
    Birney, E
    Bateman, A
    Durbin, R
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (01) : 320 - 322
  • [129] Srihari S, 2010, MACHINE LEARNING GEN
  • [130] Sun X, 2008, P 22 INT C COMP LING, V1, P841