Synergizing machine learning & symbolic methods: A survey on hybrid approaches to natural language processing

被引:1
作者
Panchendrarajan, Rrubaa [1 ]
Zubiaga, Arkaitz [1 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, 327 Mile End Rd, London E1 4NS, England
基金
英国科研创新办公室;
关键词
Hybrid NLP; Machine learning; Symbolic methods; Hybrid approaches; Natural language processing; GRAPH;
D O I
10.1016/j.eswa.2024.124097
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advancement of machine learning and symbolic approaches have underscored their strengths and weaknesses in Natural Language Processing (NLP). While machine learning approaches are powerful in identifying patterns in data, they often fall short in learning commonsense and the factual knowledge required for the NLP tasks. Meanwhile, the symbolic methods excel in representing knowledge-rich data. However, they struggle to adapt dynamic data and generalize the knowledge. Bridging these two paradigms through hybrid approaches enables the alleviation of weaknesses in both while preserving their strengths. Recent studies extol the virtues of this union, showcasing promising results in a wide range of NLP tasks. In this paper, we present an overview of hybrid approaches used for NLP. Specifically, we delve into the state-of-the-art hybrid approaches used for a broad spectrum of NLP tasks requiring natural language understanding, generation, and reasoning. Furthermore, we discuss the existing resources available for hybrid approaches for NLP along with the challenges and future directions, offering a roadmap for future research avenues.
引用
收藏
页数:15
相关论文
共 148 条
  • [1] Abels P, 2021, Arxiv, DOI arXiv:2102.02086
  • [2] Alec Radford, 2018, Improving Language Understanding by Generative Pre-Training
  • [3] Building for tomorrow: Assessing the temporal persistence of text classifiers
    Alkhalifa, Rabab
    Kochkina, Elena
    Zubiaga, Arkaitz
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (02)
  • [4] DBpedia: A nucleus for a web of open data
    Auer, Soeren
    Bizer, Christian
    Kobilarov, Georgi
    Lehmann, Jens
    Cyganiak, Richard
    Ives, Zachary
    [J]. SEMANTIC WEB, PROCEEDINGS, 2007, 4825 : 722 - +
  • [5] The structure of the FrameNet database
    Baker, CF
    Fillmore, CJ
    Cronin, B
    [J]. INTERNATIONAL JOURNAL OF LEXICOGRAPHY, 2003, 16 (03) : 281 - 296
  • [6] Bastings J., 2017, P 2017 C EMP METH NA, P1957, DOI DOI 10.48550/ARXIV.1806.08804
  • [7] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [8] The Unified Medical Language System (UMLS): integrating biomedical terminology
    Bodenreider, O
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : D267 - D270
  • [9] Bollacker K., 2008, P 2008 ACM SIGMOD IN, P1247, DOI DOI 10.1145/1376616.1376746
  • [10] Bordes A., 2013, Advances in neural information processing systems, V26