Combining Dependency Parsing and a Lexical Network Based on Lexical Functions for the Identification of Collocations

被引:4
|
作者
Fonseca, Alexsandro [1 ]
Sadat, Fatiha [1 ]
Lareau, Francois [2 ]
机构
[1] Univ Quebec Montreal, Comp Sci Dept, 201 President Kennedy Ave, Montreal, PQ H2X 3Y7, Canada
[2] Univ Montreal, Linguist & Translat Dept, CP 6128,Succ Ctr Ville, Montreal, PQ H3C 3J7, Canada
关键词
Meaning-Text Theory; Lexical function; Collocation identification; Dependency parsing; Lexical network;
D O I
10.1007/978-3-319-69805-2_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A collocation is a type of multiword expression formed by two parts: a base and a collocate. Usually, in a collocation, the base has a denotative or literal meaning, while the collocate has a connotative meaning. Examples of collocations: pay attention, easy as pie, strongly condemn, lend support, etc. The Meaning-Text Theory created the lexical functions to, among other objectives, represent the meaning existing between the base and the collocate or to represent the relation between the base and a support verb. For example, the lexical function Magn represents the meaning intensification, while the lexical function Caus, applied to a base, returns the support verb that represents the causality of the action expressed in the collocation. In a dependency parsing, each word (dependent) is directly associated with its governor in a phrase. In this paper, we show how we combine dependency parsing to extract collocation candidates and a lexical network based on lexical functions to identify the true collocations from the candidates. The candidates are extracted from a French corpus according to 14 dependency relations. The collocations identified are classified according to the semantic group of the lexical functions modeling them. We obtained a general precision (for all dependency types) of 76.3%, with a precision higher than 95% for collocations having certain dependency relations. We also found that about 86% of collocations identified belong to only four semantic categories: qualification, support verb, location and action/event.
引用
收藏
页码:447 / 461
页数:15
相关论文
共 50 条
  • [41] Precise Information Identification Method of Power Equipment Defect Text Based on Dependency Parsing
    Shao G.
    Wang H.
    Wu X.
    Lu J.
    Li J.
    He B.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (12): : 178 - 185
  • [42] A lexical-availability-based framework from short communications for automatic personality identification
    Ramirez-de-la-Rosa, Gabriela
    Jimenez-Salazar, Hector
    Villatoro-Tello, Esau
    Reyes-Meza, Veronica
    Rojas-Avila, Jaime
    COGNITIVE SYSTEMS RESEARCH, 2023, 79 : 126 - 137
  • [43] TranSentGAT: A Sentiment-Based Lexical Psycholinguistic Graph Attention Network for Personality Prediction
    Bajestani, Shahryar Salmani
    Khalilzadeh, Mohammad Mahdi
    Azarnoosh, Mahdi
    Kobravi, Hamid Reza
    IEEE ACCESS, 2024, 12 : 59630 - 59642
  • [44] English Lexical Analysis System of Machine Translation Based on Simple Recurrent Neural Network
    Zhu, Jingyan
    Computational Intelligence and Neuroscience, 2022, 2022
  • [45] Automatic Text Document Summarization Using Graph Based Centrality Measures on Lexical Network
    Yadav, Chandra Shakhar
    Sharan, Aditi
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2018, 8 (03) : 14 - 32
  • [46] English Lexical Analysis System of Machine Translation Based on Simple Recurrent Neural Network
    Zhu, Jingyan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [47] Sentiment Analysis of Online Users'Negative Emotions Based on Graph Convolutional Network and Dependency Parsing
    Fan T.
    Wang H.
    Wu P.
    Data Analysis and Knowledge Discovery, 2021, 5 (09) : 97 - 106
  • [48] A corpus based analysis of lexical richness of Beijing Mandarin speakers: variable identification and model construction
    Zhang, Yanhui
    LANGUAGE SCIENCES, 2014, 44 : 60 - 69
  • [49] Combining Lexical, Host, and Content-based features for Phishing Websites detection using Machine Learning Models
    Hamadouche, Samiya
    Boudraa, Ouadjih
    Gasmi, Mohamed
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (06):
  • [50] Fuzzy Contrast Set Based Deep Attention Network for Lexical Analysis and Mental Health Treatment
    Ahmed, Usman
    Lin, Jerry Chun-Wei
    Srivastava, Gautam
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)