Combining Dependency Parsing and a Lexical Network Based on Lexical Functions for the Identification of Collocations

被引：4

作者：

Fonseca, Alexsandro ^{[1
]}

Sadat, Fatiha ^{[1
]}

Lareau, Francois ^{[2
]}

机构：

[1] Univ Quebec Montreal, Comp Sci Dept, 201 President Kennedy Ave, Montreal, PQ H2X 3Y7, Canada

[2] Univ Montreal, Linguist & Translat Dept, CP 6128,Succ Ctr Ville, Montreal, PQ H3C 3J7, Canada

来源：

COMPUTATIONAL AND CORPUS-BASED PHRASEOLOGY, EUROPHRAS 2017 | 2017年 / 10596卷

关键词：

Meaning-Text Theory; Lexical function; Collocation identification; Dependency parsing; Lexical network;

D O I：

10.1007/978-3-319-69805-2_31

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A collocation is a type of multiword expression formed by two parts: a base and a collocate. Usually, in a collocation, the base has a denotative or literal meaning, while the collocate has a connotative meaning. Examples of collocations: pay attention, easy as pie, strongly condemn, lend support, etc. The Meaning-Text Theory created the lexical functions to, among other objectives, represent the meaning existing between the base and the collocate or to represent the relation between the base and a support verb. For example, the lexical function Magn represents the meaning intensification, while the lexical function Caus, applied to a base, returns the support verb that represents the causality of the action expressed in the collocation. In a dependency parsing, each word (dependent) is directly associated with its governor in a phrase. In this paper, we show how we combine dependency parsing to extract collocation candidates and a lexical network based on lexical functions to identify the true collocations from the candidates. The candidates are extracted from a French corpus according to 14 dependency relations. The collocations identified are classified according to the semantic group of the lexical functions modeling them. We obtained a general precision (for all dependency types) of 76.3%, with a precision higher than 95% for collocations having certain dependency relations. We also found that about 86% of collocations identified belong to only four semantic categories: qualification, support verb, location and action/event.

引用

页码：447 / 461

页数：15

共 50 条

[41] Precise Information Identification Method of Power Equipment Defect Text Based on Dependency Parsing
Shao G.
Wang H.
Wu X.
Lu J.
Li J.
He B.
Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (12): : 178 - 185
[42] A lexical-availability-based framework from short communications for automatic personality identification
Ramirez-de-la-Rosa, Gabriela
Jimenez-Salazar, Hector
Villatoro-Tello, Esau
Reyes-Meza, Veronica
Rojas-Avila, Jaime
COGNITIVE SYSTEMS RESEARCH, 2023, 79 : 126 - 137
[43] TranSentGAT: A Sentiment-Based Lexical Psycholinguistic Graph Attention Network for Personality Prediction
Bajestani, Shahryar Salmani
Khalilzadeh, Mohammad Mahdi
Azarnoosh, Mahdi
Kobravi, Hamid Reza
IEEE ACCESS, 2024, 12 : 59630 - 59642
[44] English Lexical Analysis System of Machine Translation Based on Simple Recurrent Neural Network
Zhu, Jingyan
Computational Intelligence and Neuroscience, 2022, 2022
[45] Automatic Text Document Summarization Using Graph Based Centrality Measures on Lexical Network
Yadav, Chandra Shakhar
Sharan, Aditi
INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2018, 8 (03) : 14 - 32
[46] English Lexical Analysis System of Machine Translation Based on Simple Recurrent Neural Network
Zhu, Jingyan
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[47] Sentiment Analysis of Online Users'Negative Emotions Based on Graph Convolutional Network and Dependency Parsing
Fan T.
Wang H.
Wu P.
Data Analysis and Knowledge Discovery, 2021, 5 (09) : 97 - 106
[48] A corpus based analysis of lexical richness of Beijing Mandarin speakers: variable identification and model construction
Zhang, Yanhui
LANGUAGE SCIENCES, 2014, 44 : 60 - 69
[49] Combining Lexical, Host, and Content-based features for Phishing Websites detection using Machine Learning Models
Hamadouche, Samiya
Boudraa, Ouadjih
Gasmi, Mohamed
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2024, 11 (06):
[50] Fuzzy Contrast Set Based Deep Attention Network for Lexical Analysis and Mental Health Treatment
Ahmed, Usman
Lin, Jerry Chun-Wei
Srivastava, Gautam
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (05)

← 1 2 3 4 5 →