A hybrid Approach for Arabic Multi-Word Term Extraction

被引:0
|
作者
Bounhas, Ibrahim [1 ]
Slimani, Yahya [1 ]
机构
[1] Univ Tunis, Fac Sci Tunis, Dept Comp Sci, Tunis 1060, Tunisia
来源
IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING | 2009年
关键词
Arabic language processing; morpho-syntactic parsing; multi-word terms; terminology extraction;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Building a domain model from a specialized corpus requires identifying candidate terms. It also includes identifying semantic relations between terms. Once this model is constructed it can be used for many tasks of information retrieval. In this process, multi-word terms have a great importance. In the one hand they constitute domain relevant candidate terms. On the other hand syntactic relations that link their constituents can be used to infer semantic relations between terms. In this paper we propose to extract mutli-word terms from Arabic specialized corpora. The proposed approach uses linguistic rules based on morphological features and POS (Part Of Speech) tags to parse documents and retrieve candidate terms. Statistical measures are used to deal with ambiguities generated by the linguistic tools and to rank candidate terms according to their relevance. We present experiments on a corpus from the environment domain. We report high quality results that are confirm the targets set for the precision metric.
引用
收藏
页码:429 / 436
页数:8
相关论文
共 50 条
  • [31] A Corpus Based Approach to Near Synonymy of German Multi-Word Expressions
    Huemmer, Christiane
    GWC 2004: SECOND INTERNATIONAL WORDNET CONFERENCE, PROCEEDINGS, 2003, : 142 - 149
  • [32] Chunks, multi-word units et cetera: The role of multi-word units in second language acquisition
    Aguado, Karin
    DEUTSCH ALS FREMDSPRACHE-ZEITSCHRIFT ZUR THEORIE UND PRAXIS DES FACHES DEUTSCH ALS FREMDSPRACHE, 2024, 61 (01):
  • [33] Phonological similarity in multi-word units
    Gries, Stefan Th.
    COGNITIVE LINGUISTICS, 2011, 22 (03) : 491 - 510
  • [34] Verbal Multi-Word Expressions in Yiddish
    Liebeskind, Chaya
    HaCohen-Kerner, Yaakov
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 205 - 216
  • [35] Lexical selection in multi-word production
    Janssen, Niels
    Caramazza, Alfonso
    FRONTIERS IN PSYCHOLOGY, 2011, 2
  • [36] A century in the life of multi-word verbs
    Claridge, C
    CORPUS-BASED STUDIES IN ENGLISH, 1997, (20): : 69 - 85
  • [37] Reactive multi-word synchronization for multiprocessors
    Ha, PH
    Tsigas, P
    12TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, PROCEEDINGS, 2003, : 184 - 193
  • [38] On the Structural Disambiguation of Multi-word Terms
    Cabezas-Garcia, Melania
    Leon-Arauz, Pilar
    COMPUTATIONAL AND CORPUS-BASED PHRASEOLOGY, EUROPHRAS 2019, 2019, 11755 : 46 - 60
  • [39] Analysis and evaluation of the use of the multi-word term in the indexing of a collection of photographic postcards
    Begona Lopez-Avila, Ma
    da Graca de Melo-Simoes, Maria
    Rodriguez-Bravo, Blanca
    DOCUMENTACION DE LAS CIENCIAS DE LA INFORMACION, 2021, 44 (01): : 79 - 85
  • [40] Hybrid approach to extraction of Arabic language terms
    Boulakiladel, Siharn
    Daille, Beatrice
    Aboutajdine, Driss
    INFORMATION MANAGEMENT IN THE MODERN ORGANIZATIONS: TRENDS & SOLUTIONS, VOLS 1 AND 2, 2008, : 999 - 1002