Automatic terminological collocations extraction from large corpus

被引:0
|
作者
Suarez, Octavio Santana [1 ]
Aguiar, Jose Perez [1 ]
Berriel, Isabel Sanchez [2 ]
Rodriguez, Virginia Gutierrez [2 ]
机构
[1] Univ Las Palmas Gran Canaria, Edificio Dept Informat & Matemat, Las Palmas Gran Canaria 35017, Spain
[2] Univ La Laguna, Edificio Fis & Matemat,Campus Univ Anchieta, San Cristobal la Laguna 38271, Spain
来源
PROCESAMIENTO DEL LENGUAJE NATURAL | 2011年 / 47期
关键词
automatic extraction of collocations; terminology; computational linguistics; text mining;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The automatic systems which deal with term's extractions constitute an important tool when they make reference to the labor of compilation of lexemes, which is restricted to a specific field or specialty. The textual analysis that are realized for this type of software must include strategies that could detect collocations in the field in which is done. In this topic is studied the viability of the use from extensive textual's corpus, that have not contain linguistic information, as happen with those textual's corpus that could be compiled from internet. The internet is used like a source of information for the recompilation of terminology's collocations. With that purpose is analyzed the behavior of different indicators based on the frequencies registered for a collection of economic terms in a Spanish corpus of 300.000 words.
引用
收藏
页码:145 / 152
页数:8
相关论文
共 50 条
  • [41] Automatic extraction of urban outdoor perception from geolocated free texts
    Frances A. Santos
    Thiago H. Silva
    Antonio A. F. Loureiro
    Leandro A. Villas
    Social Network Analysis and Mining, 2020, 10
  • [42] Terminological hybridity in institutional legal translation A corpus-driven analysis of key genres of EU and international law
    Prieto Ramos, Fernando
    Cerutti, Giorgina
    TERMINOLOGY, 2023, 29 (01): : 45 - 77
  • [43] Automatic Identification of Authors' Stylistics and Gender on the Basis of the Corpus of Russian Fiction Using Extended Set-theoretic Model with Collocation Extraction
    Osochkin, Alexandr
    Piotrowska, Xenia
    Fomin, Vladimir
    GLOTTOMETRICS, 2021, 50 : 76 - 89
  • [44] TERMINOLOGICAL INACCURACIES RESULTING FROM TRANSLATION IN FORENSIC LINGUISTICS
    Ramirez Salado, Mercedes
    REVISTA DE LINGUISTICA Y LENGUAS APLICADAS, 2021, 16 : 175 - 183
  • [45] Automatic Extraction of Potentially Contradictory Parameters from Specific Field Patent Texts
    Berdyugina, Daria
    Cavallucci, Denis
    CREATIVE SOLUTIONS FOR A SUSTAINABLE DEVELOPMENT (TFC 2021), 2021, 635 : 150 - 161
  • [46] Automatic Keyphrase Extraction from Persian Scientific Documents Using Semantic Relations
    Farahani, Bahare Davoodabadi
    Fatemi, Seied Omid
    Ghorbani, Mohsen
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 1972 - 1978
  • [47] Automatic extraction of reference gene from literature in plants based on texting mining
    He Lin
    Shen Gengyu
    Li Fei
    Huang Shuiqing
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 12 (04) : 400 - 416
  • [48] Automatic Symptom Extraction from Texts to Enhance Knowledge Discovery on Rare Diseases
    Metivier, Jean-Philippe
    Serrano, Laurie
    Charnois, Thierry
    Cuissart, Bertrand
    Widloecher, Antoine
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2015), 2015, 9105 : 249 - 254
  • [49] Automatic Extraction of HLA-Disease Interaction Information from Biomedical Literature
    Chae, JeongMin
    Chae, JiEun
    Lee, Taemin
    Jung, YoungHee
    Oh, HeungBum
    Jung, SoonYoung
    ADVANCES IN COMPUTATIONAL SCIENCE AND ENGINEERING, 2009, 28 : 219 - +
  • [50] A Moral Judgment System Using an Automatic Created Moral Corpus
    Yamamoto, Masahiro
    Hagiwara, Masafumi
    2016 JOINT 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 17TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2016, : 616 - 621