Automatic Keyphrase Extraction from Persian Scientific Documents Using Semantic Relations

被引:0
|
作者
Farahani, Bahare Davoodabadi [1 ]
Fatemi, Seied Omid [1 ]
Ghorbani, Mohsen [1 ]
机构
[1] Coll Engn, Sch ECE, Tehran, Iran
来源
2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019) | 2019年
关键词
Autimatic Keyphrase extration; indexing; thesuarus; semantic relations; text mining;
D O I
10.1109/iraniancee.2019.8786696
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the rapid growth of Persian scientific documents in online databases, the need for quick access to these documents has increased in recent years. As a common way to provide a succinct summarization of a document, Automatic Keyphrase Extraction (AKE) plays an important role in various applications, e.g. indexing, summarization and information retrieval. Although lots of methods have been proposed for AKE, they have not resulted in satisfactory performance for Persian texts. The main reasons include dependency on linguistic resources, training datasets, and special NLP tools that are not properly available for Persian language. Considering the special need for an effective and efficient method to carry out the task of AKE on Persian documents, we have proposed a novel Thesaurus based method. The method leverages semantic relations between words of a given document to construct graphs of related words and then extract candidate keyphrases which are semantically and grammatically correct using a scientific and technical thesaurus. Final keyphrases will be selected based on a weighting method in which the position of the phrase in the text, i.e. abstract, body, etc., would affect its weight. The experimental results show that our method outperforms the similar works on Persian texts under three evaluation metrics.
引用
收藏
页码:1972 / 1978
页数:7
相关论文
共 50 条
  • [1] An automatic keyphrase extraction system for scientific documents
    Wei You
    Dominique Fontaine
    Jean-Paul Barthès
    Knowledge and Information Systems, 2013, 34 : 691 - 724
  • [2] An automatic keyphrase extraction system for scientific documents
    You, Wei
    Fontaine, Dominique
    Barthes, Jean-Paul
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 34 (03) : 691 - 724
  • [3] Automatic Keyphrase Extraction from Scientific Documents Using N-gram Filtration Technique
    Kumar, Niraj
    Srinathan, Kannan
    DOCENG'08: PROCEEDINGS OF THE EIGHTH ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2008, : 199 - 208
  • [4] Automatic Extraction of Semantic Relations from Text Documents
    Ta, Chien D. C.
    Tuoi Phan Thi
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2016, 2016, 10018 : 344 - 351
  • [5] Automatic Keyphrase Extraction from Medical Documents
    Sarkar, Kamal
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PROCEEDINGS, 2009, 5909 : 273 - 278
  • [6] Automatic keyphrase extraction from scientific articles
    Su Nam Kim
    Olena Medelyan
    Min-Yen Kan
    Timothy Baldwin
    Language Resources and Evaluation, 2013, 47 : 723 - 742
  • [7] Automatic keyphrase extraction from scientific articles
    Kim, Su Nam
    Medelyan, Olena
    Kan, Min-Yen
    Baldwin, Timothy
    LANGUAGE RESOURCES AND EVALUATION, 2013, 47 (03) : 723 - 742
  • [8] Automatic keyphrase extraction from chinese news documents
    Wang, HF
    Li, SJ
    Yu, SW
    FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, PT 2, PROCEEDINGS, 2005, 3614 : 648 - 657
  • [9] Improved Automatic Keyphrase Extraction by Using Semantic Information
    Wang, XiaoLing
    Mu, DeJun
    Fang, Jun
    INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL 1, PROCEEDINGS, 2008, : 1061 - 1065
  • [10] Automatic keyphrase annotation of scientific documents using Wikipedia and genetic algorithms
    Joorabchi, Arash
    Mahdi, Abdulhussain E.
    JOURNAL OF INFORMATION SCIENCE, 2013, 39 (03) : 410 - 426