Selectivity-Based Keyword Extraction Method

被引:33
作者
Beliga, Slobodan [1 ]
Mestrovic, Ana [1 ]
Martincic-Ipsic, Sanda [1 ]
机构
[1] Univ Rijeka, Dept Informat, Rijeka, Croatia
关键词
Centrality Measures; Complex Network; Generalized Selectivity; Graph-Based Keyword Extraction; Keyword Expansion; Keyword Extraction; Keyword Ranking; Selectivity; CENTRALITY; LANGUAGE;
D O I
10.4018/IJSWIS.2016070101
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work the authors propose a novel Selectivity-Based Keyword Extraction (SBKE) method, which extracts keywords from the source text represented as a network. The node selectivity value is calculated from a weighted network as the average weight distributed on the links of a single node and is used in the procedure of keyword candidate ranking and extraction. The authors show that selectivity-based keyword extraction slightly outperforms an extraction based on the standard centrality measures: in/out-degree, betweenness and closeness. Therefore, they include selectivity and its modification - generalized selectivity as node centrality measures in the SBKE method. Selectivity-based extraction does not require linguistic knowledge as it is derived purely from statistical and structural information of the network. The experimental results point out that selectivity-based keyword extraction has a great potential for the collection-oriented keyword extraction task.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 40 条
[1]   A keyword extraction method from twitter messages represented as graphs [J].
Abilhoa, Willyan D. ;
de Castro, Leandro N. .
APPLIED MATHEMATICS AND COMPUTATION, 2014, 240 :308-325
[2]  
Ahel R., 2009, THE FUTURE OF INFORM, P207
[3]  
[Anonymous], 2009, THESIS U WAIKATO
[4]  
[Anonymous], 2008, P 7 PYTHON SCI C
[5]  
[Anonymous], 2004, P 2004 C EMP METH NA
[6]  
[Anonymous], 2009, NATURAL LANGUAGE PRO, DOI DOI 10.1007/S10579-010-9124-X
[7]  
Bekavac M, 2013, P 4 BIENN INT WORKSH, P43
[8]  
Beliga S., 2014, CEUR P SDSW 2014 RIV, V1310, P1
[9]  
Beliga S, 2015, J INF ORGAN SCI, V39, P1
[10]  
Boudin F., 2013, INT JOINT C NAT LANG, P834