Keyphrase Extraction Using Sequential Patterns Mining Algorithm with One-Off and General Gaps Condition

被引:0
|
作者
Liu H.-T. [1 ,2 ]
Liu Z.-Z. [1 ,2 ]
Wang L.-L. [1 ,2 ]
Wu X.-D. [3 ,4 ]
机构
[1] Key Laboratory of Intelligent Computing and Signal Processing of the Ministry of Education, Anhui University, Hefei, 230039, Anhui
[2] School of Computer Science and Technology, Anhui University, Hefei, 230601, Anhui
[3] School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, 230009, Anhui
[4] School of Computing and Informatics, University of Louisiana at Lafayette, Lafayette
来源
Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2019年 / 47卷 / 05期
关键词
General gap; Keyphrase extraction; Machine learning; Sequential patterns mining;
D O I
10.3969/j.issn.0372-2112.2019.05.020
中图分类号
学科分类号
摘要
Keyphrases are used to summarize the document and high-quality keyphrases have great importance in text summarizing, reading and indexing. However, most studies of keyphrase extraction have strict limitation in the form of patterns, and are unable to achieve the semantic relation between words and phrases. The results are failure to autonomously extract keyphrases. Keyphrase extraction using sequential patterns mining with one-off and general gaps condition algorithm (KEING) is proposed in this paper. Taking into account one off condition and general gaps, SPING(Sequential Patterns mIning with oNe-off and General gaps condition)can catch semantic relations between words and phrases more effectively. Therefore, KEING will get effective candidate keyphrases and count their features. Then a supervised machine learning method is used to train features and construct a classification model, we can extract keyphrase with this model. Experimental results demonstrate KEING can effectively extract high quality keyphrases. © 2019, Chinese Institute of Electronics. All right reserved.
引用
收藏
页码:1121 / 1128
页数:7
相关论文
共 22 条
  • [1] Zhang J., Wang W., Wei X., Et al., Climate analytics workflow recommendation as a service-provenance-driven automatic workflow mashup, Proceedings of 2015 IEEE International Conference on Web Services, pp. 89-97, (2015)
  • [2] Zhao J.S., Zhu Q.M., Zhou G.D., Zhang L., Review of research in automatic keywordextraction, Journal of Software, 28, 9, pp. 2431-2449, (2017)
  • [3] Liu X., Song Y., Liu S., Et al., Automatic taxonomy construction from keywords, Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1433-1441, (2012)
  • [4] Luhn H.P., A statistical approach to mechanized encoding and searching of literary information, IBM Journal of Research and Development, 1, 4, pp. 309-317, (1957)
  • [5] Ma H.F., Liu F., Xia Q., Et al., Keywords extraction algorithm based on weighted hypergraph random walk, Acta Electronica Sinica, 46, 6, pp. 1410-1414, (2018)
  • [6] El-Beltagy S.R., Rafea A., KP-Miner: A keyphrase extraction system for English and Arabicdocuments, Information Systems, 34, 1, pp. 132-144, (2009)
  • [7] Witten I.H., Paynter G.W., Frank E., Et al., KEA: practical automatic keyphraseextraction, Proceedings of ACM Conference on Digital Libraries, pp. 254-255, (1999)
  • [8] Turney P.D., Learning algorithms for keyphraseextraction, Information Retrieval, 2, 4, pp. 303-336, (2000)
  • [9] Barker K., Cornacchia N., Using Noun Phrase Heads to Extract DocumentKeyphrases, Proceedings of Canadian Conference on AI 2000, pp. 40-52, (2000)
  • [10] Steier A.M., Belew R.K., Exporting phrases: a statistical analysis of topicallanguage, Second Symposium on Document Analysis & Information Retrieval, pp. 179-190, (1993)