A supervised keyphrase extraction method based on the logistic regression model for social question answering sites

被引:0
|
作者
Lin, Ge [1 ,2 ]
Xiang, Yi [2 ]
Wang, Zhong [1 ]
Wang, Ruomei [2 ]
机构
[1] School of Information Science and Technology, Sun Yat-sen University
来源
Journal of Information and Computational Science | 2014年 / 11卷 / 10期
关键词
Keyphrase extraction; Machine learning; SQA sites;
D O I
10.12733/jics20104019
中图分类号
学科分类号
摘要
This paper proposes a supervised machine learning method for the problem of automatic keyphrase extraction for Social Question Answering (SQA) sites. The method is developed by: 1) Analyzing the structural and activity characteristics of typical SQA sites, 2) Developing and categorizing four types of calculation features that can describe those characteristics, and 3) Developing customized logistic regression model to be trained by the real dataset from six popular SQA sites, in both English and Chinese. Experimental results show the influences from those proposed SQA related features vary, some are helpful to keyphrase extraction for SQA sites of both languages while some are only useful for a specific site. The results also demonstrate a generally better performance comparing to a typical keyphrase extraction algorithms published previously like KEA. 1548-7741/Copyright © 2014 Binary Information Press.
引用
收藏
页码:3571 / 3583
页数:12
相关论文
共 50 条
  • [41] A precision-based diagnostic model ADOBE-accurate detection of breast cancer using logistic regression approach
    Venkatesh, Veeramuthu
    Raj, M. M. Anishin
    Sajith, K. Mohamed
    Anushiadevi, R.
    Praba, Suriya T.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8419 - 8426
  • [42] A regression model-based method for indoor positioning with compound location fingerprints
    Takayama, Tomofumi
    Umezawa, Takeshi
    Komuro, Nobuyoshi
    Osawa, Noritaka
    GEO-SPATIAL INFORMATION SCIENCE, 2019, 22 (02) : 107 - 113
  • [43] Neural Network Based Regression Model for Virtual Machines Migration Method Selection
    Altahat, Mohammad A.
    Agarwal, Anjali
    Goel, Nishith
    Zaman, Marzia
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [44] Using cognition and risk to explain the intention-behavior gap on bioenergy production: Based on machine learning logistic regression method
    He, Ke
    Ye, Lihong
    Li, Fanlue
    Chang, Huayi
    Wang, Anbang
    Luo, Sixuan
    Zhang, Junbiao
    ENERGY ECONOMICS, 2022, 108
  • [45] Quantitative risk assessment of submersible pump components using Interval number-based Multinomial Logistic Regression(MLR) model
    Bhattacharjee, Pushparenu
    Dey, Vidyut
    Mandal, U. K.
    Paul, Susmita
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 226
  • [46] BIM investment decision model (BIDM): evaluation of features and proposal of a regression model based on the LASSO method
    Zheng, Yu
    Tang, Llewellyn
    Chau, Kwong Wing
    JOURNAL OF FINANCIAL MANAGEMENT OF PROPERTY AND CONSTRUCTION, 2024,
  • [47] Multi-language Person Social Relation Extraction Model Based on Distant Supervision
    Huang, Yangchen
    Jia, Yan
    Huang, Jiuming
    He, Zhonghe
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 368 - 374
  • [48] An In-Hospital Mortality Risk Model for Elderly Patients Undergoing Cardiac Valvular Surgery Based on LASSO-Logistic Regression and Machine Learning
    Zhu, Kun
    Lin, Hongyuan
    Yang, Xichun
    Gong, Jiamiao
    An, Kang
    Zheng, Zhe
    Hou, Jianfeng
    JOURNAL OF CARDIOVASCULAR DEVELOPMENT AND DISEASE, 2023, 10 (02)
  • [49] Comparison of machine learning and conventional logistic regression-based prediction models for gestational diabetes in an ethnically diverse population; the Monash GDM Machine learning model
    Belsti, Yitayeh
    Moran, Lisa
    Du, Lan
    Mousa, Aya
    De Silva, Kushan
    Enticott, Joanne
    Teede, Helena
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 179
  • [50] Disease Concept-Embedding Based on the Self-Supervised Method for Medical Information Extraction from Electronic Health Records and Disease Retrieval: Algorithm Development and Validation Study
    Chen, Yen-Pin
    Lo, Yuan-Hsun
    Lai, Feipei
    Huang, Chien-Hua
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (01)