Automated Classification of Author's Sentiments in Citation Using Machine Learning Techniques: A Preliminary Study

被引:0
|
作者
Kim, In Cheol [1 ]
Thoma, George R. [1 ]
机构
[1] Natl Lib Med, Lister Hill Natl Ctr Biomed Commun, 8600 Rockville Pike, Bethesda, MD 20894 USA
关键词
Citation analysis; author's sentiments; Comment-on; support vector machine; n-grams word statistics; MEDLINE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scientific papers generally include citations to external sources such as journal articles, books, or Web links to refer to works that are related in an important way to the research. The reason for the citation appears within the sentences surrounding the citation tag in the body text, and represents the relationship between the citation and cited works as supportive, contrastive, corrective, etc. This could be an important clue for researchers seeking relevant previous work or approaches for a certain research purpose. We propose to develop an automated method to identify the citing author's sentiments toward the cited external sources expressed in citation sentences using machine-learning techniques and linguistic cues. As a preliminary study, this paper presents a support vector machine (SVM)-based text categorization technique to classify the author's sentiments specifically toward Comment-on (CON) articles. CON, a MEDLINE citation field, indicates previously published articles commented on by authors of a given article expressing possibly complimentary or contradictory opinions. An SVM with a radial basis kernel function (RBF) is implemented, and Input feature vectors for the SVM are created based on n-grams word statistics representing the distribution of words in CON sentences. Experiments conducted on a set of CON sentences collected from 414 different online biomedical journal titles show that the SVM with a RBF yields the best result for an input feature vector combining uni-gram and bi-gram word statistics.
引用
收藏
页码:488 / 494
页数:7
相关论文
共 50 条
  • [1] Automated Classification of Societal Sentiments on Twitter With Machine Learning
    Vyas, Piyush
    Reisslein, Martin
    Rimal, Bhaskar Prasad
    Vyas, Gitika
    Basyal, Ganga Prasad
    Muzumdar, Prathamesh
    IEEE Transactions on Technology and Society, 2022, 3 (02): : 100 - 110
  • [2] Automated Classification of Postural Control for Individuals With Parkinson's Disease Using a Machine Learning Approach: A Preliminary Study
    Li, Yumeng
    Zhang, Shuqi
    Odeh, Christina
    JOURNAL OF APPLIED BIOMECHANICS, 2020, 36 (05) : 334 - 339
  • [3] Automated classification of postural control for individuals with parkinson's disease using a machine learning approach: A preliminary study
    Li, Yumeng
    Zhang, Shuqi
    Odeh, Christina
    Journal of Applied Biomechanics, 2020, 36 (05): : 334 - 339
  • [4] Automated classification of software issue reports using machine learning techniques: an empirical study
    Pandey N.
    Sanyal D.K.
    Hudait A.
    Sen A.
    Innovations in Systems and Software Engineering, 2017, 13 (4) : 279 - 297
  • [5] Application of Machine Learning Techniques to Classify Twitter Sentiments Using Vectorization Techniques
    Padhy, Manjog
    Modibbo, Umar Muhammad
    Rautray, Rasmita
    Tripathy, Subhranshu Sekhar
    Bebortta, Sujit
    ALGORITHMS, 2024, 17 (11)
  • [6] A Study of Automated Evaluation of Student's Examination Paper using Machine Learning Techniques
    Sanuvala, Ganga
    Fatima, Syeda Sameen
    2021 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, AND INTELLIGENT SYSTEMS (ICCCIS), 2021, : 1049 - 1054
  • [7] Machine learning techniques for automated web page classification using URL features
    Devi, M. Indra
    Rajaram, R.
    Selvakuberan, K.
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 116 - 118
  • [8] Classification of Alzheimer's Disease using Machine Learning Techniques
    Shahbaz, Muhammad
    Ali, Shahzad
    Guergachi, Aziz
    Niazi, Aneeta
    Umer, Amina
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2019, : 296 - 303
  • [9] Improving in-text citation reason extraction and classification using supervised machine learning techniques
    Ihsan, Imran
    Rahman, Hameedur
    Shaikh, Asadullah
    Sulaiman, Adel
    Rajab, Khairan
    Rajab, Adel
    COMPUTER SPEECH AND LANGUAGE, 2023, 82
  • [10] Automated classification of acute leukemia on a heterogeneous dataset using machine learning and deep learning techniques
    Abhishek, Arjun
    Jha, Rajib Kumar
    Sinha, Ruchi
    Jha, Kamlesh
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 72