Using Semantic Models to Analyze Wikipedia Articles

被引:0
作者
Chen, Lin-Chih
机构
来源
INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015) | 2015年
关键词
Web; 2.0; Wikipedia; Latent semantic analysis; Probabilistic latent semantic analysis;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The network architecture based on Web 2.0 is an example of the most common applications in use today. A famous example is Wikipedia, which is a free Internet encyclopedia that is created and maintained by general users. Like this site, users can change anything on it (subject to restrictions and further changes or reversions). Since the entry and maintenance of subject material can be edited by any Internet visitor, the information should not be considered to be complete or the final answer. Thus, this situation is easy to lead to the problem of information overload. The information overload always is caused by two main issues, synonyms (two terms are syntactically interchangeable expressions) and polysemy (a term has different meanings). In this paper, we adopt two semantic models, LSA (Latent Semantic Analysis) and PLSA (Probabilistic Latent Semantic Analysis) to solve these two issues. According to the results of simulation analysis, we conclude that using multiple semantic analysis techniques can give significant performance improvements.
引用
收藏
页码:170 / 176
页数:7
相关论文
共 50 条
  • [1] Latent Semantic Analysis Models on Wikipedia and TASA
    Stefanescu, Dan
    Banjade, Rajendra
    Rus, Vasile
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 1417 - 1422
  • [2] Computing semantic similarity based on novel models of semantic representation using Wikipedia
    Qu, Rong
    Fang, Yongyi
    Bai, Wen
    Jiang, Yuncheng
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) : 1002 - 1021
  • [3] Exploring the Domain of Information "Users": Semantic Analysis of Wikipedia Articles
    Joo, Soohyung
    JOURNAL OF LIBRARY AND INFORMATION STUDIES, 2020, 18 (01): : 1 - 23
  • [4] Semantic Wikipedia
    Kroetzsch, Markus
    Vrandecic, Denny
    Voelkel, Max
    Haller, Heiko
    Studer, Rudi
    JOURNAL OF WEB SEMANTICS, 2007, 5 (04): : 251 - 261
  • [5] Controversy detection in Wikipedia using semantic dissimilarity
    Jhandir, M. Zeeshan
    Tenvir, Ali
    On, Byung-Won
    Lee, Ingyu
    Choi, Gyu Sang
    INFORMATION SCIENCES, 2017, 418 : 581 - 600
  • [6] Computing semantic relatedness using Wikipedia features
    Taieb, Mohamed Ali Hadj
    Ben Aouicha, Mohamed
    Ben Hamadou, Abdelmajid
    KNOWLEDGE-BASED SYSTEMS, 2013, 50 : 260 - 278
  • [7] Automatic Extraction of Semantic Concept-Relation Triple Pattern from Wikipedia Articles
    Choi, Junho
    Choi, Chang
    Choi, Dongjin
    Kim, Jungin
    Kim, Pankoo
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (07): : 2755 - 2770
  • [8] INFORMATION UNIQUENESS IN WIKIPEDIA ARTICLES
    Kirtsis, Nikos
    Stamou, Sofia
    Tzekou, Paraskevi
    Zotos, Nikos
    WEBIST 2010: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGY, VOL 2, 2010, : 137 - 143
  • [9] Societal Controversies in Wikipedia Articles
    Borra, Erik
    Kaltenbrunner, Andreas
    Mauri, Michele
    Weltevrede, Esther
    Laniado, David
    Rogers, Richard
    Ciuccarelli, Paolo
    Magni, Giovanni
    Venturini, Tommaso
    CHI 2015: PROCEEDINGS OF THE 33RD ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2015, : 193 - 196
  • [10] Measuring Semantic Similarity between Words Using Wikipedia
    Lu Zhiqiang
    Shao Werimin
    Yu Zhenhua
    WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 251 - +