Native Language Identification using Probabilistic Graphical Models

被引:0
作者
Nicolai, Garrett [1 ]
Islam, Md Asadul [1 ]
Greiner, Russ [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
来源
2013 INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT) | 2013年
关键词
NLI; Machine Learning; SVM; Bayesian Methods; TAN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Native Language Identification (NLI) is the task of identifying the native language of an author of a text written in a second language. Support Vector Machines and Maximum Entrophy Learners are the most common methods used to solve this problem, but we consider it from the point-of-view of probabilistic graphical models. We hypothesize that graphical models are well-suited to this task, as they can capture feature inter-dependencies that cannot be exploited by SVMs. Using progressively more connected graphical models, we show that these models out-perform SVMs on reduced feature sets. Furthermore, on full feature sets, even naive Bayes increases accuracy from 82.06% to 83.41% over SVMs on a 5-language classification task.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Introducing XGL - a lexicalised probabilistic graphical lemmatiser for isiXhosa
    Mzamo, Lulamile
    Helberg, Albert
    Bosch, Sonja
    PROCEEDINGS OF THE 2015 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2015, : 142 - 147
  • [22] Using Language Models for Enhancing the Completeness of Natural-Language Requirements
    Luitel, Dipeeka
    Hassani, Shabnam
    Sabetzadeh, Mehrdad
    REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, REFSQ 2023, 2023, 13975 : 87 - 104
  • [23] Probabilistic Forecasts of Flood Inundation Maps Using Surrogate Models
    Zanchetta, Andre D. L.
    Coulibaly, Paulin
    GEOSCIENCES, 2022, 12 (11)
  • [24] Probabilistic Forecasting of El Nino Using Neural Network Models
    Petersik, Paul Johannes
    Dijkstra, Henk A.
    GEOPHYSICAL RESEARCH LETTERS, 2020, 47 (06)
  • [25] Using Machine Learning to Evaluate and Enhance Models of Probabilistic Inference
    Gloeckner, Andreas
    Jekel, Marc
    Lisovoj, Daria
    DECISION-WASHINGTON, 2024, 11 (04): : 633 - 651
  • [26] Digital polycrystalline microstructure generation using diffusion probabilistic models
    Fernandez-Zelaia, Patxi
    Cheng, Jiahao
    Mayeur, Jason
    Ziabari, Amir Koushyar
    Kirka, Michael M.
    MATERIALIA, 2024, 33
  • [27] Predicting Author's Native Language Using Abstracts of Scholarly Papers
    Baba, Takahiro
    Baba, Kensuke
    Ikeda, Daisuke
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2018), 2018, 11177 : 448 - 453
  • [28] Determining Native Language and Deception Using Phonetic Features and Classifier Combination
    Gosztolya, Gabor
    Grosz, Tamos
    Busa-Fekete, Robert
    Toth, Laszlo
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2418 - 2422
  • [29] Probabilistic Topic Models
    Blei, David
    Carin, Lawrence
    Dunson, David
    IEEE SIGNAL PROCESSING MAGAZINE, 2010, 27 (06) : 55 - 65
  • [30] An Adaptive Probabilistic Graphical Model for Representing Skills in PbD Settings
    Dindo, Haris
    Schillaci, Guido
    PROCEEDINGS OF THE 5TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI 2010), 2010, : 89 - 90