Native Language Identification using Probabilistic Graphical Models

被引:0
作者
Nicolai, Garrett [1 ]
Islam, Md Asadul [1 ]
Greiner, Russ [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
来源
2013 INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT) | 2013年
关键词
NLI; Machine Learning; SVM; Bayesian Methods; TAN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Native Language Identification (NLI) is the task of identifying the native language of an author of a text written in a second language. Support Vector Machines and Maximum Entrophy Learners are the most common methods used to solve this problem, but we consider it from the point-of-view of probabilistic graphical models. We hypothesize that graphical models are well-suited to this task, as they can capture feature inter-dependencies that cannot be exploited by SVMs. Using progressively more connected graphical models, we show that these models out-perform SVMs on reduced feature sets. Furthermore, on full feature sets, even naive Bayes increases accuracy from 82.06% to 83.41% over SVMs on a 5-language classification task.
引用
收藏
页数:6
相关论文
共 50 条
[31]   Protein Language Models and Machine Learning Facilitate the Identification of Antimicrobial Peptides [J].
Medina-Ortiz, David ;
Contreras, Seba ;
Fernandez, Diego ;
Soto-Garcia, Nicole ;
Moya, Ivan ;
Cabas-Mora, Gabriel ;
Olivera-Nappa, Alvaro .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (16)
[32]   Mechanistic modeling of social conditions in disease-prediction simulations via copulas and probabilistic graphical models: HIV case study [J].
Khosheghbal, Amir ;
Haas, Peter J. ;
Gopalappa, Chaitra .
HEALTH CARE MANAGEMENT SCIENCE, 2025, 28 (01) :28-49
[33]   Efficient learning of discrete graphical models* [J].
Vuffray, Marc ;
Misra, Sidhant ;
Lokhov, Andrey Y. .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2021, 2021 (12)
[34]   Acoustic Language Identification Using Fast Discriminative Training [J].
Castaldo, Fabio ;
Colibro, Daniele ;
Dalmasso, Emanuele ;
Laface, Pietro ;
Vair, Claudio .
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, :389-+
[35]   Identification of Spoken Language using Machine Learning Approach [J].
Shahriar, Md Asif ;
Aziz, Iftekhar ;
Banik, Shovan ;
Sattar, Abdus .
2020 23RD INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT 2020), 2020,
[36]   Towards Using Probabilistic Models to Design Software Systems with Inherent Uncertainty [J].
Serban, Alex ;
Poll, Erik ;
Visser, Joost .
SOFTWARE ARCHITECTURE (ECSA 2020), 2020, 12292 :89-97
[37]   Cyber Threat Hunting Using Large Language Models [J].
Tanksale, Vinayak .
PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, VOL 5, ICICT 2024, 2024, 1000 :629-641
[38]   Probabilistic Graphical Model on Detecting Insiders: Modeling with SGD-HMM [J].
Saaudi, Ahmed ;
Tong, Yan ;
Farkas, Csilla .
PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY (ICISSP), 2019, :461-470
[39]   A Probabilistic Graphical Model-Based Approach for the Label Ranking Problem [J].
Carlos Alfaro, Juan ;
Gonzalez Rodrigo, Enrique ;
Angel Aledo, Juan ;
Antonio Gamez, Jose .
SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, ECSQARU 2019, 2019, 11726 :351-362
[40]   Probabilistic models for melodic prediction [J].
Paiement, Jean-Francois ;
Bengio, Samy ;
Eck, Douglas .
ARTIFICIAL INTELLIGENCE, 2009, 173 (14) :1266-1274