Morphological Analysis of Malay Words for Resolving Ambiguity

被引:0
作者
Yahaya, Mohd Fuad [1 ]
Abd Rahman, Nurazzah [1 ]
Abu Bakar, Zainab [2 ]
机构
[1] Univ Teknol MARA, Fak Sains Komputer & Matemat, Shah Alam, Selangor, Malaysia
[2] Univ Al Madinah, Fak Sains Komputer & IT, Shah Alam, Selangor, Malaysia
来源
2018 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP) | 2018年
关键词
morphological analysis; part of speech tagging; disambiguation techniques; malay corpus; malay ambiguity words; SENSE DISAMBIGUATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The issue of morphological uncertainty is broadly tended to in the cutting edge in Natural Language Processing (NLP). For the most part, vagueness is settled with the utilization of substantial physically explained corpora and machine learning. Be that as it may, such strategies do not generally accessible, as great preparing information is not available for all dialects. In this paper, we introduce a technique for disambiguation without highest quality level corpora utilizing a few factual models, to be specific, Braille Translation Algorithms and unambiguous N-grams from the naturally explained corpus. Every one of the strategies was tried on the Corpus of Glosbe and on the Corpus of Dew an Bahasa Pustaka (DBP). Therefore, more than a half of words with uncertain examinations were disambiguated in the two corpora, exhibiting high exactness. Our technique for morphological disambiguation shows that it is conceivable to dispose of a portion of the uncertain examinations in the corpus without particular phonetic assets, just with the utilization of crude information, where all conceivable morphological investigations for each word are shown.
引用
收藏
页码:31 / 35
页数:5
相关论文
共 44 条
[1]   NetiNeti: discovery of scientific names from text using machine learning methods [J].
Akella, Lakshmi Manohar ;
Norton, Catherine N. ;
Miller, Holly .
BMC BIOINFORMATICS, 2012, 13
[2]  
Alexandr R., 2016, INTERCORP A LOOK FAC
[3]  
Alok Ranjan P., 2015, ARXIV150801346
[4]   The interpretation of dream meaning: Resolving ambiguity using Latent Semantic Analysis in a small corpus of text [J].
Altszyler, Edgar ;
Ribeiro, Sidarta ;
Sigman, Mariano ;
Fernandez Slezak, Diego .
CONSCIOUSNESS AND COGNITION, 2017, 56 :178-187
[5]  
[Anonymous], 2015, P 2015 C N AM CHAPTE
[6]  
Arkhangelskiy T., CLIF 2016, P1
[7]  
Barriere C., 2016, COMPUTERM 2016, V21
[8]   The Entropy of WordsLearnability and Expressivity across More than 1000 Languages [J].
Bentz, Christian ;
Alikaniotis, Dimitrios ;
Cysouw, Michael ;
Ferrer-i-Cancho, Ramon .
ENTROPY, 2017, 19 (06)
[9]  
Bono D., 2013, FRONTIERS PSYCHOL, V4
[10]  
CAO K, 2016, ARXIV160602601