Phrase and Idiom Identification in Assamese

被引:0
|
作者
Borah, Shinjit Kamal [1 ]
Sharma, Utpal [1 ]
机构
[1] Tezpur Univ, Dept CSE, Napaam 784028, India
来源
PROCEEDING OF THE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2015) | 2016年 / 84卷
关键词
Phrase; Idiom; Assamese; Context free grammar; Computational linguistics;
D O I
10.1016/j.procs.2016.04.067
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Identification of phrases and idioms is an indispensable part of computational linguistics work. In case of Assamese, this is a challenging topic mainly because of the cases and affixes used in the language. Though, this language is an Eastern Indo-Aryan language spoken by around 30 million people, this topic has not been studied much, as very little computational linguistics work has been done for this language. Assamese language is a relatively free word order language. Context Free Grammar (CFG) can be applied in phrase level by taking extra care in defining the production rules. In this paper, we explain about a method which can be considered as modified context free grammar. Different production rules for phrases can be defined using this modified context free grammar. In this method, the right hand side of the production rules is treated as a free string. So that free word order phenomenon can be dealt with. Different idioms are also analyzed in terms of their syntax and use, to find out the similarities among them to build a dictionary of idioms. Difficulties in parsing phrases and idioms are also discussed and some of the techniques are also provided to overcome those difficulties. (C) 2016 Published by Elsevier B.V.
引用
收藏
页码:65 / 69
页数:5
相关论文
共 50 条
  • [1] Automatic Identification of Assamese and Bodo Multiword Expressions
    Barman, Anup Kumar
    Sarmah, Jumi
    Sarma, Shikhar Kr.
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 26 - 30
  • [2] Idiom Polarity Identification using Contextual Information
    Priego Sanchez, Belem
    Pinto, David
    COMPUTACION Y SISTEMAS, 2018, 22 (01): : 27 - 33
  • [3] The role of idiom length and context in spoken idiom comprehension
    Fanari, Rachele
    Cacciari, Cristina
    Tabossi, Patrizia
    EUROPEAN JOURNAL OF COGNITIVE PSYCHOLOGY, 2010, 22 (03): : 321 - 334
  • [4] Speaker Identification Using Vector Quantization and I-vector with Reference to Assamese Language
    Bharali, Sruti Sruba
    Kalita, Sanjib Kr.
    2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 164 - 168
  • [5] Idiom properties influencing idiom production in younger and older adults
    Hyun, JungMoon
    Conner, Peggy S.
    Obler, Loraine K.
    MENTAL LEXICON, 2014, 9 (02) : 294 - 315
  • [6] Testing idiom comprehension in aphasic patients: The effects of task and idiom type
    Papagno, C.
    Caporali, A.
    BRAIN AND LANGUAGE, 2007, 100 (02) : 208 - 220
  • [7] A Brief Study of Idiom
    李政
    读与写(教育教学刊), 2013, 10 (07) : 5 - 5
  • [8] On natural sets on an idiom
    Sanchez-Hernandez, Jose Patricio
    COMMUNICATIONS IN ALGEBRA, 2020, 48 (11) : 5004 - 5025
  • [9] On Defining a Homeric Idiom
    Kip, A. Maria van Erp Taalman
    MNEMOSYNE, 2012, 65 (4-5) : 539 - 551
  • [10] A Dataset of Online Handwritten Assamese Characters
    Baruah, Udayan
    Hazarika, Shyamanta M.
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2015, 11 (03): : 325 - 341