Application of Formal Grammar in Text Mining and Construction of an Ontology

被引:0
作者
Kanev, Anton [1 ]
Cunningham, Stuart [2 ]
Valery, Terekhov [1 ]
机构
[1] Bauman Moscow State Tech Univ, Dept Informat & Control Syst, Moscow, Russia
[2] Glyndwr Univ, Sch Appl Sci Comp & Engn, Wrexham, Wales
来源
PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE INTERNET TECHNOLOGIES AND APPLICATIONS (ITA) | 2017年
关键词
text mining; text data mining; natural language processing; formal grammar;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This work describes an investigation of formal grammar with application to text mining. It is an important area since text is the most widespread type of data and it contains a lot of potentially useful information. Unstructured nature of text requires other methods for its processing, in contrast to other types of data mining. In this work, the authors propose an original approach to text mining by making a parse tree for each sentence using regular grammar and creating an ontology and provide a demonstration of this system being implemented in a constrained scenario. This ontology can be used for different tasks, ranging from expert systems to automatic machine translation. The ontology is a network consisting of concepts linked by relations. The authors developed a new system to implement proposed approach working in different languages.
引用
收藏
页码:53 / 57
页数:5
相关论文
共 24 条
[1]  
Akilan A, 2015, 2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), P1679, DOI 10.1109/ECS.2015.7124872
[2]  
Amarasinghe Kasun, 2015, 2015 Resilience Week (RWS). Proceedings, P1, DOI 10.1109/RWEEK.2015.7287440
[3]  
Amarasiri R., 2005, MACH LEARN APPL P 4, DOI [10.1109/ICMLA.2005.29, DOI 10.1109/ICMLA.2005.29]
[4]  
[Anonymous], INNOVATIONS DIGITAL
[5]   Tactical Clinical Text Mining for Improved Patient Characterization [J].
Champion, Heather ;
Pizzi, Nick ;
Krishnamoorthy, Raja .
2014 IEEE INTERNATIONAL CONGRESS ON BIG DATA (BIGDATA CONGRESS), 2014, :683-690
[6]  
da Costa Pinho Isis, 2013, 2013 IEEE Frontiers in Education Conference (FIE), P393, DOI 10.1109/FIE.2013.6684853
[7]  
Dunlavy D. M., 2010, P HPDC, P344, DOI DOI 10.1145/1851476.1851526
[8]   Text Mining in Radiology Reports [J].
Gong, Tianxia ;
Tan, Chew Lim ;
Leong, Tze Yun ;
Lee, Cheng Kiang ;
Pang, Boon Chuan ;
Lim, C. C. Tchoyoson ;
Tian, Qi ;
Tang, Suisheng ;
Zhang, Zhuo .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :815-+
[9]   iProLINK: A Framework for Linking Text Mining with Ontology and Systems Biology [J].
Hu, Zhang-Zhi ;
Cohen, K. Bretonnel ;
Hirschman, Lynette ;
Valencia, Alfonso ;
Liu, Hongfang ;
Giglio, Michelle G. ;
Wu, Cathy H. .
2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2008, :467-+
[10]   Detecting themes of public concern: A text mining analysis of the Centers for Disease Control and Prevention's Ebola live Twitter chat [J].
Lazard, Allison J. ;
Scheinfeld, Emily ;
Bernhardt, Jay M. ;
Wilcox, Gary B. ;
Suran, Melissa .
AMERICAN JOURNAL OF INFECTION CONTROL, 2015, 43 (10) :1109-1111