A Mongolian Information Retrieval System Based on Solr

被引:0
作者
Ma, Lujia [1 ,2 ]
Bao, Wei [2 ]
Bao, Wugedele [2 ]
Yuan, Wuriga [2 ]
Huang, Tao [1 ,2 ]
Zhao, XiaoBing [1 ,2 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Beijing, Peoples R China
[2] Minzu Univ China, Natl Language Resource Monitoring & Res Ctr Minor, Beijing, Peoples R China
来源
PROCEEDINGS OF 2017 9TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA) | 2017年
基金
美国国家科学基金会;
关键词
Mongolian; Information retrieval; Solr;
D O I
10.1109/ICMTMA.2017.86
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A Mongolian Information Retrieval System based on Solr is proposed in this paper. The system implements the retrieval of Mongolian data store in our local machine. Firstly, in this paper, we built a Mongolian corpus with one million words, which has been corrected manually. Secondly, after being transcoded, theses data was represented by Latin characters. Finally, we used Solr to build indexes and documents so that we do queries on millions of data within seconds.
引用
收藏
页码:335 / 338
页数:4
相关论文
共 8 条
[1]  
Agre GH, 2015, 2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), P1089, DOI 10.1109/ECS.2015.7124749
[2]  
Dongre, 2015, INT J ADV RES COMPUT, V4, P463
[3]  
Murengaowa, 2008, STUDY TAGGING MONGOL
[4]  
Wei J, 2009, RES MONGOLIAN INFORM
[5]  
Wu Jinxing, 2015, CONSTRUCTION INTEGRA
[6]  
Yan Wei J, 2011, RETRIEVAL MODEL BASE
[7]  
Yu Shiwen, 2007, INTRO COMPUTATIONAL
[8]   LSCrawler: A framework for an enhanced focused web crawler based on link semantics [J].
Yuvarani, M. ;
Iyengar, N. Ch. S. N. ;
Kannan, A. .
2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE, (WI 2006 MAIN CONFERENCE PROCEEDINGS), 2006, :794-797