Information Extraction for Additive Manufacturing Using News Data

被引:1
作者
Sehgal, Neha [1 ,2 ]
Crampton, Andrew [1 ]
机构
[1] Univ Huddersfield, Huddersfield, W Yorkshire, England
[2] 3MBIC, Valuechain, Huddersfield, W Yorkshire, England
来源
ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS (CAISE 2019) | 2019年 / 349卷
基金
“创新英国”项目;
关键词
Named Entity Recognition; News data; Additive Manufacturing; Text matching; Open data;
D O I
10.1007/978-3-030-20948-3_12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recognizing named entities like Person, Organization, Locations and Date are very useful for web mining. Named Entity Recognition (NER) is an emerging research area which aims to address problems such as Machine Translation, Question Answering Systems and Semantic Web Search. The study focuses on proposing a methodology based on the integration of an NER system and Text Analytics to provide information necessary for business in Additive Manufacturing. The study proposes a foundation of utilizing the Stanford NER system for tagging news data related to the keywords "Additive Manufacturing". The objective is to first derive the organization names from news data. This information is useful to define the digital footprints of an organization in the Additive Manufacturing sector. The existence of an organization derived using the NER approach is validated by matching their names with companies listed on the Companies House portal. The organization names will be matched using a Fuzzy-based text matching algorithm. Further information on company profile, officers and key financial data is extracted to provide information about companies interested and working within the Additive Manufacturing sector. This data gives an insight into which companies have digital footprints in the Additive Manufacturing sector within the UK.
引用
收藏
页码:132 / 138
页数:7
相关论文
共 10 条
  • [1] [Anonymous], 2008, PROC AUSTRALAS LANG
  • [2] [Anonymous], P 2007 JOINT C EMP M
  • [3] Chieu HaiLeong., 2002, Proceedings of the 19th international conference on Computational linguistics-, V1, P1
  • [4] Florian R., 2003, Proceedings of CoNLL-2003, P168, DOI DOI 10.3115/1119176.1119201
  • [5] Named Entity Recognition in Query
    Guo, Jiafeng
    Xu, Gu
    Cheng, Xueqi
    Li, Hang
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 267 - 274
  • [6] Isozaki H, 2002, Proceedings of the 19th international conference on Computational linguistics-Volume, P1
  • [7] Li CL, 2012, SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P721, DOI 10.1145/2348283.2348380
  • [8] Nadeau D, 2007, LINGUIST INVESTIG, V30, P3
  • [9] Ritter Alan, 2011, P 2011 C EMPIRICAL M, P1524
  • [10] Zhou GD, 2002, 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P473