Natural language processing methods for knowledge management-Applying document clustering for fast search and grouping of engineering documents

被引:8
作者
Arnarsson, Ivar Orn [1 ]
Frost, Otto [2 ]
Gustavsson, Emil [2 ]
Jirstrand, Mats [2 ]
Malmqvist, Johan [1 ]
机构
[1] Chalmers Univ Technol, Dept Ind & Mat Sci, Chalmersplatsen 4, S-41296 Gothenburg, Sweden
[2] Fraunhofer Chalmers Ctr, Gothenburg, Sweden
来源
CONCURRENT ENGINEERING-RESEARCH AND APPLICATIONS | 2021年 / 29卷 / 02期
关键词
natural language processing; document clustering; semantic data processing; knowledge management; engineering change request;
D O I
10.1177/1063293X20982973
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Product development companies collect data in form of Engineering Change Requests for logged design issues, tests, and product iterations. These documents are rich in unstructured data (e.g. free text). Previous research affirms that product developers find that current IT systems lack capabilities to accurately retrieve relevant documents with unstructured data. In this research, we demonstrate a method using Natural Language Processing and document clustering algorithms to find structurally or contextually related documents from databases containing Engineering Change Request documents. The aim is to radically decrease the time needed to effectively search for related engineering documents, organize search results, and create labeled clusters from these documents by utilizing Natural Language Processing algorithms. A domain knowledge expert at the case company evaluated the results and confirmed that the algorithms we applied managed to find relevant document clusters given the queries tested.
引用
收藏
页码:142 / 152
页数:11
相关论文
共 35 条
  • [1] Aggarwal CC, 2012, MINING TEXT DATA, P77, DOI [DOI 10.1007/978-1-4614-3223-4_4, 10.1007/978-1-4614-3223-4]
  • [2] K-Harmonic means type clustering algorithm for mixed datasets
    Ahmad, Amir
    Hashmi, Sarosh
    [J]. APPLIED SOFT COMPUTING, 2016, 48 : 39 - 49
  • [3] [Anonymous], 2014, BUS INFORM REV
  • [4] Arnarsson IÖ, 2017, INT CONF ENG DES, P71
  • [5] Bird S., 2006, P COLING ACL INT PRE, P214
  • [6] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [7] ALPS - A LANGUAGE FOR PROCESS SPECIFICATION
    CATRON, BA
    RAY, SR
    [J]. INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 1991, 4 (02) : 105 - 113
  • [8] The latent semantic approach to studying design team communication
    Dong, A
    [J]. DESIGN STUDIES, 2005, 26 (05) : 445 - 461
  • [9] Text analysis for constructing design representations (Reprinted from Artificial Intelligence in Design, pg 21-38, 1996)
    Dong, A
    Agogino, AM
    [J]. ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1997, 11 (02): : 65 - 75
  • [10] Change and customisation in complex engineering domains
    Eckert, C
    Clarkson, PJ
    Zanker, W
    [J]. RESEARCH IN ENGINEERING DESIGN, 2004, 15 (01) : 1 - 21