Using text mining to retrieve information about circular economy

被引:27
作者
Spreafico, Christian [1 ]
Spreafico, Matteo [1 ]
机构
[1] Univ Bergamo, Dept Management Informat & Prod Engn, Via Marconi 5, I-24044 Dalmine, BG, Italy
关键词
Circular economy; Text mining; Dependency patterns; Patents;
D O I
10.1016/j.compind.2021.103525
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper proposes a method of text mining to automatically retrieve knowledge from patents on how to recycle and reuse a waste. The main novelties are the introduction of a set of specific dependency patterns and the introduction of a partially revised TRIZ (Russian acronym for T spacing diaeresis heory of Inventive Problem Solving) spacing diaeresis ontology to classify the retrieved information. The proposed dependency patterns were manually extracted from a sample patents pool about waste recycling and reuse. The classification of the information is based on different classes: (1) what transformations can be carried out on the waste, (2) what technologies can be used to carry out these transformations, (3) what products can be obtained by transforming the waste, (4) what functions can be carried out by the waste, (5) with which technologies, and (6) on which entities. An automatic implementation of the proposed method, involving the manual check of the retrieved results, was tested through a case study about wood chip recycling and reuse. Compared to the dependency patterns from the literature, the proposed ones allowed to retrieve 28 % more pertinent information. This results mainly depends by better ability of the proposed patterns to better discriminate the relevant sentences from which to extract information, compared to the other patterns (i.e. + 40 %). The automatic classification of the information was also correctly performed: in almost each class, precision and recall were higher than 60 % and on average equal to 90 %. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 61 条
[21]   Introduction of the circular economy within developing regions: A comparative analysis of advantages and opportunities for waste valorization [J].
Ferronato, Navarro ;
Rada, Elena Cristina ;
Portillo, Marcelo Antonio Gorritty ;
Cioca, Lucian Ionel ;
Ragazzi, Marco ;
Torretta, Vincenzo .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2019, 230 :366-378
[22]  
GERO JS, 1990, AI MAG, V11, P26
[23]  
Ghosh S., 2012, International Journal of Advanced Research in Computer and Communication Engineering, V1, P7
[24]  
Grabar Natalia, 2019, Yearb Med Inform, V28, P218, DOI 10.1055/s-0039-1677937
[25]  
Gudivada A, 2018, 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), P250, DOI 10.1109/SSCI.2018.8628846
[26]  
Kobayashi T, 2017, 2017 INTERNATIONAL ELECTRONICS SYMPOSIUM ON KNOWLEDGE CREATION AND INTELLIGENT COMPUTING (IES-KCIC), P276, DOI 10.1109/KCIC.2017.8228599
[27]  
Korobkin D.M., 2017, 2017 8 INT C INF INT, P1
[28]   Recent advances in the theory and practice of Logical Analysis of Data [J].
Lejeune, Miguel ;
Lozin, Vadim ;
Lozina, Irina ;
Ragab, Ahmed ;
Yacout, Soumaya .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2019, 275 (01) :1-15
[29]   A framework for automatic TRIZ level of invention estimation of patents using natural language processing, knowledge-transfer and patent citation metrics [J].
Li, Zhen ;
Tate, Derrick ;
Lane, Christopher ;
Adams, Christopher .
COMPUTER-AIDED DESIGN, 2012, 44 (10) :987-1010
[30]  
Litvin S., 2004, P TRIZ FUT C FLOR