An Open Data Repository for Engineering Design: Using Text Mining with Open Government Data

被引:7
作者
Giordano, Vito [1 ,3 ]
Coli, Elena [1 ,3 ]
Martini, Antonella [2 ,3 ]
机构
[1] Dept Informat Engn, Via Girolamo Caruso 16, I-56122 Pisa, Italy
[2] Dept Energy Syst Terr & Construct Engn, Largo Lucio Lazzarino 2, I-56122 Pisa, Italy
[3] Business Engn Data Sci B4DS Res Lab, Pisa, Italy
关键词
Engineering Design; Natural Language Processing; Open Data; Open Government Data; Open Data Repository; BIG DATA; INNOVATION; BARRIERS; ANALYTICS; EDUCATION;
D O I
10.1016/j.compind.2022.103738
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Engineering Design (ED) is a complex process in which the reuse of knowledge is crucial: applying the knowledge consolidated in previous design activities to future design activities means performing them in a better way. The relevance of data in ED is even more crucial in a business context in which Data Science (DS) is literally revolutionizing the way companies operate and therefore also the way data are analyzed. Despite having been recognized as crucial for ED processes, data still remain closed in the domain and accessible only to their owners due to several constraints related to the private and proprietary nature of the acquired data. An answer to these challenges could be found in Open Data, but at the state of the art an operational Engineering Design framework to embrace them is still far to be achieved by both academia and industry. Given these issues, the aim of this paper is to give evidence that Text Mining can help to make a complex open database more effective to be used for the ED process, taking U.S. Open Government Data (OGD) repository as a case study. Open access to methods and data used within this research is provided. The results of this study allow us to understand for which purposes it is possible to apply the datasets and to comprehend the expertise and the data science methods needed for processing different data for-mats. Moreover, this work opens relevant implications and challenges for researchers, practitioners and policy makers operating in ED and DS domains that could become opportunities for future research and industrial applications. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 76 条
  • [1] A review of empirical data of sustainability initiatives in university campus operations
    Amaral, Ana Rita
    Rodrigues, Eugenio
    Gaspar, Adelio Rodrigues
    Gomes, Alvaro
    [J]. JOURNAL OF CLEANER PRODUCTION, 2020, 250 (250)
  • [2] [Anonymous], 2015, ISPIM C P
  • [3] Bang H, 2016, PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2016, VOL 7
  • [4] Bates J., 2012, The Journal of Community Informatics, V8
  • [5] Democratic constructivist science education: enabling egalitarian literacy and self-actualization
    Bencze, JL
    [J]. JOURNAL OF CURRICULUM STUDIES, 2000, 32 (06) : 847 - 865
  • [6] Bi Zhuming, 2018, BIOCONJUGATE TECHNIQ, DOI [10.1016/b978-0-12-809952-0.00001-7, DOI 10.1016/B978-0-12-809952-0.00001-7]
  • [7] Camba Jorge D., 2020, P 53 HAWAII INT C SY, DOI [10.24251/hicss.2020.048, DOI 10.24251/HICSS.2020.048]
  • [8] From Open Data to Open Innovation Strategies: Creating e-Services Using Open Government Data
    Chan, Calvin M. L.
    [J]. PROCEEDINGS OF THE 46TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2013, : 1890 - 1899
  • [9] Descriptive Models of Sequential Decisions in Engineering Design: An Experimental Study
    Chaudhari, Ashish M.
    Bilionis, Ilias
    Panchal, Jitesh H.
    [J]. JOURNAL OF MECHANICAL DESIGN, 2020, 142 (08)
  • [10] Chen HC, 2012, MIS QUART, V36, P1165