Learning I/O Variables from Scientific Software's User Manuals

被引:3
作者
Peng, Zedong [1 ]
Lin, Xuanyi [2 ]
Santhoshkumar, Sreelekhaa Nagamalli [1 ]
Niu, Nan [1 ]
Kanewala, Upulee [3 ]
机构
[1] Univ Cincinnati, Cincinnati, OH 45221 USA
[2] Oracle Amer Inc, Redwood Shores, CA 94065 USA
[3] Univ North Florida, Jacksonville, FL 32224 USA
来源
COMPUTATIONAL SCIENCE, ICCS 2022, PT IV | 2022年
关键词
Scientific software; User manual; Software documentation; Classification; Machine learning; THEORETICAL REPLICATION;
D O I
10.1007/978-3-031-08760-8_42
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Scientific software often involves many input and output variables. Identifying these variables is important for such software engineering tasks as metamorphic testing. To reduce the manual work, we report in this paper our investigation of machine learning algorithms in classifying variables from software's user manuals. We identify thirteen natural-language features, and use them to develop a multi-layer solution where the first layer distinguishes variables from non-variables and the second layer classifies the variables into input and output types. Our experimental results on three scientific software systems show that random forest and feedforward neural network can be used to best implement the first layer and second layer respectively.
引用
收藏
页码:503 / 516
页数:14
相关论文
共 50 条
[11]   METRIC: METamorphic Relation Identification based on the Category-choice framework [J].
Chen, Tsong Yueh ;
Poon, Pak-Lok ;
Xie, Xiaoyuan .
JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 116 :177-190
[12]  
Clarno K., 2006, AM NUCL SOC TOPICAL
[13]   Requirements Classification with Interpretable Machine Learning and Dependency Parsing [J].
Dalpiaz, Fabiano ;
Dell'Anna, Davide ;
Aydemir, Fatma Basak ;
Cevikol, Sercan .
2019 27TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2019), 2019, :142-152
[14]   EQUIVALENCE OF WEIGHTED KAPPA AND INTRACLASS CORRELATION COEFFICIENT AS MEASURES OF RELIABILITY [J].
FLEISS, JL ;
COHEN, J .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 1973, 33 (03) :613-619
[15]   Deep Learning for Smart Sewer Systems: Assessing Nonfunctional Requirements [J].
Gudaparthi, Hemanth ;
Johnson, Reese ;
Challa, Harshitha ;
Niu, Nan .
2020 IEEE/ACM 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN SOCIETY (ICSE-SEIS 2021), 2020, :35-38
[16]   Coverage-based resampling: Building robust consolidated decision trees [J].
Ibarguren, Igor ;
Perez, Jesus M. ;
Muguerza, Javier ;
Gurrutxaga, Ibai ;
Arbelaitz, Olatz .
KNOWLEDGE-BASED SYSTEMS, 2015, 79 :51-67
[17]   Metamorphic Testing: A Simple Yet Effective Approach for Testing Scientific Software [J].
Kanewala, Upulee ;
Chen, Tsong Yueh .
COMPUTING IN SCIENCE & ENGINEERING, 2019, 21 (01) :66-72
[18]   Advancing viewpoint merging in requirements engineering: a theoretical replication and explanatory study [J].
Khatwani, Charu ;
Jin, Xiaoyu ;
Niu, Nan ;
Koshoffer, Amy ;
Newman, Linda ;
Savolainen, Juha .
REQUIREMENTS ENGINEERING, 2017, 22 (03) :317-338
[19]   Automated Requirements Extraction for Scientific Software [J].
Li, Yang ;
Guzman, Emitza ;
Tsiamoura, Konstantina ;
Schneider, Florian ;
Bruegge, Bernd .
INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 :582-591
[20]   Finding Metamorphic Relations for Scientific Software [J].
Lin, Xuanyi ;
Peng, Zedong ;
Niu, Nan ;
Wang, Wentao ;
Liu, Hui .
2021 IEEE/ACM 43RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: COMPANION PROCEEDINGS (ICSE-COMPANION 2021), 2021, :254-255