A More Accurate Model for Finding Tutorial Segments Explaining APIs

被引:33
作者
Jiang, He [1 ,2 ,3 ]
Zhang, Jingxuan [1 ]
Li, Xiaochen [1 ]
Ren, Zhilei [1 ]
Lo, David [4 ]
机构
[1] Dalian Univ Technol, Sch Software, Dalian, Peoples R China
[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian, Peoples R China
[3] Wuhan Univ, State Key Lab Software Engn, Wuhan, Peoples R China
[4] Singapore Management Univ, Sch Informat Syst, Singapore, Singapore
来源
2016 IEEE 23RD INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER), VOL 1 | 2016年
关键词
Application Programming Interface; Text Classification; Feature Construction;
D O I
10.1109/SANER.2016.59
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Developers prefer to utilize third-party libraries when they implement some functionalities and Application Programming Interfaces (APIs) are frequently used by them. Facing an unfamiliar API, developers tend to consult tutorials as learning resources. Unfortunately, the segments explaining a specific API scatter across tutorials. Hence, it remains a challenging issue to find the relevant segments. In this study, we propose a more accurate model to find the exact tutorial fragments explaining APIs. This new model consists of a text classifier with domain specific features. More specifically, we discover two important indicators to complement traditional text based features, namely co-occurrence APIs and knowledge based API extensions. In addition, we incorporate Word2Vec, a semantic similarity metric to enhance the new model. Extensive experiments over two publicly available tutorial datasets show that our new model could find up to 90% fragments explaining APIs and improve the state-of-the-art model by up to 30% in terms of F-measure.
引用
收藏
页码:157 / 167
页数:11
相关论文
共 35 条
[1]  
[Anonymous], 2012, Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering, FSE '12
[2]  
Bialecki A., 2012, SIGIR 2012 WORKSH OP, P1
[3]  
Buse RPL, 2012, PROC INT CONF SOFTW, P782, DOI 10.1109/ICSE.2012.6227140
[4]  
Chen Danqi, 2014, P EMNLP
[5]  
Dagenais B, 2012, PROC INT CONF SOFTW, P47, DOI 10.1109/ICSE.2012.6227207
[6]  
Daqing Hou, 2011, 2011 IEEE 27th International Conference on Software Maintenance, P233, DOI 10.1109/ICSM.2011.6080790
[7]  
De Roover C, 2013, CONF PROC INT SYMP C, P152, DOI 10.1109/ICPC.2013.6613843
[8]  
De Souza LB., 2014, P 22 INT C PROGR COM, P72, DOI DOI 10.1145/2597008.2597146
[9]  
Duala-Ekoko E, 2011, LECT NOTES COMPUT SC, V6813, P79, DOI 10.1007/978-3-642-22655-7_5
[10]   How Do API Documentation and Static Typing Affect API Usability? [J].
Endrikat, Stefan ;
Hanenberg, Stefan ;
Robbes, Romain ;
Stefik, Andreas .
36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, :632-642