Sourcerer: An Internet-Scale Software Repository

被引:27
作者
Bajracharya, Sushil [1 ]
Ossher, Joel [1 ]
Lopes, Cristina [1 ]
机构
[1] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Irvine, CA 92717 USA
来源
2009 ICSE WORKSHOP ON SEARCH-DRIVEN DEVELOPMENT-USERS, INFRASTRUCTURE, TOOLS AND EVALUATION | 2009年
关键词
D O I
10.1109/SUITE.2009.5070010
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Vast quantities of open source code are now available online, presenting a great potential resource for software developers. Yet the current generation of open source code search engines fail to take advantage of the rich structural information contained in the code they index. We have developed Sourcerer an infrastructure for large-scale indexing and analysis of open source code. By taking full advantage of this structural information, Sourcerer provides a foundation upon which state of the art search engines and related tools easily be built. We describe the Sourcerer infrastructure, present the applications that we have built on top of it, and discuss how existing tools could benefit from using Sourcerer
引用
收藏
页码:1 / 4
页数:4
相关论文
共 13 条
[1]  
[Anonymous], 2007, P 22 IEEE ACM INT C
[2]   A C++ data model supporting reachability analysis and dead code detection [J].
Chen, YF ;
Gansner, ER ;
Koutsofios, E .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1998, 24 (09) :682-694
[3]  
Gosling J., 2005, JAVA TM LANGUAGE SPE, V3rd
[4]  
Holmes K, 2005, AUST J EDUC DEV PSYC, V5, P117
[5]  
HOLMES R, 2008, ICSR 08, P330
[6]   Code Conjurer: Pulling reusable software out of thin air [J].
Hummel, Oliver ;
Janjic, Werner ;
Atkinson, Colin .
IEEE SOFTWARE, 2008, 25 (05) :45-52
[7]  
LEMOS O, 2009, 24 ANN ACM S APPL CO
[8]  
LINSTEAD E, DATA MINING KNOWLEDG
[9]  
MANDELIN D, 2005, PLDI 05, P48, DOI [10.1145/1065010.1065018, DOI 10.1145/1065010.1065018]
[10]   CodeWeb: Data mining library reuse patterns [J].
Michail, A .
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2001, :827-828