Effective Keyword Search for Software Resources installed in Large-scale Grid Infrastructures

被引:0
作者
Pallis, George [1 ]
Katsifodimos, Asterios [1 ]
Dikaiakos, Marios D. [1 ]
机构
[1] Univ Cyprus, Dept Comp Sci, Nicosia, Cyprus
来源
2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1 | 2009年
关键词
RETRIEVAL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we investigate the problem of supporting keyword-based searching for the discovery of software resources that are installed on the nodes of large-scale, federated Grid computing infrastructures. We address a number of challenges that arise from the unstructured nature of software and the unavailability of software-related metadata on Grid sites. We present Minersoft, a Grid harvester that visits Grid sites, crawls their file-systems, identifies and classifies software resources, and discovers implicit associations between them. The results of Minersoft harvesting are encoded in a weighted, typed graph, named the Software Graph. A number of IR algorithms are used to enrich this graph with structural and content associations, to annotate software resources with keywords, and build inverted indexes to support keyword-based searching for software. Using a real testbed, we present an evaluation study of our approach, using data extracted from a production-quality Grid infrastructure. Experimental results show that our approach achieves high search efficiency.
引用
收藏
页码:482 / 489
页数:8
相关论文
共 28 条
[1]   The Claremont Report on Database Research [J].
Agrawal, Rakesh ;
Ailamaki, Anastasia ;
Bernstein, Philip A. ;
Brewer, Eric A. ;
Carey, Michael J. ;
Chaudhuri, Surajit ;
Doan, AnHai ;
Florescu, Daniela ;
Franklin, Michael J. ;
Garcia-Molina, Hector ;
Gehrke, Johannes ;
Gruenwald, Le ;
Haas, Laura M. ;
Halevy, Alon Y. ;
Hellerstein, Joseph M. ;
Ioannidis, Yannis E. ;
Korth, Hank F. ;
Kossmann, Donald ;
Madden, Samuel ;
Magoulas, Roger ;
Ooi, Beng Chin ;
O'Reilly, Tim ;
Ramakrishnan, Raghu ;
Sarawagi, Sunita ;
Stonebraker, Michael ;
Szalay, Alexander S. ;
Weikum, Gerhard .
SIGMOD RECORD, 2008, 37 (03) :9-19
[2]  
ALMASKARI A, 2007, SIGIR, P773
[3]  
AMES A, 2005, MSST 05 P 22 IEEE 13, P49
[4]  
[Anonymous], SIGSOFT SOFTW ENG NO
[5]  
[Anonymous], 2007, P 16 INT C WORLD WID
[6]  
[Anonymous], 2009, DEP ELECT ENG COMPUT
[7]  
[Anonymous], INFORM SERVICES LARG
[8]   Recovering traceability links between code and documentation [J].
Antoniol, G ;
Canfora, G ;
Casazza, G ;
De Lucia, A ;
Merlo, E .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2002, 28 (10) :970-983
[9]  
BASS L, 2008, WICSA 08, P249
[10]   On ranking techniques for desktop search [J].
Cohen, Sara ;
Domshlak, Carmel ;
Zwerdling, Naama .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2008, 26 (02)