Effective keyword search for software resources installed in large-scale Grid infrastructures

Conference Paper (2009)
Author(s)

George Pallis (University of Cyprus)

A. Katsifodimos (University of Cyprus)

Marios D. Dikaiakos (University of Cyprus)

Affiliation
External organisation
DOI related publication
https://doi.org/10.1109/WI-IAT.2009.82
More Info
expand_more
Publication Year
2009
Language
English
Affiliation
External organisation
Volume number
1
Pages (from-to)
482-489
ISBN (print)
9780769538013

Abstract

In this paper, we investigate the problem of supporting keyword-based searching for the discovery of software resources that are installed on the nodes of largescale, federated Grid computing infrastructures. We address a number of challenges that arise from the unstructured nature of software and the unavailability of software-related metadata on Grid sites.We presentMinersoft, a Grid harvester that visits Grid sites, crawls their file-systems, identifies and classifies software resources, and discovers implicit associations between them. The results of Minersoft harvesting are encoded in a weighted, typed graph, named the Software Graph. A number of IR algorithms are used to enrich this graph with structural and content associations, to annotate software resources with keywords, and build inverted indexes to support keyword-based searching for software. Using a real testbed, we present an evaluation study of our approach, using data extracted from a production-quality Grid infrastructure. Experimental results show that our approach achieves high search efficiency.

No files available

Metadata only record. There are no files for this record.