Search upon UML repositories with text matching techniques

Conference Paper (2012)
Author(s)

Bojana Bislimovska (Politecnico di Milano)

Alessandro Bozzon (Politecnico di Milano)

Marco Brambilla (Politecnico di Milano)

Piero Fraternali (Politecnico di Milano)

Affiliation
External organisation
DOI related publication
https://doi.org/10.1109/SUITE.2012.6225479 Final published version
More Info
expand_more
Publication Year
2012
Language
English
Affiliation
External organisation
Article number
6225479
Pages (from-to)
9-12
ISBN (print)
9781467318488
Event
2012 4th International Workshop on Search-Driven Development: Users, Infrastructure, Tools, and Evaluation, SUITE 2012 (2012-06-05 - 2012-06-05), Zurich, Switzerland
Downloads counter
156

Abstract

As the quantity of software artifacts, mainly source code and software models, stored in repositories increases, the need for their efficient search becomes more important. In this paper we propose content-based query (a.k.a query-by-example) approach for searching software model repositories, in order to retrieve significant models or model fragments. The query-by-example search conveys the user need in form of a model or pattern specified in a coarse way. Our approach incorporates analysis and indexing of models using textual information retrieval techniques, which exploit the knowledge of the metamodel the models conform to. This allows us to explore different segmentation granularities on models and different indexing techniques ranging from simple bag of words, to index structures which integrate metamodel information. We detail the proposed theoretical framework, the implementation of the method upon open-source architectures, and we discuss the results of our experiments upon a public dataset of UML models.