Time is Money

A similar repository recommender system that saves development time using tag hierarchies

Bachelor Thesis (2022)
Author(s)

A.C. Ionescu (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Contributor(s)

Maliheh Izadi – Mentor (TU Delft - Software Engineering)

A Van Deursen – Mentor (TU Delft - Software Technology)

George Iosifidis – Graduation committee member (TU Delft - Embedded Systems)

Faculty
Electrical Engineering, Mathematics and Computer Science
Copyright
© 2022 Andrei Ionescu
More Info
expand_more
Publication Year
2022
Language
English
Copyright
© 2022 Andrei Ionescu
Graduation Date
22-06-2022
Awarding Institution
Delft University of Technology
Project
['CSE3000 Research Project']
Programme
['Computer Science and Engineering']
Faculty
Electrical Engineering, Mathematics and Computer Science
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Developers do not want to reinvent the wheel when developing software systems. Open-source software repositories are packed with resources that may assist developers with their work. Since Github enabled repository tagging, a new opportunity arose to help developers find the needed resources tailored to their needs. The current work proposes two similar repository approaches enhanced by a tag hierarchy generation approach that is used in our recommender systems. The first approach provides advanced repository indexing, and it is constructed as a wrapper around the Google Programmable Search, and the second approach is based on the GitHub Search API. During the study, we developed and deployed a fully-fledged framework that allows us to create, label, weigh and evaluate any tag hierarchy and our recommending approaches.
Analyzing the results collected by our evaluation framework, we concluded that the Google Search approach is preferred over the GitHub approach from both accuracy and user perspective point of view. The Google Search approach outperformed the baseline by 18.75\%, but also the GitHub Search by more than 100\% concerning the MAP (Mean Average Precision) metric.

Files

Andrei_Ionescu_RP.pdf
(pdf | 1.25 Mb)
License info not available