S. Huang | TU Delft Repository

Navigating Repositories

Assessing the Impact of External Repositories on Packages in Maven Central

Bachelor thesis (2024) - J.W. Sandifort (author) , Sebastian Proksch (mentor) , S. Huang (mentor) , Casper Poulsen (graduation committee member)

This paper presents a comprehensive experimental study on the use and impact of external repositories in the Maven ecosystem. For this research the prevalence, naming patterns, and potential risks associated with external repositories were analyzed. We analyzed 199,188 packages a ...

Finding your digital sibling

Grouping GitHub projects that share certain attributes based on interactions and activities

Bachelor thesis (2024) - R.W. de Bruin (author) , S. Proksch (mentor) , S. Huang (mentor) , Julia Olkhovskaya (graduation committee member)

This study explores the feasibility of categorizing GitHub projects based on their interactions and activities, aiming to assist both researchers and practitioners in navigating the vast landscape of open-source software. Through experiments and analysis, key attributes contribut ...

Discovering Digital Siblings

Quantifying Inter-Repository Similarity Through GitHub Dependency Structures

Bachelor thesis (2024) - Mateusz Rębacz (author) , Sebastian Proksch (mentor) , S. Huang (mentor) , Julia Olkhovskaya (graduation committee member)

Open Source developers typically use Git repositories to transparently store the source code of projects and contribute to the code of others. There are millions of repositories actively hosted on platforms such as GitHub. This presents an opportunity for sharing knowledge betwee ...

Finding your digital sibling: which other GitHub projects are similar to yours?

Finding similar repositories based on the available documentation

Bachelor thesis (2024) - A.C. Turcu (author) , S. Proksch (mentor) , S. Huang (mentor) , Julia Olkhovskaya (graduation committee member)

This paper aims to study the importance of considering the documentation side of GitHub repositories when assessing the similarity between two or more applications. Readme and Wiki files, along with Comments from the source files, are the dimensions proposed to be analyzed throug ...

Analyzing Similar Build Configurations Across Different GitHub Projects

Bachelor thesis (2024) - C.M. Manoli (author) , S. Proksch (mentor) , S. Huang (mentor) , Julia Olkhovskaya (graduation committee member)

GitHub is the home of hundreds of millions of Open Source Software(OSS) repositories where users collaborate on projects and find inspiration for new ideas. Some of these projects have certain build configurations set up to make building, testing, and deploying the software more ...

Contribution of source code identifiers to GitHub project similarity

Which other GitHub projects are similar to yours?

Bachelor thesis (2024) - J.G.M. Crienen (author) , S. Proksch (mentor) , S. Huang (mentor) , Julia Olkhovskaya (graduation committee member)

GitHub is an online platform that hosts millions of projects. Many of these projects have the same topic or share the same goal. Finding similar projects which can be used as role models, inspiration or examples can help developers meet their requirements faster and more efficien ...

Github Mining

Discover the Descriptive Metrics of the Context in Continuous Integration (CI) Project

Bachelor thesis (2023) - P.J. Hibbs (author) , S. Huang (mentor) , Sebastian Proksch (graduation committee member)

Continuous Integration (CI) systems automate the building, testing, and possibly more. However, it is still unclear how CI should be implemented in different contexts. Therefore, this paper tries to answer the question "What metrics can be used to describe project activity", as p ...

Discovering the metrics for assessing a project’s maturity

An analysis of key indicators of maturity

Bachelor thesis (2023) - K. Sartori (author) , S. Proksch (mentor) , S. Huang (mentor) , E. Aivaloglou (graduation committee member)

Continuous integration (CI) is a software engineering practice that promotes frequent code integration into a shared repository, improving the productivity within development teams as well as the quality of the software being developed. While CI adoption has gained traction, stud ...

Discovering the topics of Continuous Integration Projects on GitHub

Bachelor thesis (2023) - L. Ostrovskis (author) , S. Huang (mentor) , S. Proksch (graduation committee member) , E.A. Aivaloglou (graduation committee member)

Continuous Integration (CI) is a software development technique that enhances software quality and development efficiency, but its implementation usually depends on the project's context. This creates an opportunity for studying real-world CI projects on GitHub, focusing on their ...

Exploring Descriptive Metrics of Build Performance

A Study of GitHub Actions in Continuous Integration Projects

Bachelor thesis (2023) - R. Constantinescu (author) , S. Proksch (mentor) , S. Huang (mentor) , E. Aivaloglou (graduation committee member)

The Continuous Integration (CI) practice, has been rapidly growing and developing ever since it's introduction. This practice has been constantly providing benefits to developers such as early bug detection and feedback to development teams. In this study, we aim to identify the ...

GitHub Mining

The Implementation of Continuous Integration Pipelines

Bachelor thesis (2023) - A.C. de Vries (author) , S. Proksch (mentor) , S. Huang (mentor) , E.A. Aivaloglou (graduation committee member)

While continuous integration has already been proven to positively affect software development, little is known about how it should be implemented based on project context. This paper investigates how CI pipelines are configured by analysing data mined from software projects on G ...