Semantic versioning and impact of breaking changes in the Maven repository

Journal Article (2017)
Author(s)

Steven Raemaekers (TU Delft - Software Engineering, ING Bank)

Arie Deursen (TU Delft - Software Technology)

Joost Visser (Software Improvement Group)

Research Group
Software Engineering
Copyright
© 2017 S.B.A. Raemaekers, A. van Deursen, Joost Visser
DOI related publication
https://doi.org/10.1016/j.jss.2016.04.008
More Info
expand_more
Publication Year
2017
Language
English
Copyright
© 2017 S.B.A. Raemaekers, A. van Deursen, Joost Visser
Research Group
Software Engineering
Issue number
July
Volume number
129
Pages (from-to)
140-158
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Systems that depend on third-party libraries may have to be updated when updates to these libraries become available in order to benefit from new functionality, security patches, bug fixes, or API improvements. However, often such changes come with changes to the existing interfaces of these libraries, possibly causing rework on the client system. In this paper, we investigate versioning practices in a set of more than 100,000 jar files from Maven Central, spanning over 7 years of history of more than 22,000 different libraries. We investigate to what degree versioning conventions are followed in this repository. Semantic versioning provides strict rules regarding major (breaking changes allowed), minor (no breaking changes allowed), and patch releases (only backward-compatible bug fixes allowed). We find that around one third of all releases introduce at least one breaking change. We perform an empirical study on potential rework caused by breaking changes in library releases and find that breaking changes have a significant impact on client libraries using the changed functionality. We find out that minor releases generally have larger release intervals than major releases. We also investigate the use of deprecation tags and find out that these tags are applied improperly in our dataset.