PyDriller

Python Framework for Mining Software Repositories

Conference Paper (2018)
Author(s)

Davide Spadini (TU Delft - Software Engineering)

Maurício Aniche (TU Delft - Software Engineering)

A Bacchelli (Universitat Zurich)

Research Group
Software Engineering
Copyright
© 2018 D. Spadini, Maurício Aniche, A. Bacchelli
DOI related publication
https://doi.org/10.1145/3236024.3264598
More Info
expand_more
Publication Year
2018
Language
English
Copyright
© 2018 D. Spadini, Maurício Aniche, A. Bacchelli
Research Group
Software Engineering
Pages (from-to)
908-911
ISBN (print)
978-1-4503-5573-5
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Software repositories contain historical and valuable information about the overall development of software systems. Mining software repositories (MSR) is nowadays considered one of the most interesting growing fields within software engineering. MSR focuses on extracting and analyzing data available in software repositories to uncover interesting, useful, and actionable information about the system. Even though MSR plays an important role in software engineering research, few tools have been created and made public to support developers in extracting information from Git repository. In this paper, we present PyDriller, a Python Framework that eases the process of mining Git. We compare our tool against the state-of-the-art Python Framework GitPython, demonstrating that PyDriller can achieve the same results with, on average, 50% less LOC and significantly lower complexity.

URL: https://github.com/ishepard/pydriller

Materials: https://doi.org/10.5281/zenodo.1327363

Pre-print: https://doi.org/10.5281/zenodo.1327411

Files

Main.pdf
(pdf | 0.519 Mb)
License info not available