On the Shoulders of Giants: A New Dataset for Pull-based Development Research

Conference Paper (2020)
Author(s)

Xunhui Zhang (National University of Defense Technology)

Ayushi Rastogi (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Yue Yu (National University of Defense Technology)

Research Group
Software Engineering
DOI related publication
https://doi.org/10.1145/3379597.3387489 Final published version
More Info
expand_more
Publication Year
2020
Language
English
Research Group
Software Engineering
Pages (from-to)
543-547
ISBN (electronic)
978-1-4503-7517-7
Event
17th International Conference on Mining Software Repositories (2020-10-05 - 2020-10-06), Seoul, Korea, Republic of
Downloads counter
184

Abstract

Pull-based development is a widely adopted paradigm for collaboration in distributed software development, attracting eyeballs from both academic and industry. To better study pull-based development model, this paper presents a new dataset containing 96 features collected from 11,230 projects and 3,347,937 pull re- quests. We describe the creation process and explain the features in details. To the best of our knowledge, our dataset is the most comprehensive and largest one toward a complete picture for pull-based development research.