Title: Mining Fork-Including Software Development Traces
Abstract: This dataset relates to the paper: Mining Fork-Including Development Traces (abstract below)<br> Authors: Iris Reinhartz-Berger and Amir Tomer<br> Starting point: readme.txt Open-source software development is a common practice that encourages collaborative development and reuse across projects. Forking is a way to make a copy of an existing project and explore it for different purposes. Two types of forks are commonly mentioned in the literature: <em>contributing forks</em> which continue the development lines of the forked projects and aim at merging the contribution back to the forked projects; and <em>independently developed forks</em> which open new lines of development deviating from the forked projects. In this study, we aim to explore characteristics of fork-involving software development traces. Analyzing 880 Java projects and their related action and observation events, with process mining and statistical techniques, we found that the occurrence of certain event types may predict the fork type, while the creation of certain fork types increase the involvement of users in the forked projects.