View on GitHub

RENE

Pull Requests or Commits? Which Method Should We Use to Study Contributors’ Behavior?

Replication package

This site contains scripts and datas to reproduce the paper intituled: Pull Requests or Commits? Which Method Should We Use to Study Contributors’ Behavior?

Guidelines

Selecting the projects

All projects used in this paper is available on this link

Collecting data

All data collected to our paper was builded in python, and stored in a MYSQL database.

  1. Create the database structure to store the data obtained by the collection tables structure

  2. Before executing the script GET_PR.py it is necessary to make some presets: Set the variables GH_USER and GH_PASSWD with your github platform user data (github registration required). Set the LANGUAGE variable with the language of the project to be collected.

  3. For eache project, run the script get_pr:

  get_pr('project_owner', 'project_name')
  1. Run the update_pr script to update some values
  update_pr()
  1. Run others scripts, available here

*** All commits and PR data is available here

Analysis