~berfr/commit-history-data-analysis

Commit history aggregation and analysis tools
Add license file
Initial commit

refs

master
browse log

clone

read-only
https://git.sr.ht/~berfr/commit-history-data-analysis
read/write
git@git.sr.ht:~berfr/commit-history-data-analysis

You can also use your local clone with git send-email.

commit-history-data-analysis

About

This project contains scripts to aggregate and analyze git commit history. The get-data.sh script will scan a base directory for git repositories and will output their log in CSV format in the data directory. In the end, it will merge all results into the data/results.csv file. To analyze the resulting data, simply open Jupyter Notebook as indicated below and rerun the whole notebook.

Instructions

# put `git-csvlog` in a directory in your `$PATH`
ln -s git-csvlog ~/bin/

# run `get-data.sh` with the base directory and author list as a parameters
./get-data.sh ~/code/ "author 1" "author 2"

# run data analysis on data
python3 -m venv venv && . venv/bin/activate
pip install -r requirements.txt
jupyter notebook analyze_results.ipynb