~berfr/commit-history-data-analysis

Commit history aggregation and analysis tools
Add berfr blog article link in readme file
Add license file
Initial commit

refs

master
browse  log 

clone

read-only
https://git.sr.ht/~berfr/commit-history-data-analysis
read/write
git@git.sr.ht:~berfr/commit-history-data-analysis

You can also use your local clone with git send-email.

#commit-history-data-analysis

#About

This project contains scripts to aggregate and analyze git commit history. The get-data.sh script will scan a base directory for git repositories and will output their log in CSV format in the data directory. In the end, it will merge all results into the data/results.csv file. To analyze the resulting data, simply open Jupyter Notebook as indicated below and rerun the whole notebook.

An article with discussions related to this project can be found here: https://berfr.me/posts/commit-history-data-analysis/.

#Instructions

# put `git-csvlog` in a directory in your `$PATH`
ln -s git-csvlog ~/bin/

# run `get-data.sh` with the base directory and author list as a parameters
./get-data.sh ~/code/ "author 1" "author 2"

# run data analysis on data
python3 -m venv venv && . venv/bin/activate
pip install -r requirements.txt
jupyter notebook analyze_results.ipynb