Commit history aggregation and analysis tools
Add berfr blog article link in readme file
Add license file
Initial commit


browse  log 



You can also use your local clone with git send-email.



This project contains scripts to aggregate and analyze git commit history. The get-data.sh script will scan a base directory for git repositories and will output their log in CSV format in the data directory. In the end, it will merge all results into the data/results.csv file. To analyze the resulting data, simply open Jupyter Notebook as indicated below and rerun the whole notebook.

An article with discussions related to this project can be found here: https://berfr.me/posts/commit-history-data-analysis/.


# put `git-csvlog` in a directory in your `$PATH`
ln -s git-csvlog ~/bin/

# run `get-data.sh` with the base directory and author list as a parameters
./get-data.sh ~/code/ "author 1" "author 2"

# run data analysis on data
python3 -m venv venv && . venv/bin/activate
pip install -r requirements.txt
jupyter notebook analyze_results.ipynb