ref: 52961ab33d317e10dd736f7c3e5c70a038690d45 commit-history-data-analysis/README.md -rw-r--r-- 806 bytes
52961ab3 — berfr Initial commit 9 months ago



This project contains scripts to aggregate and analyze git commit history. The get-data.sh script will scan a base directory for git repositories and will output their log in CSV format in the data directory. In the end, it will merge all results into the data/results.csv file. To analyze the resulting data, simply open Jupyter Notebook as indicated below and rerun the whole notebook.


# put `git-csvlog` in a directory in your `$PATH`
ln -s git-csvlog ~/bin/

# run `get-data.sh` with the base directory and author list as a parameters
./get-data.sh ~/code/ "author 1" "author 2"

# run data analysis on data
python3 -m venv venv && . venv/bin/activate
pip install -r requirements.txt
jupyter notebook analyze_results.ipynb