Overview

This tool aims to enhance the understanding and improvement of pull request (PR) reviews within a team.

It provides operational indicators and dashboards to analyze review dynamics, such as review time, number of comments per review, etc., across one or multiple repositories.

Getting started

Displaying dashboards

To quickly try the tool, load the test dataset with the following steps:

Execute make start to run dockers.
Use make load_demo_dataset to load the demo dataset into the database.
Access the main dashboard at http://localhost:3000/dashboards with the following credentials:
- User: admin
- Password: admin

Loading a real repository hosted on GitHub or Azure

Launch the script ./init.sh.

Fill in the GIT_BRANCHES in .env to specify the repositories to load.

The format should be a JSON array without line breaks.

[ { "name": "git branch name", "repository": { "organisation": "Azure or Git organization", "project": "project name (used by Azure)", "name": "remote repository name", "url": "remote SSH URL for cloning the repository", "token": "GitHub or Azure personal access token" } } ]

Run dockers with make start.
Execute setup_db to create and init the database.
Run the command make check_settings to verify connections to the repositories.
- If an error occurs, check .env or verify the correctness of the GitHub or Azure personal access token.
Clone repositories with make clone_repositories.
- This command copies SSH keys into ./git/.ssh and clones repositories into ./git/repositories/*.
Collect data using command make load to load information into the local database:
- Reviews information (pull requests, reviewers, approvers, ...) from Azure/GitHub APIs
- Features added to the main branch (GIT_BRANCHES[*].name in .env) from git stats
Access the main dashboard at http://localhost:3000/dashboards with the following credentials:
- User: admin
- Password: admin

Commands and Features

Check Configuration

Use make check_settings to verify connections to the repositories for cloning and API calls.

This command ensures that .env is correct and everything works as expected when loading pull requests, comments, and features.

Load Pull Requests and Comments

To load pull requests and comments, use make load_pull_requests_review_informations.

This command loads data by delta, avoiding the need to reload all data each time. It calls Azure and GitHub APIs to get pull requests and comments, processing and uploading them into the database.

Load Features

To load information about features added over time, use make load_features_from_repositories.

This command also loads data by delta. It clones repositories, extracts git information, and uploads them into the features database table.

Transcode Data

Developers and Reviewers

For developers contributing with different accounts, use make transcode_into_database to group contributions.

Copy postgres/transco/transcoders.json.example to postgres/transco/transcoders.json.
Complete the developers_names_by_email section.
Apply updates with make transcode_into_database.

Example

Some developers may contribute with different accounts to a repository. For instance, you may find some commits or reviews associated with these different email addresses: - `alain.dupont@company.com` - `a.dupont@company.com` - `alain.dupont@perso.com`. In such cases, you'll want to group all of them under the name "Alain DUPONT."

Pull Requests Type

For setting pull request types based on branch names, use make transcode_into_database.

Copy postgres/transco/transcoders.json.example to postgres/transco/transcoders.json.
Complete the pull_requests_type section.
Apply updates with make transcode_into_database.

Example

By default, the type is set to "feat" for all branches. However, if you have a branch-naming policy, such as starting all feature branches with "feat/", "fix/" or "release/", you may want to filter or exclude certain branch types on dashboards. For example: - "feat/add_button" and "feat/new_screen" could be of type "feat," - "fix/infinite_loop" and "fix/display_bug" could be of type "fix."

Clear database data

Run make setup_db to clear all data and starts importing from scratch.

Dump Database to JSON (and reimport)

Run make dump_db to dump the database to JSON files. Reimport later with make init_db_from_json_files.

Dump ANONYMIZED Database to JSON (and reimport)

Run make create_demo_dataset to dump and anonymize the database. Reimport later with make load_demo_dataset.

How does it work?

Data collection process (ETL)

sequenceDiagram participant Azure participant Github participant ETL Service participant GitRepository participant LocalDatabase autonumber loop through branches/repositories defined in GIT_BRANCHES .env Azure ->>+ ETL Service: Getting pull requests and comments via API Github ->> ETL Service: Getting pull requests and comments via API GitRepository ->> ETL Service: Getting git stats (part of code which is modified, number of changes, etc) Note over GitRepository,ETL Service: To use this data, the main branch should have only one commit per feature.<br/>Commits from feature branches should be squashed during the merge into the main branch. end ETL Service ->> LocalDatabase: Process and store data

Dashboards

Dashboards, built with Grafana, provide operational indicators and data exploration:

Metrics dashboard displays operational indicators
Details: Pull Requests and Details: Comments allow checking or displaying lists of pull requests and comments.

Project structure

The project follows domain-driven design principles:

Collected data is modeled into entities (cf. domain/entities)
Various repositories (in the meaning of DDD) load and/or upload entities from/to different sources (Azure, GitHub, JSON, Git, Database)
Domain base use cases are defined in domain/use_cases and more complex uses cases are defined in app/controllers
APIs routes or commands in the presentation layer define how to manipulate use cases

Tests

Run make run_test_database before running tests to run the database used as a mock for tests and make tests to run all tests

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github/workflows		.github/workflows
.vscode		.vscode
dashboards		dashboards
git		git
postgres		postgres
src		src
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.sh		Dockerfile.sh
LICENSE		LICENSE
Makefile		Makefile
README.gif		README.gif
README.md		README.md
docker-compose.yml		docker-compose.yml
init.sh		init.sh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Getting started

Displaying dashboards

Loading a real repository hosted on GitHub or Azure

Commands and Features

Check Configuration

Load Pull Requests and Comments

Load Features

Transcode Data

Developers and Reviewers

Pull Requests Type

Clear database data

Dump Database to JSON (and reimport)

Dump ANONYMIZED Database to JSON (and reimport)

How does it work?

Data collection process (ETL)

Dashboards

Project structure

Tests

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

dorianrod/GitReviewLens

Folders and files

Latest commit

History

Repository files navigation

Overview

Getting started

Displaying dashboards

Loading a real repository hosted on GitHub or Azure

Commands and Features

Check Configuration

Load Pull Requests and Comments

Load Features

Transcode Data

Developers and Reviewers

Pull Requests Type

Clear database data

Dump Database to JSON (and reimport)

Dump ANONYMIZED Database to JSON (and reimport)

How does it work?

Data collection process (ETL)

Dashboards

Project structure

Tests

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages