A library for automating test execution and repository operations.
It simplifies working with Docker and local repositories and provides utilities for managing patches and tests.
Originally built to facilitate LLM generation testing at the repository level, aimed at easing workflows for data scientists.
- Install the latest version using pip:
pip install repositorytest- To run the project in debug mode, add the current folder to the Python path:
import sys
sys.path.append("/path/to/repotest")- Check which modes work on your current machine:
from repotest.scripts import check_perfomanceThe following diagram shows the architecture of the project and how different components interact:
from repotest.utils.clean import remove_all_containers, remove_all_images, clean_all, ...The project supports two execution modes: Docker-based and local.
Using Docker:
from repotest.core.docker.java import JavaDockerRepo
repo = JavaDockerRepo(
repo="Osiris-Team/SPPU",
base_commit="1a86a84e3c2dd7446b4778db8bb5dbaf44944f1c",
image_name="maven:3.9.9-eclipse-temurin-23-alpine"
)
repo.clean() # Reset the repo folder to the base commit state
# repo.build_env("", timeout=60*5) # Set up the environment
# repo.apply_patch(patch) # Apply a patch to the repo
repo.run_test("mvn test", timeout=60*5) # Run tests with a 5-min timeoutUsing Local Filesystem:
from repotest.core.local.java import JavaLocalRepo
repo = JavaLocalRepo(
repo="Osiris-Team/SPPU",
base_commit="1a86a84e3c2dd7446b4778db8bb5dbaf44944f1c",
)
repo.clean() # Reset the repo folder to the base commit state
# repo.build_env("", timeout=60*5)
# repo.apply_patch(patch)
repo.run_test("mvn test", timeout=60*5)The interface for *Repo classes is the same across modes.
The only difference is that JavaDockerRepo allows specifying an image and cache volumes.
By default, Maven cache is stored in a Docker volume.
The following code generates a git patch from changes in the repository:
from repotest.utils.git.git_diff_wrapper import GitDiffWrapper
changer = GitDiffWrapper(
repo=task['repo_name'],
base_commit=task['base_commit']
)
changer.change_test(
fn_test=task['test_file'],
str_test=task['output'],
str_source=task['source_code']
)
changer.fix_pom_file()
patch = changer.git_diff()All logs are saved to: repotest/logs/%Y-%m-%d.log
Only critical logs are printed to stdout by default.
To increase verbosity and print logs to stdout:
from repotest.logger import change_console_logger_level
from logging import DEBUG
change_console_logger_level(DEBUG)To build the package from source:
python -m buildTests follow the same file structure as the main library. To run them (takes ~1–2 minutes):
pytest testsAll datasets used for testing the library are stored in ../datasets/*.json.
Example of how to load one:
from datasets import load_dataset
ds = load_dataset('json', data_files="../datasets/java_from_1_2k_from_parser.jsonl")By default, logs are stored in ~/.cache/repotest/logs/{date}.log.
To toggle stdout logging:
from repotest.constants import enable_stdout_logs, disable_stdout_logs
enable_stdout_logs()
disable_stdout_logs()