This guide covers the public SkillBench setup flow for A-Evolve, including auto-bootstrap, manual overrides, and common troubleshooting steps.
gitinPATHfor auto-bootstrap- Docker for SkillBench task execution
- Python 3.11+ for A-Evolve
uvif you want to use SkillBench'sharbormode
Install the A-Evolve SkillBench extras first:
pip install -e ".[skillbench,dev]"If you do not provide any SkillBench paths, A-Evolve bootstraps the public benchflow-ai/skillsbench repo automatically.
- Default repo URL:
https://github.com/benchflow-ai/skillsbench.git - Default pinned ref:
828bb921fb94dc065bfefd6bac4e8938be3f71e0 - Default cache root:
~/.cache/agent-evolve/skillbench/<ref>/repo
The bootstrap downloads the subset needed for both native and harbor:
tasks/tasks-no-skills/libs/pyproject.tomluv.lock.python-version
A-Evolve resolves SkillBench paths in this order:
- Explicit CLI arguments such as
--tasks-dir-with-skillsor--harbor-repo - Low-level env vars:
SKILLBENCH_TASKS_DIRSKILLBENCH_TASKS_NO_SKILLS_DIRSKILLBENCH_HARBOR_REPO - Repo-level env vars:
SKILLBENCH_REPO_DIRSKILLBENCH_REPO_REF - Auto-bootstrap into the cache directory
Low-level path overrides always win over repo-level auto-derivation.
| Variable | Purpose |
|---|---|
SKILLBENCH_REPO_DIR |
Path to a local public SkillsBench clone; A-Evolve derives tasks/, tasks-no-skills/, and the Harbor repo root from it |
SKILLBENCH_REPO_REF |
Ref used for auto-bootstrap when SKILLBENCH_REPO_DIR is not set |
SKILLBENCH_TASKS_DIR |
Direct override for tasks/ |
SKILLBENCH_TASKS_NO_SKILLS_DIR |
Direct override for tasks-no-skills/ |
SKILLBENCH_HARBOR_REPO |
Direct override for the Harbor-capable SkillsBench repo root |
python examples/skillbench_examples/skillbench_solve_one.py \
--mode native \
--use-skills truepython examples/skillbench_examples/skillbench_solve_one.py \
--mode harbor \
--use-skills falseexport SKILLBENCH_REPO_DIR=/path/to/skillsbench
python examples/skillbench_examples/skillbench_solve_one.py --mode nativeexport SKILLBENCH_TASKS_DIR=/path/to/tasks
export SKILLBENCH_TASKS_NO_SKILLS_DIR=/path/to/tasks-no-skills
python examples/skillbench_examples/skillbench_evolve_in_situ_cycle.py --use-skills falseAuto-bootstrap requires git. Install it or set SKILLBENCH_REPO_DIR / SKILLBENCH_TASKS_DIR manually.
If bootstrap fails, verify:
- the machine has internet access to GitHub
- the pinned ref still exists
gitcan clonehttps://github.com/benchflow-ai/skillsbench.git
You can bypass bootstrap by cloning the repo yourself and setting SKILLBENCH_REPO_DIR.
For native mode, A-Evolve only requires the selected tasks directory:
use_skills=trueneedstasks/use_skills=falseneedstasks-no-skills/
Check your direct overrides first, then repo-level overrides.
harbor mode needs a full public SkillsBench repo root containing at least:
libs/pyproject.toml
If you only set task directories, harbor can still fail. Point SKILLBENCH_HARBOR_REPO or SKILLBENCH_REPO_DIR at a full SkillsBench repo.
The built package now bundles seed_workspaces/skillbench. If a packaged install still cannot find the seed workspace, rebuild and reinstall the wheel, then verify that seed_workspaces/skillbench/manifest.yaml is present in the wheel contents.