Quickstart¶

This guide takes you from a fresh install to a running research session.

1. Configure a provider¶

Run the interactive setup wizard once. It writes your provider, model, and API key to a user config so you don't repeat them on every run:

arbor setup

First run shortcut

If you start a run before configuring anything, Arbor detects the missing config in an interactive terminal and walks you through arbor setup automatically.

Prefer to do it by hand? Set environment variables instead:

AnthropicOpenAIOpenAI-compatible (LiteLLM)

export ANTHROPIC_API_KEY=sk-ant-...

export OPENAI_API_KEY=sk-...

export OPENAI_API_KEY=...            # your gateway key
export OPENAI_BASE_URL=https://your-gateway/v1

See Configuration for the full provider matrix.

2. Start a session¶

The way to use Arbor is to run arbor inside your project directory:

cd my_project
arbor

This opens an intake chat. You describe your goal in plain language; the intake agent reads your code and README, confirms the metric and baseline, helps you shape a plan, and then launches the study once you both agree. From there you stay in the same terminal — watching progress and steering the run with slash commands.

No project yet? Try the bundled example

For a no-API-budget, no-GPU, sub-second run you can watch end-to-end, use the examples/algotune_knn task — make a brute-force k-NN solver faster while matching the reference output. Run it outside your Arbor checkout so experiment worktrees don't touch that repo:

cp -r examples/algotune_knn /tmp/algotune_knn
cd /tmp/algotune_knn
git init -q && git add -A && git commit -qm baseline
arbor

Seed the goal up front

You can pass your objective as the first argument and still go through intake:

arbor "maximize dev score without changing eval or data"

Headless / scripted runs (CI)

To launch without the intake conversation — for benchmarks or CI — skip the chat with --yes and point at the project explicitly:

arbor run "improve held-out accuracy" \
  --yes \
  --yes-cwd /path/to/project \
  --config /path/to/project/research_config.yaml

For day-to-day use, prefer the interactive arbor above.

3. Watch it work¶

While a run is active you get three views:

Terminal dashboard — live status of the current cycle, the Idea Tree, and costs.
Read-only web monitor — auto-starts in your browser near port 8765 (disable with --no-webui, change with --webui-port).
REPORT.md — the final write-up, generated when the run finishes.

Inside the dashboard you can steer the run with slash commands such as /status, /tree, /evidence, /cost, /pause, and /resume. See the CLI reference.

4. Read the results¶

When the run completes, Arbor writes a REPORT.md and opens an optional read-only Q&A prompt so you can interrogate the finished study (disable with --no-followup). All artifacts — the Idea Tree, checkpoints, logs, and per-experiment branches — live under <project>/.arbor/sessions/<run_name>/.

Where to go next¶

Preparing a Benchmark

Wire up an eval command and protect your data so Arbor can iterate safely.

Preparing a Benchmark
How It Works

The arbor cycle, the Idea Tree, and held-out discipline.

How It Works