typedef-ai/ade-bench-plugin
3 stars · Last commit 2026-05-08
Claude Code plugin that generates ADE-Bench benchmark tasks from your own dbt project
README preview
# ade-bench-plugin A [Claude Code plugin](https://docs.anthropic.com/en/docs/claude-code) that generates [ADE-Bench](https://github.com/dbt-labs/ade-bench) benchmark tasks from your own dbt project. ## What it does Instead of relying on ADE-Bench's built-in sample projects, this plugin lets you benchmark AI agents against **your own** dbt models and data. It provides two skills: - **`plan-tasks`** (recommended) — Interactive pair-planning. Explores your project with you, reasons about what makes good benchmarks, and builds a task plan collaboratively before generating anything. - **`create-task`** — Automated pipeline. Scans your project, matches models against a pattern catalog, and generates tasks. Faster, but less tailored. ## Prerequisites - [Claude Code](https://docs.anthropic.com/en/docs/claude-code) installed - `git`, [`uv`](https://github.com/astral-sh/uv), and Docker available on your machine (for running the generated tasks) - A dbt project with either: - A DuckDB database file, or - A Snowflake account with credentials configured The plugin will install [ADE-Bench](https://github.com/dbt-labs/ade-bench) for you on first use — `plan-tasks` and `create-task` detect when it's missing and offer to set it up at `~/.ade-bench`. You can also install it explicitly: