typedef-ai/ade-bench-plugin

3 stars · Last commit 2026-05-08

Claude Code plugin that generates ADE-Bench benchmark tasks from your own dbt project

PluginWorkflow & Automation TDD & Testing Debugging Documentation Design & UI DevOps & Infra Productivity AI & Prompting Data & ML

README preview

# ade-bench-plugin

A [Claude Code plugin](https://docs.anthropic.com/en/docs/claude-code) that generates [ADE-Bench](https://github.com/dbt-labs/ade-bench) benchmark tasks from your own dbt project.

## What it does

Instead of relying on ADE-Bench's built-in sample projects, this plugin lets you benchmark AI agents against **your own** dbt models and data. It provides two skills:

- **`plan-tasks`** (recommended) — Interactive pair-planning. Explores your project with you, reasons about what makes good benchmarks, and builds a task plan collaboratively before generating anything.
- **`create-task`** — Automated pipeline. Scans your project, matches models against a pattern catalog, and generates tasks. Faster, but less tailored.

## Prerequisites

- [Claude Code](https://docs.anthropic.com/en/docs/claude-code) installed
- `git`, [`uv`](https://github.com/astral-sh/uv), and Docker available on your machine (for running the generated tasks)
- A dbt project with either:
  - A DuckDB database file, or
  - A Snowflake account with credentials configured

The plugin will install [ADE-Bench](https://github.com/dbt-labs/ade-bench) for you on first use — `plan-tasks` and `create-task` detect when it's missing and offer to set it up at `~/.ade-bench`. You can also install it explicitly:

View full repository on GitHub →