Skip to content

Commit b3d3ddb

Browse files
committed
refactor(eval): rename /audit skill to /agent-eval
Renames the `.claude/skills/audit/` directory and all internal references to `agent-eval`, aligning the skill name with the `/agent-eval` command it invokes.
1 parent 1cd162a commit b3d3ddb

2 files changed

Lines changed: 4 additions & 4 deletions

File tree

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
2-
name: audit
3-
description: Benchmark CodeGraph retrieval quality on a real codebase by comparing agent behavior with vs without CodeGraph. Use when the user runs /audit or asks to test, benchmark, audit, or validate a codegraph version (the local dev build or a published npm version) against a language's repo.
2+
name: agent-eval
3+
description: Benchmark CodeGraph retrieval quality on a real codebase by comparing agent behavior with vs without CodeGraph. Use when the user runs /agent-eval or asks to test, benchmark, audit, or validate a codegraph version (the local dev build or a published npm version) against a language's repo.
44
---
55

66
# CodeGraph Quality Audit
@@ -32,7 +32,7 @@ user type a specific version (e.g. `0.7.10`). Map the answer to a VERSION token:
3232
- "Latest published" → `latest`
3333
- a typed version → that string (e.g. `0.7.10`)
3434

35-
**Step 2 — language.** Read `.claude/skills/audit/corpus.json`. Ask with
35+
**Step 2 — language.** Read `.claude/skills/agent-eval/corpus.json`. Ask with
3636
`AskUserQuestion` which language to test, listing the languages that have entries.
3737

3838
**Step 3 — repo.** From the chosen language's entries, ask which repo. Label each
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
{
2-
"_comment": "Test corpus for /audit. Add entries freely. size: Small (<~150 files), Medium (~150-1500), Large (>~1500). 'question' is a representative architectural question that exercises cross-file understanding.",
2+
"_comment": "Test corpus for /agent-eval. Add entries freely. size: Small (<~150 files), Medium (~150-1500), Large (>~1500). 'question' is a representative architectural question that exercises cross-file understanding.",
33
"TypeScript": [
44
{ "name": "ky", "repo": "https://github.com/sindresorhus/ky", "size": "Small", "files": "~25", "question": "How does ky implement request retries and timeouts?" },
55
{ "name": "excalidraw", "repo": "https://github.com/excalidraw/excalidraw", "size": "Medium", "files": "~600", "question": "How does Excalidraw render and update canvas elements?" },

0 commit comments

Comments
 (0)