Agent Flow Plugin

Transform Claude Code into a multi-agent orchestrated system with verification gates.

Features

Specialized Agent Delegation: Route tasks to expert agents (Riko, Senku, Loid, Lawliet, Alphonse)
Mandatory Verification Gates: Stop hooks ensure tests pass before task completion
Domain Expertise: Skills provide guidance on task classification and verification
Cost-Aware Model Selection: Opus for exploration/planning, Sonnet for execution/review/verification
Knowledge Graph Integration: Graphify MCP server gives Riko/Senku/Lawliet structural codebase queries (blast-radius, communities, shortest path)
Personal Knowledge Base: Personal-kb MCP server surfaces cross-project prior decisions and patterns for Riko/Senku/Lawliet

Prerequisites

Requirement	Version	Purpose
Claude Code CLI	latest	Plugin host
Bash	4+	Hook scripts
`jq`	any	Plugin validation + hook scripts
`git`	any	Installation
Python 3.9+ with `graphifyy[mcp]`	optional	Knowledge graph queries (install via `pipx install 'graphifyy[mcp]'`)

Installation

Method 0: Claude Code Marketplace (Recommended)

/plugin marketplace add josix/agent-flow
/plugin install agent-flow@josix-plugins

Method 1: Local Plugin Directory (Development)

claude --plugin-dir /path/to/agent-flow

Method 2: Git Clone

git clone https://github.com/josix/agent-flow.git
cd agent-flow
claude --plugin-dir .

New to agent-flow? Walk through your first orchestration — a fresh-repo demo that exercises all 5 agents end-to-end.

Validating Your Installation

Before running your first orchestration, verify the plugin is wired up correctly:

bash scripts/validate-plugin.sh

The script runs 10 structural checks covering:

plugin.json and hooks/hooks.json are valid JSON
All agent, skill, and command frontmatter parse correctly
Hook scripts exist, are executable, and have valid syntax
Edge cases: missing jq, malformed hook inputs, env-var propagation

If all checks pass, you see: ✓ All tests passed.

Commands

/deep-dive

Gather comprehensive codebase context using parallel exploration agents. Creates a reusable context file that accelerates subsequent orchestration tasks.

/deep-dive                    # Full codebase exploration
/deep-dive --focus=src/auth   # Focus on specific path
/deep-dive --refresh          # Refresh existing context

Output: .claude/deep-dive.local.md - Ephemeral, session-scoped context file

Workflow:

Fire 5+ parallel Riko agents exploring different aspects (structure, conventions, anti-patterns, etc.)
Senku synthesizes findings into unified context
Compile output to structured markdown

Integration with /orchestrate:

/orchestrate --use-deep-dive Add user authentication

When used with --use-deep-dive, orchestration leverages existing context to skip redundant exploration.

/orchestrate

Coordinate complex multi-step tasks through the agent system. This command delegates to specialist agents in sequence:

Riko explores the codebase for context
Senku creates an implementation plan
Loid implements the changes
Lawliet reviews code quality
Alphonse runs verification gates

/orchestrate Add user authentication with JWT tokens
/orchestrate --use-deep-dive Add user profile page   # Use existing deep-dive context

Arguments:

--use-deep-dive: Use existing deep-dive context for accelerated exploration

Note: Planning and verification are handled by agents (Senku, Alphonse) within the orchestration workflow rather than as standalone commands. This prevents responsibility conflicts in multi-agent coordination.

/team-orchestrate

Execute complex tasks with parallel review and verification using Agent Teams. This command follows the same workflow as /orchestrate but runs Lawliet (review) and Alphonse (verification) concurrently after implementation, reducing wall-clock time by 30-40%.

/team-orchestrate Add user authentication with JWT tokens
/team-orchestrate --use-deep-dive Add user profile page
/team-orchestrate --force-sequential Add feature  # Fall back to sequential mode

Arguments:

--use-deep-dive: Use existing deep-dive context for accelerated exploration
--force-sequential: Force sequential execution instead of parallel review+verification

Workflow:

Riko explores the codebase (sequential)
Senku creates an implementation plan (sequential)
Loid implements the changes (sequential)
Lawliet + Alphonse run in parallel (team mode)
Results merged and processed

/agent-flow:analyze

Parse Claude Code session transcripts and live hook events to surface subagent behaviour, tool usage, token costs, and improvement opportunities. All data stays local in a SQLite database — no network calls, no dependencies beyond Python stdlib.

# Load all sessions and generate a report
bash scripts/analyze.sh load --all-sessions && bash scripts/analyze.sh report

# Report for a single session
bash scripts/analyze.sh report --session <session_id>

# List all sessions in the database
bash scripts/analyze.sh sessions

# Ad-hoc SQL against the store
bash scripts/analyze.sh sql "SELECT agent_type, tool_name, COUNT(*) n FROM events GROUP BY agent_type, tool_name ORDER BY n DESC"

# Prune sessions older than 30 days
bash scripts/analyze.sh retention --days 30

The report covers tool usage by agent, MCP/skill invocations, thinking effort, token spend, subagent dispatch rates, rejection rates, and automated improvement heuristics (high iteration rate, tool allowlist violations, missing MCP usage, model mismatches).

Output files (gitignored): .claude/observability/events.db, .claude/observability/report.md

For the complete subcommand reference (including label, export, and exporter configuration) see docs/guides/using-analyze.md.

Agents

Agent	Model	Purpose
Senku	Opus	Creates detailed implementation strategies
Riko	Opus	Fast codebase exploration
Loid	Sonnet	Implements code changes
Lawliet	Sonnet	Code quality assurance
Alphonse	Sonnet	Runs tests and validation

Hooks

UserPromptSubmit Hook (Prompt-based)

Analyzes user prompts for task clarity:

Evaluates if the request is well-defined
Applies prompt refinement when needed
Transforms vague requests into structured format (Goal, Description, Actions)

PreToolUse Hook (enforce-delegation.sh)

Provides delegation guidance: Reminds agents about proper delegation patterns when writing files.

Allowed silently: Writing to .senku/ directory (planning files) Allowed with reminder: Writing to other files (shows delegation guidance message)

Note: Agent tool restrictions are the primary enforcement mechanism - Riko, Senku, and Lawliet do not have Write/Edit tools in their definitions. This hook provides helpful context rather than blocking.

PreToolUse Hook (validate-changes.sh)

Validates file writes before execution to prevent:

Path traversal attacks (.. in paths)
Writes to sensitive files (.env, credentials, keys)
Writes to system paths (/etc, /usr, /bin)

PostToolUse Hook (Task verification reminder)

Context-aware verification guidance: After delegation via Task tool, provides agent-specific verification guidance.

Verification levels by agent type:

Riko (exploration): Accept findings, no code verification needed
Senku (planning): Review plan completeness
Loid (implementation): Full verification required (read files, run tests, check types)
Lawliet (review): Consider feedback
Alphonse (verification): Check test results

This ensures appropriate verification without unnecessary friction for non-implementation tasks.

PostToolUse Hook (validate-changes.sh)

Validates file writes after execution using the same guardrails as PreToolUse.

Stop Hook (verify-completion.sh)

Runs before task completion to verify:

Node.js Projects:

Tests: npm test (when package.json has test script)
TypeScript: npx tsc --noEmit (when tsconfig.json exists)

Python Projects:

Tests: pytest detection priority chain:
- Custom command (from .claude/test-command file)
- uv run pytest (when uv.lock present)
- Global pytest (when tests/ directory exists)
Import/collection errors: ImportError, ModuleNotFoundError, SyntaxError treated as fatal
Known failures: Only blocks on NEW failures not in .claude/known-test-failures list
Type checking: mypy . (requires both mypy command and mypy.ini file)

Configuration Files:

File	Purpose
`.claude/skip-test-verification`	Bypass all test verification (first line used as reason)
`.claude/known-test-failures`	List of expected failures (one test name per line)
`.claude/test-command`	Custom test command to run (first non-comment line used)

SessionStart Hook (load-project-context.sh)

Detects project type and sets environment:

Project type (nodejs, python, rust, go, java)
Test framework (jest, pytest, cargo-test, etc.)
Available tooling (TypeScript, ESLint, Ruff)

TeammateIdle Hook (teammate-idle-check.sh)

Validates teammate output quality using role-based criteria:

Input: JSON from stdin with teammate_role and teammate_output fields
Reviewer (Lawliet): Must contain verdict (APPROVED/NEEDS_CHANGES/BLOCKED) + static analysis evidence
Verifier (Alphonse): Must contain at least 2 verification gate results + command output
Other roles: Approved without specific checks
Output: Always exits 0; decision communicated via JSON decision field (approve or block) written to stdout

TaskCompleted Hook (task-completed-check.sh)

Validates task completion messages for concrete evidence:

Input: JSON from stdin with task_status and completion_message fields
Validation: Only for complete/done/finished tasks
Evidence checks: Message length >= 20 chars, file mentions, verification indicators, concrete actions, results/metrics
Output: Always exits 0; decision communicated via JSON decision field (approve or block) written to stdout

Skills

Skills are domain expertise modules that provide behavioral patterns and best practices. Each skill has an owner agent responsible for embodying it and consumer agents that reference it.

For the complete skill-agent mapping, see skills/skill-agent-mapping/SKILL.md.

Skill-Agent Relationships

Skill	Owner	Consumers
exploration-strategy	Riko	Senku, Loid
task-classification	Senku	Riko, Orchestrator
prompt-refinement	Senku	Orchestrator
verification-gates	Alphonse	Loid, Lawliet
agent-behavior-constraints	System	All
team-decision	Senku	Orchestrator
graphify-usage	Riko	Senku, Lawliet
personal-kb-usage	Riko	Senku, Lawliet

Task Classification

Guidance on routing tasks to appropriate agents:

Trivial → Direct handling
Exploratory → Explorer agent
Implementation → Executor + Verifier
Complex → Orchestrator

Verification Gates

Mandatory validation patterns:

Pre-commit gates (type check, lint)
Pre-complete gates (tests, build)
Override rules (when skipping is allowed)

Exploration Strategy

Parallel context gathering and search stop rules.

Prompt Refinement

Ambiguous request clarification and structured task specification.

Agent Behavior Constraints

Model routing, tool access permissions, and universal behavioral guardrails.

Architecture Principles

The "Subagents LIE" Principle

Never trust unverified output. Every significant change requires mandatory testing, linting, and type-checking before acceptance.

Orchestrator Discipline

Orchestrators delegate work rather than implementing directly. Specialists handle domain-specific tasks.

Cost-Aware Model Selection

Opus: Strategic/planning tasks - deep reasoning (Riko, Senku)
Sonnet: Execution/verification tasks - speed (Loid, Lawliet, Alphonse)

Documentation

For detailed documentation, see the docs/ directory:

Architecture: Overview, Team Orchestration
Concepts: Parallel Safety
Guides: Using Team-Orchestrate
Reference: Commands, Hooks, Skills, State Files

See also CHANGELOG.md for version history.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.claude-plugin		.claude-plugin
agents		agents
commands		commands
docs		docs
hooks		hooks
scripts		scripts
skills		skills
.gitignore		.gitignore
.mcp.json		.mcp.json
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml

Folders and files

Latest commit

History

Repository files navigation

Agent Flow Plugin

Features

Prerequisites

Installation

Method 0: Claude Code Marketplace (Recommended)

Method 1: Local Plugin Directory (Development)

Method 2: Git Clone

Validating Your Installation

Commands

/deep-dive

/orchestrate

/team-orchestrate

/agent-flow:analyze

Agents

Hooks

UserPromptSubmit Hook (Prompt-based)

PreToolUse Hook (enforce-delegation.sh)

PreToolUse Hook (validate-changes.sh)

PostToolUse Hook (Task verification reminder)

PostToolUse Hook (validate-changes.sh)

Stop Hook (verify-completion.sh)

SessionStart Hook (load-project-context.sh)

TeammateIdle Hook (teammate-idle-check.sh)

TaskCompleted Hook (task-completed-check.sh)

Skills

Skill-Agent Relationships

Task Classification

Verification Gates

Exploration Strategy

Prompt Refinement

Agent Behavior Constraints

Architecture Principles

The "Subagents LIE" Principle

Orchestrator Discipline

Cost-Aware Model Selection

Documentation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages