StackMemory

Lossless, project-scoped memory for AI tools • v0.3.16

StackMemory is a production-ready memory runtime for AI coding tools that preserves full project context across sessions. With Phases 1-4 complete, it delivers:

✅ 89-98% faster task operations than manual tracking
✅ 10,000+ frame depth support with hierarchical organization
✅ Full Linear integration with bidirectional sync
✅ 20+ MCP tools for Claude Code
✅ Context persistence that survives /clear operations
✅ Two-tier storage system with local tiers and infinite remote storage
✅ Smart compression (LZ4/ZSTD) with 2.5-3.5x ratios
✅ Background migration with configurable triggers
✅ 296 tests passing with improved error handling
✅ npm v0.3.16 published with production-ready improvements

Instead of a linear chat log, StackMemory organizes memory as a call stack of scoped work (frames), with intelligent LLM-driven retrieval and team collaboration features.

Memory is storage. Context is a compiled view.

Why StackMemory exists

Development tools lose context between sessions:

Previous decisions aren't tracked
Constraints get forgotten
Changes lack history
Tool execution isn't recorded

StackMemory solves this by:

Storing all events, tool calls, and decisions
Smart retrieval of relevant context
Call stack organization (10,000+ frame depth)
Configurable importance scoring
Team collaboration through shared stacks

Core concepts (quick mental model)

Concept	Meaning
Project	One GitHub repo (initial scope)
Frame	A scoped unit of work (like a function call)
Call Stack	Nested frames; only the active path is "hot"
Event	Append-only record (message, tool call, decision)
Digest	Structured return value when a frame closes
Anchor	Pinned fact (DECISION, CONSTRAINT, INTERFACE)

Frames can span:

multiple chat turns
multiple tool calls
multiple sessions

Hosted vs Open Source

Hosted (default)

Cloud-backed memory runtime
Fast indexing + retrieval
Durable storage
Per-project pricing
Works out-of-the-box

Open-source local mirror

SQLite-based
Fully inspectable
Offline / air-gapped
Intentionally N versions behind
No sync, no org features

OSS is for trust and inspection. Hosted is for scale, performance, and teams.

How it integrates

StackMemory integrates as an MCP tool and is invoked on every interaction in:

Claude Code
compatible editors
future MCP-enabled tools

The editor never manages memory directly; it asks StackMemory for the context bundle.

Product Health Metrics

Current Status (v0.3.16)

Metric	Current	Target	Status
Test Coverage	80%	90%	🟡
Performance (p50)	TBD	<50ms	🔄
Documentation	60%	100%	🟡
Active Issues	13 high	0 high	🟡
Code Quality	296 tests	350+	✅
npm Downloads	Growing	1K+/week	🚀

Quality Score: 72/100

Formula: (Test Coverage × 0.3) + (Performance × 0.3) + (Documentation × 0.2) + (Issues Resolution × 0.2)

Next Sprint Priorities

[STA-289] Performance Optimization - Achieve SLA targets
[STA-291] Code Cleanup - Zero TODOs, 90% coverage

QuickStart

1. Hosted (Recommended)

Step 1: Create a project

stackmemory projects create \
  --repo https://github.com/org/repo

This creates a project-scoped memory space tied to the repo.

Step 2: Install StackMemory

npm install -g @stackmemoryai/stackmemory@0.3.16
# or latest
npm install -g @stackmemoryai/stackmemory@latest

Step 3: Setup Claude Code Integration (Automated)

# Automatic setup - configures MCP and session hooks
npm run claude:setup

This automatically:

Creates ~/.claude/stackmemory-mcp.json MCP configuration
Sets up session initialization hooks
Updates ~/.claude/config.json with StackMemory integration

Manual setup alternative:

Click to expand manual setup steps

Create MCP configuration:

mkdir -p ~/.claude
cat > ~/.claude/stackmemory-mcp.json << 'EOF'
{
  "mcpServers": {
    "stackmemory": {
      "command": "stackmemory",
      "args": ["mcp-server"],
      "env": { "NODE_ENV": "production" }
    }
  }
}
EOF

Update Claude config:

{
  "mcp": {
    "configFiles": ["~/.claude/stackmemory-mcp.json"]
  }
}

Claude Code sessions automatically capture tool calls, maintain context across sessions, and sync with Linear when configured.

Available MCP tools in Claude Code:

Tool	Description
`get_context`	Retrieve relevant context for current work
`add_decision`	Record a decision with rationale
`start_frame`	Begin a new context frame
`close_frame`	Close current frame with summary
`create_task`	Create a new task
`update_task_status`	Update task status
`get_active_tasks`	List active tasks (with filters)
`get_task_metrics`	Get task analytics
`linear_sync`	Sync with Linear
`linear_update_task`	Update Linear issue
`linear_get_tasks`	Get tasks from Linear

2. Open-Source Local Mode

Step 1: Clone

git clone https://github.com/stackmemory/stackmemory
cd stackmemory

Step 2: Run local MCP server

cargo run --bin stackmemory-mcp
# or
npm run dev

This creates:

.memory/
  └── memory.db   # SQLite

All project memory lives locally.

Step 3: Point your editor to local MCP

{
  "tools": {
    "stackmemory": {
      "command": "stackmemory-mcp",
      "args": ["--local"]
    }
  }
}

How it works

Each interaction: ingests events → updates indices → retrieves relevant context → returns sized bundle.

Example MCP response (simplified)

{
  "hot_stack": [
    { "frame": "Debug auth redirect", "constraints": [...] }
  ],
  "anchors": [
    { "type": "DECISION", "text": "Use SameSite=Lax cookies" }
  ],
  "relevant_digests": [
    { "frame": "Initial auth refactor", "summary": "..." }
  ],
  "pointers": [
    "s3://logs/auth-test-0421"
  ]
}

Storage & limits

Two-Tier Storage System (v0.3.15+)

StackMemory implements an intelligent two-tier storage architecture:

Local Storage Tiers

Young (<24h): Uncompressed, complete retention in memory/Redis
Mature (1-7d): LZ4 compression (~2.5x), selective retention
Old (7-30d): ZSTD compression (~3.5x), critical data only

Remote Storage

Archive (>30d): Infinite retention in S3 + TimeSeries DB
Migration: Automatic background migration based on age, size, and importance
Offline Queue: Persistent retry logic for failed uploads

Free tier (hosted)

1 project
Up to 2GB local storage
Up to 100GB retrieval egress / month

Paid tiers

Per-project pricing
Unlimited storage + retrieval
Team sharing
Org controls
Custom retention policies

No seat-based pricing.

Claude Code Integration

StackMemory can automatically save context when using Claude Code, so your AI assistant has access to previous context and decisions.

Quick Setup

# Add alias
echo 'alias claude="~/Dev/stackmemory/scripts/claude-code-wrapper.sh"' >> ~/.zshrc
source ~/.zshrc

# Use: claude (saves context on exit)

Integration Methods

# 1. Shell wrapper (recommended)
claude [--auto-sync] [--sync-interval=10]

# 2. Linear auto-sync daemon
./scripts/linear-auto-sync.sh start [interval]

# 3. Background daemon
./scripts/stackmemory-daemon.sh [interval] &

# 4. Git hooks
./scripts/setup-git-hooks.sh

Features: Auto-save on exit, Linear sync, runs only in StackMemory projects, configurable sync intervals.

RLM (Recursive Language Model) Orchestration

StackMemory includes an advanced RLM system that enables handling arbitrarily complex tasks through recursive decomposition and parallel execution using Claude Code's Task tool.

Key Features

Recursive Task Decomposition: Breaks complex tasks into manageable subtasks
Parallel Subagent Execution: Run multiple specialized agents concurrently
8 Specialized Agent Types: Planning, Code, Testing, Linting, Review, Improve, Context, Publish
Multi-Stage Review: Iterative improvement cycles with quality scoring (0-1 scale)
Automatic Test Generation: Unit, integration, and E2E test creation
Full Transparency: Complete execution tree visualization

Usage

# Basic usage
stackmemory skills rlm "Your complex task description"

# With options
stackmemory skills rlm "Refactor authentication system" \
  --max-parallel 8 \
  --review-stages 5 \
  --quality-threshold 0.9 \
  --test-mode all

# Examples
stackmemory skills rlm "Generate comprehensive tests for API endpoints"
stackmemory skills rlm "Refactor the entire authentication system to use JWT"
stackmemory skills rlm "Build, test, and publish version 2.0.0"

Configuration Options

Option	Description	Default
`--max-parallel`	Maximum concurrent subagents	5
`--max-recursion`	Maximum recursion depth	4
`--review-stages`	Number of review iterations	3
`--quality-threshold`	Target quality score (0-1)	0.85
`--test-mode`	Test generation mode (unit/integration/e2e/all)	all
`--verbose`	Show all recursive operations	false

How It Works

Task Decomposition: Planning agent analyzes the task and creates a dependency graph
Parallel Execution: Independent subtasks run concurrently up to the parallel limit
Review Cycle: Review agents assess quality, improve agents implement fixes
Test Generation: Testing agents create comprehensive test suites
Result Aggregation: All outputs are combined into a final deliverable

Note: RLM requires Claude Code Max plan for unlimited subagent execution. In development mode, it uses mock responses for testing.

Guarantees & Non-goals

Guarantees: Lossless storage, project isolation, survives session/model switches, inspectable local mirror.

Non-goals: Chat UI, vector DB replacement, tool runtime, prompt framework.

CLI Commands

# Core
stackmemory init                          # Initialize project
stackmemory status                        # Current status
stackmemory progress                      # Recent activity

# Tasks
stackmemory tasks list [--status pending] # List tasks
stackmemory task add "title" --priority high
stackmemory task done <id>

# Search & Logs
stackmemory search "query" [--tasks|--context]
stackmemory log [--follow] [--type task]

# Context
stackmemory context show [--verbose]
stackmemory context push "name" --type task
stackmemory context add decision "text"
stackmemory context pop [--all]

# Linear Integration
stackmemory linear setup                  # OAuth setup
stackmemory linear sync [--direction from_linear]
stackmemory linear auto-sync --start
stackmemory linear update ENG-123 --status done

# Storage Management
stackmemory storage status               # Show tier distribution
stackmemory storage migrate [--tier young] # Trigger migration
stackmemory storage cleanup --force      # Clean old data
stackmemory storage config --show        # Show configuration

# Analytics & Server
stackmemory analytics [--view|--port 3000]
stackmemory mcp-server [--port 3001]

Status

Hosted: Private beta
OSS mirror: Production ready
MCP integration: Stable
CLI: v0.3.16 - Full task, context, Linear, and storage management
Two-tier storage: Complete
Test Suite: 296 tests passing

Changelog

v0.3.16 (2026-01-15)

✅ Fixed critical error handling - getFrame() returns undefined instead of throwing
✅ Improved test coverage and fixed StackMemoryError constructor usage
✅ Removed dangerous secret-cleaning scripts from repository
✅ All tests passing, lint clean, build successful
✅ Published to npm with production-ready improvements

v0.3.15 (2026-01-14)

✅ Two-tier storage system implementation complete
✅ Smart compression with LZ4/ZSTD support
✅ Background migration with configurable triggers
✅ Improved Linear integration with bidirectional sync

Roadmap

Phase 4 (Completed): Two-tier storage system with local tiers and infinite remote storage Phase 5 (Next): PostgreSQL production adapter, enhanced team collaboration, advanced analytics
Phase 3: Team collaboration, shared stacks, frame handoff
Phase 4: Two-tier storage, enterprise features, cost optimization

Name		Name	Last commit message	Last commit date
Latest commit History 190 Commits
.claude		.claude
.github/workflows		.github/workflows
.husky		.husky
.opencode/command		.opencode/command
.ralph		.ralph
archive/linear-cleanup-2026		archive/linear-cleanup-2026
bin		bin
bjarne		bjarne
config		config
docs		docs
packages		packages
scripts		scripts
src		src
templates/claude-hooks		templates/claude-hooks
test-bjarne-complex		test-bjarne-complex
test-bjarne-simple		test-bjarne-simple
test-bjarne		test-bjarne
test-results		test-results
test-zeroshot		test-zeroshot
test		test
zeroshot-main		zeroshot-main
.claude-precommit		.claude-precommit
.dockerignore		.dockerignore
.env.example		.env.example
.env.railway		.env.railway
.eslintignore		.eslintignore
.gitignore		.gitignore
.lint-fix-log.json		.lint-fix-log.json
.node-version		.node-version
.npmignore		.npmignore
.npmrc		.npmrc
.nvmrc		.nvmrc
.prettierrc		.prettierrc
.puppeteerrc.json		.puppeteerrc.json
3-tier-data-handling-feature.md		3-tier-data-handling-feature.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
RALPH_INTEGRATION_SUMMARY.md		RALPH_INTEGRATION_SUMMARY.md
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
RLM_LINEAR_WORKFLOW.md		RLM_LINEAR_WORKFLOW.md
SPEC.md		SPEC.md
api-feature.md		api-feature.md
claude-api-wrapper.cjs		claude-api-wrapper.cjs
claude-api-wrapper.js		claude-api-wrapper.js
esbuild.config.js		esbuild.config.js
eslint.config.js		eslint.config.js
mcp_review_config.json		mcp_review_config.json
nixpacks.toml		nixpacks.toml
opencode.jsonc		opencode.jsonc
package-lock.json		package-lock.json
package.json		package.json
railway.json		railway.json
railway.toml		railway.toml
runway.yaml		runway.yaml
simple-feature.md		simple-feature.md
stackmemory.json		stackmemory.json
test-idea.md		test-idea.md
test-tier-migration.js		test-tier-migration.js
tool-comparison-results.md		tool-comparison-results.md
tsconfig.build.json		tsconfig.build.json
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Folders and files

Latest commit

History

Repository files navigation

StackMemory

Why StackMemory exists

Core concepts (quick mental model)

Hosted vs Open Source

Hosted (default)

Open-source local mirror

How it integrates

Product Health Metrics

Current Status (v0.3.16)

Quality Score: 72/100

Next Sprint Priorities

QuickStart

1. Hosted (Recommended)

Step 1: Create a project

Step 2: Install StackMemory

Step 3: Setup Claude Code Integration (Automated)

2. Open-Source Local Mode

Step 1: Clone

Step 2: Run local MCP server

Step 3: Point your editor to local MCP

How it works

Example MCP response (simplified)

Storage & limits

Two-Tier Storage System (v0.3.15+)

Local Storage Tiers

Remote Storage

Free tier (hosted)

Paid tiers

Claude Code Integration

Quick Setup

Integration Methods

RLM (Recursive Language Model) Orchestration

Key Features

Usage

Configuration Options

How It Works

Guarantees & Non-goals

CLI Commands

Status

Changelog

v0.3.16 (2026-01-15)

v0.3.15 (2026-01-14)

Roadmap

License

Additional Resources

ML System Design

Documentation

Husky fix successful

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages