03_visualize_dashboard.md

Tutorial 3: Visualization Dashboard

Learn how to visualize your analysis results using ContextLab's web UI and API.

Overview

ContextLab provides multiple ways to visualize your results:

Web Dashboard: Interactive SvelteKit UI
CLI Visualizations: Text-based tables and charts
REST API: Programmatic access to visualization data

Starting the Dashboard

Method 1: Local Development

# Terminal 1: Start API
make api
# or
uvicorn api.main:app --reload

# Terminal 2: Start Web UI
cd web
npm install
npm run dev

Visit http://localhost:5173

Method 2: Docker

# Build and run
docker-compose up

# Or individually
make docker-api
make docker-web

Dashboard Features

Home Page

The dashboard home shows:

Stats Cards: Total runs, chunks, and tokens
Recent Runs: Quick access to latest analyses
Feature Overview: Links to tutorials

Runs List

View all analysis runs at /runs:

Sortable table of runs
Filter by model, date, or tokens
Quick actions: View details, Delete

Run Detail Page

View detailed analysis at /runs/{run_id}:

1. Embedding Visualization

Interactive scatter plot showing:

2D projection of embedding space (UMAP)
Points colored by redundancy score
Hover for chunk details

Interpretation:

Clusters indicate similar content
Isolated points are unique chunks
Color gradient shows redundancy (purple = high, yellow = low)

2. Token Timeline

Dual-axis chart showing:

Bar chart: Token distribution per chunk
Line chart: Salience scores over time

Interpretation:

Peaks in salience indicate important sections
Consistent token counts suggest uniform chunking
Dips may indicate section boundaries

3. Chunks Table

Searchable, sortable table of all chunks:

Click column headers to sort
Search by ID, text, or source
Color-coded salience/redundancy badges

CLI Visualization

For headless environments or quick checks:

List All Runs

contextlab viz

Output:

┌─────────┬───────────────┬────────┬────────┬──────────────────┐
│ Run ID  │ Model         │ Chunks │ Tokens │ Timestamp        │
├─────────┼───────────────┼────────┼────────┼──────────────────┤
│ a1b2c3d │ gpt-4o-mini   │ 45     │ 8234   │ 2025-01-15 10:30 │
│ e4f5g6h │ gpt-4         │ 23     │ 12450  │ 2025-01-14 15:20 │
└─────────┴───────────────┴────────┴────────┴──────────────────┘

View Run Details

contextlab viz <run_id> --headless

Output:

Run: a1b2c3d
├─ Model: gpt-4o-mini
├─ Chunks: 45
├─ Total Tokens: 8234
└─ Timestamp: 2025-01-15 10:30

Redundancy Distribution:
0.0-0.2: ████████████████ (20)
0.2-0.4: ██████████ (12)
0.4-0.6: █████ (7)
0.6-0.8: ███ (4)
0.8-1.0: ██ (2)

Top 5 Most Salient Chunks:
1. chunk_12: salience=0.892
   "Machine learning algorithms require..."

2. chunk_7: salience=0.765
   "Deep neural networks consist of..."

Using the REST API

Fetch Run List

curl http://localhost:8000/api/viz/runs?limit=10

Response:

{
  "runs": [
    {
      "run_id": "a1b2c3d",
      "model": "gpt-4o-mini",
      "num_chunks": 45,
      "total_tokens": 8234,
      "timestamp": "2025-01-15T10:30:00"
    }
  ]
}

Fetch Run Details

curl http://localhost:8000/api/viz/runs/a1b2c3d

Fetch Embeddings

curl http://localhost:8000/api/viz/runs/a1b2c3d/embeddings

Response:

{
  "run_id": "a1b2c3d",
  "embeddings": [[0.1, 0.2, ...], [0.3, 0.4, ...]],
  "metadata": [
    {
      "id": "chunk_0",
      "tokens": 150,
      "salience": 0.65,
      "redundancy": 0.23
    }
  ]
}

Custom Visualizations

Using Python + Matplotlib

import matplotlib.pyplot as plt
from contextlab.io.ds import DataStore

store = DataStore()
_, chunks = store.load_run("a1b2c3d")

# Salience distribution
saliences = [c.salience for c in chunks]
plt.hist(saliences, bins=20)
plt.xlabel("Salience")
plt.ylabel("Frequency")
plt.title("Salience Distribution")
plt.show()

Using Pandas

import pandas as pd
from contextlab.io.ds import DataStore

store = DataStore()
_, chunks = store.load_run("a1b2c3d")

# Create DataFrame
df = pd.DataFrame([
    {
        "id": c.id,
        "tokens": c.tokens,
        "salience": c.salience,
        "redundancy": c.redundancy,
        "source": c.source
    }
    for c in chunks
])

# Analyze
print(df.describe())
print("\nTop 10 by salience:")
print(df.nlargest(10, "salience"))

Exporting Data

# Export to CSV
df.to_csv("analysis_results.csv", index=False)

# Export to JSON
import json
with open("analysis_results.json", "w") as f:
    json.dump([c.model_dump() for c in chunks], f, indent=2)

Advanced: Real-time Monitoring

Create a monitoring dashboard that updates as new runs complete:

from contextlab.io.ds import DataStore
import time

store = DataStore()

def monitor_runs(interval=5):
    seen_ids = set()

    while True:
        runs = store.list_runs(limit=10)
        new_runs = [r for r in runs if r.run_id not in seen_ids]

        for run in new_runs:
            print(f"New run detected: {run.run_id}")
            print(f"  Chunks: {run.num_chunks}, Tokens: {run.total_tokens}")
            seen_ids.add(run.run_id)

        time.sleep(interval)

# Run in background
monitor_runs()

Troubleshooting

Web UI not loading

Check API is running: curl http://localhost:8000/health
Check web dev server: cd web && npm run dev
Check browser console for errors

Empty visualizations

Verify run exists: contextlab viz
Check embeddings were generated: curl http://localhost:8000/api/viz/runs/{run_id}/embeddings
Re-run analysis with --mock flag to test

API connection errors

Check CORS settings in api/main.py
Verify proxy config in web/vite.config.ts
Check firewall settings

Best Practices

Use web UI for exploration: Interactive charts best for understanding patterns
Use CLI for automation: Integrate into scripts and pipelines
Use API for integration: Build custom dashboards or export data
Export important runs: Save visualizations as PNG/PDF before cleanup
Monitor storage: Clean up old runs periodically

Next Steps

Explore API Reference
Build custom compression strategies
Integrate ContextLab into your LLM application

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tutorial 3: Visualization Dashboard

Overview

Starting the Dashboard

Method 1: Local Development

Method 2: Docker

Dashboard Features

Home Page

Runs List

Run Detail Page

1. Embedding Visualization

2. Token Timeline

3. Chunks Table

CLI Visualization

List All Runs

View Run Details

Using the REST API

Fetch Run List

Fetch Run Details

Fetch Embeddings

Custom Visualizations

Using Python + Matplotlib

Using Pandas

Exporting Data

Advanced: Real-time Monitoring

Troubleshooting

Web UI not loading

Empty visualizations

API connection errors

Best Practices

Next Steps

FilesExpand file tree

03_visualize_dashboard.md

Latest commit

History

03_visualize_dashboard.md

File metadata and controls

Tutorial 3: Visualization Dashboard

Overview

Starting the Dashboard

Method 1: Local Development

Method 2: Docker

Dashboard Features

Home Page

Runs List

Run Detail Page

1. Embedding Visualization

2. Token Timeline

3. Chunks Table

CLI Visualization

List All Runs

View Run Details

Using the REST API

Fetch Run List

Fetch Run Details

Fetch Embeddings

Custom Visualizations

Using Python + Matplotlib

Using Pandas

Exporting Data

Advanced: Real-time Monitoring

Troubleshooting

Web UI not loading

Empty visualizations

API connection errors

Best Practices

Next Steps