LearnFlow AI

Agentic learning system that turns any content into personalized, multi-modal learning materials.

Drop a URL. Get a study guide, flashcards, quizzes, and a "best-of" video with only the high-signal clips.

The Problem

Learning from YouTube videos and online content is slow and fragmented:

Find video
Watch entire thing (including fluff)
Copy URL to transcriber
Paste transcript to LLM
Wrestle with prompts
Copy output to audio generator
Wait for TTS
Listen
Take notes manually
Organize notes somewhere
Forget 80% in a week

The Solution

Drop URL → Click "Learn" → Done

LearnFlow handles everything:

Ingest: Download and transcribe with timestamps
Distill: Remove fluff, preserve every insight
Teach: Generate study guides, quizzes, flashcards, audio
Edit: Extract best clips, repeat at 2x for memorization
Store: RAG database for instant recall
Reinforce: Spaced repetition keeps it in your head

Features

Content Acquisition

YouTube URL input (batch supported)
Article and PDF processing
Automatic format detection

Lossless Distillation

Removes filler and hype
Preserves: examples, steps, definitions, edge cases
Timestamp citations back to source

Multi-Modal Output

Study guides (Markdown)
Flashcards (Anki-compatible)
Quizzes (auto-generated)
Audio narratives (ElevenLabs TTS)
Diagrams and mind maps

Video Editing

Extracts only high-signal clips
Scores by: information density, novelty, actionability
2x speed + 4x repetition (configurable)
Concatenates into single "best-of" video

Knowledge Management

RAG database (ChromaDB)
Semantic search across all learnings
"Re-saturate" queries: "What was that thing about X?"
Session history and categorization

Trust vs Validate Modes

Trust: Pure distillation (fast)
Validate: Research agent verifies claims, appends corrections

Project Structure

learnflow-ai/
├── docs/
│   ├── FEATURES.md           # Full feature specification
│   ├── GOALS.md              # Roadmap and success criteria
│   ├── architecture/
│   │   └── ARCHITECTURE.md   # System design
│   └── research/
│       └── RESEARCH-DIGEST.md # Curated research findings
├── src/
│   ├── agents/               # LangGraph agents
│   ├── pipelines/            # Processing pipelines
│   └── integrations/         # External service connectors
├── config/                   # Configuration files
└── scripts/                  # CLI tools

Tech Stack

Component	Technology
Orchestration	LangGraph
LLM	Claude / GPT-4
Transcription	Whisper (timestamped)
Video Download	yt-dlp
Video Editing	ffmpeg
TTS	ElevenLabs
RAG	ChromaDB
Storage	Local + S3

Quick Start

# Clone and set up environment
git clone https://github.com/Lucface/learnflow-ai.git
cd learnflow-ai

# Create Python environment
uv venv
source .venv/bin/activate

# Install dependencies (when available)
uv pip install -e .

Implementation in progress. See Goals & Roadmap for MVP timeline.

Documentation

Features - Complete feature specification
Goals & Roadmap - MVP definition and success criteria
Architecture - System design and data models
Research - Curated findings from related projects

Status

Phase: Research & Design Complete

Next: MVP 1 implementation (core pipeline)

Related Projects

Key repos that informed this design:

StudentTraineeCenter/edu-agent - Adaptive study platform
NirDiamant/GenAI_Agents - LangGraph patterns
jdepoix/youtube-transcript-api - Transcript extraction

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
docs		docs
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LearnFlow AI

The Problem

The Solution

Features

Content Acquisition

Lossless Distillation

Multi-Modal Output

Video Editing

Knowledge Management

Trust vs Validate Modes

Project Structure

Tech Stack

Quick Start

Documentation

Status

Related Projects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

LearnFlow AI

The Problem

The Solution

Features

Content Acquisition

Lossless Distillation

Multi-Modal Output

Video Editing

Knowledge Management

Trust vs Validate Modes

Project Structure

Tech Stack

Quick Start

Documentation

Status

Related Projects

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages