Agentic learning system that turns any content into personalized, multi-modal learning materials.
Drop a URL. Get a study guide, flashcards, quizzes, and a "best-of" video with only the high-signal clips.
Learning from YouTube videos and online content is slow and fragmented:
- Find video
- Watch entire thing (including fluff)
- Copy URL to transcriber
- Paste transcript to LLM
- Wrestle with prompts
- Copy output to audio generator
- Wait for TTS
- Listen
- Take notes manually
- Organize notes somewhere
- Forget 80% in a week
Drop URL → Click "Learn" → Done
LearnFlow handles everything:
- Ingest: Download and transcribe with timestamps
- Distill: Remove fluff, preserve every insight
- Teach: Generate study guides, quizzes, flashcards, audio
- Edit: Extract best clips, repeat at 2x for memorization
- Store: RAG database for instant recall
- Reinforce: Spaced repetition keeps it in your head
- YouTube URL input (batch supported)
- Article and PDF processing
- Automatic format detection
- Removes filler and hype
- Preserves: examples, steps, definitions, edge cases
- Timestamp citations back to source
- Study guides (Markdown)
- Flashcards (Anki-compatible)
- Quizzes (auto-generated)
- Audio narratives (ElevenLabs TTS)
- Diagrams and mind maps
- Extracts only high-signal clips
- Scores by: information density, novelty, actionability
- 2x speed + 4x repetition (configurable)
- Concatenates into single "best-of" video
- RAG database (ChromaDB)
- Semantic search across all learnings
- "Re-saturate" queries: "What was that thing about X?"
- Session history and categorization
- Trust: Pure distillation (fast)
- Validate: Research agent verifies claims, appends corrections
learnflow-ai/
├── docs/
│ ├── FEATURES.md # Full feature specification
│ ├── GOALS.md # Roadmap and success criteria
│ ├── architecture/
│ │ └── ARCHITECTURE.md # System design
│ └── research/
│ └── RESEARCH-DIGEST.md # Curated research findings
├── src/
│ ├── agents/ # LangGraph agents
│ ├── pipelines/ # Processing pipelines
│ └── integrations/ # External service connectors
├── config/ # Configuration files
└── scripts/ # CLI tools
| Component | Technology |
|---|---|
| Orchestration | LangGraph |
| LLM | Claude / GPT-4 |
| Transcription | Whisper (timestamped) |
| Video Download | yt-dlp |
| Video Editing | ffmpeg |
| TTS | ElevenLabs |
| RAG | ChromaDB |
| Storage | Local + S3 |
# Clone and set up environment
git clone https://github.com/Lucface/learnflow-ai.git
cd learnflow-ai
# Create Python environment
uv venv
source .venv/bin/activate
# Install dependencies (when available)
uv pip install -e .Implementation in progress. See Goals & Roadmap for MVP timeline.
- Features - Complete feature specification
- Goals & Roadmap - MVP definition and success criteria
- Architecture - System design and data models
- Research - Curated findings from related projects
Phase: Research & Design Complete
Next: MVP 1 implementation (core pipeline)
Key repos that informed this design:
- StudentTraineeCenter/edu-agent - Adaptive study platform
- NirDiamant/GenAI_Agents - LangGraph patterns
- jdepoix/youtube-transcript-api - Transcript extraction