One-click Python tool that removes timestamps, duration markers, and blank lines from pasted YouTube transcripts.
When downloading long course transcripts, timestamps like 0:00 or 1:23:45 make the text hard to study. This script cleans it quickly into readable paragraphs.
- Removes timestamps (0:00, 1:23:45, etc.)
- Removes durations ("1 minute", "8 seconds", etc.)
- Removes blank lines and extra spaces
- Keeps paragraph structure
- Python 3
re(regular expressions)- File I/O
- Put your raw transcript in
raw_transcript.txt - Run:
- Turn into a CLI tool with arguments
- Support
.srtsubtitle files - Make a Streamlit web version