The Video Notation Schema mirrors a professional video production pipeline. Below is an overview of the key schema sections, their intended use, and practical guidance for integrating them into your workflow.
Establishes foundational information about the project.
Key Fields:
title: Name of the projectduration_seconds: Overall durationvideo_type: Format (e.g.,advertisement,short film,educational)aspect_ratio,frame_rate,resolution: Technical specslanguage,target_audience,keywords: Contextual metadataaccessibility: Includes captions, audio descriptions, flashing/motion warnings
Tip: Define early, as it governs the global frame for AI interpretation.
Defines consistent, reusable character profiles for appearance, voice, and behaviour.
Key Fields (per character):
character_id,name,appearance,wardrobeage_range,gender,ethnicity_descriptorpersonality_traits,vocal_description,accent
Overrides: Scene-level details like wardrobe_override, action_details, or behaviour supplement or replace global character traits.
Tip: Define once, and reference throughout for consistent rendering.
Reusable definitions that simplify complex projects.
Physical items used across scenes. Override with prop_id_reference in scenes.
Reusable audio (ambient, sound effects, music loops). Reference with audio_id_reference.
Non-character elements (e.g., animals, scenery). Referenced via subject_id_reference.
Predefined motion graphics (e.g., infographic, animated_logo). Customise with overrides.
Reusable lighting styles (e.g., golden_hour_window). Referenced with preset_id_reference.
Reusable emotional tones (e.g., calm_inviting_tone). Also referenced via preset_id_reference.
Sets overarching artistic direction for the video.
Key Subsections:
shot: Camera settings (angle, motion, focus, lens)subject: Styling for background or unnamed subjectscinematography: Lighting/tone (can reference presets)audio: Ambient, music, dialogue defaultscolor_palette: Defines visual tone (hex codes or themes)
Tip: Define after characters to unify visual/auditory tone across scenes.
Defines an ordered array of narrative or visual moments.
Key Fields:
scene_id,narrative_role,duration_secondslocation,time_of_day,environmentsubjects: Characters/objects present and their actionsvisual_details: Props, motion graphics, timing breakdownsaudio: Dialogue, ambient overrides, sound effectstransition_to_next_scene: Style and logic of scene cuts
Overrides: Use shot_overrides, cinematography_overrides, and scene-specific settings to adapt global styles.
Tip: Compose scene-by-scene using previous definitions as a base layer.
Enables human-readable production and post-production notes.
Key Fields:
direction_notes: Director-style creative instructionspost_production: VFX, editing, sound mixingproduction_assets: Linked image/audio references or concepts
Tip: Use for coordination across creative teams.
- The JSON can be use as prompt to any text-to-video AI model.
- Visual Studio Code + JSON/YAML plugins
- Online JSON Schema Validators
- Python:
jsonschema,pydantic - JavaScript/TypeScript:
ajv,zod,joi - Go:
gojsonschema
The Video Notation Schema provides a structured, extensible way to craft AI video prompts with precision and reusability. From granular camera work to production notes, it enables creators to produce consistent, high-quality outputs at scale.