AI Video Studio
Locally Powered AI

Generate videos, talking avatars, images, voiceovers, and audiobooks — all running locally on your hardware. 5+ video engines, 6 lip sync engines, 5 TTS voices, zero cloud dependency.

🎬 5+ Video Models

🗣️ 6 Lip Sync Engines

🔊 5 TTS Engines

📖 6 Export Formats

Content Creation

Generate video, images, speech, and music with AI — from a single prompt to a finished production.

🎬

AI Video Generation

Multiple AI engines for image-to-video and text-to-video with draft and premium quality presets.

Create Video →

🖼️

AI Image Generation

Text-to-image, img2img, inpainting, background removal, watermark removal, and AI face enhancement.

Generate Image →

🗣️

Talking Avatars & Lip Sync

6 lip sync engines with automatic cascade fallback for realistic talking head videos.

Create Avatar →

🔊

Text-to-Speech

5 text-to-speech engines with 20+ voices and voice cloning support.

Generate Speech →

🧠

AI Narrative Engine

AI-powered story parsing with automatic scene splitting and vision analysis for context-aware generation.

Write Narrative →

🎵

Video Stitching & Music

Multi-segment concatenation, audio normalization, and background music mixing for polished final output.

Browse Music →

Publishing & Documents

From manuscript to finished book or marketing campaign — write, compile, and publish.

📖

Book Studio

Upload PDF, DOCX, EPUB, TXT, RTF, or HTML. Scrivener-style binder, Quill rich editor, audiobook generation, and compile to 6 export formats.

Open Studio →

📊

UGC Video Templates

9 ready-made templates — Hands Holding, Unboxing, Bedroom Review, Car Review, Lifestyle, Studio Shot, Close-up, Rotation, Comparison.

Browse Templates →

Developer Tools

A full IDE and training dashboard built into the platform.

💻

Code Editor (IDE)

Full IDE with 4-panel layout, AI coding agent, file explorer, integrated terminal, and diff preview.

Open Editor →

🧪

Training Dashboard

Real-time ML metrics with live charts, fine-tuning pipelines, auto-recovery, and AI-generated training reports.

Open Dashboard →

Platform

⚡

Hardware Accelerated

GPU-accelerated generation with BF16/TF32 precision and torch.compile acceleration.

🔒

100% Local & Private

Powered by Ollama + ComfyUI. No cloud services, no data leaves your machine, no subscription required. Full control over your AI stack.

5+ Video Models

6 Lip Sync Engines

5 TTS Engines

9 UGC Templates

Local GPU Compute

The AI video generator built for video creators

📷Drop image, click to upload, or paste from clipboard

Narrative Mode

Describe the Motion

Describe the motion, camera movement, or action you want to see. The AI will animate your image based on this description.

Add more to queue multiple videos that run back-to-back. Each populated description becomes one separate video.

Character Dialogue (Optional)

Start talking at:seconds into video

The character will speak this text with lip-synced animation.

TTS Engine

Voice

Lip Sync

Add voice narration

Background Music:

Checking memory...

Backend

Negative Prompt

Optional. Sent directly to Wan as negative_prompt and merged with Taleclip's Wan stability negatives.

Use raw prompt exactly
Check this when you pasted a prompt from Tools > Prompt. Skips the second I2V rewrite pass and sends your positive prompt directly to Wan.

Direct backend runs Wan 2.2-I2V locally via Diffusers. Use 40 steps / 5.5 guidance for action fidelity.

Keep WAN 2.2 warm for the next generations
Leaves the native WAN worker loaded so back-to-back 5s videos skip cold startup. Do not click Free Memory while batching.

Context Preview — edit AI-generated per-segment prompts before generating

Your creations will appear here

Generation History

▼

The AI image generator for video creators

Text to Image Image Edit Enhance Photo Watermark Removal

Upscale images using AI enhancement

🖼️

Click or drag images here

You can select multiple images

Scale Factor

2x is faster, 4x provides higher resolution output.

Remove watermarks from images using AI inpainting

🖼️

Drop image, click to upload, or paste from clipboard

Detection Mode

AI will automatically detect and remove watermarks

Removal Intensity

Medium works best for most watermarks. Use Aggressive for stubborn marks.

Enhance after removal (AI upscale)

Image Prompt

📐

Your images will appear here

AI Voiceover

Create talking avatars or generate audio from text

1. Choose Avatar

Primary Actor

Select a presenter for this voiceover.

Realistic actor catalog

Loading avatars...

Upload Custom Actor

🎭

Drop face image here or click to browse

Best results: clear front-facing photo with white or transparent background

2. Enter Text to Speak

3. Voice Settings

TTS Engine

Voice

Lip Sync Method

Default engine: Good quality, runs locally, no watermarks

Avatar & Voice

Primary Actor

Not selected

Choose an avatar to preview the presenter.

Selected actor will be used for talking-avatar generation.

Tips for Best Results

Use a clear, front-facing photo
Good lighting improves lip sync
Keep text under 500 characters
Default engine works best for most cases

Text to Speech

Generate natural-sounding voiceovers with AI

Enter Your Text

0 characters

Voice Settings

TTS Engine

Voice

Speed

Audio Result

Tips

VibeVoice offers the most natural sound
Use punctuation for natural pauses
Preview voice before generating
Edge TTS has more voice options

Enhance Photo

Upscale images using AI enhancement

🖼️

Click or drag images here

You can select multiple images

Scale Factor

2x is faster, 4x provides higher resolution output.

Target Dimensions

KDP Template PDF (optional)

Upload a KDP cover template PDF to match its exact page dimensions. This overrides width/height/DPI and outputs a print-ready PDF with identical MediaBox points.

Preset Size

Width (px)

Height (px)

Padding Background Color

Transparent

Output DPI

600 DPI recommended for print-ready book covers.

Output Format

PNG for lossless, TIFF/PDF for print shops. Auto uses PNG for transparent, JPG otherwise.

Enhance image before resizing (recommended)

Creative Assets

Videos

Click to play. Select multiple to delete.

No videos yet

Edited Photos

Photos edited with ChronoEdit. Select multiple to delete.

No edited photos yet

Generated Images

AI-generated images from Text-to-Image. Select multiple to delete.

No generated images yet

Merge Videos

Drag videos to reorder. They will be seamlessly stitched together.

🎬

Click or drag videos here

Enhance Your Prompt

Your Prompt

Enhancement Style

Detailed Concise Technical Creative

LLM Uncensored mode rewrites prompts as clear Wan 2.2-I2V-A14B motion instructions.

Enhanced Prompt

Negative Prompt

Project Context Optional

Load Context File

e.g., CLAUDE.md, README.md

OR

Analyze Folder

Recent Enhancements

No enhancement history yet

GPU Memory:

Loading...

🤖

UGC Video Templates

Select a template style for your product video

Loading templates...

Generate Video

Video Duration

Longer videos require multiple generation passes

Product Images (1 image required)

📷 Image 1 Click to upload

Drag & drop images or click slots to upload

Product Description

Recent Generations

No videos generated yet. Select a template above to get started.

LoRA Training

Idle

Started --

Elapsed --

ETA --

▼ Training Content

Loading training data info...

0%

System Memory 0 GB

Loading...

GPU Utilization 0%

0 F

Training Loss --

Learning Rate --

Progress 0/0 ep

0%

Phase0/8

Epoch0/0

Step0/0

Training Speed -- step/s

No reports yet

📓

Select or create a notebook to get started.

No company selected

What lede are you scouting?

Model Sources

Deep Scout Use browser fallback Require citations Local-only mode Completion chime

Deep Scout performs a more thorough source-backed search. It may take longer, but it can return richer evidence, stronger validation, and better company context. Use it when accuracy and completeness matter more than speed.

Companies—

Sources—

Artifacts—

Needs review—

Spark stack—

Companies

Click a row to select that company for the Sources / Jobs / Artifacts tabs.

ID	Name	Ticker	Sector	Stage	Updated

Sources for selected company

Paste source

Source type Title URL Source date Pasted text

Ingest URL

URL * Source type Title (optional)

Uses the same fetcher as Spark Scout — local, no paid APIs. Failed fetches are stored with status failed so you can retry.

Upload file (TXT, MD, DOCX up to 8 MB)

PDF parsing is not enabled in this build (pypdf not installed). Convert to text first.

Run Spark Scout for this company

Query * Mode Max sources Freshness (days)

After completion, switch to the Spark Scout Runs tab to import individual sources with per-source type mapping.

	Citation	Type	Origin	Title	URL	Quality	Words	Age	Created

Scout Jobs

ID	Mode	Status	Started	Completed	Error

Artifacts

ID	Type	Title	Validation	Approval	Confidence	Citations	Provider

Settings

Spark stack health

Loading…

SEC ticker lookup

Uses public SEC EDGAR ticker→CIK mapping (no paid APIs). If the network is unreachable, the lookup reports it cleanly and does not block the rest of the workspace.

SEC submissions (selected company)

Audit Log

Filtered to the selected company when one is chosen, otherwise shows recent events for your account.

Time	Action	Scope	Details

CRM Dashboard

Export-only by default. No external email, no auto-push to CRMs. Every prospect's data is grounded in stored sources with citation IDs.

Total prospects—

High-fit—

Needs review—

Outreach ready—

Approved for export—

Exported leads—

CRM push dry runs—

Blocked by compliance—

Spark Scout runs—

Sources from Spark—

Prospect Pipeline

Prospects

	Company	Ticker	Sector	Stage	Fit	IR need	Confidence	Completeness	Spark sources	Outreach	Review	CRM export	Last activity	Next follow-up

Outreach review queue

Follow-up tasks

Title	Prospect	Due	Status	Notes

CRM export / sync status

Loading…

Phone Agent

Paid APIs used: no SIP/PSTN ready, not configured

BackendLoading

Active Calls0

Calls Today0

Leads0

Appointments0

Errors0

Runtime Status

Recent Calls

No calls loaded.

Browser Test Mode

No active call

Start a browser test call to begin.

STT status loads from runtime discovery.

TTS fallback status loads from runtime discovery.

Latency: not measured yet.

Transcript

Voicemail

Appointment Requests

Agent Identity

Agent name Used in the greeting and throughout the script. Example: Sally, Hannah, Marcus. Business name Used in the greeting. Example: Smart AI Coach. Voice Voice the agent uses. Save and start a new call to hear the change.

Smart AI Coach Prompt Greeting Qualification Questions Appointment Rules Escalation Rules Availability Windows LLM Model TTS Voice Realtime Model Realtime Voice Realtime Max Minutes OpenAI API Key Call-center background Background Volume

Leads

Create / Edit Appointment

Title Lead ID Starts At Ends At Notes

Appointments

Call History

Discovered Providers

Telephony

Browser Test ModeAvailable in v1

PSTN/SIPNot configured

AsteriskNot detected

Enable SIP/PSTN when ready Provider SIP Host SIP Port Transport SIP Username SIP Password Outbound From Number Inbound DID Asterisk Context Asterisk Test Extension Phone Agent Event URL

Asterisk pjsip.conf template

Load telephony config to generate template.

Asterisk extensions.conf template

Load telephony config to generate template.

Source

YAML / JSON / D2

No preview yet. Configure inputs above and click Generate.

Ready. Excalidraw / Draw.io export — future.

History

Generate

Create a new podcast from your audiobook library, narrated in your trained voice. Daily scheduled run targets 15 minutes; manual runs can be any length.

Idle

Max talking time (minutes) Uses ~165 words/minute to calculate the maximum script length.

Target word count If set, the script must land near this target. Cannot exceed max time.

Voice Chatterbox is the daily-cron default. XTTS works for short samples but goes robotic on full episodes. ElevenLabs is paid but is the only option that sounds like you over a full podcast.

Default: 2250-2750 words (~15 minutes).

History

All previously generated podcasts. Download individual episodes or batch them as a ZIP.

	Title	Generated	Topics	Duration	Size	Status	Actions

Loading podcast history...

🛡️ Sprite Generator

Generate pixel-art sprites locally on Spark using SDXL Turbo + PixelArtXL. Custom LoRA training pipeline supports fine-tuning on the FightLust style or any pack you upload.

🧠

LoRA Training

ninjaadv · SDXL base 1.0

— —

— waiting for status…

📊

Step

—

⏱️

Elapsed

—

⏳

ETA

—

📉

Loss

—

🎮

GPU

—

⚡

Power

—

Dataset —

Output —

📜 Live log tail

…

Generate a sprite

Preset

Custom prompt (overrides preset)

Seed (optional)

PixelArt LoRA strength — 1.00

Clean transparent background Keep subject as-is; remove surrounding noise.

Idle. Pick a preset or write a prompt.

Sprite Library

FightLust Zone Editor Layout Zone name biome 0 (grass) Exclusive enemies Ready.

v1.00.00

Scroll/+/− = zoom · Pan tool or hold Space + drag = pan · F = fit · 0 = 100% · Left-drag = paint / move marker

Click a tile to select it for painting.

Type to search everything placeable. Click a result to drop it.

V select · H pan · B brush · E erase · R rect · G bucket · I eyedropper · W water
Scroll/+/− zoom · Space+drag pan · F fit · 0 100%
Del remove selected · Ctrl+Z undo · Ctrl+Y redo · Ctrl+S save to game

—

Total tokens (24h)

—

Total calls (24h)

—

Spend (USD, 24h)

—

Spark uptime

none

Last error

Link Health

Feature URL Probes (cached 5 min)

Tokens by model

Calls by site

Calls over time

Recent calls

Time	Model	Source	Site	Tokens	Spend	Status
Loading…

SIEM

Security Information & Event Management — audit trail, alerts, and threat intelligence.

🏢 TALECLIP.LOCAL

Overview

Investigation

Alerts

Detection Rules

Audit Log

Retention

Visitor Map

IP Intelligence

Reports

Settings

Loading dashboard...

Event Timeline (24h)

Top IPs (24h)

Top Actions (24h)

Categories (24h)

Severity Distribution (24h)

Open Alerts

Event Search & Investigation

🔍

Use the filters above and click Search

Alert Instances

Detection Rules

SIEM Audit Trail

Immutable log of all SIEM configuration changes, rule updates, exports, and alert resolutions.

Data Retention Policies

Configure how long security events are retained per category. Events older than the retention period are automatically purged.

Public IPs only

Visitor World Map Locations are approximate — based on IP geolocation.

Summary

Top countries

Recent visitors

IP Intelligence

Security Reports

Downloadable PDF briefings for any selected IP. Reports include risk-score breakdown, network ownership, and recommended defensive actions.

Visitor Intelligence Settings

All settings are admin-only. API key VALUES are never displayed — only configured / not-configured indicators.

Entity

👤

Select a user object to view properties

Click a user in the tree to open their profile

📦 Backup TaleClip

Split-ZIP archive (10 GB parts max). Schedule: Tue / Thu / Sat at 02:00. Retention: latest 3 sets. Excludes LLM binaries, audio, video, generated media.

Loading…

🎨 Appearance & Themes

Choose the visual theme used across TaleClip. Dark and light mode are independent — every theme supports both.

Loading themes…

🎯 Branding Scraper

Extract colors, fonts, logos and design tokens from any public website. Download as JSON to feed into Claude's design-system tooling — Firecrawl-style.

🐑 ShepherdsPen

ShepherdsPen runs as a sibling app at http://0.0.0.0:8080. Use the controls below to start it, then open it in a new tab.

Checking…

—

YouTube Downloader

Paste a YouTube URL, choose a format, and get a direct download link. Processing happens on the server.

YouTube URL

Format

MP3 Audio only · 192 kbps WAV Audio only · uncompressed MP4 Video + audio · best quality MP4 1080p Video + audio · up to 1080p (falls back to best available)

Only download content you own, have permission to use, or are legally allowed to download.

Web

Public webpage extraction, media downloads, and full-site mirrors. Only download content you own or have permission to archive. Some platforms may restrict automated downloads.

AI Product Recreation Intelligence

Point at any AI SaaS site. Discover its services, infer its pipelines, generate a Taleclip recreation plan + a ready-to-paste Claude Code CLI prompt.

Target URL Depth Engine

Focus notes (optional) — narrow what to research

Saved sessions

No saved sessions yet.

Live engine log

Media Downloader

Image & video extraction from a public page.

URL

Images Videos Skip duplicates Include metadata JSON

Max files Max depth Request delay (ms) Concurrent

Live progress

Files found0

Downloaded0

Skipped0

Failed0

Estimated ZIP size0 B

Job history

Name	URL	Mode	Status	Found	Downloaded	Created	Size	Actions
No jobs yet.

Scraper settings

Default request delay (ms) Default concurrency Default max pages Default max file size (MB)

User-Agent string

Respect robots.txt Safe mode (SSRF + sensitive path guards)

Duplicate detection

Loading status…

Templates

Loading…

New template

Name Description HTML body

Use tokens. Signer tokens (signature, signature_name, signature_date) are filled by the recipient.

Send for Signature

Signing Requests

Loading…

💾 Space Saver

Disk usage analyzer + safe cleanup. Scan any allowed folder, see what's eating your space, drill in, and delete with a two-step trash-first confirmation.

Total capacity

—

Used

—

Free

—

Used %

—

Scan target

—

Folder / path

	Name	Size	%	Files	Folders	Modified	Type	Warning
No scan yet.

🌱1. Project Source ▾

Lovable project URL Optional. Used to enrich metadata + auto-detect cloud vs. non-cloud.

GitHub repository Required if you want a real build (we clone this).

Live site URL Used for Playwright visual capture.

Branch

Project type

GitHub token (optional — for private repos)

🗄️2. Backend & Supabase ▾

Supabase URL

Supabase project ref

Anon key (public) Safe to share — embedded in client builds.

Service role key

Supabase access token

DB password

Edge functions

Storage references

Redirect URLs

Production domain

Include schema export (requires DB password)

🛠️3. Build Settings ▾

Package manager

Node version

Install command (override)

Build command (override)

Output directory (override)

Hosting target

📸4. Capture & Fidelity ▾

Visual capture

Crawl depth

Max pages

Cleanup window (hours)

Include screenshots in ZIP Include source files in ZIP Generate deployment guide README

🤖5. AI Assistance ▾

AI engine Optional assist for migration notes, redirects, and config rewrites.

🗄️6. Database Migration (optional) ▾

Enable database migration

Source database is treated as read-only. No data is ever written to or modified on the source. Service role keys are never included in the downloadable ZIP.

Cross-engine migration (e.g. Postgres → MySQL) is best-effort and may require manual review.

Idle

🌐 Live capture waiting…

No page captured yet.

—

Logs Autoscroll

Logs will appear here once the job starts.

📦 Results

Job ID —

Auto-cleanup in —

Status Working files removed after window.

⬇ Download ZIP

Binder

No chapters yet.

📝

Select a chapter from the Binder to begin writing.

Draft

0 words 0 chars 0 no-space Saved

Inspector

Table of Contents

Metadata

Status -- Words 0 Last saved --

Reading Voice

Engine

Voice

Chapter Notes

Generation

Keep close to TOC

Detail

Min Words

Guardrails

Scope

Source Content

TXT, PDF, DOCX…

Style Sheet

Sermon → Paperback Book

Upload a sermon or lecture audio file. Taleclip transcribes it, then converts the transcript into a chaptered paperback manuscript with strict editorial guardrails (no fabrication, preserves the speaker's voice, no lecture artifacts). Export to Markdown, plain text, or 6×9 KDP-ready DOCX.

1. Upload audio 2. Transcribe 3. Review transcript 4. Generate book 5. Export

🎙️

Drop audio/video here, or click to browse

.mp4, .mkv, .mov, .webm, .avi, .mp3, .wav, .flac, .ogg, .m4a, .aac (max 500 MB)

Already have a transcript? Paste text instead

Print Cover Calculator

Calculate cover dimensions and generate print-ready templates for KDP paperback and hardcover books.

Binding Type

Interior Type

Paper Type

Reading Direction

Measurement Units

Interior Trim Size

Import Manuscript (optional)

📄

Drop .docx or .pdf to auto-detect page count

Page Count (enter your KDP or Word page count)

24–830

Cover Image Compositing ▼

Image Enhancement ▼

Cover Image to Enhance

Drag & drop cover image here or click to browse

Preset Size

Width (px)

Height (px)

KDP Template PDF (optional)

Upload a KDP cover template PDF to match its exact page dimensions. This overrides width/height/DPI and outputs a print-ready PDF.

Padding Background Color

Transparent

Output DPI

600 DPI recommended for print-ready book covers.

Output Format

PNG for lossless, TIFF/PDF for print shops. Auto uses PNG for transparent, JPG otherwise.

Enhance image before resizing (recommended)

Template Preview

📖

Enter book details and click Calculate to see template preview

Manuscript

0 words 0 chars

Filename format ?

Voice

Engine

Voice

Speed

Tone

Output

Split by structure Chapter Lesson Part

Project

Live Chunks

0 / 0

Chunks will appear here as they generate.

Audio Files

No audiobooks generated yet.

Voice Training

How Voice Training Works

Train the AI to speak in your voice for audiobook narration. There are two approaches — pick whichever fits your situation:

QUICK START (Zero-Shot Cloning)

Upload or record a voice sample (10+ seconds). The AI imitates your voice immediately — no training wait. Good enough for drafts. Uses Chatterbox engine.

BEST QUALITY (XTTS Fine-Tuning)

Upload 60+ seconds of audio, then train a custom model on your voice (takes 1–2 hours). The AI truly learns your voice — much more accurate and natural. Best for final audiobooks.

1 Upload Your Voice Samples

Upload .wav or .mp3 audio files of you speaking clearly. You can also click Record to record directly from your microphone. Tips for best results:

Use a quiet room — no background noise, music, or echo
Speak naturally at your normal pace and tone
For Quick Start: 10–30 seconds is enough
For XTTS Fine-Tuning: upload at least 60 seconds total (more is better, 2–5 minutes ideal)
Multiple short clips are fine — they get combined automatically

2a Quick Start: Guided Voice Training

Click the button below and you'll be shown 5 short passages to read aloud. The system records you reading each one, then builds a voice profile from those recordings. This is the fastest way to get a working voice clone.

After completing this, your voice will appear in the Voice dropdown on the Audiobook tab (Chatterbox engine).

2b Learn My Voice (Read Your Book)

Similar to Guided Training, but uses excerpts from your actual book as the reading material. Each round has 3 passages. The more rounds you do, the better the voice quality gets. This also adds to your voice sample data for XTTS training below.

3 XTTS Voice Fine-Tuning (Best Quality)

This trains a dedicated AI model on your voice data. Unlike zero-shot cloning (Steps 2a/2b), this actually learns the unique characteristics of your voice — pitch, cadence, tone, pronunciation. The result is a much more accurate and natural-sounding voice.

How to use XTTS Fine-Tuning:

Upload voice samples in Step 1 above (minimum 60 seconds total, 2–5 minutes recommended)
Click "Prepare Data" — this splits your audio into clean training chunks, normalizes volume, and creates a training dataset. Wait for it to finish.
Click "Train My Voice" — this starts the actual AI training. It runs for the number of epochs shown below (default 50). Training takes 1–2 hours depending on data size. You can leave this page and come back — training continues in the background.
When training completes, click "Test Voice" to hear a sample of the AI speaking in your trained voice.
Go to the Audiobook tab, select "XTTS (Fine-tuned)" from the Voice dropdown, and generate your audiobook.

Status Checking...

Training Duration

Advanced Settings

Epochs

Batch Size

Learning Rate

Visual Education

Convert documents into narrated educational videos

Source Documents

Drag & drop documents here, or click to browse

Multiple files supported — drop several at once, or drop additional batches to add more. Supports .docx, .pdf, .txt, .pptx, .md

— OR browse server folder —

Training Prompt / Script Guidance (optional)

Paste a super prompt or custom script to guide structure, tone, emphasis, and learning objectives. Uploaded documents remain the factual grounding context.

Sample prompts:

Leave blank to use Taleclip's default facilitator-focused workshop-prep prompt. Picking a sample above overwrites the textarea (you'll be asked to confirm if it already has text).

🎬 Video Length

How long should the finished video be?

Full length or min

Leave Full length checked to narrate the entire document (default). Or enter a target number of minutes and the transcript will be shortened (or expanded) to land near that runtime.

Settings

Project Title

TTS Engine (GPU-accelerated)

Script Engine (AI) Claude CLI analyzes documents deeper and produces richer transcripts

Voice

Output Quality

Mode

Visual Style

Training Presentation uses synchronized bullets, diagrams, checklists, and summary slides. Cinematic mode is best for concept explainers and transitions.

AI-generated B-roll imagery (optional)

Generate AI illustrations for points that have no relevant document figure. Off (recommended) = clean concept cards — faster, and better for learning retention.

Animated hero clips (optional)

Generate real motion video for your most important points instead of a still image. Needs the LTX warm server and adds several minutes per clip.

Number of hero animations

How many motion clips to generate (0 = none, up to 10). Every other point uses a still image.

Checking engines…

Tone

Technical Level

Brand Pack

Generate captions (SRT/VTT)

Split into short modules (recommended)

Output one short video per learning objective (~3–6 min each) instead of one long file. Microlearning improves retention and makes updates easier.

Add knowledge-check quizzes (recommended)

End each module with 2–3 multiple-choice questions drawn only from that module's narration. Active recall is the single biggest retention lever, and the quiz JSON is reused for LMS scoring.

Research topic first (find recent guides to ground the script)

Combine all documents into one video

▶ Pronunciation Guide — fix how TTS says specific words

Taleclip

Account Settings

Appearance

API Keys

System Memory

GPU Utilization

AI Video StudioLocally Powered AI

Content Creation

AI Video Generation

AI Image Generation

Talking Avatars & Lip Sync

Text-to-Speech

AI Narrative Engine

Video Stitching & Music

Publishing & Documents

Book Studio

UGC Video Templates

Developer Tools

Code Editor (IDE)

Training Dashboard

Platform

Hardware Accelerated

100% Local & Private

The AI video generator built for video creators

Creating your video...

Scenes

Your video is ready!

Queue: 0 of 0 complete

Completed Videos

Generation History

Original

Enhanced

Enhancement Options

Local Model & Pipeline

The AI image generator for video creators

Enhancement Complete!

Watermark Removed!

Generating your image...

Your Generated Image

AI Voiceover

1. Choose Avatar

2. Enter Text to Speak

3. Voice Settings

Primary Actor

Generating...

Result

Audio Result

Tips for Best Results

Transcribe Audio / Video

Transcript

Transcript-first Audio Cleanup

Text to Speech

Enter Your Text

Voice Settings

Audio Result

Tips

Enhance Photo

Enhancement Complete!

Creative Assets

Videos

Edited Photos

Generated Images

Merge Videos

Filters

Filters

AI Prompt Enhancer

Enhance Your Prompt

Project Context Optional

Recent Enhancements

Marketing

UGC Video Templates

Generate Video

Generating Video

Generation Complete

Recent Generations

LoRA Training

Source-grounded synthesis

Sources cited

Scout Trail ▾

Companies

AI Video Studio
Locally Powered AI

Voicemail

Appointment Requests