Generate videos, talking avatars, images, voiceovers, and audiobooks โ all running locally on your hardware. 5+ video engines, 6 lip sync engines, 5 TTS voices, zero cloud dependency.
๐ฌ5+Video Models
๐ฃ๏ธ6Lip Sync Engines
๐5TTS Engines
๐6Export Formats
Content Creation
Generate video, images, speech, and music with AI โ from a single prompt to a finished production.
๐ฌ
AI Video Generation
Multiple AI engines for image-to-video and text-to-video with draft and premium quality presets.
Create Video โ
๐ผ๏ธ
AI Image Generation
Text-to-image, img2img, inpainting, background removal, watermark removal, and AI face enhancement.
Generate Image โ
๐ฃ๏ธ
Talking Avatars & Lip Sync
6 lip sync engines with automatic cascade fallback for realistic talking head videos.
Create Avatar โ
๐
Text-to-Speech
5 text-to-speech engines with 20+ voices and voice cloning support.
Generate Speech โ
๐ง
AI Narrative Engine
AI-powered story parsing with automatic scene splitting and vision analysis for context-aware generation.
Write Narrative โ
๐ต
Video Stitching & Music
Multi-segment concatenation, audio normalization, and background music mixing for polished final output.
Browse Music โ
Publishing & Documents
From manuscript to finished book or marketing campaign โ write, compile, and publish.
๐
Book Studio
Upload PDF, DOCX, EPUB, TXT, RTF, or HTML. Scrivener-style binder, Quill rich editor, audiobook generation, and compile to 6 export formats.
Open Studio โ
๐
UGC Video Templates
9 ready-made templates โ Hands Holding, Unboxing, Bedroom Review, Car Review, Lifestyle, Studio Shot, Close-up, Rotation, Comparison.
Browse Templates โ
Developer Tools
A full IDE and training dashboard built into the platform.
๐ป
Code Editor (IDE)
Full IDE with 4-panel layout, AI coding agent, file explorer, integrated terminal, and diff preview.
Open Editor โ
๐งช
Training Dashboard
Real-time ML metrics with live charts, fine-tuning pipelines, auto-recovery, and AI-generated training reports.
Open Dashboard โ
Platform
โก
Hardware Accelerated
GPU-accelerated generation with BF16/TF32 precision and torch.compile acceleration.
๐
100% Local & Private
Powered by Ollama + ComfyUI. No cloud services, no data leaves your machine, no subscription required. Full control over your AI stack.
5+Video Models
6Lip Sync Engines
5TTS Engines
9UGC Templates
LocalGPU Compute
The AI video generator built for video creators
๐ทDrop image, click to upload, or paste from clipboard
Describe the motion, camera movement, or action you want to see. The AI will animate your image based on this description.
๐ Import Script
Supports: "Visuals:", "Dialog (Lip-sync):", "Narration:", scene markers, and "---" separators.
๐พ Saved Scene Sets
Save multiple manual playbooks and load them later. Stored locally in your browser.
๐ค Dialogue Voice:
Scene 1~5 sec clip
sec into scene
Each scene = one ~5 second video segment. Check ๐ค to add lip-synced dialogue to a scene.
seconds into video
The character will speak this text with lip-synced animation.
Fast text-to-video generation. Describe your scene and hit Execute.
Checking memory...
Context Preview โ edit AI-generated per-segment prompts before generating · Evaluate โ copy current settings to clipboard
Upload a video or audio file to generate a text transcription using Whisper AI.
๐๏ธ
Drop files or a folder, or click to browse ยท pick a folder
Supports .mp4, .mkv, .mov, .webm, .avi, .mp3, .wav, .flac, .ogg, .m4a, .aac (max 500 MB each). Folders are recursed and files are sorted intelligently (natural order, grouped by sub-folder).
Queue(0)
Uploading...0%
Transcribing...
Progress0%
Transcript
Transcript-first Audio Cleanup
Import audio or video, transcribe with word-level timestamps, then delete filler words, pauses and
repeats directly from the transcript โ the audio is cut in sync. Export a cleaned file when done.
Text to Speech
Generate natural-sounding voiceovers with AI
Enter Your Text
0 characters
Voice Settings
Generating...
Audio Result
Tips
VibeVoice offers the most natural sound
Use punctuation for natural pauses
Preview voice before generating
Edge TTS has more voice options
Enhance Photo
Upscale images using AI enhancement
๐ผ๏ธ
Click or drag images here
You can select multiple images
Original size:
After 2x:
Output DPI: 300
2x is faster, 4x provides higher resolution output.
Upload a KDP cover template PDF to match its exact page dimensions. This overrides width/height/DPI and outputs a print-ready PDF with identical MediaBox points.
600 DPI recommended for print-ready book covers.
PNG for lossless, TIFF/PDF for print shops. Auto uses PNG for transparent, JPG otherwise.
Enhancement Complete!
Creative Assets
Videos
Click to play. Select multiple to delete.
No videos yet
Edited Photos
Photos edited with ChronoEdit. Select multiple to delete.
No edited photos yet
Generated Images
AI-generated images from Text-to-Image. Select multiple to delete.
No generated images yet
Merge Videos
Drag videos to reorder. They will be seamlessly stitched together.
๐ฌ
Click or drag videos here
0 videosTotal: 0s
Loading music...
๐ต
No music files. Add .mp3/.wav to assets/music/
Filters
⌘K
Loading sound effects...
๐
No sound effects. Add .mp3/.wav to assets/sound_effects/
Filters
⌘K
AI Prompt Enhancer
Enhance your prompts with context-aware AI
Enhance Your Prompt
Project Context Optional
No file selected
e.g., CLAUDE.md, README.md
OR
๐
Ready
--
No project analyzed yet
Project Summary
--
Recent Enhancements
No enhancement history yet
Marketing
GPU Memory:
Loading...
๐ค
UGC Video Templates
Select a template style for your product video
๐ฌ
No template selected
Loading templates...
Generate Video
Longer videos require multiple generation passes
๐ทImage 1Click to upload
Drag & drop images or click slots to upload
Generating Video
Job:
Starting...
Generation Complete
Recent Generations
No videos generated yet. Select a template above to get started.
LoRA Training
Idle
Started--
Elapsed--
ETA--
▼Training Content
Loading training data info...
0%
System Memory0 GB
Loading...
GPU Utilization0%
0 F
Training Loss--
Learning Rate--
Progress0/0 ep
0%
Phase0/8
Epoch0/0
Step0/0
Training Speed-- step/s
No reports yet
0 selected
>
Idle
๐
Select or create a notebook to get started.
Phone Agent
Paid APIs used: noSIP/PSTN ready, not configured
BackendLoading
Active Calls0
Calls Today0
Leads0
Appointments0
Errors0
Runtime Status
Recent Calls
No calls loaded.
Browser Test Mode
No active call
Start a browser test call to begin.
STT status loads from runtime discovery.
TTS fallback status loads from runtime discovery.
Latency: not measured yet.
Transcript
Voicemail 0 unread
Appointment Requests 0 new
Agent Identity
Leads
Create / Edit Appointment
Appointments
Call History
Discovered Providers
Telephony
SIP/PSTN ready, not configured
Browser Test ModeAvailable in v1
PSTN/SIPNot configured
AsteriskNot detected
Asterisk pjsip.conf template
Load telephony config to generate template.
Asterisk extensions.conf template
Load telephony config to generate template.
Podcast
SchedulerLoading...
Generate
Create a new podcast from your audiobook library, narrated in your trained voice. Daily scheduled run targets 15 minutes; manual runs can be any length.
Idle
Uses ~165 words/minute to calculate the maximum script length.
If set, the script must land near this target. Cannot exceed max time.
Default: 2250-2750 words (~15 minutes).
Starting...0%
โ
Generation failed
Live System Telemetry
Memory and GPU usage updating in real time while the job runs.
โ Live
System Memory
โ GBโ total
โ GB
GPU Utilization
โ %โ VRAM
100%
History
All previously generated podcasts. Download individual episodes or batch them as a ZIP.
0 selected
Title
Generated
Topics
Duration
Size
Status
Actions
๐๏ธ
No podcasts yet
Click "Generate Podcast Now" or wait for the next scheduled run.
Loading podcast history...
Web
Public webpage extraction, media downloads, and full-site mirrors.
Only download content you own or have permission to archive.
Some platforms may restrict automated downloads.
โ๏ธ
This tool does not bypass logins, paywalls, CAPTCHAs, or
other access controls. Respect robots.txt, platform
terms, and copyright.
Smart Scraper
Analyse a public page and extract structured fields.
Detected content
Custom selectors
Pagination
Export format
Preview
Run Analyze Page to preview detected content.
Media Downloader
Image & video extraction from a public page.
โน๏ธInstagram only works for fully public pages that load without a login wall.
We do not use cookies, credentials, or bypass techniques.
Live progress
Files found0
Downloaded0
Skipped0
Failed0
Estimated ZIP size0 B
Full Site Downloader
SiteSucker-style mirror of a public site for offline browsing.
Crawl report
Pages crawled0
Assets downloaded0
Errors0
Total downloaded0 B
Job history
Name
URL
Mode
Status
Found
Downloaded
Created
Size
Actions
No jobs yet.
Scraper settings
Book Writer
Write books chapter-by-chapter or generate professional audiobooks with AI voices.
Binder
No chapters yet.
Uploadingโฆ
๐
Select a chapter from the Binder to begin writing.
Draft
0 words0 chars0 no-spaceSaved
Inspector
Table of Contents
Metadata
Status--Words0Last saved--
Reading Voice
Chapter Notes
Generation
Generatingโฆ
Guardrails
Source Content
TXT, PDF, DOCXโฆ
Style Sheet
Compile Manuscript
Compile For
Contents
Section Layouts
Configure how each section type appears in the compiled output.
Separators
Front Matter
Back Matter
Page Settings
These settings apply to PDF output only.
Metadata
Typography
Preview
Click "Refresh Preview" to see the compiled structure.
Choose Workflow Structure
Select how your book is organized before importing.
This sets the naming for each level of your outline.
Name your three hierarchy levels:
Humanize Content
Editorial rewrite for clarity, flow, and natural voice. No new facts added.
0 / 10
Humanizingโฆ
Sending to editor modelโฆ
Long chapters may take 2โ5 minutes with large models.
Review Changes
Confirm to overwrite the editor. Cancel to discard.
Quality Evaluation
Audit without modifying content.
Evaluatingโฆ
Reword Selection
Rewrite only the highlighted text using your instructions. The rest of the chapter is untouched.
Rewritingโฆ
Sending to modelโฆ
Review Rewrite
Sermon โ Paperback Book
Upload a sermon or lecture audio file. Taleclip transcribes it, then converts the transcript into a chaptered paperback manuscript with strict editorial guardrails (no fabrication, preserves the speaker's voice, no lecture artifacts). Export to Markdown, plain text, or 6ร9 KDP-ready DOCX.
0 words โ edit inline before generating if needed.
Book metadata (remembered across sessions)
Targets โ optional hints to the outliner
Prayers โ only included when checked
Queuedโฆ
Progress0%
Manuscript
Print Cover Calculator
Calculate cover dimensions and generate print-ready templates for KDP paperback and hardcover books.
Hardcover calculations are approximate and may not match KDP exactly. Wrap (0.51") and hinge (0.4") are official KDP values. Spine width and full cover dimensions are derived estimates. Use KDP Cover Calculator for final submission accuracy.
Add your text, choose a voice, then click Generate Audiobook.
Live Chunks
0 / 0
Chunks will appear here as they generate.
Audio Files
No audiobooks generated yet.
Voice Training
How Voice Training Works
Train the AI to speak in your voice for audiobook narration. There are two approaches — pick whichever fits your situation:
QUICK START (Zero-Shot Cloning)
Upload or record a voice sample (10+ seconds). The AI imitates your voice immediately — no training wait. Good enough for drafts. Uses Chatterbox engine.
BEST QUALITY (XTTS Fine-Tuning)
Upload 60+ seconds of audio, then train a custom model on your voice (takes 1–2 hours). The AI truly learns your voice — much more accurate and natural. Best for final audiobooks.
1
Upload .wav or .mp3 audio files of you speaking clearly. You can also click Record to record directly from your microphone. Tips for best results:
Use a quiet room — no background noise, music, or echo
Speak naturally at your normal pace and tone
For Quick Start: 10–30 seconds is enough
For XTTS Fine-Tuning: upload at least 60 seconds total (more is better, 2–5 minutes ideal)
Multiple short clips are fine — they get combined automatically
2a
Click the button below and you'll be shown 5 short passages to read aloud. The system records you reading each one, then builds a voice profile from those recordings. This is the fastest way to get a working voice clone.
After completing this, your voice will appear in the Voice dropdown on the Audiobook tab (Chatterbox engine).
2b
Similar to Guided Training, but uses excerpts from your actual book as the reading material. Each round has 3 passages. The more rounds you do, the better the voice quality gets. This also adds to your voice sample data for XTTS training below.
3
This trains a dedicated AI model on your voice data. Unlike zero-shot cloning (Steps 2a/2b), this actually learns the unique characteristics of your voice — pitch, cadence, tone, pronunciation. The result is a much more accurate and natural-sounding voice.
Click "Prepare Data" — this splits your audio into clean training chunks, normalizes volume, and creates a training dataset. Wait for it to finish.
Click "Train My Voice" — this starts the actual AI training. It runs for the number of epochs shown below (default 50). Training takes 1–2 hours depending on data size. You can leave this page and come back — training continues in the background.
When training completes, click "Test Voice" to hear a sample of the AI speaking in your trained voice.
Go to the Audiobook tab, select "XTTS (Fine-tuned)" from the Voice dropdown, and generate your audiobook.
StatusChecking...
0%Elapsed: 0:00
0%
Epoch 0/0Loss: โETA: calculating...
Advanced Settings
Learn My Voice
Excerpt 1 of 3
Loading...
Read the excerpt above clearly and naturally, as if narrating your audiobook.
✓
Round Complete!
Your voice profile has been enhanced.
Voice Training
Passage 1 of 5
Warm-Up: Natural Speech
Loading...
Read the text above clearly and naturally. Speak at your normal pace.
✓
All Passages Recorded!
Name your voice profile:
Visual Education
Convert documents into narrated educational videos
Source Documents
Drag & drop documents here, or click to browse
Supports .docx, .pdf, .txt, .pptx
— OR browse server folder —
Settings
Claude CLI analyzes documents deeper and produces richer transcripts
2 min
โถPronunciation Guideโ fix how TTS says specific words
Ingest
Script
TTS
Render
Stitch
0%
โถ
Review & Edit Narration Scripts
Edit the generated scripts below, then click "Generate Video" to produce the final output.
This avoids the browser file picker entirely, so you wonโt see an โUploadโ button. You browse inside this modal and click Select This Folder (our OK button).
Admin shares: C$, D$. Or use a custom Windows share name.
Enter host + share, then click โConnect & Browseโ.
๐ง Build Your UNC Path (2 Simple Steps):
How to find your Windows IP:
1. Press Win+R โ type cmd โ press Enter
2. Type ipconfig โ press Enter
3. Look for "IPv4 Address" (e.g., 192.168.1.100)
How to copy the path:
1. Open File Explorer on Windows
2. Navigate to your folder (e.g., D:\ExamsCatalyst\Core)
3. Click in the address bar at the top
4. Copy the path (Ctrl+C) and paste it here (Ctrl+V)
Fill in the fields above to generate path...
โ Example with your D:\ExamsCatalyst\Core folder:
You enter:
Step 1: 192.168.1.100 (your Windows IP)
Step 2: D:\ExamsCatalyst\Core (from File Explorer)
System builds: \\192.168.1.100\D$\ExamsCatalyst\Core
How to find your Windows path:
1. Open File Explorer on Windows
2. Navigate to your project folder
3. Copy the path from address bar (e.g., C:\Users\YourName\Projects\MyProject)
4. Replace C: with \\YOUR_WINDOWS_IP\C$
Examples:
โข Using IP: \\192.168.1.100\C$\Users\YourName\Projects\MyProject
โข Using hostname: \\DESKTOP-ABC\C$\Users\YourName\Projects