Model Library

Lip Sync & Talking Video Models

Choose the best model for dubbing, avatar videos, or text‑driven generation.

16 AI video and lip sync models in one catalog

Covering video dubbing, talking avatars, audio-driven video, and text-to-video

Open a model page, preview use cases, then launch the workspace directly

Choose the right model faster

Short answer: this catalog groups every LipsyncX model by workflow so users can quickly find the best fit for dubbing, talking-head video, audio-driven generation, or text-to-video creation.

Each model page now follows the same product-style structure: simple hero, live workspace, examples, use cases, specs, and FAQ.

Video to Video

Transform existing footage with new speech and motion.

7 models

Audio to Video

Drive realistic lip sync videos from speech or music audio.

5 models

Text to Video

Generate talking videos directly from text prompts.

1 models

Browse by workflow

Category 1

Video to Video

Transform existing footage with new speech and motion.

Category 2

Audio to Video

Drive realistic lip sync videos from speech or music audio.

Category 3

Text to Video

Generate talking videos directly from text prompts.

Category 4

Dubbing

Models optimized for multilingual voice dubbing workflows.

Category 5

New

Latest models recently added to the catalog.

Why this catalog works better

Users can start from the workflow they already understand instead of decoding technical model names first.

Each card leads to a focused product page with a live creation area, example output, and quick answers before the user commits time or credits.

That makes the catalog more useful for SEO and better for conversion because discovery and action now happen in the same flow.

Video to Video

Transform existing footage with new speech and motion.

7 models

Sync Lipsync 2.0

Popular

Balanced quality and speed for general lip‑sync dubbing.

VideoAudioVideo

Sync Lipsync 2 Pro

Higher facial detail and realism for close‑ups.

VideoAudioVideo

Sync React‑1

Emotion‑aware sync with subtle facial expression control.

VideoAudioVideo

Sync Lipsync 1.9

Legacy, lightweight option for fast processing.

VideoAudioVideo

LatentSync

Diffusion‑based sync with strong temporal consistency.

VideoAudioVideo

PixVerse LipSync

Reliable video lip sync with fast turnaround.

VideoAudioVideo

VEED LipSync

Clean, straightforward lip sync for standard formats.

VideoAudioVideo

Audio to Video

Drive realistic lip sync videos from speech or music audio.

5 models

LongCat Multi‑Avatar

Stable talking heads for longer content and multi‑speaker scenes.

ImageAudioVideo

LongCat Single‑Avatar

Consistent identity for single‑speaker narration.

ImageAudioVideo

OmniHuman

Turn a single photo and audio into a lip‑synced digital human video.

ImageAudioVideo

LTX 2.3

New

High-quality audio-to-video generation for stylized talking clips.

Audio URLImage URL (optional)Video file

Kling LipSync (Audio‑to‑Video)

Audio‑driven lip sync with high precision.

ImageAudioVideo

Text to Video

Generate talking videos directly from text prompts.

1 models

Kling LipSync (Text‑to‑Video)

Generate lip‑synced video directly from a script.

TextVideo

Dubbing

Models optimized for multilingual voice dubbing workflows.

2 models

Dubbing

Localized output with synced lip movement.

VideoAudio/TextVideo

ElevenLabs Dubbing

Dub videos with language detection and voice selection.

VideoVideo

New

Latest models recently added to the catalog.

1 models

Seedance 2.0

New

New multi‑shot video model with strong consistency.

Text/AudioVideo