LTX 2.3

Default

LTX 2.3 Audio to Video

Turn speech into a stylized talking clip with synchronized lip motion.

Use audio as the timing backbone, optionally add a first-frame image, and steer the result with a concise prompt. This is the workflow already wired into LipsyncX quick create.

Input

Build your clip

Step 1 of 4

Step 1

Upload required audio

2-20s, uploaded automatically.

Step 2

Optional first-frame image

Skip it if prompt defines the look.

Step 3

Prompt and guidance

Keep it visual and concise.

Prompt

Without an image, the prompt defines the scene and visual direction.

Guidance scaleRange 1-50

Without an image, prompt quality and guidance scale determine the first-frame look.

Run

Ready to generate

5 credits / sec

Audio is requiredPrompt required without imageGuidance 1-50

Result

Preview

The flagship LTX 2.3 promo clip with fast cuts and cinematic motion.

LTX 2.3

Upload audio to estimate cost

Generate as soon as your inputs are ready.

Preview

LTX 2.3 Audio to Video

Input on the left, generated result on the right, using official reference material adapted into our page system.

Input

Official example first frame from the workflow demo.

Generated

Why This Path

Pick the right LTX workflow faster

Each mode exists for a different starting point: speech, prompt, still frame, continuation, or a local retake.

Audio drives pacing, lip motion, and phrasing.

Optional first-frame image gives you tighter visual control.

Short voice-led clips are fast to test and easy to iterate.

This is the workflow currently connected to our generation flow.

Production Fit

Use Cases

Where this workflow is most useful inside a real content pipeline.

Talking promos

Turn a recorded line into a polished social clip.

Avatar experiments

Test voice-led hosts before building a bigger workflow.

Audio-first explainers

Start from speech, then add visual direction only where needed.

FAQ

Frequently Asked Questions

Short answers for the practical questions people usually ask when choosing a workflow.

Do I need an image?

No. Audio is required, while the first-frame image is optional.

What gives me the most control?

A clean voice track, a strong first-frame image, and a concise visual prompt.

Is this the page with the live tool?

Yes. This is the only LTX workflow page currently wired to quick create.