You can drop your own file here
You can drop your own file here
Kling Video v1 Pro (AI Avatar) is a generative video model that creates an avatar-style video from two key inputs: a background image URL and an audio URL. You can optionally add a prompt to guide expressions, tone, and on-screen actions. This makes it well-suited for developers building AI avatar video generation, talking head video, and audio-driven video synthesis into apps, internal tools, or creative pipelines.
Because the output is anchored to your provided image and timed to your audio, the model is especially strong at producing consistent scenes and predictable pacing—ideal for product demos, narration, announcements, and short-form content where you control the script.
image_url as the visual base for stable composition.audio_url to drive timing and overall performance/ambience.prompt to nudge facial expression, mood, gestures, and intent.image_url to reduce blur and preserve detail.audio_url to improve perceived sync and clarity.Is Kling Video v1 Pro (AI Avatar) a text-to-video model?
It’s primarily image + audio to video, with an optional prompt for behavior and tone.
What inputs are required?
image_url and audio_url are required. prompt is optional.
How do I get the best quality output?
Use a sharp, high-resolution image_url and a clear audio_url with consistent volume and minimal background noise.
What should I put in the prompt?
Direction for expressions and actions (e.g., “confident, friendly tone; gentle smile; small hand gestures”).
How is it different from other AI video generators?
It’s optimized for audio-driven avatar performance anchored to your provided image, giving stable visuals and predictable timing.