1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
const axios = require('axios');
const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/kling-v1-pro-ai-avatar";
const data = {
"image_url": "https://segmind-resources.s3.amazonaws.com/input/209f6f09-7fc6-47c0-8b6e-128ae75db915-wan-720-ip.png",
"audio_url": "https://segmind-resources.s3.amazonaws.com/input/82538587-def4-4990-9bad-525255b0adf8-kling-pro-ai-avatar.mp3",
"prompt": "Create a friendly AI avatar greeting."
};
(async function() {
try {
const response = await axios.post(url, data, { headers: { 'x-api-key': api_key } });
console.log(response.data);
} catch (error) {
console.error('Error:', error.response.data);
}
})();
Set image URL for video background. Use diverse themes for creative or professional looks.
Set audio URL for background sound. Opt for soothing or energetic tracks as needed.
Provide direction prompt. Use detailed prompts for specific outcomes.
To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.
Edited by Segmind Team on October 12, 2025.
Kwaivgi Kling V1 AI Avatar Pro is a premium AI model that seamlessly merges audio and visual data to produce true-to-life avatars from static images. It is powered via the WaveSpeedAI platform API, making it an excellent model to create precisely synced audio-video avatars that don natural, convincing facial expressions, without any sense of artificiality. Kwaivgi Kling V1 AI Avatar Pro can be an invaluable asset for content creators, developers, digital marketers, and other creative professionals who need immersive videos.
How do I optimize image and audio inputs for the best results? Images and audio recordings must be clear and high-resolution. Furthermore, well-defined facial features in images and clear speech in audio files will render the best results.
Can I customize the avatar's expressions and movements? Yes, detailed prompts will enable you to control the avatar's behavior, expressions, and overall presentation style.
What file formats are supported for input? The model accepts standard image formats (via URL) for visuals and common audio formats (via URL) for sound input.
How does it compare to other avatar generation models? Kwaivgi Kling V1 AI Avatar Pro stands out because of its sophisticated lip-sync accuracy, natural facial expressions, and seamless API integration capabilities.
What's the recommended prompt structure for optimal results? The prompt must include clear direction (e.g., "Create a professional greeting"), add emotional context, and specify a specific presentation style or behavior that you wish to incorporate in the avatar.