Click or Drag-n-Drop
You can drop your own file here
Kling 2.6 Pro is an advanced image-to-video generative AI model that turns a single still image plus a text prompt into a short, cinematic clip—with integrated, native audio. Provide an image_url as the visual starting frame and describe the scene in prompt; the model generates smooth motion, coherent visuals, and synchronized sound design (voice-like audio cues, ambience, and effects) to match the on-screen action.
It’s designed for teams who need fast, production-ready video generation via REST API, especially when you want the output to feel like a complete “scene” instead of silent B-roll.
image_url)generate_audio for richer, immersive clipsduration)prompt and negative_prompt for precisioncfg_scale (0–1) to balance prompt adherence vs. natural motion16:9, 9:16, 1:1 (aspect_ratio) for web, social, and productmode="pro" for polished outputsnegative_prompt to prevent artifacts: “no distortion, no jitter, no clutter”.cfg_scale:
aspect_ratio early (e.g., 16:9 cinematic, 9:16 social) to avoid reframing.generate_audio=true when you want the scene to feel complete (ambience + SFX).Is Kling 2.6 Pro text-to-video?
It’s primarily image-to-video: you can include image_url as the visual anchor plus a detailed prompt.
Does it generate audio automatically?
Yes—set generate_audio to true to include native audio alongside the video.
What video lengths are supported?
duration supports "5" or "10" seconds.
How is it different from other image-to-video models?
Its standout is native audio generation plus coherent cinematic motion from a single image.
What parameters should I tweak first for best results?
Start with prompt + negative_prompt, then adjust cfg_scale, duration, and aspect_ratio.
What mode should I use?
Use mode="pro" (the available option) for high-quality, polished outputs.