POST

javascript

const axios = require('axios');
const FormData = require('form-data');


const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/multi-image-kontext-max";

const reqBody = {
  "seed": 42,
  "prompt": "put the green dress on the woman while maintaining the pose of the woman as it is",
  "aspect_ratio": "1:1",
  "input_image_1": "https://segmind-resources.s3.amazonaws.com/output/9cb479d3-5c5f-4d5d-a782-972acbc42598-c1.jpg",
  "input_image_2": "https://segmind-resources.s3.amazonaws.com/output/79feee7b-d09f-4bde-bec3-c8e8a8703f04-d2.png",
  "output_format": "jpg",
  "safety_tolerance": 1
};

(async function() {
    try {
        const formData = new FormData();
        
        // Append regular fields
        for (const key in reqBody) {
            if (reqBody.hasOwnProperty(key)) {
                formData.append(key, reqBody[key]);
            }
        }

        // Convert and append images as Base64 if necessary
        
        
        const response = await axios.post(url, formData, {
            headers: {
                'x-api-key': api_key,
                ...formData.getHeaders()
            }
        });
        console.log(response.data);
    } catch (error) {
        console.error('Error:', error.response ? error.response.data : error.message);
    }
})();

RESPONSE

image/jpeg

HTTP Response Codes

200 - OKImage Generated

401 - UnauthorizedUser authentication failed

404 - Not FoundThe requested URL does not exist

405 - Method Not AllowedThe requested HTTP method is not allowed

406 - Not AcceptableNot enough credits

500 - Server ErrorServer had some issue with processing

Attributes

seedint ( default: 42 )

Sets seed for reproducibility.

promptstr *

Describes the prompt for image transformation.

aspect_ratioenum:str ( default: 16:9 )

Sets output aspect ratio. Use '16:9' for wide images.

Allowed values:

input_image_1str *

First image for transformation

input_image_2str *

Second image for transformation.

output_formatenum:str ( default: jpg )

Sets output forma.

Allowed values:

safety_toleranceint ( default: 1 )

Controls content safety level. Use 1 for moderate strictness.

min : 0,

max : 2

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

FLUX.1 Kontext [max] – Image Generation and Editing Model

What is FLUX.1 Kontext [max]?

FLUX.1 Kontext [max] is an advanced AI image generation and editing model from Black Forest Labs. Using a cutting-edge multimodal transformer-diffusion architecture, it converts rich text prompts and one or more input images into stunning, photorealistic visuals with integrated typography. Developers, creators, and product managers leverage its rapid inference to innovate in branding, editorial design, social media, and beyond—without extensive prompt tuning.

Key Features

• Multimodal Transformer-Diffusion
– Deep text understanding meets diffusion-based image synthesis for lifelike results.
• Dynamic Style Transfer
– Blend textures, colors, and forms from two inputs (input_image_1, input_image_2) in a single pass.
• Native Typography Integration
– Auto-place and style headlines, captions, and logos within generated imagery.
• Robust Prompt Comprehension
– Handles complex instructions (“A futuristic cityscape at dusk with neon typography”) out of the box.
• Aspect Ratio & Format Control
– Supports common ratios (1:1, 16:9, 9:16, 4:3, 21:9) and output_format choices (jpg, png).
• Reproducibility & Safety
– Set a seed (default 42) for consistent outputs and adjust safety_tolerance (0–2) to meet compliance needs.

Best Use Cases

Branding & Logo Creation: Generate cohesive brand assets with on-canvas typography.
Editorial & Magazine Layouts: Craft high-resolution visuals aligned with article styles.
Social Media Campaigns: Produce platform-specific formats (e.g., 9:16 for Stories, 1:1 for feeds).
Creative Prototyping: Rapidly iterate between hyperrealistic, retro, and avant-garde aesthetics.
Commercial Storytelling: Enhance product mockups, advertisements, and packaging design.

Prompt Tips and Output Quality

• Be Descriptive: Include setting, materials, lighting, and perspective.
• Specify Typography: Add font style descriptors (“bold serif”, “neon cursive”) for precise text integration.
• Use Technical Tags: Combine natural language with tags like #retro or #neon to hint style.
• Adjust Aspect Ratio Early: Choose aspect_ratio to match final medium (print, web, mobile).
• Control Reproducibility: Use seed to lock randomness and regenerate identical outputs.
• Match Format to Deliverables: Select jpg for smaller files or png when transparency is needed.

FAQs

Q: How do I get the most photorealistic images?
A: Provide detailed prompts with lighting, camera angle, materials, and supply high-quality input images.

Q: Can I merge two source images?
A: Yes—use input_image_1 and input_image_2 together. The model blends them via style-transfer techniques.

Q: Which aspect ratios are available?
A: From match_input_image to 1:1, 16:9, 9:16, 4:3, 3:2, 21:9, 9:21, and more—choose based on your target platform.

Q: How is consistency maintained across runs?
A: Set the integer seed parameter; identical seeds yield reproducible results.

Q: Do I need lengthy prompt engineering?
A: No. FLUX.1 Kontext [max] excels at interpreting nuanced prompts with minimal iteration.

Popular Models

SDXL Img2Img SDXL Img2Img is used for text-guided image-to-image translation. This model uses the weights from Stable Diffusion to generate new images from an input image using StableDiffusionImg2ImgPipeline from diffusers

Fooocus Fooocus enables high-quality image generation effortlessly, combining the best of Stable Diffusion and Midjourney.

Faceswap V2 Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

Codeformer CodeFormer is a robust face restoration algorithm for old photos or AI-generated faces.