POST

javascript

const axios = require('axios');


const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/gpt-image-1.5-edit";

const data = {
  "prompt": "A photorealistic wide drone shot of a colossal man (exact face/body from the reference) casually sitting across a London street, one knee raised, hand resting. He wears a navy overcoat, knit sweater, dark trousers, boots, and a minimalist beanie. Tiny cars, buses, bikes, and pedestrians move around him, with classic London red-brick buildings, black lamps, and cobblestone streets dwarfed by his size. Soft overcast London daylight highlights wet pavement.",
  "image_urls": "https://segmind-resources.s3.amazonaws.com/input/92a8e420-2d12-48c0-97f5-73c6c4820c34-black-man-image.jpeg",
  "size": "auto",
  "quality": "high",
  "background": "opaque",
  "output_compression": 100,
  "output_format": "png",
  "moderation": "auto"
};

(async function() {
    try {
        const response = await axios.post(url, data, { headers: { 'x-api-key': api_key } });
        console.log(response.data);
    } catch (error) {
        console.error('Error:', error.response.data);
    }
})();

RESPONSE

image/jpeg

HTTP Response Codes

200 - OKImage Generated

401 - UnauthorizedUser authentication failed

404 - Not FoundThe requested URL does not exist

405 - Method Not AllowedThe requested HTTP method is not allowed

406 - Not AcceptableNot enough credits

500 - Server ErrorServer had some issue with processing

Attributes

promptstr * Affects Pricing

Text prompt guides image generation. Example: 'A cyberpunk city skyline at night.'

image_urlsstr *

Links to reference images.

maskstr *

Image to be used as a base. Leave null to upload later.

sizeenum:str ( default: auto ) Affects Pricing

Choose image resolution. 'auto' balances speed and quality.

Allowed values:

qualityenum:str ( default: high ) Affects Pricing

Sets visual quality. 'auto' balances detail and performance.

Allowed values:

backgroundenum:str ( default: opaque )

Background type for the image. Transparent is useful for overlays.

Allowed values:

output_compressionint ( default: 95 )

Defines output image compression level. Use 100 for best quality.

output_formatenum:str ( default: png )

Specifies the output image format. Use 'png' for high-quality needs.

Allowed values:

moderationenum:str ( default: auto )

Sets moderation strictness. 'low' relaxes content restrictions.

Allowed values:

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

GPT Image 1.5 Edit: AI-Powered Image Editing Model

What is GPT Image 1.5 Edit?

GPT Image 1.5 Edit is OpenAI's multimodal AI model that transforms image editing workflows through natural language instructions. Instead of manual pixel manipulation, developers can now describe desired changes in plain English—"replace the blue sky with a sunset" or "remove the background and make it transparent"—and the model executes these edits with professional precision. Built for production environments, it offers zero-latency cold starts and REST API integration, making it ideal for high-volume creative pipelines, e-commerce platforms, and design automation tools.

Key Features

Natural Language Editing: Modify images using text prompts without complex photo editing software
Advanced Inpainting & Outpainting: Fill, extend, or remove image regions with context-aware generation
Multi-Image Style Consistency: Reference multiple images to maintain brand guidelines and visual coherence
Mask-Based Precision: Apply edits to specific areas using optional mask images for surgical accuracy
Flexible Output Control: Choose from multiple resolutions (1024x1024, 1536x1024, 1024x1536, auto), quality levels, and formats (PNG, JPEG, WebP)
Transparent Background Support: Generate overlay-ready assets for compositing and design workflows
Production-Ready API: Fast response times with configurable compression (1-100) and moderation settings

Best Use Cases

E-commerce & Product Photography: Swap backgrounds, retouch product images, adjust lighting, and generate lifestyle shots at scale.

Marketing & Content Creation: Edit campaign visuals, personalize ad creatives, and maintain brand consistency across multiple assets.

UI/UX Design: Rapidly prototype interface variations, edit mockups, and adjust visual elements without leaving design workflows.

Photography Post-Processing: Remove unwanted objects, extend image boundaries, adjust colors, and perform complex retouching operations.

Social Media Management: Adapt content for different platforms, create variations, and apply quick edits to archival photography.

Prompt Tips and Output Quality

Be Specific and Descriptive: Instead of "make it better," write "increase brightness by 20%, add warm color grading, and soften shadows."

Leverage Quality Parameters: Use quality: high for final deliverables; quality: auto balances speed and detail for iterative work.

Combine Masks with Prompts: Upload mask images to isolate edit zones—perfect for changing specific objects while preserving surroundings.

Reference Images for Style Transfer: Include multiple image_urls to guide aesthetic consistency, especially for brand-compliant edits.

Resolution Strategy: Choose size: auto for the model to determine optimal resolution, or specify exact dimensions for precise output requirements.

Format Selection: Use PNG for transparency needs and maximum quality; JPEG/WebP for smaller file sizes in web applications.

FAQs

How is GPT Image 1.5 Edit different from text-to-image models?
Unlike generative models that create images from scratch, GPT Image 1.5 Edit modifies existing images based on your instructions, preserving context and composition while applying targeted changes.

What parameters should I adjust for best results?
Start with quality: high and output_compression: 100 for professional work. Lower compression (80-95) reduces file size for web use. Use background: transparent when generating assets for compositing.

Can I edit multiple areas in a single API call?
Yes, through detailed prompts or mask images. Describe all changes in one comprehensive prompt (e.g., "remove person on left, change sky to sunset, brighten foreground"), or use masks to target specific regions.

Does the model support batch processing?
The REST API accepts individual requests. For batch editing, implement parallel API calls in your application logic to process multiple images simultaneously.

What image formats are supported as input?
The model accepts standard web formats via image_urls parameter (PNG, JPEG, WebP). Provide publicly accessible URLs for both base images and optional masks.

Is GPT Image 1.5 Edit suitable for automated workflows?
Absolutely. The zero-cold-start architecture and REST API integration make it perfect for real-time editing pipelines, content management systems, and automated design tools requiring consistent, fast responses.

Popular Models

SDXL Img2Img SDXL Img2Img is used for text-guided image-to-image translation. This model uses the weights from Stable Diffusion to generate new images from an input image using StableDiffusionImg2ImgPipeline from diffusers

Faceswap V2 Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

SDXL Inpaint This model is capable of generating photo-realistic images given any text input, with the extra capability of inpainting the pictures by using a mask

Stable Diffusion XL 1.0 The SDXL model is the official upgrade to the v1.5 model. The model is released as open-source software