HiDream-I1 (Fast) Free Serverless API

POST

javascript

const axios = require('axios');
const FormData = require('form-data');


const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/hidream-l1-fast";

const reqBody = {
  "seed": -1,
  "prompt": "a cute panda holding a sign that says \"Keep Calm and Keep Building\"",
  "model_type": "fast",
  "resolution": "1024 × 1024 (Square)",
  "speed_mode": "Lightly Juiced 🍊 (more consistent)",
  "output_format": "webp",
  "output_quality": 100
};

(async function() {
    try {
        const formData = new FormData();
        
        // Append regular fields
        for (const key in reqBody) {
            if (reqBody.hasOwnProperty(key)) {
                formData.append(key, reqBody[key]);
            }
        }

        // Convert and append images as Base64 if necessary
        
        
        const response = await axios.post(url, formData, {
            headers: {
                'x-api-key': api_key,
                ...formData.getHeaders()
            }
        });
        console.log(response.data);
    } catch (error) {
        console.error('Error:', error.response ? error.response.data : error.message);
    }
})();

RESPONSE

image/jpeg

HTTP Response Codes

200 - OKImage Generated

401 - UnauthorizedUser authentication failed

404 - Not FoundThe requested URL does not exist

405 - Method Not AllowedThe requested HTTP method is not allowed

406 - Not AcceptableNot enough credits

500 - Server ErrorServer had some issue with processing

Attributes

seedint ( default: -1 )

Seed (-1 for random)

promptstr *

Prompt

model_typeenum:str ( default: fast )

An enumeration.

Allowed values:

resolutionenum:str ( default: 1024 × 1024 (Square) )

Image resolution

Allowed values:

speed_modeenum:str ( default: Lightly Juiced 🍊 (more consistent) )

Quality vs Speed

Allowed values:

output_formatenum:str ( default: webp )

Output format.

Allowed values:

output_qualityint ( default: 100 )

Output image quality (for jpg and webp)

min : 1,

max : 100

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

Overview: HiDream-I1

HiDream-I1 is a state-of-the-art, open-source text-to-image model built for exceptional image generation quality, accurate prompt adherence, and broad commercial usability. It's designed for creators, developers, and researchers looking for high performance without licensing constraints.

Key Features

| Feature | Description | |-------------------------------|-------------| | Superior Image Quality | Consistently produces high-fidelity images across styles—photorealistic, cartoon, concept art, and more. Scores highly on the HPS v2.1 benchmark, which aligns with human aesthetic preferences. Great at rendering text within images. | | Best-in-Class Prompt Following | Achieves top-tier scores on GenEval and DPG benchmarks. Outperforms all other open-source models in prompt accuracy, ensuring precise visual outputs from user instructions. | | Open Source (MIT License) | Freely available for personal, academic, and commercial use. Ideal for developers and startups seeking to integrate a powerful model without licensing headaches. | | Commercial-Ready | Outputs can be used for business applications like product mockups, ads, UI/UX design, and content creation, without additional licensing requirements. | | Multiple Versions Available | Choose from: • Full – highest quality • Dev – quality-performance balance • Fast – optimized for real-time use |

Technical Highlights

| Component | Details | |------------------|---------| | Architecture | Based on Mixture of Experts (MoE) using a Diffusion Transformer (DiT) backbone for modular and efficient processing. | | Text Encoders | Integrates multiple encoders for richer semantic understanding: • OpenCLIP • OpenAI CLIP • T5-XXL • Llama-3.1-8B-Instruct | | Routing | Uses dynamic routing to selectively activate expert pathways based on the input prompt, boosting both quality and efficiency. |

Ideal Use Cases

Concept art and storyboarding
Product photography and eCommerce mockups
Graphic design and editorial images
Game asset creation
UI/UX prototyping with text-in-image requirements
Research and experimentation in generative AI

Popular Models

SDXL Img2Img SDXL Img2Img is used for text-guided image-to-image translation. This model uses the weights from Stable Diffusion to generate new images from an input image using StableDiffusionImg2ImgPipeline from diffusers

illusion-diffusion-hq Monster Labs QrCode ControlNet on top of SD Realistic Vision v5.1

Faceswap V2 Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

Faceswap Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training