POST

javascript

const axios = require('axios');


const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/chroma";

const data = {
  "prompt": "Close-up portrait of a young knight in shining armor, holding a sword, set against a medieval castle background, dramatic lighting.",
  "negative_prompt": "low quality, ugly, deformed, blurry, bad anatomy, distorted, unrealistic",
  "width": 1024,
  "height": 1024,
  "aspect_ratio": "1:1 square 1024x1024",
  "cfg": 7,
  "sampler_name": "euler",
  "scheduler": "beta",
  "steps": 40,
  "seed": 123456789,
  "samples": 1,
  "image_format": "png",
  "image_quality": 95,
  "base64": false
};

(async function() {
    try {
        const response = await axios.post(url, data, { headers: { 'x-api-key': api_key } });
        console.log(response.data);
    } catch (error) {
        console.error('Error:', error.response.data);
    }
})();

RESPONSE

image/jpeg

HTTP Response Codes

200 - OKImage Generated

401 - UnauthorizedUser authentication failed

404 - Not FoundThe requested URL does not exist

405 - Method Not AllowedThe requested HTTP method is not allowed

406 - Not AcceptableNot enough credits

500 - Server ErrorServer had some issue with processing

Attributes

promptstr *

Describes the imagery scene; specific details yield rich images. Use for artistic depiction.

negative_promptstr ( default: low quality, ugly, deformed, blurry, bad anatomy, distorted, unrealistic )

Excludes undesirable elements; keeps image focus clear. Useful for professional look.

widthint ( default: 1024 )

Defines image width in pixels; adjust for different display needs.

min : 768,

max : 2048

heightint ( default: 1024 )

Sets image height; balance with width for proper ratio.

min : 768,

max : 2048

aspect_ratioenum:str ( default: 1:1 square 1024x1024 )

Selects image shape; square fits media platforms well.

Allowed values:

cfgfloat ( default: 7 )

Guides prompt adherence; higher values mean more accuracy.

min : 1,

max : 20

sampler_nameenum:str ( default: euler )

Selects image sampling; 'euler' for balanced quality and speed.

Allowed values:

schedulerenum:str ( default: beta )

Manages noise schedule; 'beta' for smooth transitions.

Allowed values:

stepsint ( default: 40 )

Changes denoising steps; more steps for enhanced detail.

min : 10,

max : 75

seedint ( default: 123456789 )

Fixes randomness; set for replicable outcomes.

samplesint ( default: 1 )

Number of images generated; adjust for more options.

image_formatenum:str ( default: png )

Output format choice; 'png' offers high quality.

Allowed values:

image_qualityint ( default: 95 )

Sets image detail level; 95 for fine detail.

min : 1,

max : 100

base64bool ( default: 1 )

Outputs image as base64 string; useful for embedding.

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

Discovering the Power of the Chroma Model

Chroma is an advanced, 8.9-billion-parameter text-to-image AI model crafted with the FLUX.1-schnell architecture, designed for those seeking to harness the potential of generative AI. Its high-fidelity text-to-image synthesis capabilities allow users to create detailed, imaginative visuals from simple text prompts. Leveraging Chroma's open-source nature enables a broader scope of experimentation and creative freedom.

For developers, Chroma offers the ability to automate and streamline workflows with its efficient and stable architectural enhancements. By integrating the model into existing pipelines through custom scripting or APIs, developers can generate diverse visual assets at scale, thus boosting productivity and innovation. Furthermore, its open-source flexibility invites developers to fine-tune the model on specific datasets, enabling customized solutions tailored to unique business needs.

Creators, such as artists and designers, can expedite project timelines by utilizing Chroma for rapid prototyping and asset generation. Artists can craft vivid media concepts by merely articulating creative ideas in natural language, while marketing teams can use Chroma to generate unique campaign visuals without relying on stock images.

Executives will appreciate Chroma's strategic advantages, including its potential to reduce costs associated with traditional design processes and enhance ROI through innovative visual content creation. Additionally, by facilitating community-driven research, Chroma opens doors for ongoing improvements and benchmarking within the diffusion model landscape.

In summary, Chroma represents a transformational tool in text-to-image generation. By mastering prompt engineering and utilizing quality control processes, users can unlock unprecedented creativity and efficiency across various domains.

Discovering Chroma’s potential begins with mastering prompt engineering and selecting parameters that match your creative goals. Follow these guidelines to generate striking, high-quality images across a range of use cases.

Prompt Engineering
• Be specific and descriptive: “A futuristic city skyline at sunset with neon reflections” yields richer results than “city.”
• Use style cues: mention artists, mediums, lighting, or color palettes (for example, “in the style of Impressionist oil painting”).
• Employ negative prompts to filter out unwanted artifacts (“low quality, blurry, deformed, unrealistic”).

Core Parameters
• Width/Height: Choose between 768–2048 px. For social media, a square (1:1, 1024×1024) works well; for portraits, try 896×1152 (3:4); for landscapes, 1344×768 (16:9).
• CFG Scale: Balances creativity vs. prompt fidelity. Set 5–7 for artistic exploration, 8–12 for photorealism, and up to 15 for maximum adherence on precise concepts.
• Steps: Number of denoising iterations. 20–30 for quick drafts, 40–50 for balanced detail, 60–75 for ultra-fine rendering.
• Sampler:
– “euler” or “euler_a” for speed and good quality
– “heun” or “lms” for smoother results
– “dpmpp_2s_a” or “dpmpp_sde” for highest fidelity
• Scheduler: “karras” or “beta” ensure smooth noise scheduling; “exponential” can yield more stylized textures.
• Seed: Fix a seed for reproducible outputs, or leave blank for random variation.
• Samples: Increase to 3–5 to explore variations in one batch.

Use-Case Recommendations

Photorealism (e-commerce products, architecture):
– Resolution: 1024×1024
– CFG: 10–12
– Steps: 50–60
– Sampler: dpmpp_2s_a, Scheduler: karras
Illustrative Art (comics, concept art):
– Resolution: 896×1152 (3:4)
– CFG: 7–9
– Steps: 30–40
– Sampler: heun, Scheduler: exponential
Quick Prototyping (storyboards, mood boards):
– Resolution: 768×768
– CFG: 5–6
– Steps: 20–25
– Sampler: euler, Scheduler: beta
High-Detail Fine Art (prints, posters):
– Resolution: 2048×2048
– CFG: 12–15
– Steps: 60–75
– Sampler: dpmpp_sde, Scheduler: karras

Workflow Tips

• Iterate: start with a strong core prompt and refine with additional details or negative terms.
• Batch generation: use multiple samples to compare styles and pick the best.
• Post-processing: minor color correction or upscaling can polish final assets.

By fine-tuning these parameters and iterating on your text prompts, you’ll unlock Chroma’s full creative power and produce visuals tailored to any project.

Popular Models

SDXL Img2Img SDXL Img2Img is used for text-guided image-to-image translation. This model uses the weights from Stable Diffusion to generate new images from an input image using StableDiffusionImg2ImgPipeline from diffusers

SDXL Controlnet SDXL ControlNet gives unprecedented control over text-to-image generation. SDXL ControlNet models Introduces the concept of conditioning inputs, which provide additional information to guide the image generation process

Fooocus Fooocus enables high-quality image generation effortlessly, combining the best of Stable Diffusion and Midjourney.

Faceswap Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training