POST

javascript

const axios = require('axios');

const fs = require('fs');
const path = require('path');

async function toB64(imgPath) {
    const data = fs.readFileSync(path.resolve(imgPath));
    return Buffer.from(data).toString('base64');
}

const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/sd3-med-tile";

const data = {
  "prompt": "Anime style illustration of a girl wearing a suit. A moon in sky. In the background we see a big rain approaching.",
  "negative_prompt": "low quality,less details",
  "image": "toB64('https://segmind-sd-models.s3.amazonaws.com/display_images/sd3m-controlnet/sd3-tile.jpg')",
  "num_inference_steps": 20,
  "guidance_scale": 7,
  "seed": 698845,
  "samples": 1,
  "strength": 0.8,
  "sampler": "dpmpp_2m",
  "scheduler": "sgm_uniform",
  "base64": false
};

(async function() {
    try {
        const response = await axios.post(url, data, { headers: { 'x-api-key': api_key } });
        console.log(response.data);
    } catch (error) {
        console.error('Error:', error.response.data);
    }
})();

RESPONSE

image/jpeg

HTTP Response Codes

200 - OKImage Generated

401 - UnauthorizedUser authentication failed

404 - Not FoundThe requested URL does not exist

405 - Method Not AllowedThe requested HTTP method is not allowed

406 - Not AcceptableNot enough credits

500 - Server ErrorServer had some issue with processing

Attributes

promptstr *

Text prompt for image generation

negative_promptstr ( default: low quality,less details )

Negative text prompt to avoid certain qualities

imageimage *

Input image

num_inference_stepsint ( default: 20 )

Number of inference steps for image generation

min : 1,

max : 100

guidance_scalefloat ( default: 7 )

Guidance scale for image generation

min : 1,

max : 20

seedint ( default: 698845 )

Seed for random number generation

samplesint ( default: 1 )

Number of samples to generate

strengthfloat ( default: 0.8 )

Strength of the image transformation

min : 0,

max : 1

samplerenum:str ( default: dpmpp_2m )

Sampler for the image generation process

Allowed values:

schedulerenum:str ( default: sgm_uniform )

Scheduler for the image generation process

Allowed values:

base64bool ( default: 1 )

Base64 encoding of the output image

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

Stable Diffusion 3 (SD3) Tile ControlNet

SD3 Medium Tile ControlNet is an advanced deep learning model designed for generating detailed images based on textual prompts and tile-based input images. By using tiling techniques, this model can create coherent large-scale images with a high level of detail and consistency. SD3 Medium Tile ControlNet is ideal for scenarios requiring expansive and detailed visual outputs.

How to Use the Model?

Input Prompts: Provide a textual description of the desired image in the "Prompt" field.
Input Image: Upload an image to guide the generation process.
Negative Prompts: Indicate elements to exclude from the generation.
Inference Steps: Set the number of steps for the model to refine the image. More steps typically result in higher quality.
Strength: Adjust this to control how strongly the input image influences the generated output. Higher values will make the output more similar to the input tiles.
Seed: Define a seed value for reproducibility. Randomly generate seeds if consistency is not required.
Guidance Scale: Adjusts how closely the generated image follows the prompt. Higher values ensure the image aligns closely with the prompt.

How to Fine-Tune Outputs?

Fine-tuning the outputs can be achieved by adjusting several parameters:

Inference Steps: Increasing the number of steps (e.g., from 20 to 50) can generate finer details but at the cost of longer processing times.
Strength: Adjust the strength to control the influence of the input image. For minor adjustments, vary between 0.6 to 0.9. Lower values provide more creative freedom to the model.
Guidance Scale: Typically between 7 and 15. Use higher values for strict adherence to prompts and lower values for more abstract results.
Sampler: Different samplers (e.g., ddim, p_sampler) can affect the generation style and speed. Experiment with these to find the optimal balance for your use case.

Use Cases

SD3 Medium Tile ControlNet can be effectively used for various applications, such as:

Architectural Visualization: Generate detailed floorplans, facades, and landscape designs from textual descriptions and input tiles.
Game Design: Create expansive and coherent game maps and environments.
Graphic Design: Produce large-format graphics and posters with consistent detailing.
Marketing Materials: Develop intricate and high-quality visual content for marketing campaigns.

Popular Models

SDXL Controlnet SDXL ControlNet gives unprecedented control over text-to-image generation. SDXL ControlNet models Introduces the concept of conditioning inputs, which provide additional information to guide the image generation process

Faceswap V2 Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

SDXL Inpaint This model is capable of generating photo-realistic images given any text input, with the extra capability of inpainting the pictures by using a mask

Codeformer CodeFormer is a robust face restoration algorithm for old photos or AI-generated faces.