1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
const axios = require('axios');
const fs = require('fs');
const path = require('path');
async function toB64(imgPath) {
const data = fs.readFileSync(path.resolve(imgPath));
return Buffer.from(data).toString('base64');
}
const api_key = "YOUR API-KEY";
const url = "https://api.segmind.com/v1/sd3-med-canny";
const data = {
"prompt": "Anime style illustration of a girl wearing a suit. A moon in sky. In the background we see a big rain approaching.",
"negative_prompt": "low quality,less details",
"image": "toB64('https://segmind-sd-models.s3.amazonaws.com/display_images/sd3m-controlnet/sd3-canny-ip.jpg')",
"num_inference_steps": 20,
"guidance_scale": 7,
"seed": 698845,
"samples": 1,
"strength": 0.8,
"sampler": "dpmpp_2m",
"scheduler": "sgm_uniform",
"base64": false
};
(async function() {
try {
const response = await axios.post(url, data, { headers: { 'x-api-key': api_key } });
console.log(response.data);
} catch (error) {
console.error('Error:', error.response.data);
}
})();
Text prompt for image generation
Negative text prompt to avoid certain qualities
Input image
Number of inference steps for image generation
min : 1,
max : 100
Guidance scale for image generation
min : 1,
max : 20
Seed for random number generation
Number of samples to generate
Strength of the image transformation
min : 0,
max : 1
Sampler for the image generation process
Allowed values:
Scheduler for the image generation process
Allowed values:
Base64 encoding of the output image
To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.
Stable Diffusion 3 (SD3) Medium Canny ControlNet generates high-quality images based on textual prompts and input images. It utilizes Canny edge detection to provide fine-grained control over the generated outputs. SD3 ControlNet enhances image coherence and detail, making it a powerful tool for various image generation tasks.
Input Prompts: Provide a textual description of the desired image in the "Prompt" field.
Input Image: Optionally, upload an image to guide the generation process.
Negative Prompts: Indicate elements to exclude from the generation.
Inference Steps: Set the number of steps for the model to refine the image. More steps typically result in higher quality.
Strength: Use this to blend the input image with the generated output. Values close to 1 emphasize the input image more.
Seed: Define a seed value for reproducibility. Randomly generate seeds if consistency is not required.
Guidance Scale: Adjusts how closely the generated image follows the prompt. Higher values ensure the image aligns closely with the prompt.
Fine-tuning the outputs can be achieved by adjusting several parameters:
Inference Steps: Increasing the number of steps (e.g., from 20 to 50) can generate finer details but at the cost of longer processing times.
Strength: Adjust the strength to control the influence of the input image. For minor adjustments, vary between 0.6 to 0.9. Lower values provide more creative freedom to the model.
Guidance Scale: Typically between 7 and 15. Use higher values for strict adherence to prompts and lower values for more abstract results.
Sampler: Different samplers (e.g., ddim, p_sampler) can affect the generation style and speed. Experiment with these to find the optimal balance for your use case.
SD3 Medium Canny ControlNet is versatile and can be applied to numerous scenarios:
Artistic Image Creation: Generate unique artwork based on textual prompts and rough sketches.
Design Prototyping: Quickly produce visual prototypes for product designs.
Story Illustration: Create coherent images to accompany literary descriptions.
Advertising: Generate tailored visuals for marketing materials.