Click or Drag-n-Drop
PNG, JPG or GIF, Up-to 2048 x 2048 px
Click or Drag-n-Drop
PNG, JPG or GIF, Up-to 2048 x 2048 px
Click or Drag-n-Drop
PNG, JPG or GIF, Up-to 2048 x 2048 px

Edited by Segmind Team on November 27, 2025.
Qwen-Image-Edit-2509-Photous is an advanced AI model that has been perfected using Qwen/Qwen-Image-Edit-2509. It is a major upgrade in the realm of image-to-image models as it addresses and solves the daunting task of producing cohesive and lifelike group photos from separate individual portraits. This variant ensures stable facial features and seamless scene integration, which was often lacking in the original Qwen model, which would produce visual results with inconsistent characters in multi-image compositions. Qwen-Image-Edit-2509-Photous is developed using diffusers and LoRA adapters, making it useful for creators and developers focused on building photo editing software, social media applications, and AI-driven platforms for visual storytelling. Additionally, it is capable of incorporating a subtle vintage touch through artistic grain and nostalgic tones while maintaining the clarity and structure of the original visuals.
group_photo LoRA and supports additional custom LoRA URLs.Effective Prompt Structure: The model responds best to natural language describing mood, setting, and composition; therefore, using descriptive scene-setting prompts like "Create a group photo with friends smiling under a sunset" renders precise and high-quality results instead of prompts that contain technical instructions.
Image Input Best Practices: The model accepts 1-3 images: use all three inputs for richer group compositions. Also, upload high-resolution, well-lit portraits with clear facial features while ensuring consistent lighting and orientation across input portraits for cohesive results across multiple iterations.
Parameter Optimization:
match_input_image to preserve original dimensions or select 16:9 for cinematic group shots.-1 for creative variation.group_photo LoRA, then experiment with custom LoRA URLs for style variations.Camera Angles: Include "Rotate the camera for a dynamic angle" in prompts for portrait-style edits to add depth and professional framing.
Is Qwen-Image-Edit-2509-Photous open-source?
The base Qwen/Qwen-Image-Edit-2509 model is open-source, but this fine-tuned variant (valiantcat/Qwen-Image-Edit-2509-photous) is hosted on Segmind's API platform; you can access it via API without local installation.
How does it differ from the original Qwen-Edit-2509?
Qwen-Edit-2509 struggles with character consistency in multi-image composites. On the other hand, Qwen-Image-Edit-2509-Photous is fine-tuned to address that limitation, adding specialized training for facial coherence and vintage photo effects optimized for group scenes.
What parameters should I tweak for the best results?
Focus on three essential parameters:
Can I use more than 3 input images?
At present, Qwen-Image-Edit-2509-Photous can only accept 3 simultaneous inputs (image_1, image_2, image_3). For larger groups, you can composite outputs iteratively or use batch processing with strategic input rotation.
What's the difference between JPEG, PNG, and WebP outputs?
JPEG offers the smallest file sizes for photographic content; PNG supports transparency (useful for overlays); WebP provides superior compression with high quality, making it ideal for web applications.
How do custom LoRA URLs work?
After selecting the base group_photo LoRA, add lora_2_url or lora_3_url parameters pointing to Hugging Face LoRA checkpoint URLs. These layers add additional stylistic effects like specific film stocks or artistic filters on top of the group photo optimization.