Advanced AI image generator with precise text rendering and editing, excelling in complex Chinese and English text layouts.
Gen Qwen Image is an advanced AI image generator designed for content creators and professionals needing precise text rendering and image editing. It leverages a 20 billion parameter Multimodal Diffusion Transformer (MMDiT) model, delivering exceptional quality particularly in complex Chinese and English text generation.
Gen Qwen Image is a 20 billion parameter AI image generation model that specializes in high-fidelity text rendering and image editing, particularly for complex Chinese and English text, based on the MMDiT architecture.
It supports proper stroke order, multi-line text layout, and paragraph-level semantic coherence, overcoming prior limitations in generating accurate Chinese characters within images.
Yes, it is released under the Apache 2.0 license, fully permitting commercial use with access to enhanced features suitable for professional content creation.
Gen Qwen Image can generate complex layouts including mixed-language multi-line text blocks and paragraphs, maintaining semantic and visual consistency.
It enables precise modifications to parts of generated images without affecting the rest, preserving overall coherence and detail.