简介
随着 Qwen-Image-2512 的发布,许多创作者都在问:它与广泛使用的 Z-Image Turbo 相比表现如何?具体来说,哪个模型在处理复杂指令、特定文本渲染和复杂细节方面更好?
为了找出答案,我们在 720x1280(竖屏)分辨率下,使用相同的种子(seed)和提示词(prompts)进行了直接的 A/B 测试。
测试工作流:
- 平台:zimage.run
- 设置: 模型 A:Qwen-Image-2512 模型 B:Z-Image Turbo 分辨率:720x1280
- 为什么选择这个工具:它托管了这两个模型,并允许免费、免登录测试,这使得复现这些结果变得非常容易。
5 组测试提示词与结果
以下是用于本次对比的确切提示词。您可以复制它们来亲自验证结果。 (注:左图 = Qwen-Image-2512,右图 = Z-Image Turbo)
1. 小丑 (纹理与光影)
关注点:物理皮肤纹理(干裂的妆容)和戏剧性的光影。
`An ultra-detailed, hyper-realistic extreme close-up portrait of The Joker. The frame is filled with his face in a tense three-quarter profile, capturing a moment of unsettling stillness. His skin is a grotesque canvas: a thick layer of caked, smeared white makeup cracks like dry earth, revealing sallow, scarred skin beneath. Crazed streaks of smudged red lipstick stretch far beyond his lips into a permanent, manic grimace. Toxic green hair, oily and unkempt, frames his face. The eyes are the focal point---hollow, dark-rimmed, and gleaming with a volatile mix of calculated madness and raw, chilling mirth. Every pore, every flake of peeling makeup, and the subtle, menacing tension in his jaw muscles are rendered in microscopic detail. Dramatic, chiaroscuro lighting from a single source casts deep shadows across his features, creating extreme contrast and amplifying the sinister, iconic atmosphere. Shot on a phantom high-speed camera, 8K resolution, with the texture and impact of a key film still from a psychological thriller. `

2. 网红与特定文本 (文本渲染)
关注点 :在霓虹灯牌上生成特定的文本字符串 [Qwen-Image-2512]。
`A stunning, intimate editorial portrait focused on the charismatic face of a 21-year-old blonde social media influencer. She flashes a playful, knowing smile while confidently pointing a manicured finger directly towards the sleek, glowing neon sign bearing the text "[Qwen-Image-2512]". Soft, directional natural light from a large window washes over her, creating a high-contrast interplay of light and shadow that sculpts her flawless features, sparkling eyes, and textured blonde hair. The atmosphere is modern, vibrant, and stylish, with a shallow depth of field that renders the chic, minimalist urban loft background into a soft, creamy bokeh, ensuring all focus remains on her engaging expression and the luminous sign. `

3. 蒸汽朋克大都市 (场景复杂度)
关注点:复杂竖屏场景中的细节密度与构图。
`A breathtaking cinematic masterpiece, ultra-wide panorama of a vast, multi-layered steampunk metropolis nestled within a colossal mountain canyon at sunrise. The city is a vertical labyrinth: towering Neo-Victorian spires with glowing clockwork faces, mid-level residential districts of brass and stained glass connected by buzzing aerial trams, and bustling lower streets where steam-carriages navigate cobblestone roads. The sky is dominated by a fleet of majestic brass-and-wood airships with canvas wings, some docking at skyscraper-sized clockwork towers, others departing alongside smaller personal ornithopters. Countless copper pipes and vents emit plumes of steam, catching the brilliant golden-hour light which creates long, dramatic shadows and glints off countless gears, glass domes, and polished brass. Victorian-clad citizens crowd grand plazas, market stalls, and intricate bridge networks, full of life. In the foreground, a massive, slowly-turning central gear and a cascading waterfall turned into a steam-powered generator add dynamic scale. The atmosphere is thick with hopeful industry, mist, and sunbeams, hyper-detailed, 8K, epic sense of scale and wonder. `

4. 宿舍房间 (氛围)
关注点:室内光线与特定物体(床、书桌)的摆放。
`A close-up, dynamic selfie of a 20-year-old American college student with long, flowing hair and a model's poised, athletic figure. She has a bright, confident smile and expressive eyes, capturing a moment of lively charm. She wears a casual yet stylish outfit, like a fitted university sweatshirt slipped off one shoulder. The photo is taken in a classic American dorm room: behind her, a cozy loft bed with school-branded blankets is visible, alongside a desk cluttered with textbooks, a laptop, and a poster-covered wall featuring a university flag or souvenir. Sunlight streams warmly through a nearby window, casting soft, natural light that highlights her features and the vibrant, youthful atmosphere. The image is sharp, clear, and full of life, embodying the authentic, energetic spirit of campus life. `

5. 新艺术运动 (风格迁移)
关注点:对阿尔丰斯·穆夏(Alphonse Mucha)艺术风格的遵循。
`A graceful Art Nouveau depiction of a "Winter Goddess." Flowing, organic lines frame intricate patterns of frost-kissed pine branches, holly berries, and delicate snowflakes woven into her hair and gown. Silver leaf accents glimmer like ice against a muted wintry palette of frosted blues, deep evergreen, and soft pearl white. In the style of Alphonse Mucha, the composition is highly decorative and ornamental, evoking the serene yet majestic beauty of a snow-blanketed forest. `

结论
基于这 5 项在 720x1280 分辨率下的测试,两个模型的对比结果如下:
- 指令遵循度: Qwen-Image-2512 倾向于更加**直白且粗砺(Hardcore)**。在小丑(Joker)的测试中,它严格遵循了"像干裂土地一样开裂"的指令,生成了高度纹理化、近乎发自内心的真实结果。Z-Image Turbo 虽然遵循了指令,但应用了一层美学柔化(磨皮),导致外观更加干净但质感稍逊。
- 文本渲染: 两个模型都成功理解了生成文本的请求。Z-Image Turbo 在霓虹灯牌上生成了清晰、连贯的字符,创造了令人信服的视觉效果。然而,Qwen-Image-2512 展现了更高的精确度,准确地拼写了特定的字符串,并按要求包含了标点符号。
- 画面丰富度: 在诸如蒸汽朋克大都市等复杂场景中,Qwen-Image-2512 在竖幅画面中填充了高密度的信息(纹理、背景齿轮)。Z-Image Turbo 优先考虑平衡的构图,通常会简化背景元素以保持焦点清晰。
亲自尝试: 您可以免费测试这两个模型(无需登录),看看哪个更适合您的风格: 👉 https://zimage.run