<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Releases @ Draw Things]]></title><description><![CDATA[Talks about Draw Things releases.]]></description><link>https://releases.drawthings.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!z_ZD!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f0328e-c1ab-411f-9649-a4dfd7c935b3_1024x1024.png</url><title>Releases @ Draw Things</title><link>https://releases.drawthings.ai</link></image><generator>Substack</generator><lastBuildDate>Thu, 09 Apr 2026 20:30:31 GMT</lastBuildDate><atom:link href="https://releases.drawthings.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Engineering @ Draw Things]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[releasesdrawthings@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[releasesdrawthings@substack.com]]></itunes:email><itunes:name><![CDATA[Authors of Draw Things]]></itunes:name></itunes:owner><itunes:author><![CDATA[Authors of Draw Things]]></itunes:author><googleplay:owner><![CDATA[releasesdrawthings@substack.com]]></googleplay:owner><googleplay:email><![CDATA[releasesdrawthings@substack.com]]></googleplay:email><googleplay:author><![CDATA[Authors of Draw Things]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Introducing MediaGenerationKit: Hybrid Media Generation for Production Apps]]></title><description><![CDATA[A Swift package for integrating open-weight image and video models, with the flexibility to run locally or through the cloud.]]></description><link>https://releases.drawthings.ai/p/introducing-mediagenerationkit-hybrid</link><guid isPermaLink="false">https://releases.drawthings.ai/p/introducing-mediagenerationkit-hybrid</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Wed, 01 Apr 2026 23:11:33 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!z_ZD!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f0328e-c1ab-411f-9649-a4dfd7c935b3_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Draw Things app started as a local-only solution for running image and video generation tasks efficiently on Apple Silicon. Over the years, we have focused on the quality of generation, the efficiency of inference, and, more recently, on how to enable people to <a href="https://releases.drawthings.ai/p/privacy-pass-and-draw-things-cloud">safely and securely offload their generation tasks</a> to our Cloud when their devices are too weak.</p><p>There are many providers nowadays offering model inference as a service. Most of them provide TypeScript SDKs, and some of them provide Swift packages. None of them offer hybrid local and remote generation support, which is increasingly essential for customers trying to balance reach, quality, and cost.</p><p>Today, we are releasing a preview of MediaGenerationKit, a Swift package that allows you to integrate the latest open-weight media generation models into your app. You can choose to run them locally, host inference stack on your own Cloud or through our Cloud. For model selection, you can directly use a Hugging Face link to reference a model, or use one of our internal model identifiers. It supports the same comprehensive generation configurations that the Draw Things app supports.</p><p>For a simple example, it is really just a few lines of code:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;swift&quot;,&quot;nodeId&quot;:&quot;bf649896-4cdc-4bfa-a4f0-5faac50cf916&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-swift">import Foundation
import MediaGenerationKit

@main
struct ExampleApp {
  static func main() async throws {
    try await MediaGenerationEnvironment.default.ensure(
      "flux_2_klein_4b_q8p.ckpt"
    )

    var pipeline = try await MediaGenerationPipeline.fromPretrained(
      "flux_2_klein_4b_q8p.ckpt",
      backend: .local
    )

    pipeline.configuration.width = 1024
    pipeline.configuration.height = 1024
    pipeline.configuration.steps = 4

    let results = try await pipeline.generate(
      prompt: "a cat in studio lighting",
      negativePrompt: ""
    )

    try results[0].write(
      to: URL(fileURLWithPath: "/tmp/cat.png"),
      type: .png
    )
  }
}</code></pre></div><p>Unlike <a href="https://releases.drawthings.ai/p/draw-things-cli-local-media-generation">draw-things-cli</a>, MediaGenerationKit allows you to offload generation tasks either to your local gRPC server or to our Cloud Compute backend. For our Cloud Compute backend, you can request an API key with your existing Draw Things+ account, or purchase one, at <a href="https://api.drawthings.ai/dashboard">api.drawthings.ai/dashboard</a>. The default API key supports up to 20 generation tasks per month for the Free tier and up to 200 generation tasks per month for the Draw Things+ tier. Beyond that, you would use the pay-as-you-go API flow.</p><p>You can integrate MediaGenerationKit today using Swift Package Manager:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;swift&quot;,&quot;nodeId&quot;:&quot;4aaef7ae-59a4-485e-9c9e-dac29140aed0&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-swift">import PackageDescription

let package = Package(
  name: "MyApp",
  dependencies: [
    .package(
      url: "https://github.com/drawthingsai/media-generation-kit.git",
      revision: "37e0b70092ad1a0c1d4c6b24f16e17b73f1b1fb3"
    )
  ],
  targets: [
    .executableTarget(
      name: "MyApp",
      dependencies: [
        .product(name: "MediaGenerationKit", package: "media-generation-kit")
      ]
    )
  ]
)</code></pre></div><p>To learn more about the project, read the README at <a href="https://github.com/drawthingsai/media-generation-kit">https://github.com/drawthingsai/media-generation-kit</a></p>]]></content:encoded></item><item><title><![CDATA[Metal Quantized Attention: pulling M5 Max ahead with Int8 matrix multiplication]]></title><description><![CDATA[M5 Max is the biggest AI performance jump we have seen on Apple Silicon. Our latest release pushes real-world performance further with Metal Quantized Attention and fused Int8 matrix multiplication.]]></description><link>https://releases.drawthings.ai/p/metal-quantized-attention-pulling</link><guid isPermaLink="false">https://releases.drawthings.ai/p/metal-quantized-attention-pulling</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Wed, 01 Apr 2026 00:52:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!s4TM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Release of the M5 Max generated a great deal of goodwill in the AI community. We measured a 3.3&#215; speed-up over the M4 Max out of the box, a level of performance we have not seen before on Apple Silicon.</p><p>At the same time, if you look at <a href="https://images.nvidia.com/aem-dam/Solutions/geforce/blackwell/nvidia-rtx-blackwell-gpu-architecture.pdf">datasheets</a> of their direct competitors, especially for the RTX 5080 Mobile, the picture is more nuanced. For AI workloads such as LLM prefilling and image generation, the RTX 5080 Mobile still advertises roughly 320 TFLOPs of peak compute. By comparison, our most efficient FP16 matrix multiplication shader on the M5 Max delivers about 60 TFLOPs. Part of that gap is a necessary trade-off for a thinner, quieter, and more elegant design. Another part is that the advertised ~320 TFLOPs figure for the RTX 5080 Mobile is based on FP8 matrix multiplication, which is a lower-precision format than FP16.</p><p>With <a href="https://apps.apple.com/us/app/draw-things-ai-generation/id6444050820">Draw Things v1.20260330.0</a>, we are releasing Metal Quantized Attention to help close that gap. In real-world AI workloads such as image and video generation, our shaders now regularly reach around 110 TFLOPs on M5 Max. That brings Apple Silicon closer than ever to the RTX 5080 Mobile in practical terms, assuming thermal limits allow that device to sustain its peak performance.</p><h3>Metal Quantized Attention</h3><p>Metal Quantized Attention is our implementation of a low-bit attention operator written in Metal compute shaders. It takes FP16 or BF16 query, key, and value tensors, performs online quantization to Int8, and carries out the attention computation almost entirely in Int8. Queries and keys use row-group-wise scale quantization, while values use row-wise affine quantization.</p><p>In practice, this delivers a 1.24&#215; to 1.41&#215; speed-up over our Metal Flash Attention shader, which is already our fastest attention shader on M5 with Neural Accelerators.</p><h3>Int8 Matrix Multiplication with Row-wise Scales</h3><p>We also did a careful analysis of the quality impact of Int8 math with row-wise scales and found it to be an acceptable trade-off relative to the full-precision base model. In our testing, it is comparable to, and often better than, our 6-bit palettized quantization scheme.</p><p>The fused shader handles row-wise dynamic activation quantization, Int8-to-Int8 matrix multiplication, and post-accumulation dequantization together. End to end, our dynamic quantization + Int8 matrix multiplication + dequantization fused kernel is about 1.61&#215; to 1.87&#215; faster than our FP16 baseline.</p><h3>End-to-End Speed-up</h3><p>In v1.20260330.0, we enabled Metal Quantized Attention by default on M5 devices. You can still turn it off in Machine Settings. This feature applies across all models. Int8 matrix multiplication applies only to the 8-bit S models we recently added.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!s4TM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!s4TM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 424w, https://substackcdn.com/image/fetch/$s_!s4TM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 848w, https://substackcdn.com/image/fetch/$s_!s4TM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 1272w, https://substackcdn.com/image/fetch/$s_!s4TM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!s4TM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png" width="1456" height="680" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:680,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!s4TM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 424w, https://substackcdn.com/image/fetch/$s_!s4TM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 848w, https://substackcdn.com/image/fetch/$s_!s4TM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 1272w, https://substackcdn.com/image/fetch/$s_!s4TM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b7ff009-e756-4bcd-8a4b-0b04f9c1a482_2048x956.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>End to end, these changes deliver a 1.19&#215; to 1.76&#215; speed-up over v1.20260323.0. For a single sampling step alone, excluding model loading, text encoding, image decoding, and model lowering, the improvement ranges from 1.43&#215; to 1.95&#215;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Gn4V!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Gn4V!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 424w, https://substackcdn.com/image/fetch/$s_!Gn4V!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 848w, https://substackcdn.com/image/fetch/$s_!Gn4V!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 1272w, https://substackcdn.com/image/fetch/$s_!Gn4V!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Gn4V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png" width="1456" height="680" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:680,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Gn4V!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 424w, https://substackcdn.com/image/fetch/$s_!Gn4V!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 848w, https://substackcdn.com/image/fetch/$s_!Gn4V!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 1272w, https://substackcdn.com/image/fetch/$s_!Gn4V!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1f41711-d475-46e9-9aba-91b5ad859c6d_2048x956.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>Pulling Ahead over Other Implementations</h3><p>Metal Quantized Attention and fused Int8 matrix multiplication push our implementation well ahead of other Apple Silicon implementations for the same task. In our testing, we have seen performance ranging from 1.24&#215; to 2.57&#215; faster than the next best alternatives, including <a href="http://github.com/filipstrand/mflux">mflux</a>, <a href="https://ollama.com/blog/image-generation">Ollama</a>, <a href="https://github.com/antirez/iris.c">iris.c</a>, and <a href="https://github.com/Blaizzy/mlx-video">mlx-video</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!NYxp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!NYxp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 424w, https://substackcdn.com/image/fetch/$s_!NYxp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 848w, https://substackcdn.com/image/fetch/$s_!NYxp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 1272w, https://substackcdn.com/image/fetch/$s_!NYxp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!NYxp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png" width="1456" height="841" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:841,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!NYxp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 424w, https://substackcdn.com/image/fetch/$s_!NYxp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 848w, https://substackcdn.com/image/fetch/$s_!NYxp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 1272w, https://substackcdn.com/image/fetch/$s_!NYxp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcc80ac94-d1a1-4c43-b0eb-f1b0474ed664_1770x1022.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As always, our shader work is <a href="https://github.com/liuliu/ccv/tree/unstable/lib/nnc/mfa">publicly available</a> under the FreeBSD 3-Clause License, and our inference stack is <a href="https://github.com/drawthingsai/draw-things-community">publicly available</a> under the LGPLv3 license. For local media generation tasks with your agent, you can also use <code>draw-things-cli</code>.</p>]]></content:encoded></item><item><title><![CDATA[Draw Things Test Set: a status update]]></title><description><![CDATA[Today&#8217;s image models are powerful, but where do they still break? In building the Draw Things Test Set, we found both recurring failure modes and a few surprises.]]></description><link>https://releases.drawthings.ai/p/draw-things-test-set-a-status-update</link><guid isPermaLink="false">https://releases.drawthings.ai/p/draw-things-test-set-a-status-update</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Thu, 26 Mar 2026 22:11:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!C2xx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbae24a37-d510-4fb7-a411-6affb201daea_768x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The new generation of image models, circa 2025, can do a remarkable range of things. They can modify elements within an image, generate photorealistic scenes, change style and lighting, and make many Photoshop-like tasks surprisingly easy. But where exactly are the boundaries of these models? Where do they fail, and how do they fail?</p><p>Draw Things Test Set is our attempt to answer those questions.</p><p>A few guiding principles shaped this Test Set and made it interesting to a broader audience, which is why we want to share an early status update.</p><ol><li><p>We focused on practical use cases. People use Draw Things for all kinds of work, both professional and recreational. We want the Test Set to reflect those real-world uses, and to provide guardrails so that future model development does not regress on the capabilities people actually care about.</p></li><li><p>We focused on differentiating model capabilities in a way that clearly shows the progress made over the past few years. Rather than nitpicking aesthetics, we want the differences between models to be obvious, without depending on highly trained human taste to decide which result is only marginally better.</p></li><li><p>We wanted to map where model capabilities could go next, and what remains missing from the current generation. Ideally, the next wave of models will begin to close some of these gaps.</p></li></ol><p>We ran all models on our own inference stack, which gives each one a fair playground. That allows us to fix parameters, keep results reproducible, and optimize each setup appropriately for the model being tested.</p><p>We plan to release the Calibrated Test Set publicly while keeping the full Test Set private to avoid saturation. Today, I want to share some of the findings that emerged during its construction. While I will highlight a few failure cases from closed-source models, those results may not be reproducible on your end. For models available in the Draw Things app, however, the results are reproducible. For each prompt, we select the best result across multiple trials, spanning different aspect ratios (1:1, 2:3, 3:2, 3:4, and 4:3) and resolutions.</p><h3>1. Anatomy is still an issue</h3><p>Contrary to common belief, anatomy is still a problem for the current generation of models, especially in complex scenes with conflicting instructions.</p><p><em>Prompt: A wide-angle realistic photo of a woman holding a steady yoga headstand on a mat. A grey tabby cat is curled up sleeping on her vertically upturned feet, while a small child in the foreground is carefully stacking a precarious tower of three wooden toy blocks on the woman&#8217;s horizontal stomach. The scene requires a clear void between the woman&#8217;s torso and the floor, with distinct points of contact for both the cat and the blocks.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bae24a37-d510-4fb7-a411-6affb201daea_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2c3e7746-0688-49d3-9723-388d08941935_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/20f4d525-11e9-492b-b729-844f7a8d80e4_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5c527f40-b1cb-489b-857a-da6d11014cd9_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/085723ec-05cd-49d2-95eb-8d671452e2e0_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a48f7a07-9374-4915-b2d6-2fce8d90f07f_2752x1536.png&quot;}],&quot;caption&quot;:&quot;1. FLUX.2 [dev]; 2. FLUX.2 [klein] 9B; 3. FLUX.2 [klein] 9B Base; 4. Z-Image Turbo; 5. Qwen Image 2512; 6. Nano Banana 2.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/261069dd-61eb-4370-97e5-1c8699475046_1456x964.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>Even in simpler cases, larger models still tend to hold an advantage over smaller ones. Distilled models in particular can show more anatomical issues than their base counterparts.</p><p><em>Prompt: A person throwing a red ball to a dog in a sunny park. The ball is in mid-air between the person&#8217;s extended arm and the jumping dog.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/92713c36-75f1-43f6-a99b-85e496f3527a_896x1152.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f53abb2b-01b5-45c1-809b-32992446da88_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1a7dbf9a-375f-4364-9cf8-180b104f204c_1152x896.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d495dd60-98b4-4cd4-9243-a60688a909b8_1024x1024.png&quot;}],&quot;caption&quot;:&quot;1. FLUX.2 [dev]; 2. FLUX.2 [klein] 9B; 3. FLUX.2 [klein] 9B Base; 4. FLUX.2 [klein] 4B Base.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0bdbca2f-df28-492a-9a3b-419fa70996fd_1456x1456.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>2. Knowledge distribution is uneven</h3><p>Diffusion Transformer models arguably contain much more world knowledge than earlier U-Net-based ones, and pairing them with a real LLM certainly helps. But the knowledge distribution is still uneven, and the gaps between models remain obvious.</p><p><em>Prompt: A high-resolution product photo of a 1998 Apple iMac G3 in Bondi Blue. The shot emphasizes the translucent pinstriped plastic casing, clearly revealing the internal metal CRT shielding and vacuum tube components through the shell. It includes the matching hockey puck circular mouse and the translucent keyboard. The active screen displays the Mac OS 8 desktop with classic Platinum-style icons.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9b479257-720a-4a2b-91ef-a719b3ddc836_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3de1ea7d-7554-4fe7-8b33-566409293632_1024x1024.png&quot;}],&quot;caption&quot;:&quot;FLUX.2 [dev] obviously know what Mac and the OS should look like while Qwen Image 2512 knew neither.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f46b2aa0-35ba-4070-8505-f21046014e26_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: A cinematic low-angle shot of a Lockheed SR-71 Blackbird on a wet tarmac at dawn. The image must capture the corrugated, ribbed texture of the matte-black titanium skin. The focus is on the Pratt and Whitney J58 engine nacelles, featuring the sharp, conical movable intake spikes. Small puddles of red JP-7 fuel are visible leaking onto the ground beneath the fuselage, reflecting the Skunk Works logo on the tail fins.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cf5bf66f-8c13-40c0-b890-f57307080ad5_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bb4477ed-d895-43e8-bc08-3e8156bd23af_768x1536.png&quot;}],&quot;caption&quot;:&quot;FLUX.2 [dev] knew very little about SR-71 while Qwen Image 2512 in this case, have the jet engine done correctly.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e8cc3477-164e-4598-bb40-48eb78d4280a_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>3. Physics and math are still challenging</h3><p>Models appear to have some understanding of physics, but that understanding is still shallow.</p><p><em>Prompt: A realistic photograph of exactly 7 sheep standing on a green field, each sheep fully and clearly visible and individually countable.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3cb497a3-a588-4c0f-8c9b-22b9cfcc02be_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/17b122cd-0ce0-4ec3-9408-e2cee46960b2_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/787647c7-b166-40f6-84e4-4bd0a80e27bd_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0a820da6-01e8-478e-bebf-a1454f38f1c6_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4bb874bc-2bf7-47cf-8f21-97cfce543f96_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4e3580c0-7b64-400b-8d19-ac8e34cc9a01_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd47264a-eb35-4f9c-a9c7-b679ede7b8c1_768x1536.png&quot;}],&quot;caption&quot;:&quot;1. FLUX.2 [dev]; 2. FLUX.2 [klein] 9B; 3. FLUX.2 [klein] 9B Base; 4. FLUX.2 [klein] 4B; 5. FLUX.2 [klein] 4B Base; 6. Z-Image Turbo; 7. Qwen Image 2512.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7b4ae7b9-5c76-47bd-a753-d2441f0589e5_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: A ball bouncing, shown at three points in its trajectory: the moment of impact with the ground (slightly compressed), mid-bounce upward, and at the peak of the bounce. Motion blur should be appropriate for each stage.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f7f2fcaa-a55d-4d35-899c-e7b80be379ac_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a1a6e00a-4f52-465e-8ab2-4779961359b5_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/751d074e-f1cd-468a-a231-4bb6e59ddd3b_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f2d596b-3054-4de0-9b17-c5ca09c4d1fb_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e0149dc8-87c4-4688-b234-2530466a7438_1152x896.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f3761de3-6a77-4603-88d2-1a210be0ce8c_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f94f931-a4d5-4542-92c7-7b471e56e1cf_1536x768.png&quot;}],&quot;caption&quot;:&quot;1. FLUX.2 [dev]; 2. FLUX.2 [klein] 9B; 3. FLUX.2 [klein] 9B Base; 4. FLUX.2 [klein] 4B; 5. FLUX.2 [klein] 4B Base; 6. Z-Image Turbo; 7. Qwen Image 2512.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af985767-e52e-4a13-b49c-95b05b5817f1_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>4. Common sense is hard</h3><p>By &#8220;common sense,&#8221; we mean the things left unsaid but still necessary for the image to make sense.</p><p><em>Prompt: The exact moment of collision between two cars in an intersection. One red sedan T-boning a blue SUV. The image must show: both front airbags deploying, broken glass mid-shatter, the deformation of metal, and appropriate motion blur on debris.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/afc95063-fded-48c6-8cc2-cac0efa20353_1152x896.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/351c6d96-fc5b-4db7-ac98-aa66f923292e_896x1152.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/652f7b7d-94f3-48d0-a652-33741a7bbc62_1152x896.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c38cf558-7ad6-4fd4-ba40-8751879231be_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf8e2cd8-84dd-4a5a-8b41-cfcb02762f86_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4bdd04ed-d1d2-4cdd-91a3-3ed255462f33_1152x896.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7bfa9838-1d3d-4c16-8014-52ecbad4b5c3_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/885a66a0-8bfa-4a37-9f57-c5791ebc2762_2816x1536.png&quot;}],&quot;caption&quot;:&quot;1. FLUX.2 [dev]; 2. FLUX.2 [klein] 9B; 3. FLUX.2 [klein] 9B Base; 4. FLUX.2 [klein] 4B; 5. FLUX.2 [klein] 4B Base; 6. Z-Image Turbo; 7. Qwen Image 2512; 8. Nano Banana 2.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/05a0d1d9-8286-40e3-ad4c-cbb60c0a4f3b_1456x1700.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: A wine glass that has just been knocked off a table, captured 0.5 seconds after leaving the edge. The glass should be tilted at a realistic angle for its falling trajectory, with wine beginning to spill out following gravity but not yet scattered.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c7e2ab6-c1f1-41e5-a3fd-4a90872a2291_896x1152.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7653c699-3d92-4b05-b324-1c4f8c9cf8a2_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/31de881e-a13b-4b16-8c7a-a890eb3dc04d_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c6d53de9-b218-47f5-95bf-f186c6d08dab_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/75275a7a-41f7-47a6-8213-6856ab33bad0_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9ddf70f6-cc9d-4caa-88a0-95c545865a5c_896x1152.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8406ff84-18a3-479d-9256-99593111421e_768x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/66d85102-af0f-42fd-b48c-9d75611594ef_2816x1536.png&quot;}],&quot;caption&quot;:&quot;1. FLUX.2 [dev]; 2. FLUX.2 [klein] 9B; 3. FLUX.2 [klein] 9B Base; 4. FLUX.2 [klein] 4B; 5. FLUX.2 [klein] 4B Base; 6. Z-Image Turbo; 7. Qwen Image 2512; 8. Nano Banana 2.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/43d9602e-148c-4c35-bd1b-f9b1293f0ff0_1456x1700.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>5. LLMs are unreliable raters</h3><p>It is tempting to use an LLM as a judge to automate model evaluation. In practice, that is unreliable.</p><p>An LLM often makes the same mistake the image model makes, which can artificially inflate some models&#8217; scores more than others. That is one of the main reasons we want to release our calibrated Test Set: it is rated by humans, which gives us a better sense of the actual error bounds.</p><p><em>Prompt: A photo of a classic red British telephone box K6 model on a London street corner. The box should feature the distinctive crown emblem at the top and the correct multi-pane glass windows.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/08f861c2-7118-48e7-a00d-4217d91e6340_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf7d26ed-b2fc-473f-a5c6-c75f242fc224_1024x1024.png&quot;}],&quot;caption&quot;:&quot;LLM rated both 8 but missed many issues in FLUX.2 [klein] 9B's result (text, crown position) while Z-Image Turbo's minor issues are much harder to spot.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6cb5a60f-e242-4555-88f8-711a376d6090_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: A realistic, high-shutter-speed photo of a ballet dancer performing a graceful high jete leap directly over a sleeping golden retriever. In the background, an artist at an easel is sketching the exact scene onto a canvas. The dancer&#8217;s shadow must fall realistically across the dog&#8217;s fur, and the sketch on the canvas must visibly mirror the dancer&#8217;s pose.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/41342976-cf77-463d-ba48-e4151c919121_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/077e54c9-9fa8-4d95-93bc-6afc02864f60_1024x1024.png&quot;}],&quot;caption&quot;:&quot;FLUX.2 [dev] has obvious consistency issues (dancer's facing is wrong on the picture) but LLM rated it 9. Qwen Image 2512 did better but rated 6.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e0a0f32d-d9fc-4a4a-85da-10aac78c23ce_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: A composite architectural visualization. The top section shows a photorealistic 3D cutaway view of the ground floor of a modern house, revealing a furnished living room (with sofa and TV) and a kitchen (with an island) separated by a partial wall. This 3D model hovers directly above a matching 2D architectural blueprint floor plan of the same level with perfectly aligned layout.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f80fe833-63f1-4d07-bc2b-03a41bc1b5a3_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d1f3faa1-fcde-4338-8051-d7874fe853ad_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a5cf2232-4f4b-444d-b9ce-ab37dee9095c_2816x1536.png&quot;}],&quot;caption&quot;:&quot;1. FLUX.2 [klein] 9B Base; 2. Qwen Image 2512. LLM rated both at 8, but one is obviously better than the other. 3. Nano Banana 2.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6422023c-0116-4b02-a7ec-87afba24e3cb_1456x474.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>6. Style transfer and relighting are surprisingly good, even with small models</h3><p>When most of the heavy lifting is already done, especially semantic preservation, even small models can perform well on edits that are less semantic in nature. In some cases, these edits are even more physically plausible than the models&#8217; image-generation counterparts.</p><p><em>Prompt: Relight the entire scene to place them outdoors around a campfire at night. The only light source should be the warm, flickering orange glow of the fire coming from below their faces. This should create strong upward-casting shadows (under brows, noses, and chins) and illuminate the texture of their clothing from beneath, while keeping the background pitch black.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b82072ce-c47d-43ec-a5bd-fedd636c6281_1408x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d5a7f85-d893-40ee-a440-3ad5e3486268_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0093e446-6f35-4eb9-b44c-7428ae8e5d5e_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/acf800ab-73e9-43a8-8946-db9880ece3ee_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b4c57117-e18f-4c3f-8d08-b88a41135bb0_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2290648f-a91f-4119-823b-a12d1fc19db4_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a40be4f5-b1ee-4f55-bfef-e14a01ccee52_1536x768.png&quot;}],&quot;caption&quot;:&quot;1. Input; 2. FLUX.2 [dev]; 3. FLUX.2 [klein] 9B; 4. FLUX.2 [klein] 9B Base; 5. FLUX.2 [klein] 4B; 6. FLUX.2 [klein] 4B Base; 7. Qwen Image Edit 2511.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/77968a3e-be43-4248-ac6f-eac9c6e57f07_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: Change the lighting to warm golden hour sunset, with orange tones and long shadows.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d36462a3-0c9a-41f0-9d75-d90f5549f82e_2816x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/19813e4e-cc88-41fa-b025-baa2a50af34d_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68c6d92d-0797-4015-8d9c-bac3fa0cf12a_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6154e6f9-a51d-4945-b222-0c342cb04567_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0d81c1eb-f84c-46c8-8519-c22561b60c80_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c4706079-af1c-44fa-93ae-cffd89133f37_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/02203ff8-e9d8-4a70-891e-f39aa097b818_1536x768.png&quot;}],&quot;caption&quot;:&quot;1. Input; 2. FLUX.2 [dev]; 3. FLUX.2 [klein] 9B; 4. FLUX.2 [klein] 9B Base; 5. FLUX.2 [klein] 4B; 6. FLUX.2 [klein] 4B Base; 7. Qwen Image Edit 2511.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/62ed4082-4b94-4604-8a05-3ea7db23eaf2_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: Edit the statue&#8217;s feet to be wearing detailed, modern high-top basketball sneakers. The sneakers must appear to be carved entirely from the same block of aged marble, matching the stone texture, color, and weathering of the rest of the statue.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c46771a-e984-46b1-a26b-a3e55c1a2781_2816x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/021621d8-8b2c-4ed3-90f3-c0ed800b30f5_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0d7dec0a-47f1-40c2-83a4-e2dec19921bb_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/447a35b8-4014-44ac-a06a-80fa735758f9_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bf9b308f-ddcd-44c5-8502-fc791f874daa_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1fd199fc-1c2c-4dc1-8352-9ec2ea93a07b_1536x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f01fa819-75bd-45aa-ab14-83cfcadeb646_1536x768.png&quot;}],&quot;caption&quot;:&quot;1. Input; 2. FLUX.2 [dev]; 3. FLUX.2 [klein] 9B; 4. FLUX.2 [klein] 9B Base; 5. FLUX.2 [klein] 4B; 6. FLUX.2 [klein] 4B Base; 7. Qwen Image Edit 2511.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d77b5e1-35b5-4d25-ad6d-5d02d70a6dbd_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>7. Editing models are not reasoners</h3><p>The current generation of editing models is unable to perform even simple reasoning tasks. Some models, such as Qwen Image Edit 2511, do show more initiative, but they are still far from where they need to be.</p><p><em>Prompt: Fill in the dates correctly.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7148dc11-4a80-4932-9e3c-43ddc948d157_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/89de07ff-68f7-4851-8713-1d1bfc68750c_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f8fc1e5f-c6bb-4730-8c80-242f46543172_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e1d5d8f0-3570-4e9d-b16d-6fdf27e080d0_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c6258d3c-6155-4fbe-a0bb-2717d74fc8c4_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3e88ad9d-0962-4680-96e5-614cc0ba1da8_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/259adadf-365f-4056-be28-f4cf32c8bff5_1024x1024.png&quot;}],&quot;caption&quot;:&quot;1. Input; 2. FLUX.2 [dev]; 3. FLUX.2 [klein] 9B; 4. FLUX.2 [klein] 9B Base; 5. FLUX.2 [klein] 4B; 6. FLUX.2 [klein] 4B Base; 7. Qwen Image Edit 2511.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/92577023-7a44-45b1-b9bb-45b87228d9e6_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: Draw the next moon phase.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/816e7ff2-c55c-47a9-8f57-97964d929c63_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6fb6c9b2-393e-44fb-81bc-b142eb363f62_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/63654f57-724b-46f4-a18a-8797a62cdd12_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ba7d706d-a308-4e00-85ab-bf726d185247_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1771702d-0a4b-45d9-9831-b2111161938a_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e1306ab7-6d3c-42e6-bd4a-da020c886703_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/674ff009-a4af-4e47-890f-3778ffcd6714_1024x1024.png&quot;}],&quot;caption&quot;:&quot;1. Input; 2. FLUX.2 [dev]; 3. FLUX.2 [klein] 9B; 4. FLUX.2 [klein] 9B Base; 5. FLUX.2 [klein] 4B; 6. FLUX.2 [klein] 4B Base; 7. Qwen Image Edit 2511.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a6b7f4e9-253e-4031-a99a-3dd45ee98593_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p><em>Prompt: Draw the solution path from Start to Finish.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9de74eed-448f-49e2-83a1-3672517a8984_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0c275d67-1fa5-4ebf-b4ea-220f5f56af13_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eb9efc3a-b854-4254-8656-859965c129e0_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d19b676-0bf2-4011-8174-5c706dfd0c08_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/961ec467-8637-4764-95a2-67ddfeb15481_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/564da43d-5e41-46a3-b95a-52623e96f7da_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f878fe6c-377c-41da-81ce-4fbac51a9502_1024x1024.png&quot;}],&quot;caption&quot;:&quot;1. Input; 2. FLUX.2 [dev]; 3. FLUX.2 [klein] 9B; 4. FLUX.2 [klein] 9B Base; 5. FLUX.2 [klein] 4B; 6. FLUX.2 [klein] 4B Base; 7. Qwen Image Edit 2511.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6fa41f40-8fd5-42b2-abb6-38c30094982e_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><div><hr></div><p>We are still actively working through the Test Set, so this is only a status update on what we have seen so far. If this research direction is interesting to you, feel free to contact us at <a href="mailto: info@drawthings.ai">info@drawthings.ai</a></p>]]></content:encoded></item><item><title><![CDATA[draw-things-cli: local media generation from command-line on your Mac]]></title><description><![CDATA[draw-things-cli is our rebooted image generation CLI: fast, scriptable, and powered by Draw Things&#8217; custom inference stack. It is also the fastest image generation tool on macOS.]]></description><link>https://releases.drawthings.ai/p/draw-things-cli-local-media-generation</link><guid isPermaLink="false">https://releases.drawthings.ai/p/draw-things-cli-local-media-generation</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Wed, 25 Mar 2026 21:03:12 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!RpQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>CLI is getting a new lease on life in the age of agentic AI. But for us, this release is actually long overdue.</p><p>From the beginning, we have had an internal command-line tool for running image generation. It has been invaluable for quickly debugging different parts of our pipeline.</p><p>Over time, however, as the app gained more features, it became more convenient to debug them directly inside the app. As a result, the command-line tool gradually fell out of maintenance.</p><p>Our public repository, <a href="https://github.com/drawthingsai/draw-things-community">draw-things-community</a>, maintains our core inference stack under the GPLv3 license. We regularly build and release <code>gRPCServerCLI</code> for both NVIDIA platforms and macOS from this public source repository.</p><p><code>draw-things-cli</code> is a reboot of that command-line tool. It is vibe-coded, which makes it straightforward to implement many of the command-line niceties you would expect: shell auto-completion, direct image output in the terminal, automatic model downloads when needed, and recommended settings from the community repository when available. Powered by <a href="https://apps.apple.com/us/app/draw-things-ai-generation/id6444050820">Draw Things</a>&#8217; custom inference stack, it is also the fastest image generation tool on macOS. It is fully open-source, under GPL-v3 license.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RpQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RpQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 424w, https://substackcdn.com/image/fetch/$s_!RpQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 848w, https://substackcdn.com/image/fetch/$s_!RpQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 1272w, https://substackcdn.com/image/fetch/$s_!RpQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RpQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png" width="1456" height="1195" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1195,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3871159,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://releases.drawthings.ai/i/192127290?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RpQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 424w, https://substackcdn.com/image/fetch/$s_!RpQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 848w, https://substackcdn.com/image/fetch/$s_!RpQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 1272w, https://substackcdn.com/image/fetch/$s_!RpQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9275b53-6d9b-4944-8a73-7a248944ec31_2174x1784.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To install <code>draw-things-cli</code> today, you can use Homebrew or build it from source. To install it with Homebrew:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;bash&quot;,&quot;nodeId&quot;:&quot;1ddc7043-d45f-4a79-a3f8-75e770caa083&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-bash">brew install drawthingsai/draw-things/draw-things-cli</code></pre></div>]]></content:encoded></item><item><title><![CDATA[Introducing Lightning Draft: interactive image generation on M5 Max]]></title><description><![CDATA[M5 Max, which shipped in early March, delivers breakthrough AI performance, with more than a 3.3&#215; speed-up over M4 Max.]]></description><link>https://releases.drawthings.ai/p/introducing-lightning-draft-interactive</link><guid isPermaLink="false">https://releases.drawthings.ai/p/introducing-lightning-draft-interactive</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Tue, 24 Mar 2026 17:42:47 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/135e355e-596e-4307-a23b-b488bbb80a65_2508x1838.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>M5 Max, which shipped in early March, delivers breakthrough AI performance, with more than a 3.3&#215; speed-up over M4 Max. This is the kind of previously unseen performance we are now seeing on Apple Silicon.</p><p>Interactive image generation and editing are not new. But existing approaches come with compromises. You either have to rely on older, heavily distilled models such as Stable Diffusion 1.5 or SDXL, reduce resolution or step count, or run on cloud hardware such as H100s at hourly rates.</p><p>In v1.20260323.0, Draw Things introduces Lightning Draft. With Lightning Draft enabled on M5 Max, you can run state-of-the-art models such as FLUX.2 [klein] 4B, 9B, or Z-Image Turbo at their recommended settings with interactive latency of around or under one second locally. This makes prompt iteration dramatically easier and opens up new forms of creative exploration that were not practical before.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;1ec0cf2e-b456-4427-b8e9-a9b9315dfe07&quot;,&quot;duration&quot;:null}"></div><p>In the same v1.20260323.0 release, we also continued shipping performance improvements for M5 chips. With a clearer understanding of the Neural Accelerators&#8217; performance characteristics, we improved our MatMul kernel by roughly another 10% on M5 for large matrices. In practice, that translates into about a 2% to 5% speed gain for larger models such as FLUX.2 [dev] and LTX-2.3 22B.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cyOd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cyOd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 424w, https://substackcdn.com/image/fetch/$s_!cyOd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 848w, https://substackcdn.com/image/fetch/$s_!cyOd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 1272w, https://substackcdn.com/image/fetch/$s_!cyOd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cyOd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png" width="1456" height="537" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:537,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cyOd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 424w, https://substackcdn.com/image/fetch/$s_!cyOd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 848w, https://substackcdn.com/image/fetch/$s_!cyOd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 1272w, https://substackcdn.com/image/fetch/$s_!cyOd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F332eafa8-47e5-4ecf-abdd-472f07ac38d0_1492x550.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Use Recommended Settings with 6-bit variant (slower on M5 Max than Exact variant); take the second run time.</figcaption></figure></div><p>Today marks the first day of our release week. More updates are coming throughout the week. Stay tuned!</p>]]></content:encoded></item><item><title><![CDATA[Quantify Z Image Turbo efficiency gains]]></title><description><![CDATA[Z Image Turbo, with the Draw Things implementation, delivers the fastest state-of-the-art image generation experience on Apple devices.]]></description><link>https://releases.drawthings.ai/p/quantify-z-image-turbo-efficiency</link><guid isPermaLink="false">https://releases.drawthings.ai/p/quantify-z-image-turbo-efficiency</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Tue, 09 Dec 2025 20:40:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!iM9B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><a href="https://huggingface.co/Tongyi-MAI/Z-Image-Turbo">Z Image Turbo</a> was released at the end of November by Alibaba Tongyi Lab, was quickly praised for its small size, strong prompt adherence, and photo-realism.</p><p><a href="https://drawthings.ai">Draw Things</a> is promised to be the most efficient way to run generative media on your Apple devices. When Z Image Turbo arrived, we knew we had to support it and rigorously quantify its performance and quality claims.</p><p>Z Image Turbo is a 6B-parameter DiT model that borrows many of the most successful tweaks from DiT models developed over the years. Our 6-bit&#8211;quantized variant, fully loaded into RAM, occupies only ~4 GiB, making it an ideal always-on sidekick for your design tasks. This stands in stark contrast to the ~11 GiB required by <a href="https://releases.drawthings.ai/p/introducing-qwen-image-support">Qwen Image</a>.</p><p>It is also a step-distilled model out of the box, so no separate Lightning LoRA is needed.</p><p>To properly evaluate Z Image Turbo, we first invested in an efficient implementation. Across a wide range of Apple devices, the Draw Things implementation is up to 54% faster than other available implementations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iM9B!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iM9B!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 424w, https://substackcdn.com/image/fetch/$s_!iM9B!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 848w, https://substackcdn.com/image/fetch/$s_!iM9B!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!iM9B!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iM9B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png" width="1188" height="2048" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2048,&quot;width&quot;:1188,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iM9B!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 424w, https://substackcdn.com/image/fetch/$s_!iM9B!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 848w, https://substackcdn.com/image/fetch/$s_!iM9B!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!iM9B!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F576ba4d4-dbab-4fc4-9f3d-6c7641ddbef9_1188x2048.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>An apples-to-apples comparison shows that Z Image Turbo is not dramatically faster than previous models on a per-step basis. It delivers a 1.01&#215;&#8211;1.23&#215; improvement over <a href="https://huggingface.co/black-forest-labs/FLUX.1-dev">FLUX.1 [dev]</a>, a 1.31&#215;&#8211;1.41&#215; improvement over <a href="https://huggingface.co/Qwen/Qwen-Image">Qwen Image</a>, and a 1.31&#215;&#8211;1.45&#215; improvement over <a href="https://huggingface.co/HiDream-ai/HiDream-I1-Fast">HiDream [fast]</a>, all measured per step.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Rnra!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Rnra!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 424w, https://substackcdn.com/image/fetch/$s_!Rnra!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 848w, https://substackcdn.com/image/fetch/$s_!Rnra!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 1272w, https://substackcdn.com/image/fetch/$s_!Rnra!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Rnra!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png" width="1456" height="1167" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1167,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Rnra!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 424w, https://substackcdn.com/image/fetch/$s_!Rnra!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 848w, https://substackcdn.com/image/fetch/$s_!Rnra!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 1272w, https://substackcdn.com/image/fetch/$s_!Rnra!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa8350f94-a145-4409-b0b2-61373da71a96_1492x1196.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>However, Z Image Turbo&#8217;s versatility and low RAM requirement make it excellent for rapid iteration and, in practice, it often feels faster to use.</p><p><em>Prompt: 35mm analogue full-body portrait of a beautiful woman wearing black sheer dress, catwalking in a busy market, soft colour grading, infinity cove, shadows, kodak, contax t2</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9eb795be-30e9-4edb-a38b-dd453a7ddfe9_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3f906510-043d-4d37-b4cc-06e16f6e4738_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3c6ec8be-9961-4d24-8e34-0c3742cc5ff7_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ee4f3374-f137-4c48-bc28-df078c0690a0_1024x1024.png&quot;}],&quot;caption&quot;:&quot;Same initial noise and prompt over 2, 4, 6, 8 steps. The similarity in composition allows iterative workflow to first \&quot;draft\&quot; the prompt then \&quot;refine\&quot; the image.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c0f85ac-feb9-48b5-8511-87f9006d7d4a_1456x1456.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>Like <a href="https://releases.drawthings.ai/i/170984339/the-resolutions">Qwen Image</a> and other recently released models, it also handles a wider range of resolutions more gracefully.</p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8858da3a-0bc1-450e-b761-8751aa9aa918_512x512.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c1d1672-8dc7-41fc-b175-b3bd95b93531_768x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/25b6445b-f241-4a24-b4df-7bbc2fd09055_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1476724b-d036-4038-b74a-212d455c3f9e_1280x1280.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a8381112-59c5-46fa-b861-d21058727b88_1536x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/04d16962-606b-473e-a473-43743ec0f2d1_2048x2048.png&quot;}],&quot;caption&quot;:&quot;From left to right: 512x512, 768x768, 1024x1024, 1280x1280, 1536x1536, 2048x2048.&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fe0e26d4-7926-4985-acae-5c36ffb3092b_1456x964.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>There are more cases we would like to share when we added more recent released models to our app, stay tuned!</p><p></p>]]></content:encoded></item><item><title><![CDATA[Metal FlashAttention v2.5 w/ Neural Accelerators: delivering breakthrough performance on the Apple M5 chip]]></title><description><![CDATA[Draw Things brings breakthrough 4.6&#215; performance gains to Apple silicon with Metal FlashAttention v2.5 w/ Neural Accelerators.]]></description><link>https://releases.drawthings.ai/p/metal-flashattention-v25-w-neural</link><guid isPermaLink="false">https://releases.drawthings.ai/p/metal-flashattention-v25-w-neural</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Mon, 10 Nov 2025 21:42:16 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!H3cd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><a href="https://apps.apple.com/us/app/draw-things-ai-generation/id6444050820">Draw Things</a> is the fastest way to generate images or videos with your Apple silicon, locally and privately. Our work on <a href="https://engineering.drawthings.ai/p/metal-flashattention-2-0-pushing-forward-on-device-inference-training-on-apple-silicon-fe8aac1ab23c">Metal FlashAttention</a> has been the bedrock of this claim.</p><p>The recent release of <a href="https://www.apple.com/newsroom/2025/10/apple-introduces-the-powerful-new-ipad-pro-with-the-m5-chip/">M5 debuts Draw Things with 3.5&#215; performance improvements over M4.</a> This was achieved through Apple&#8217;s MPSGraph API, first <a href="https://releases.drawthings.ai/p/iphone-17-pro-doubles-ai-performance">enabled in our 1.20250912.0 release</a>. While MPSGraph API adequately implements Neural Accelerators support, it lacks the fine-grained memory and performance optimizations that we have developed through our ongoing Metal FlashAttention research.</p><p><a href="https://apps.apple.com/us/app/draw-things-ai-generation/id6444050820">Version 1.20251107.1</a> is our first release containing the preview version of Metal FlashAttention v2.5 w/ Neural Accelerators. It delivers up to 4.6&#215; performance improvements on M5 over M4<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. With adequate cooling<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>, it sometimes outperforms M2 Max and narrows the gap to M3 Ultra. Its efficient memory management allows 5-second, 480p (448&#215;768) video generation (with Wan 2.2 A14B models) on an M5 iPad with 16GiB of RAM.</p><p>We will discuss more Metal FlashAttention v2.5 w/ Neural Accelerators implementation details in a separate post on the <a href="https://engineering.drawthings.ai/">Engineering@Draw Things</a> channel, stay tuned!</p><h3>Performance</h3><p>Metal FlashAttention v2.5 w/ Neural Accelerators delivers the fastest performance on Apple&#8217;s non-Pro M-series chips, often rivaling or exceeding previous Max M-series chips. With several large image-generation models (<a href="https://huggingface.co/black-forest-labs/FLUX.1-schnell">FLUX.1 [schnell],</a> a 12B-parameter model; <a href="https://huggingface.co/Qwen/Qwen-Image">Qwen Image,</a> a 20B-parameter model; and <a href="https://huggingface.co/HiDream-ai/HiDream-I1-Fast">HiDream,</a> a 17B-parameter model), the M5 iPad can now generate high-resolution images in under a minute.</p><p>On A19 Pro, it improves on <a href="https://releases.drawthings.ai/p/iphone-17-pro-doubles-ai-performance">our previously reported numbers</a> (with MPSGraph API), and now rivals M3 Pro (18c) performance.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wRro!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wRro!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 424w, https://substackcdn.com/image/fetch/$s_!wRro!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 848w, https://substackcdn.com/image/fetch/$s_!wRro!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 1272w, https://substackcdn.com/image/fetch/$s_!wRro!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wRro!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png" width="1456" height="943" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4264105d-9d64-4998-835e-f6612df9437d_1492x966.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:943,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wRro!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 424w, https://substackcdn.com/image/fetch/$s_!wRro!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 848w, https://substackcdn.com/image/fetch/$s_!wRro!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 1272w, https://substackcdn.com/image/fetch/$s_!wRro!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4264105d-9d64-4998-835e-f6612df9437d_1492x966.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">2-Step Distilled Generation, End-to-End Time Spent (including loading model from SSD). This is updated from previous numbers.</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dbBD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dbBD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 424w, https://substackcdn.com/image/fetch/$s_!dbBD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 848w, https://substackcdn.com/image/fetch/$s_!dbBD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 1272w, https://substackcdn.com/image/fetch/$s_!dbBD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dbBD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png" width="1456" height="943" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:943,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dbBD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 424w, https://substackcdn.com/image/fetch/$s_!dbBD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 848w, https://substackcdn.com/image/fetch/$s_!dbBD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 1272w, https://substackcdn.com/image/fetch/$s_!dbBD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa20189cf-02e3-46d8-9d7b-ad1765fe6c55_1492x966.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">2-Step Distilled Generation, Time Spent at 1 Sampling Step (excluding text encoding, image decoding or model loading). This is updated from previous numbers.</figcaption></figure></div><p>On M5 iPad, we observe 3.6&#215; to 5.5&#215; raw performance improvements over M4 iPad, and 3.3&#215; to 4.6&#215; end-to-end improvements. In the mid-range, M5 iPad performs as well as M2 Max<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. At the high end, M5 iPad often runs about 80% slower than M3 Ultra (60c).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H3cd!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H3cd!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 424w, https://substackcdn.com/image/fetch/$s_!H3cd!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 848w, https://substackcdn.com/image/fetch/$s_!H3cd!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!H3cd!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H3cd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png" width="1456" height="2007" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2007,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!H3cd!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 424w, https://substackcdn.com/image/fetch/$s_!H3cd!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 848w, https://substackcdn.com/image/fetch/$s_!H3cd!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!H3cd!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4d9eb7c-0a8c-48f9-a4e0-a4f7e4df646a_1486x2048.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">4-Step Distilled Generation, End-to-End Time Spent (including loading model from SSD).</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e75h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e75h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 424w, https://substackcdn.com/image/fetch/$s_!e75h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 848w, https://substackcdn.com/image/fetch/$s_!e75h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!e75h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e75h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png" width="1456" height="2007" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:2007,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!e75h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 424w, https://substackcdn.com/image/fetch/$s_!e75h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 848w, https://substackcdn.com/image/fetch/$s_!e75h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!e75h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd80bb07-4e5f-435c-b50e-88a17bfd6ea6_1486x2048.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">4-Step Distilled Generation, Time Spent at 1 Sampling Step (excluding text encoding, image decoding or model loading).</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!KKt7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KKt7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 424w, https://substackcdn.com/image/fetch/$s_!KKt7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 848w, https://substackcdn.com/image/fetch/$s_!KKt7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 1272w, https://substackcdn.com/image/fetch/$s_!KKt7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KKt7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png" width="1456" height="529" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:529,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KKt7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 424w, https://substackcdn.com/image/fetch/$s_!KKt7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 848w, https://substackcdn.com/image/fetch/$s_!KKt7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 1272w, https://substackcdn.com/image/fetch/$s_!KKt7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff396f81c-d71a-462c-8610-f6a2029f0994_1492x542.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">6-Step Distilled Generation, End-to-End Time Spent (including loading model from SSD). Significant time was spent on VAE decoding (~180s on iPad), which MFA v2.5 is not yet optimized for (3D convolution kernels).</figcaption></figure></div><h3>Support</h3><p>Metal FlashAttention v2.5 w/ Neural Accelerators leverages the Neural Accelerators across all key operators &#8212; including matrix multiplication, attention, and segmented matrix multiplication, the latter being essential for Mixture-of-Experts models.</p><h3>Availability &amp; Limitations</h3><p>The source code of Metal FlashAttention v2.5 w/ Neural Accelerators is available at <a href="https://github.com/liuliu/ccv/tree/unstable/lib/nnc/mfa/v2">https://github.com/liuliu/ccv/tree/unstable/lib/nnc/mfa/v2</a>. Draw Things <a href="https://apps.apple.com/us/app/draw-things-ai-generation/id6444050820">1.20251107.1</a> integrates the full implementation of Metal FlashAttention 2.5 w/ Neural Accelerators. It is currently released as a <em>preview</em> because there are still performance cliffs around odd attention sequence lengths and large head dimensions. BF16 support is turned off due to unsolved bugs. The neural accelerators-enabled shaders also take a significantly longer time to specialize (often 10s or more for the first generation). We expect to resolve these issues in future updates.</p><p><em>Edit (20251118): 1.20251117.1 now supports BF16 as well as binary artifacts cache for neural accelerators-enabled shaders.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>The end-to-end result was obtained on an M5 iPad with the 1.20251107.1 build and iPadOS 26.1, compared against an M4 iPad with the same build and iPadOS 26.1, for 1280&#215;1280 FLUX.1 [schnell] (5-bit) 4-step generation. For all measurements below, we ran the generation twice and took the second measurement.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>We used an ice pad underneath the iPad for maximum cooling performance.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>The end-to-end result was obtained on a Mac Studio with an M2 Max (38c) chip, using the 1.20251107.1 build and macOS 15.7.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>The end-to-end result was obtained on a Mac Studio with an M3 Ultra (60c) chip, using the 1.20251107.1 build and macOS 15.6.1.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Introducing Boost: a new way for flexible media generation with Cloud Compute]]></title><description><![CDATA[Boost lets Draw Things+ creators exceed default Compute Unit limits &#8212; enabling higher resolutions, longer steps, and more complex generations in the cloud.]]></description><link>https://releases.drawthings.ai/p/introducing-boost-a-new-way-for-flexible</link><guid isPermaLink="false">https://releases.drawthings.ai/p/introducing-boost-a-new-way-for-flexible</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Thu, 09 Oct 2025 17:28:26 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!z_ZD!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f0328e-c1ab-411f-9649-a4dfd7c935b3_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Local generations with Draw Things have always offered <strong>unparalleled flexibility</strong> &#8212; you can generate at any resolution, with any number of steps or LoRAs. People love that freedom.</p><p>In <strong>February</strong>, we introduced <strong>Cloud Compute</strong>, built on top of our Server-Offload feature. It lets you tweak all the same parameters available in local generation &#8212; but now, the heavy lifting happens on our high-performance servers. This feature is available to both <strong>Community</strong> members and <strong>Draw Things+</strong> customers, differing only in their <strong>Compute Unit</strong> allotment and the <strong><a href="https://releases.drawthings.ai/p/privacy-pass-and-draw-things-cloud">Privacy Pass</a></strong> guarantee.</p><h3>What Are Compute Units?</h3><p>Compute Units represent the <strong>complexity</strong> of a media generation task. They help us ensure fair server usage across the community by limiting how long each generation can run.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4B9n!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4B9n!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 424w, https://substackcdn.com/image/fetch/$s_!4B9n!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 848w, https://substackcdn.com/image/fetch/$s_!4B9n!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 1272w, https://substackcdn.com/image/fetch/$s_!4B9n!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4B9n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png" width="584" height="270" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:270,&quot;width&quot;:584,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:80986,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://releases.drawthings.ai/i/175725135?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4B9n!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 424w, https://substackcdn.com/image/fetch/$s_!4B9n!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 848w, https://substackcdn.com/image/fetch/$s_!4B9n!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 1272w, https://substackcdn.com/image/fetch/$s_!4B9n!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6127c335-4650-4fc7-baa6-ca302362a12d_584x270.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A Qwen Image Edit 2509 generation task at 768x1152 with 30-steps, 4 CFG and 2 reference images.</figcaption></figure></div><p>While this system keeps things fair, it can sometimes limit creative freedom &#8212; and we wanted to change that.</p><h3>Introducing Boost</h3><p>Version <strong><a href="https://apps.apple.com/us/app/draw-things-ai-generation/id6444050820">1.20251007.2</a></strong> introduces <strong>Boost</strong>, a new way to push beyond the default Compute Unit limits for Draw Things+ customers.</p><p>A <strong>Boost</strong> is a one-time token that allows you to exceed your standard Compute Unit limit. It&#8217;s only consumed when your generation completes successfully. Each Boost is <strong>60,000 Compute Units</strong>, and you can <strong>stack</strong> multiple Boosts together &#8212; for example, three Boosts raise your upper limit to <strong>180,000 Compute Units</strong>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!fgpQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!fgpQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 424w, https://substackcdn.com/image/fetch/$s_!fgpQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 848w, https://substackcdn.com/image/fetch/$s_!fgpQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 1272w, https://substackcdn.com/image/fetch/$s_!fgpQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!fgpQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png" width="570" height="242" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:242,&quot;width&quot;:570,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:72149,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://releases.drawthings.ai/i/175725135?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!fgpQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 424w, https://substackcdn.com/image/fetch/$s_!fgpQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 848w, https://substackcdn.com/image/fetch/$s_!fgpQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 1272w, https://substackcdn.com/image/fetch/$s_!fgpQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F708a6f3f-fc39-4fcc-ae10-56e3ff3f02c8_570x242.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Upper limit changed to 180,000 Compute Units when 3 Boosts applied.</figcaption></figure></div><h3>Availability</h3><p>Boosts are available to <strong>Draw Things+</strong> customers. We&#8217;ll begin issuing <strong>one-time free Boosts</strong> to our Draw Things+ customers over the coming days and weeks.</p><p>Because of how Boosts are tracked and verified, <strong>Boosts and Privacy Pass cannot be used together</strong> at this time.</p>]]></content:encoded></item><item><title><![CDATA[iPhone 17 Pro doubles AI performance for the next wave of generative models]]></title><description><![CDATA[With new Neural Accelerator, the iPhone 17 Pro delivers a 2&#215; leap in inference speed. FLUX.1 completes in under 35 seconds, while larger 20B models like Qwen Image finish just over 45&#8212;all on-device.]]></description><link>https://releases.drawthings.ai/p/iphone-17-pro-doubles-ai-performance</link><guid isPermaLink="false">https://releases.drawthings.ai/p/iphone-17-pro-doubles-ai-performance</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Fri, 19 Sep 2025 22:02:55 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!BQCO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Apple Neural Engine has long been a cornerstone of Apple&#8217;s SoC, enabling fast and efficient on-device inference. It works well for smaller models, and Draw Things supports it with the Stable Diffusion v1.5 series at 512&#215;512 resolution. But server-grade large-scale models&#8212;<a href="https://huggingface.co/black-forest-labs/FLUX.1-dev">FLUX.1</a>, <a href="https://huggingface.co/Qwen/Qwen-Image">Qwen Image</a>, <a href="https://huggingface.co/tencent/HunyuanVideo">Hunyuan Video</a>, <a href="https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B">Wan 2.2</a>&#8212;the ones we often talk about with more than 10 billion parameters&#8212;are a different beast. Unfortunately, the ANE falls short when handling them.</p><p>The newly released iPhone 17 Pro features the powerful A19 Pro SoC with new GPU Neural Accelerators. Combined with other GPU improvements, it delivers exceptional performance for its size. For compute-bound tasks such as diffusion-based neural networks, Draw Things has seen a remarkable 2&#215; generational improvement across many state-of-the-art models.</p><p>On iPhone 17 Pro, the <a href="https://huggingface.co/black-forest-labs/FLUX.1-schnell">FLUX.1 [schnell]</a> / <a href="https://huggingface.co/black-forest-labs/FLUX.1-dev">[dev]</a> models at 768&#215;768 run at ~10s per step, finishing 2-step inference in <strong>under 35 seconds</strong>. Even the latest 20B-parameter models, such as <a href="https://huggingface.co/Qwen/Qwen-Image">Qwen Image</a>, run at ~13s per step, completing 2-step inference just <strong>over 45 seconds</strong> at 768&#215;768. At larger resolutions, efficiency shines even more: 1024&#215;1024 inference takes just <strong>over 50 seconds</strong> for the FLUX series and just <strong>around 65 seconds</strong> for the Qwen Image series&#8212;truly delivering MacBook Pro&#8211;like performance.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BQCO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BQCO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!BQCO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!BQCO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!BQCO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BQCO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png" width="1456" height="900" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:900,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BQCO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!BQCO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!BQCO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!BQCO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6b8b939b-53df-41a9-8e75-27048dc5101a_1492x922.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Qwen Image with Lightning LoRA, 2-Step Distilled Generation, End-to-End Time Spent (including loading model from SSD).</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!m2oW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!m2oW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!m2oW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!m2oW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!m2oW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!m2oW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png" width="1456" height="900" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:900,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!m2oW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!m2oW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!m2oW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!m2oW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F896c7fe7-7a03-44b5-af79-aa204dc5d09b_1492x922.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Qwen Image with Lightning LoRA, 2-Step Distilled Generation, Time Spent at 1 Sampling Step (excluding text encoding, image decoding or model loading).</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wPXA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wPXA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!wPXA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!wPXA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!wPXA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wPXA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png" width="1456" height="900" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:900,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wPXA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!wPXA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!wPXA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!wPXA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd8082b21-0b7f-4360-9ccd-071d5963cf21_1492x922.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">FLUX.1 [schnell], 2-Step Distilled Generation, End-to-End Time Spent (including loading model from SSD).</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ox2W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ox2W!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!ox2W!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!ox2W!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!ox2W!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ox2W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png" width="1456" height="900" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:900,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ox2W!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 424w, https://substackcdn.com/image/fetch/$s_!ox2W!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 848w, https://substackcdn.com/image/fetch/$s_!ox2W!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 1272w, https://substackcdn.com/image/fetch/$s_!ox2W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa1941db7-ed2b-4d26-af41-e12904c4de6c_1492x922.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">FLUX.1 [schnell], 2-Step Distilled Generation, Time Spent at 1 Sampling Step (excluding text encoding, image decoding or model loading).</figcaption></figure></div><p><a href="https://apps.apple.com/us/app/draw-things-ai-generation/id6444050820">Draw Things</a> remains focused on delivering the best media generation models&#8212;locally, <a href="https://releases.drawthings.ai/p/privacy-pass-and-draw-things-cloud">privately</a>, and <a href="https://engineering.drawthings.ai/p/optimizing-qwen-image-for-edge-devices">efficiently</a>. Stay tuned and subscribe for more exciting updates!</p><h4>Notes</h4><p>The iPhone 16 Pro numbers reflect typical daily usage without additional cooling (such as fans or ice packs). At a room temperature of 70&#176;F, thermal throttling begins after about a minute, reducing performance for 1024&#215;1024 and 1280&#215;1280 generations. In contrast, the iPhone 17 Pro manages thermals far better, maintaining stronger sustained performance over longer runs.</p>]]></content:encoded></item><item><title><![CDATA[Privacy Pass and Draw Things+ Cloud Compute]]></title><description><![CDATA[Draw Things+ Cloud Compute now provides not just privacy-first, but privacy-guaranteed Cloud Compute for media generation tasks.]]></description><link>https://releases.drawthings.ai/p/privacy-pass-and-draw-things-cloud</link><guid isPermaLink="false">https://releases.drawthings.ai/p/privacy-pass-and-draw-things-cloud</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Wed, 20 Aug 2025 15:55:15 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!z_ZD!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F88f0328e-c1ab-411f-9649-a4dfd7c935b3_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Draw Things is an app that runs your art generation tasks offline, directly on your device. Still, there are many reasons why you might prefer running art generation on a server, including:</p><ul><li><p>Avoiding excessive on-device battery usage;</p></li><li><p>Avoiding large local disk usage from downloaded models;</p></li><li><p>Running larger models that would otherwise be slow or impractical on-device.</p></li></ul><p>However, running art generation on a server often comes with hidden trade-offs that customers may not fully realize:</p><ul><li><p>Their art generations may be subject to DMCA takedowns&#8212;even outside the U.S. or within fair use&#8212;with little recourse (an imbalance between copyright claims and fair-use defenses).</p></li><li><p>The text&#8211;image pairs they generate can be used as preference data to train better models, or even sold to third parties, often without explicit acknowledgment (most users unknowingly sign away these rights during onboarding).</p></li><li><p>So-called &#8220;privacy-first&#8221; claims by service providers are often weak: at best, just policies that can be changed later, and at worst, unenforceable promises quickly broken in the name of &#8220;move fast and break things.&#8221;</p></li></ul><h3>Draw Things+ Cloud Compute</h3><p>Recognizing the real benefits for our customers, Draw Things introduced <strong>Cloud Compute</strong> earlier this year, building on our publicly released <strong>Server-Offload</strong> feature.</p><p>The Server-Offload feature allows customers to host art generation servers that the Draw Things app can connect to&#8212;either on a powerful local computer (Mac or Linux with NVIDIA GPUs) or a remotely hosted machine from a bare-metal provider. It is fully open-source at <a href="https://github.com/drawthingsai/draw-things-community">https://github.com/drawthingsai/draw-things-community</a> and supports every feature the app can run locally.</p><p>Cloud Compute builds directly on Server-Offload. Our GPU servers are hosted by reputable bare-metal providers and run the <em>exact same</em> Docker image we publish publicly at <a href="https://hub.docker.com/u/drawthingsai">https://hub.docker.com/u/drawthingsai</a>. The authentication front-end also runs on our open-source code, hosted by reputable cloud providers.</p><p>By design, Server-Offload / Cloud Compute handles requests <strong>ephemerally in RAM only</strong>, ensuring your data cannot be stored or inspected later. <a href="https://ollama.com/turbo">Many services who claimed privacy-first</a> would be happily stop here. But not us.</p><h3>Privacy Pass</h3><p>To turn privacy promises into privacy guarantees, we looked at several approaches:</p><ul><li><p><strong>Homomorphic encryption</strong>: promising, but impractical without hardware support and still niche in the crypto world;</p></li><li><p><strong>Transparency logs</strong>: immutable, third-party (or distributed) logs tracking what software runs in the cloud;</p></li><li><p><strong>Hardware attestation</strong>: verifying, via trusted hardware, that the declared software is actually running&#8212;often paired with transparency logs;</p></li><li><p><strong>Security audits</strong>: external firms reviewing code and practices to verify that claims match reality.</p></li></ul><p>Inspired by <a href="https://help.kagi.com/kagi/privacy/privacy-pass.html">Kagi Search&#8217;s Privacy Pass</a>, we chose to implement <a href="https://privacypass.github.io/">Privacy Pass</a> as our first step towards providing concrete privacy guarantees to our Cloud Compute customers.</p><p>Privacy Pass ensures that even someone with full access to our backend cannot link a specific art generation to a specific customer.</p><p>Let&#8217;s go over this step-by-step. To effectively prevent abuse while maintaining service quality, we ask customers to sign in with Apple before using Cloud Compute. This enables standard controls: rate-limiting, authentication, prioritization for Draw Things+ users, and more.</p><p>Normally, the Draw Things app communicates with our backend in two steps:</p><ol><li><p>Request our authentication endpoint to sign a generation request payload, using a locally saved user ID token (so we can attach queue prioritization and rate-limit data).</p></li><li><p>Send the signed token to our GPU servers to generate the art.</p></li></ol><p>The problem: the same signed token flows from authentication to GPU servers. In principle, this allows someone to link a user (from authentication endpoint) to their generated art (from GPU servers).</p><p>With Privacy Pass, this becomes a three-step process:</p><ol><li><p>Request a Privacy Pass from our backend, using the user ID token locally (we can count issued passes and enforce limits).</p></li><li><p>Use the <strong>unwrapped</strong> Privacy Pass (different from the one seen by our backend) to request a signed generation payload from the authentication endpoint.</p></li><li><p>Send the signed token to GPU servers for art generation.</p></li></ol><p>Because the privacy pass you get in step 1 is different from privacy pass you use for step 2, our backend cannot connect a specific art generation to a specific user. Anonymity is guaranteed.</p><h3>Availability</h3><p>Privacy Pass is currently available only to <strong>Draw Things+ customers</strong>. Each Draw Things+ customer can issue up to <strong>500 Privacy Passes per day</strong>. If that limit is exceeded, the system falls back to standard authentication without Privacy Pass.</p><p>You can also disable Privacy Pass from <em>Machine Settings</em>. If the <strong>shield icon</strong> is shown, Privacy Pass is active.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jA1a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jA1a!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 424w, https://substackcdn.com/image/fetch/$s_!jA1a!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 848w, https://substackcdn.com/image/fetch/$s_!jA1a!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 1272w, https://substackcdn.com/image/fetch/$s_!jA1a!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jA1a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png" width="570" height="184" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:184,&quot;width&quot;:570,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:48754,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://releases.drawthings.ai/i/171301915?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jA1a!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 424w, https://substackcdn.com/image/fetch/$s_!jA1a!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 848w, https://substackcdn.com/image/fetch/$s_!jA1a!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 1272w, https://substackcdn.com/image/fetch/$s_!jA1a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc881a8d0-5c5a-4781-b488-e4ba85390245_570x184.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p></p><p></p>]]></content:encoded></item><item><title><![CDATA[Introducing Qwen Image support]]></title><description><![CDATA[The most powerful open-source image generation model now is broadly available across the Apple ecosystem.]]></description><link>https://releases.drawthings.ai/p/introducing-qwen-image-support</link><guid isPermaLink="false">https://releases.drawthings.ai/p/introducing-qwen-image-support</guid><dc:creator><![CDATA[Authors of Draw Things]]></dc:creator><pubDate>Fri, 15 Aug 2025 17:45:29 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!w6wR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e1a93c4-dbf6-4eb1-822b-66a142ae9b67_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><a href="https://qwenlm.github.io/blog/qwen-image/">Qwen Image</a> is the most powerful open-source image generation model to date, released by the Qwen team at Alibaba. We&#8217;ve been working hard to support it across the Apple ecosystem, and we&#8217;re happy to announce that it is now broadly available through the Draw Things app. From iPhone to Mac, Apple devices released within the past five years can run this state-of-the-art model directly on-device.</p><h3>The Model</h3><p>Draw Things provides Qwen Image 1.0 in the following variants:</p><ol><li><p><strong>8-bit quantized model</strong> &#8212; ~16 GiB peak runtime VRAM; suitable for devices with 24 GiB or more total RAM.</p></li><li><p><strong>6-bit quantized model</strong> &#8212; ~11 GiB peak runtime VRAM; suitable for devices with 16 GiB or more total RAM.</p></li><li><p><strong>FP16 model</strong> &#8212; ~30 GiB peak runtime VRAM; suitable for devices with 48 GiB or more total RAM.</p></li><li><p><strong>BF16 model</strong> &#8212; ~30 GiB peak runtime VRAM; suitable for M3 and later devices with 48 GiB or more total RAM.</p></li></ol><p>The quantization schemes have been meticulously tested to match the reference implementation with virtually no perceptible loss in quality. On devices with less than 16 GiB total RAM, Draw Things intelligently offloads partial weights, cutting peak runtime VRAM requirements further by more than 50% without speed penalties.</p><p><em>Prompt: A coffee shop entrance features a chalkboard sign reading "Qwen Coffee &#128522; $2 per cup," with a neon light beside it displaying "&#36890;&#20041;&#21315;&#38382;". Next to it hangs a poster showing a beautiful Chinese woman, and beneath the poster is written "&#960;&#8776;3.1415926-53589793-23846264-33832795-02384197".</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e1a93c4-dbf6-4eb1-822b-66a142ae9b67_1536x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f60deb39-8eb6-48d3-8c4d-6e8fc00434e8_1536x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/14d055fe-ad4c-44fe-9709-b827c8c4837d_1536x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2c1bb581-a9fa-45f6-9ab4-4dec7de2a49c_1536x1024.png&quot;}],&quot;caption&quot;:&quot;6-bit, 8-bit, FP16, BF16&quot;,&quot;alt&quot;:&quot;A coffee shop entrance features a chalkboard sign reading \&quot;Qwen Coffee &#128522; $2 per cup,\&quot; with a neon light beside it displaying \&quot;&#36890;&#20041;&#21315;&#38382;\&quot;. Next to it hangs a poster showing a beautiful Chinese woman, and beneath the poster is written \&quot;&#960;&#8776;3.1415926-53589793-23846264-33832795-02384197\&quot;.&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2b4a3c00-b3ad-4ff1-8eec-547194efee9a_1456x1456.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>The Speed</h3><p>In general, Qwen Image runs about 10% slower than FLUX.1 in apple-to-apple comparisons. For example, on iPhone 16 Pro, a 2-step generation (with 4-step Lightning LoRA) at 768&#215;768 takes roughly 100 seconds.</p><h3>Superior Prompt Adherence</h3><p>Qwen Image excels at prompt adherence. It can separate complex entity descriptions more accurately than HiDream-I1 in our tests, and far better than FLUX.1.</p><p>For fairness, cross-model comparisons use Qwen Image 1.0 (6-bit) with 8-Step Lightning LoRA and Text Guidance set to 1.0, comparable to other CFG-distilled models (e.g., HiDream I1 [dev], FLUX.1 Krea [dev]).</p><p><em>Prompt: a smiling indian man with a google t-shirt next to a frowning asian man with a shirt saying nexus at a meeting table facing each other, Ultra HD, 4k, cinematic composition.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f49d0d2f-2db4-4c44-8f38-caabf1299cb8_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/db13accc-a37c-4007-a8cf-22079aae1926_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3260469f-d1b8-4fb5-b406-7d47ac4ca5b4_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81cb358b-621d-4a0d-9949-73a2df6a4e8f_1024x1024.png&quot;}],&quot;caption&quot;:&quot;FLUX.1 [dev], FLUX.1 Krea [dev], HiDream I1 [dev], Qwen Image w/ Lightning&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/95bbd57e-e59c-4b33-a75d-3d396ded023f_1456x1456.png&quot;}},&quot;isEditorNode&quot;:true}"></div><ul><li><p><strong>FLUX.1 [dev]</strong> &#8212; fails to separate facial expressions.</p></li><li><p><strong>FLUX.1 Krea [dev]</strong> &#8212; separates expressions, but composition doesn&#8217;t follow the prompt (&#8220;facing each other&#8221;).</p></li><li><p><strong>HiDream I1 [dev]</strong> &#8212; captures subtler expressions but less precise. Google logo color is wrong.</p></li><li><p><strong>Qwen Image</strong> &#8212; succeeds at both expression and composition.</p></li></ul><h3>The Resolutions</h3><p>Qwen Image kept composition well from small resolutions (512x512) to large resolutions (2048x2048). That means for less powerful devices, you can do smaller resolution image generations without compromises beyond resolutions.</p><p><em>Prompt: 35mm analogue full-body portrait of a beautiful woman wearing black sheer dress, catwalking in a busy market, soft colour grading, infinity cove, shadows, kodak, contax t2</em></p><p>From left to right: 512x512, 768x768, 1024x1024, 1280x1280, 1536x1536, 2048x2048</p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e83b7d0-cb7b-45ab-ac6a-49f19f953fcf_512x512.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f34f1bfb-37df-4b18-86ac-9e0cb9fd56f0_768x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a50071ef-2dcc-478e-9e03-d22adc57e89e_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32accf10-2491-4c65-aeb0-d8915402e42f_1280x1280.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fbaf1879-0a4a-4e48-bc52-486e1859736e_1536x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/973f6a95-2ff0-4b80-9df1-06e4802b4158_2048x2048.png&quot;}],&quot;caption&quot;:&quot;FLUX.1 [dev]&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/557bda44-a75e-4782-9c1f-545ca148e279_1456x964.png&quot;}},&quot;isEditorNode&quot;:true}"></div><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/82a0479c-6c4c-40b4-9312-1cbfaedaea06_512x512.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/47ca52d2-cb33-42c6-8177-d1a9ec4239b0_768x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbe8b8c9-5f21-4f4f-94d1-c388b123aa1c_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eed4a786-6cb5-4afd-b2ba-23c51320f575_1280x1280.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/29b62ec2-12ee-4b2f-82cd-2895998ecce8_1536x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/60199eea-eb81-4dd5-a422-5386d3058439_2048x2048.png&quot;}],&quot;caption&quot;:&quot;FLUX.1 Krea [dev]&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c38312da-7dd9-49a4-ac2e-f46e020a17b6_1456x964.png&quot;}},&quot;isEditorNode&quot;:true}"></div><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/03154257-01f1-48ac-bc4d-7b8c4043a049_512x512.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a55c3e43-34f6-46f0-b741-0bd04cd0cbba_768x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c8488f4b-de3d-46f9-8d00-39ef07863fdc_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7adf3162-eafb-49b6-a944-69a8432a913e_1280x1280.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f8d0be7e-0e2b-4d3d-8327-a2f4b16b3116_1536x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21502cf3-c39f-40a2-a5a1-f56ba1335bc3_2048x2048.png&quot;}],&quot;caption&quot;:&quot;HiDream I1 [dev]&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ac45eaf4-3783-4291-ad1a-d678674f50d8_1456x964.png&quot;}},&quot;isEditorNode&quot;:true}"></div><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1a247f3f-c057-411d-8d64-9668f5fe824a_512x512.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/948b4c76-00cb-4f02-9858-b139b5eb32a1_768x768.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b965f579-97fe-48ba-a312-3260d5d665e5_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/99405d31-2217-4620-9eef-dec047f89413_1280x1280.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5a4abb50-b15e-45ed-9fd4-a6da58930041_1536x1536.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bb5160c3-4b4b-4480-bb51-51e05c9333d0_2048x2048.png&quot;}],&quot;caption&quot;:&quot;Qwen Image w/ Lightning&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33c1d891-68a4-4e14-a68a-076aaf4ceaba_1456x964.png&quot;}},&quot;isEditorNode&quot;:true}"></div><ul><li><p><strong>FLUX.1 [dev]</strong> &#8212; loses quality at 1536&#215;1536 and degrades comprehension at 2048&#215;2048.</p></li><li><p><strong>FLUX.1 Krea [dev]</strong> &#8212; maintains quality but fails on composition at 2048&#215;2048.</p></li><li><p><strong>HiDream I1 [dev]</strong> &#8212; struggles at 512&#215;512.</p></li><li><p><strong>Qwen Image</strong> &#8212; minor prompt confusion at low resolutions (&#8220;catwalk&#8221; misread as &#8220;cat ears&#8221;), but holds composition and quality even at 2048&#215;2048.</p></li></ul><h3>The Hero</h3><p>Qwen Image 1.0 excels in &#8220;wall of text&#8221; situations.</p><p><em>Prompt: Ultra HD, 4k, cinematic composition. A photograph of an anthropomorphic polar bear in a navy suit with a red bow-tie, in front of a blackboard, in what appears to be a college class-room. On the blackboard, colorful chalks are used to write "Two households, both alike in dignity,<br>In fair Verona, where we lay our scene,<br>From ancient grudge break to new mutiny,<br>Where civil blood makes civil hands unclean.<br>From forth the fatal loins of these two foes<br>A pair of star-cross'd lovers take their life;<br>Whose misadventured piteous overthrows<br>Do with their death bury their parents' strife.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5a964fac-2e09-43d5-a6e9-03e8b0d9c6cc_1536x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/14143187-dcbe-49c0-af72-b6b43522cdc1_1536x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8c156d02-70bd-443c-9f55-6fb0cb9f9a77_1536x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab95408a-f0b9-4542-bd10-e4e84755b765_1536x1024.png&quot;}],&quot;caption&quot;:&quot;FLUX.1 [dev], FLUX.1 Krea [dev], HiDream I1 [dev], Qwen Image w/ Lightning&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b9e87775-e69d-4567-b39b-405a873c926d_1456x1456.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p>Qwen Image 1.0 still produces some spelling errors, but among the four models tested, it comes closest to accuracy.</p><h3>The Bad</h3><p>While Qwen Image 1.0 demonstrates exceptional prompt adherence, it can be unpredictable aesthetically. It supports a wide range of styles, but with a simple prompt, it sometimes defaults to random stylistic preferences. Adjustments are often needed for precise aesthetics.</p><p><em>Prompt: A baby yoda wearing a halloween pajama, holding a sign says "Qwen Image 1.0 &#10084;&#65039; Draw Things".</em></p><p>The prompt trapped the model into 3D / illustration style, requiring targeted prompt tuning to switch styles.</p><p><em>Adjusted prompt: A amateur photograph of baby yoda wearing a halloween pajama, holding a sign says "Qwen Image 1.0 &#10084;&#65039; Draw Things". Home video.</em></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0266328d-6d21-4c2c-8c44-3db38fb2aecc_1024x1024.png&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1025dca1-525f-4beb-98e8-a56a53181669_1024x1024.png&quot;}],&quot;caption&quot;:&quot;Original, Adjusted&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c577f1e-6f8a-4fe7-89c4-b3914a07d15a_1456x720.png&quot;}},&quot;isEditorNode&quot;:true}"></div><h3>Engineering Note</h3><p>Qwen Image model is a 60-layer MMDiT model. As we&#8217;ve discussed in <a href="https://engineering.drawthings.ai/p/bf16-and-image-generation-models-803cf0515bee">BF16 and Image Generation Models</a>, deep MMDiT architectures produce progressively larger activations during training. For Qwen Image, the final layers reach activation ranges around 60 million, which requires BF16 for proper support. To make the model usable in FP16, activations must be scaled down in more places. We&#8217;ll publish a separate write-up detailing our findings.</p>]]></content:encoded></item></channel></rss>