Slow generation #14

KintCark · 2025-05-22T13:42:08Z

Y is the generation speed so slow with sdxl I'm trying to use playground 2.5 but it takes forever to get a pic then it gives me a black image so I waited all that time for norhing,will you be making better optimizations soon BTW there is no support apk for 8gen1 it says 8gen1+ I tried using it and it just crashes when loading quantization model.

rmatif · 2025-05-22T15:21:10Z

@KintCark

There's nothing I can do about the speed — it’s not related to the Flutter/Dart side. It has to do with the ggml library, and the inference is slow on mobile due to the massive compute workload. That’s also why you don’t see many local diffusion inference apps around.

As for CPU performance, ggml is already fairly well optimized. I might revisit KleidAI microkernels, but in my last tests, they didn’t perform well with more than 4 threads.

I’m doing my best to improve performance by adding an OpenCL backend:
leejet/stable-diffusion.cpp#680
and also by tuning Vulkan:
ggml-org/llama.cpp#13483
Still, it's tough to beat current CPU performance

The best path forward is to use distilled models that can converge in a single step. I’ve already added some:
leejet/stable-diffusion.cpp#675
and plan to include more in the future.

I tested the app on a Snapdragon 8 Gen 3 and it works fine. Some vendors may not support custom CPU instructions, so I recommend sticking with the generic APK. On this device, I get around 8s/it for SD1.5, which is decent — you can get good results in under a minute.

The dark image output issue is due to SDXL’s VAE having NaN issues in FP16. As mentioned in a previous issue, Playground 2.5 also needs a custom scheduler anyway.

For now, I recommend sticking to distilled models with CFG-free sampling

rmatif closed this as completed May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slow generation #14

Slow generation #14

KintCark commented May 22, 2025

rmatif commented May 22, 2025 •

edited

Loading

Uh oh!

Slow generation #14

Slow generation #14

Comments

KintCark commented May 22, 2025

rmatif commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rmatif commented May 22, 2025 •

edited

Loading