Replies: 1 comment
-
Custom usage and debugging is not an issue of the diffusers library itself so moving this to discussions. If you want to do multiple generations in parallel on a single GPU, create different pipeline instances using the same internal components ( |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the bug
When I test the mit-han-lab/nunchaku-flux.1-kontext-dev model, it runs normally in a non-concurrent scenario, but throws an error when I try to run it with concurrent requests.
My GPU is a single RTX 4090D.
How can I enable multi-concurrency support on a single GPU?
Thank you in advance for your help.
Here is my error message:
[2025-08-08 17:14:50.242] [info] Initializing QuantizedFluxModel on device 0
[2025-08-08 17:14:50.382] [info] Loading partial weights from pytorch
[2025-08-08 17:14:51.445] [info] Done.
Injecting quantized module
Loading checkpoint shards: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:00<00:00, 99.47it/s]
Loading pipeline components...: 57%|████████████████████████████████████████████████████████████████████████████████████████▌ | 4/7 [00:00<00:00, 28.54it/s]You set
add_prefix_space
. The tokenizer needs to be converted from the slow tokenizersLoading pipeline components...: 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 7/7 [00:00<00:00, 19.02it/s]
Generation
height
andwidth
have been adjusted to 752 and 1360 to fit the model requirements.Generation
height
andwidth
have been adjusted to 880 and 1168 to fit the model requirements.43%|███████████████████████████████████████████████████████████████████████████████▎ | 12/28 [00:17<00:23, 1.45s/it]
57%|█████████████████████████████████████████████████████████████████████████████████████████████████████████▋ | 16/28 [00:18<00:13, 1.17s/it]
处理图像时出错: index 29 is out of bounds for dimension 0 with size 29
处理图像时出错: index 29 is out of bounds for dimension 0 with size 29
Reproduction
Logs
System Info
~/FLUX.1-Kontext-Dev-nunchaku# diffusers-cli env
Copy-and-paste the text below in your GitHub issue and FILL OUT the two last points.
Who can help?
No response
Beta Was this translation helpful? Give feedback.
All reactions