Skip to content

Conversation

@TiagoSantos81
Copy link

  • a i7+3070 8Gb laptop can gos from ~34.5 ips with batchSize=10 to ~37 ips with batchSize=32 and slight VRAM offloading to RAM

* a i7+3070 8Gb laptop can gos from ~34.5 ips with batchSize=10 to ~37 ips with batchSize=32 and slight VRAM offloading to RAM
* get prompt class need to be implemented first, so first batch in line works as the usual warm-up
@TiagoSantos81 TiagoSantos81 changed the title [maxperf] allow arbitrary batch sizes for better trials [maxperf] allow arbitrary batch sizes for better trials and various minor fixes Dec 7, 2023
@TiagoSantos81 TiagoSantos81 changed the title [maxperf] allow arbitrary batch sizes for better trials and various minor fixes [maxperf] [artspew] allow arbitrary batch sizes for better trials and various minor fixes Dec 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant