Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
15 changes: 15 additions & 0 deletions speed-bench/m5_max_q2q4_imatrix.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
ctx_tokens,prefill_tokens,prefill_tps,gen_tokens,gen_tps,kvcache_bytes
2048,2048,413.85,128,34.42,52184460
18432,16384,405.31,128,28.42,277693836
34816,16384,374.49,128,27.75,503203212
51200,16384,333.84,128,26.79,728712588
67584,16384,298.66,128,25.75,954221964
83968,16384,269.69,128,25.43,1179731340
100352,16384,248.99,128,24.36,1405240716
116736,16384,230.49,128,23.63,1630750092
133120,16384,215.12,128,22.37,1856259468
149504,16384,198.15,128,21.70,2081768844
165888,16384,187.32,128,20.72,2307278220
182272,16384,176.49,128,20.16,2532787596
198656,16384,165.14,128,19.54,2758296972
200000,1344,157.02,128,19.37,2776775308
52 changes: 52 additions & 0 deletions speed-bench/m5_max_q2q4_imatrix_ts.svg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.