Skip to content
Discussion options

You must be logged in to vote

Hey :) This is due to a major update in our methodology released in around November 2025, see this blog post. We now account for more recent GPUs, batching and we consider a more efficient backend (vLLM), leading to lower approximations, but that we believe closer to reality.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@tgilormini-source
Comment options

Answer selected by samuelrince
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants