-
Couldn't load subscription status.
- Fork 29
LLM pipeline implementation #1040
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
…re pipeline cannot handle an input size larger than the max prefill size
|
MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅ |
…lemented performance benchmark for LLM pipeline
…y input and issue_query only handles output tokens
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
.
|
This PR should resolve the issue with the iOS build: #1064 |
Thanks a ton! I'll look into the windows issue. |
…s calculated per instruction not per sample
|
@freedomtan to test the app (and the accuracy of tinyMMLU). |
|
for performance:
for accuracy:
|
|
@farook-edev please share link to all assets used for LLM benchmark, TFRecords for datasets and models |
|
confirmed that I can run the apk (https://github.com/mlcommons/mobile_app_open/actions/runs/18638579937/artifacts/4313148049) on Pixel 10. However, it seems there are no ttft and tokens/s information in The following is the |
You should be able to find them here. |
|
@farook-edev please help to have meaningful numbers in the @anhappdev: we always update to have the latest LoadGen version when we have a new release. Please do it. Surely, we should test it carefully. |
|
Disable the C++ exception handling when building Eigen for iOS. |
|
@mohitmundhragithub: please provide the link to the discussed mlperf_client testing method. |
|
prompts currently in client: https://github.com/mlcommons/mlperf_client_dev/tree/main/data/llama3/prompts current media document for 1.5 submission. Describes all aspects of the benchmark: https://docs.google.com/document/d/1QUWJa-iKyXznzco3trM6P-UOlvN33sKV3Yp7MyQCzrs/edit?pli=1&tab=t.0
Performance sheet (contains all the formula used): |
|
Note on samples:
|
|




No description provided.