-
Notifications
You must be signed in to change notification settings - Fork 9
Open
Description
Hello, thank you for sharing with the open source community. @ranonrkm @jianc99
I ran the result of Token per second based on Baseline.
But the result of running according to this command ENABLE_INTRA_NODE_COMM=1 torchrun --standalone --nproc_per_node=8 tests/SnapKV/selfspec_benchmark.py --model checkpoints/meta-llama/Meta-Llama-3.1-8B/model.pth --model_name meta-llama/Meta-Llama-3.1-8B --rank_group 0 1 2 3 4 5 6 7 --gamma 3 --B 64 --prefix_len 16032 --max_len 16128 --draft_budget 257 --benchmark --compile is as shown below.

Could you tell me how to view this result? How to compare with Baseline?
Metadata
Metadata
Assignees
Labels
No labels