-
Notifications
You must be signed in to change notification settings - Fork 45
Open
Description
Hi, congrats and the great job you're doing in this repo. I set out with a similar goal of implementing hackable and high-performance inference and post-training package in JAX and then found this repository. It seems that most of the ideas in my mind were already implemented here. I wonder if you have any comparative benchmark of inference and training performance with the Pytorch ecosystem as the baseline, e.g., VLLM for inference and Unsloth for post-training etc. I know that the true value proposition might be the performance on TPUs, but it would be great if we could achieve comparable performance with the highly optimized Pytorch ecosystem on GPUs as well.
yiyousong
Metadata
Metadata
Assignees
Labels
No labels