-
Notifications
You must be signed in to change notification settings - Fork 717
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature] Add docs for pass in token ids directly #2661
Comments
The --skip-tokenizer-init flag allows direct input of token IDs instead of text prompts.
|
Also clarify the naming when skip_init_tokenizer is on, prompt -> prompt token |
@shuaills Also, check whether these two parameters are conflicted. Add an example on:
I think we should change this: https://github.com/sgl-project/sglang/blob/main/test/srt/test_engine_token_ids.py And add examples in: https://github.com/sgl-project/sglang/tree/main/examples/runtime And change docs accordingly. |
Also, please take special consideration of special tokens and chat template. I assume that: give a string
We should have: A1 == A2 == A3 Also, the sampling parameter may introduce some randomness for Maybe you can give link to this in the unit tests. Also, do not remove any of the current test cases in |
Thanks for the clarification. Sounds good. |
|
Be careful about the chat template and special tokens. |
What's the usage of special tokens and chat templates. |
@shuaills And keep care of multi-modal models. |
@shuaills Also, here is something. In vllm, there is a strange parameter called I am using this right now, too tedious: input_token_id_list = [list(output["input_ids"]) for output in outputs]
output_token_id_list = [
list(output["output_ids"]) + [eos_token_id] if list(output["output_ids"])[-1] != eos_token_id
else list(output["output_ids"])
for output in outputs
] |
Checklist
Motivation
In most of RLHF frameworks, the prompts are pre-tokenized when data processing, so they can directly pass in token ids to the sglang engine rather than the prompts. So we should add docs on how to do this and how to get tokens directly.
Related resources
No such.
The text was updated successfully, but these errors were encountered: