[Feature Request] Prompt lookup decoding

Hi TRT-LLM team, 

I see that this technique could be beneficial for **Summarization**, **Context-QA**, and **Multi-turn chat**. 

This technique is similar to speculative decoding, but instead of using a draft model, they switch to 'string' matching in the prompt to generate candidate token sequences.

Github : [Prompt-lookup-decoding](https://github.com/apoorvumang/prompt-lookup-decoding/tree/main)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Prompt lookup decoding #311

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Prompt lookup decoding #311

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions