-
Notifications
You must be signed in to change notification settings - Fork 11
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Description
Summary
Implement and finalize src/speculators/models/mlp.py to support MLP-based speculator models for speculative decoding algorithms utilizing combined token/embedding speculators.
Reference
Acceptance Criteria
Classes and Test Cases
- Implement
MLPSpeculatorConfigandMLPSpeculatorfollowing the example insrc/speculators/models/eagle.py. - Ensure compatibility with
SpeculatorModelConfig.from_pretrainedandSpeculatorModel.from_pretrained. - Implement comprehensive test cases following the examples in:
tests/unit/models/test_eagle_config.pytests/unit/models/test_eagle_model.py
MLPSpeculatorConfig
- Include all relevant hyperparameters expected to change or be configured to construct a working MLP Speculator model as defined in the referenced paper.
MLPSpeculator
- Correctly create the required MLP-based architecture from a given
MLPSpeculatorConfig. - Enable loading and saving of weights.
- Integrate seamlessly with the existing system.
TokenProposal Functionality
- Implement any missing
TokenProposalmethods or functionality as defined in the paper, or expand current implementations as needed.
Out of Scope (Future Targets)
- Implement a functioning forward pass for
SpeculatorModelcompatible with training flows. - Implement a functioning generate pass for
SpeculatorModelcompatible with generation flows. - Create an Algorithm factory to handle preconfigured hyperparameters for the desired supported algorithms.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers