Added changes for incorporating /rerank support through Huggingface TEI #9311
+807,091
−806,922
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
The code currently runs and passes test, the only thing that needs testing is an actual locally running instance of TEI, to get results from.
Title
Implementing
/rerank
functionRelevant issues
Fixes #8372
Pre-Submission checklist
Please complete all items before asking a LiteLLM maintainer to review your PR
tests/litellm/
directory, Adding at least 1 test is a hard requirement - see detailsmake test-unit
)[https://docs.litellm.ai/docs/extras/contributing_code]Type
🆕 New Feature
Changes
Challenges:
rand_distr
,half
used bycandle
, its a bit difficult to setup a TEI instance to test this code against for API calls.half::bf16: SampleUniform
is not satisfied huggingface/candle#2805, so assuming it will take time to propagate, this is the current state of testing things.make unit-test
didnt run, due to lack of bedrock credentials, which shoudln't have been an issue, but not sure what caused it as I wasnt working with it.but in the meanwhile here are the files that have major changes
litellm/llms/huggingface/rerank/handler.py
litellm/llms/huggingface/rerank/transformations.py
tests/litellm/rerank_api/test_rerank_hf.py
litellm/rerank_api/main.py