fix(llm): cost metrics calculation for unsupport litellm prefix #6022

xingyaoww · 2025-01-04T03:29:27Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

Currently, our metric cost calculation doesn't work if the model is served through LiteLLM proxy.

This PR

first try to grab llm_provider-x-litellm-response-cost from the litellm proxy response https://docs.litellm.ai/docs/proxy/cost_tracking#how-to-track-spend-with-litellm
adds a fallback (eg remove the litellm_proxy/ prefix for cost) -- so as long as the model name is the same, it will get the cost correctly.

Link of any specific issues this addresses

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:40ca3d5-nikolaik   --name openhands-app-40ca3d5   docker.all-hands.dev/all-hands-ai/openhands:40ca3d5

enyst · 2025-01-04T08:46:26Z

openhands/llm/llm.py

            if cost is None:
+                cost = float(
+                    _hidden_params.get('additional_headers', {}).get(
+                        'llm_provider-x-litellm-response-cost', 0.0


Is this the header name, or just x-litellm-response-cost?
https://github.com/BerriAI/litellm/blob/3230c902c9d9c4c933773e946b744965573a14f9/litellm/proxy/proxy_server.py#L738

yeah i think this is the actual key in that dict 🤔

Strange! 😅

OK, let's go with it

Double checked again to make sure!

enyst

Looks reasonable, thanks!

(I think sometime we could move this to the logging mixin or something, we seem to have more and more code unrelated to the main purpose of this class to keep track of in the same place 😅)

…etric

fix(llm): cost metrics calculation for litellm_proxy prefix

d216cc8

xingyaoww requested a review from enyst January 4, 2025 03:29

xingyaoww added 2 commits January 4, 2025 03:35

try to get cost directly from litellm proxy

3d6616b

fix header

1a7941f

enyst reviewed Jan 4, 2025

View reviewed changes

enyst approved these changes Jan 4, 2025

View reviewed changes

Merge commit 'ef2053011dadf3164b568f24961f46ede6e310d6' into xw/fix-m…

40ca3d5

…etric

xingyaoww enabled auto-merge (squash) January 4, 2025 17:52

xingyaoww merged commit aaff3dd into main Jan 4, 2025
19 checks passed

xingyaoww deleted the xw/fix-metric branch January 4, 2025 18:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(llm): cost metrics calculation for unsupport litellm prefix #6022

fix(llm): cost metrics calculation for unsupport litellm prefix #6022

xingyaoww commented Jan 4, 2025 •

edited by github-actions bot

Loading

enyst Jan 4, 2025

xingyaoww Jan 4, 2025

enyst Jan 4, 2025

xingyaoww Jan 4, 2025

enyst left a comment

fix(llm): cost metrics calculation for unsupport litellm prefix #6022

fix(llm): cost metrics calculation for unsupport litellm prefix #6022

Conversation

xingyaoww commented Jan 4, 2025 • edited by github-actions bot Loading

enyst Jan 4, 2025

Choose a reason for hiding this comment

xingyaoww Jan 4, 2025

Choose a reason for hiding this comment

enyst Jan 4, 2025

Choose a reason for hiding this comment

xingyaoww Jan 4, 2025

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment

xingyaoww commented Jan 4, 2025 •

edited by github-actions bot

Loading