Optimize AI Bridge's JSON handling for llm payloads

The sdks that AI Bridge currently uses to interact with LLM clients and servers makes some trade-offs that work well for clients, but not for a gateway.

To scale bridge, we need to reduce the CPU and memory footprint of JSON handling in bridge.

See [here](https://www.notion.so/coderhq/AI-Bridge-Pre-GA-Scale-tests-2c5d579be5928088b565d15dd8bdea41?source=copy_link) for more information.

Pull requests already exist to address some of these issues:
* https://github.com/anthropics/anthropic-sdk-go/pull/262
* https://app.graphite.com/github/pr/coder/aibridge/102/perf-only-marshal-once

Both of these also need to be applied to OpenAI.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimize AI Bridge's JSON handling for llm payloads #1236

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Optimize AI Bridge's JSON handling for llm payloads #1236

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions