remove tokenizer deps #5026

chitalian · 2025-10-08T18:41:34Z

We should rely on the model provider returning the usage tokens. Calculating usage tokens on our CPU is hurting performance and is not accurate at all. especailly with more complex modalities, function calling etc...

Since we are prioritizing the gateway moving forward, reducing this extra layer of complexity is helpful

vercel · 2025-10-08T18:41:40Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Preview	Comments	Updated (UTC)
helicone	Ready	Preview	Comment	Nov 21, 2025 6:01pm
helicone-bifrost	Ready	Preview	Comment	Nov 21, 2025 6:01pm
helicone-eu	Ready	Preview	Comment	Nov 21, 2025 6:01pm

greptile-apps

Greptile Overview

Summary

This PR removes local tokenization dependencies from the Helicone codebase in favor of relying on LLM provider-returned token usage data. The changes eliminate CPU-intensive token counting that was impacting performance and accuracy, particularly for complex scenarios involving function calls and multimodal inputs.

The PR deletes key tokenization files including tokenCounter.ts, gptWorker.ts, and tokenRouter.ts, while removing dependencies on @anthropic-ai/tokenizer, js-tiktoken, and tiktoken from package.json. Body processors for OpenAI, Anthropic, Vercel, and Llama streams have been simplified to rely exclusively on provider-supplied usage data rather than calculating tokens locally.

This architectural shift aligns with Helicone's strategic focus on their AI Gateway, reducing system complexity and computational overhead. The changes maintain backward compatibility by gracefully handling cases where providers don't return usage data, typically returning -1 values with informative error messages directing users to enable stream usage in their provider settings.

Important Files Changed

Changed Files

Filename	Score	Overview
valhalla/jawn/src/lib/shared/bodyProcessors/openAIStreamProcessor.ts	4/5	Removes manual token counting and relies entirely on OpenAI provider usage data with proper fallback handling
valhalla/jawn/src/lib/shared/bodyProcessors/anthropicStreamBodyProcessor.ts	4/5	Simplifies processor by removing complex model-specific tokenization logic and relying on provider tokens
valhalla/jawn/src/lib/tokens/gptWorker.js	4/5	Complete deletion of Node.js entry point for TypeScript tokenization worker
valhalla/jawn/package.json	4/5	Removes three tokenization library dependencies while keeping gpt-tokenizer for potential other uses
valhalla/jawn/src/lib/tokens/tokenCounter.ts	4/5	Complete removal of all local token counting functions and tokenizer initialization
valhalla/jawn/src/lib/shared/bodyProcessors/vercelStreamProcessor.ts	4/5	Eliminates 66-line fallback token calculation block and relies solely on stream-provided usage
valhalla/jawn/src/lib/shared/bodyProcessors/anthropicBodyProcessor.ts	4/5	Removes legacy token counting for older Claude models and uses only Anthropic-provided usage data
valhalla/jawn/src/lib/shared/bodyProcessors/llamaStreamBodyProcessor.ts	4/5	Removes tokenizer fallback logic and simplifies to use only provider-returned metrics
valhalla/jawn/src/lib/routers/tokenRouter.ts	1/5	Critical issue: Router deleted but still imported in index.ts, will cause runtime errors
valhalla/jawn/src/lib/tokens/gptWorker.ts	4/5	Removes GPT tokenizer worker thread implementation to eliminate computational overhead

Confidence score: 2/5

This PR introduces a breaking change that will likely cause immediate runtime failures due to missing tokenRouter import
Score significantly lowered due to incomplete cleanup leaving dangling imports that will crash the application
Pay close attention to valhalla/jawn/src/lib/routers/tokenRouter.ts deletion and any files importing it

Sequence Diagram

sequenceDiagram
    participant User

Additional Comments (2)

valhalla/jawn/src/lib/shared/bodyProcessors/anthropicStreamBodyProcessor.ts, line 237-313 (link)

style: Dead code - these unused functions should be removed entirely instead of keeping them 'for reference'.
valhalla/jawn/src/lib/routers/tokenRouter.ts, line 1 (link)

logic: Critical issue: The main server file still imports this deleted tokenRouter (line 20 in index.ts) and uses it (line 171). This will cause a runtime error when the server starts. The import and usage need to be removed from index.ts.

_{10 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2025-10-08T18:43:04Z

valhalla/jawn/src/lib/shared/bodyProcessors/anthropicBodyProcessor.ts

+          totalTokens:
+            parsedResponseBody?.usage?.input_tokens +
+            parsedResponseBody?.usage?.output_tokens +
+            parsedResponseBody?.usage?.cache_creation_input_tokens +
+            parsedResponseBody?.usage?.cache_read_input_tokens,


style: Consider using nullish coalescing (?? 0) for cache token values to handle undefined values safely in the addition

Prompt To Fix With AI

This is a comment left during a code review. Path: valhalla/jawn/src/lib/shared/bodyProcessors/anthropicBodyProcessor.ts Line: 22:26 Comment: **style:** Consider using nullish coalescing (`?? 0`) for cache token values to handle undefined values safely in the addition How can I resolve this? If you propose a fix, please make it concise.

claude · 2025-10-08T19:02:45Z

Claude finished @chitalian's task —— View job

Code Review Complete

Score: 8/10 - Good to merge with minor suggestions

Summary:
This PR successfully removes local tokenization dependencies to rely on LLM provider-returned token usage data. The approach is sound for performance and accuracy, especially for complex scenarios like function calls and multimodal inputs. The implementation is clean with proper fallback handling.

Analysis

What was removed:

✅ Deleted @anthropic-ai/tokenizer, js-tiktoken, tiktoken dependencies
✅ Removed tokenCounter.ts, gptWorker.ts/js, tokenRouter.ts
✅ Cleaned up index.ts imports properly (Greptile's alert was outdated)
✅ Simplified all body processors to rely on provider usage data

What was improved:

✅ Better error handling with informative messages directing users to documentation
✅ Consistent fallback behavior across all processors
✅ Reduced computational overhead and complexity
✅ More accurate token counts from authoritative sources

Minor Issues & Suggestions

1. Null Safety in Anthropic Body Processor (anthropicBodyProcessor.ts:26-27)

// Consider using nullish coalescing for safer addition
totalTokens: (parsedResponseBody.input_tokens ?? 0),
promptTokens: (parsedResponseBody.input_tokens ?? 0),

2. Error Message Consistency
The OpenAI stream processor provides a helpful error message:

helicone_error: "counting tokens not supported, please see https://docs.helicone.ai/use-cases/enable-stream-usage"

Consider adding similar informative errors in other processors when usage data is missing.

3. Dead Code Removal (Found in original file)
In anthropicStreamBodyProcessor.ts, there appear to be unused function remnants around lines 237-313 that should be removed entirely rather than kept "for reference".

Security & Performance Assessment

✅ Security: No security concerns identified. The change actually improves security by:

Reducing attack surface (fewer dependencies)
Relying on authoritative provider data instead of local calculations

✅ Performance: Significant performance improvement by:

Eliminating CPU-intensive tokenization
Reducing memory usage from tokenizer libraries
Faster response processing

✅ Accuracy: Improved accuracy by:

Using provider-calculated tokens (especially for function calls, multimodal content)
Eliminating estimation errors from local tokenization

Breaking Changes Assessment

Low Risk: The change maintains backward compatibility:

Same API surface for usage data
Graceful fallback when usage data unavailable (-1 values with helpful errors)
Existing integrations should continue working

Testing Recommendations

While not blocking, consider testing:

Stream responses from different providers (OpenAI, Anthropic, Vercel)
Non-stream responses with and without usage data
Error scenarios when providers don't return usage information

Suggestions Summary:

Minor: Add nullish coalescing in anthropicBodyProcessor.ts:26-27
Cleanup: Remove any remaining dead code from stream processors
Enhancement: Add consistent error messages across all processors when usage data missing

Overall Assessment: This is a well-executed refactor that meaningfully improves performance and accuracy while maintaining API compatibility. The strategic shift toward provider-supplied usage data aligns well with the AI Gateway focus.

Resolved conflicts: - .claude/settings.local.json: Kept main's version - anthropicBodyProcessor.ts: Merged logic from main and fixed unsafe arithmetic with nullish coalescing - Auto-generated files (swagger.json, routes.ts, jawnTypes): Accepted main's versions - yarn.lock: Accepted main's version Also fixed unsafe arithmetic operations in anthropicBodyProcessor.ts by using nullish coalescing (?? 0) for cache token calculations.

- Remove tokenRouter import and usage from index.ts (breaking import fix) - Fix unsafe arithmetic in anthropicStreamBodyProcessor.ts by using nullish coalescing (?? 0) - Remove dead code functions (recursivelyConsolidateAnthropicListForClaude3 and recursivelyConsolidateAnthropic) - Remove heliconeCalculated flag since we're relying on provider tokens 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]>

remove tokenizer deps

ae8c93b

greptile-apps bot reviewed Oct 8, 2025

View reviewed changes

vercel bot had a problem deploying to Preview – helicone-bifrost October 8, 2025 18:49 Failure

connortbot approved these changes Oct 8, 2025

View reviewed changes

vercel bot deployed to Preview – helicone-eu October 8, 2025 18:57 View deployment

vercel bot deployed to Preview – helicone October 8, 2025 18:59 View deployment

Merge branch 'main' into justin/remove-tokenizer-deps

9384166

vercel bot deployed to Preview – helicone-eu October 8, 2025 19:18 View deployment

vercel bot had a problem deploying to Preview – helicone-bifrost October 8, 2025 19:23 Failure

vercel bot deployed to Preview – helicone October 8, 2025 19:26 View deployment

fix bifrost build

52b60af

vercel bot had a problem deploying to Preview – helicone-bifrost October 8, 2025 21:23 Failure

vercel bot had a problem deploying to Preview – helicone-eu October 8, 2025 21:24 Failure

vercel bot had a problem deploying to Preview – helicone October 8, 2025 21:30 Failure

chitalian and others added 2 commits November 20, 2025 10:47

vercel bot deployed to Preview – helicone-eu November 21, 2025 17:59 View deployment

vercel bot deployed to Preview – helicone-bifrost November 21, 2025 18:01 View deployment

vercel bot deployed to Preview – helicone November 21, 2025 18:01 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

remove tokenizer deps #5026

remove tokenizer deps #5026

Uh oh!

chitalian commented Oct 8, 2025

Uh oh!

vercel bot commented Oct 8, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

greptile-apps bot Oct 8, 2025

Uh oh!

claude bot commented Oct 8, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

remove tokenizer deps #5026

Are you sure you want to change the base?

remove tokenizer deps #5026

Uh oh!

Conversation

chitalian commented Oct 8, 2025

Uh oh!

vercel bot commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Greptile Overview

Summary

Important Files Changed

Confidence score: 2/5

Sequence Diagram

Additional Comments (2)

Uh oh!

greptile-apps bot Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

claude bot commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Complete

Analysis

Minor Issues & Suggestions

Security & Performance Assessment

Breaking Changes Assessment

Testing Recommendations

Suggestions Summary:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vercel bot commented Oct 8, 2025 •

edited

Loading

greptile-apps bot left a comment •

edited

Loading

claude bot commented Oct 8, 2025 •

edited

Loading