fixing chunking token issues within limit for embedding models #156

ASuresh0524 · 2025-11-01T00:10:23Z

Fixing #154 issues

The merge conflict in embedding_compute.py has been resolved while preserving all token limit fixes AND integrating the batch processing improvements from PR #152:

Combined Features:

Token-aware truncation with tiktoken support
Modern /api/embed endpoint with true batch processing
Enhanced error detection for token limit violations
Model token limit registry for safety
All performance improvements from batch processing

The PR is now ready for review and provides the complete solution for issue #153.

Checklist

Tests pass (uv run pytest)
Code formatted (ruff format and ruff check)
Pre-commit hooks pass (pre-commit run --all-files)

fixing chunking token issues within limit for embedding models

d6ed618

ASuresh0524 mentioned this pull request Nov 1, 2025

Fix/chunking token limit behavior #154

Merged

4 tasks

ASuresh0524 closed this Nov 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fixing chunking token issues within limit for embedding models #156

fixing chunking token issues within limit for embedding models #156

ASuresh0524 commented Nov 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fixing chunking token issues within limit for embedding models #156

fixing chunking token issues within limit for embedding models #156

Conversation

ASuresh0524 commented Nov 1, 2025

Fixing #154 issues

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants