Skip to content

Adapt Sentence Transformer model to BertForSeqClassification #4462

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

caronzh03
Copy link

@caronzh03 caronzh03 commented May 20, 2025

  • Change BertPooler to use mean pooling
  • Retrofit a ST bert model to HF's BertForSeqClassification
  • Print logits during inference

@caronzh03 caronzh03 force-pushed the caron-sentence-bert branch from d6d4a38 to 38800a0 Compare May 20, 2025 00:02
@poweiw poweiw added Community want to contribute PRs initiated from Community triaged Issue has been triaged by maintainers Generic Runtime General operational aspects of TRTLLM execution not in other categories. labels Jun 5, 2025
@caronzh03 caronzh03 requested review from a team as code owners June 13, 2025 18:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Community want to contribute PRs initiated from Community Generic Runtime General operational aspects of TRTLLM execution not in other categories. triaged Issue has been triaged by maintainers
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants