feat: add stop string matching #969

blueswhen · 2025-07-14T07:24:57Z

No description provided.

gemini-code-assist

Summary of Changes

Hello @blueswhen, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request adds a significant new feature to the text generation process: the ability to stop generation based on specific string patterns, not just token IDs. It involves modifications across several core components to store, detect, and handle these stop strings, ultimately providing more granular control over the output and ensuring that the final generated text is clean of the stop sequences themselves.

Highlights

Stop String Matching Implementation: Introduced a new mechanism to detect and handle user-defined stop strings within the generated text, allowing for more precise control over text generation termination.
Enhanced Sampling Parameters: The SamplingParams object has been extended to store and manage the string representations of stop sequences, alongside their token ID counterparts, enabling string-based matching.
Dynamic Tokenizer Integration: The inference backend now dynamically loads the tokenizer when stop string matching is enabled, allowing it to decode generated token IDs into strings for comparison against the specified stop strings.
Configurable Feature: The entire stop string matching functionality can be enabled or disabled via the ENABLE_STOP_STRING_MATCH environment variable, providing flexibility for deployment and testing.
Post-Generation Stop String Trimming: After generation, if a stop string caused the termination, the matched stop string is now automatically removed from the end of the final generated text to ensure clean output.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces stop string matching. I've identified a high-severity issue in how stop sequences are processed, a high-severity issue in the matching logic that might cause premature termination, and a medium-severity suggestion to improve correctness. Overall, this is a solid contribution and with these fixes, it will be even better.

gemini-code-assist · 2025-07-14T07:26:56Z

lightllm/server/core/objs/sampling_params.py

+        if isinstance(stop_sequences, str):
+            stop_sequences = [stop_sequences]
        for group_idx in range(self.size):
-            self.groups[group_idx].initialize(groups[group_idx])
+            self.groups[group_idx].initialize(groups[group_idx], stop_sequences[group_idx])


There's a potential issue in this loop. If stop_sequences contains a list of token IDs (e.g., [1, 2, 3]), it will be used as the sequence_str argument to StopSequence.initialize, which expects a string. This can lead to incorrect behavior or a crash. Ensure that only strings are passed as sequence_str.

gemini-code-assist · 2025-07-14T07:26:56Z

lightllm/server/router/model_infer/infer_batch.py

+            ]
+            tail_str = tokenizer.decode(tail_token_ids, skip_special_tokens=False)
+            for stop_str in self.stop_sequences_str:
+                if stop_str in tail_str:


Using in to check for the stop string can lead to premature termination. If the stop string is "User:" and the model generates "Assistant: Hello! User: Hi!", the generation would stop prematurely. Use tail_str.endswith(stop_str) to match only at the end.

Suggested change

if stop_str in tail_str:

if tail_str.endswith(stop_str):

gemini-code-assist · 2025-07-14T07:26:57Z

lightllm/server/api_openai.py

@@ -518,9 +522,22 @@ async def _collect_generation_results(generator, request: CompletionRequest, pro
            finish_reason = finish_status.get_finish_reason()
            prompt_tokens = metadata["prompt_tokens"]

+    # 处理停止序列剔除
+    final_text = "".join(final_output)
+    if finish_reason == "stop" and hasattr(sampling_params, "stop_sequences") and sampling_params.stop_sequences:


The condition hasattr(sampling_params, "stop_sequences") and sampling_params.stop_sequences is always true. It's better to explicitly check if there are any stop sequences defined by looking at the size attribute.

Suggested change

if finish_reason == "stop" and hasattr(sampling_params, "stop_sequences") and sampling_params.stop_sequences:

if finish_reason == "stop" and sampling_params.stop_sequences.size > 0:

gemini-code-assist bot reviewed Jul 14, 2025

View reviewed changes

blueswhen force-pushed the stop_string_match branch 2 times, most recently from b93b511 to b9f3add Compare July 14, 2025 07:51

feat: add stop string matching

50a1663

blueswhen force-pushed the stop_string_match branch from b9f3add to 50a1663 Compare July 15, 2025 02:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add stop string matching #969

feat: add stop string matching #969

Uh oh!

blueswhen commented Jul 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jul 14, 2025

Uh oh!

gemini-code-assist bot Jul 14, 2025

Uh oh!

gemini-code-assist bot Jul 14, 2025

Uh oh!

Uh oh!

	if finish_reason == "stop" and hasattr(sampling_params, "stop_sequences") and sampling_params.stop_sequences:
	if finish_reason == "stop" and sampling_params.stop_sequences.size > 0:

feat: add stop string matching #969

Are you sure you want to change the base?

feat: add stop string matching #969

Uh oh!

Conversation

blueswhen commented Jul 14, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Jul 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!