Fu-Jie
diff --git a/‎.gemini/skills/release-prep/SKILL.md‎
Lines changed: 19 additions & 7 deletions b/‎.gemini/skills/release-prep/SKILL.md‎
Lines changed: 19 additions & 7 deletions
diff --git a/‎README.md‎
Lines changed: 1 addition & 1 deletion b/‎README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎README_CN.md‎
Lines changed: 1 addition & 1 deletion b/‎README_CN.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/development/fix-role-tool-error.md‎
Lines changed: 124 additions & 0 deletions b/‎docs/development/fix-role-tool-error.md‎
Lines changed: 124 additions & 0 deletions
diff --git a/‎docs/development/fix-role-tool-error.zh.md‎
Lines changed: 126 additions & 0 deletions b/‎docs/development/fix-role-tool-error.zh.md‎
Lines changed: 126 additions & 0 deletions
@@ -73,11 +73,21 @@ Create two versioned release notes files:
 #### Required Sections
 
 Each file must include:
-1. **Title**: `# v{version} Release Notes` (EN) / `# v{version} 版本发布说明` (CN)
-2. **Overview**: One paragraph summarizing this release
-3. **New Features** / **新功能**: Bulleted list of features
-4. **Bug Fixes** / **问题修复**: Bulleted list of fixes
-5. **Migration Notes** / **迁移说明**: Breaking changes or Valve key renames (omit section if none)
+0. **Marketplace Badge**: A prominent button linking to the plugin on openwebui.com using shields.io (e.g., `[![](https://img.shields.io/badge/OpenWebUI%20Community-Get%20Plugin-blue?style=for-the-badge)](URL)`).
+1. **Overview Header**: Use `## Overview` as the first header.
+2. **Summary Paragraph**: A paragraph summarizing the release. **NEVER** include the version number as a title.
+3. **README Link**: Direct link to the plugin's README file on GitHub.
+4. **New Features** / **新功能**: Bulleted list of features
+5. **Bug Fixes** / **问题修复**: Bulleted list of fixes
+6. **Related Issues** / **相关 Issue**: Link to GitHub Issues. **ONLY** include if a specific issue is resolved. **NEVER use placeholders.**
+7. **Related PRs** / **相关 PR**: Link to the Pull Request. **ONLY** include if the PR is already created and the ID is known. **NEVER use placeholders.**
+8. **Migration Notes**: Breaking changes or Valve key renames (omit section if none)
+
+---
+
+## Language Standard
+
+- **Release Notes Files**: Use **English ONLY** for the final `.md` files to maintain professional consistency on GitHub. Avoid bilingual content in the release description.
 6. **Companion Plugins** / **配套插件** (optional): If a companion plugin was updated
 
 If a release notes file already exists for this version, update it rather than creating a new one.
@@ -98,8 +108,10 @@ Generate the commit message following `commit-message.instructions.md` rules:
 - **Language**: English ONLY
 - **Format**: `type(scope): subject` + blank line + body bullets
 - **Scope**: use plugin folder name (e.g., `github-copilot-sdk`)
-- **Body**: 1-3 bullets summarizing key changes
-- Explicitly mention "READMEs and docs synced" if version was bumped
+- **Body**: 
+    - 1-3 bullets summarizing key changes
+    - Explicitly mention "READMEs and docs synced" if version was bumped
+    - **MUST** end with `Closes #XX` or `Fixes #XX` if an issue is being resolved.
 
 Present the full commit message to the user for review before executing.
 
 
@@ -27,7 +27,7 @@ A collection of enhancements, plugins, and prompts for [open-webui](https://gith
 | 🥈 | [Smart Infographic](https://openwebui.com/posts/smart_infographic_ad6f0c7f) | ![v](https://img.shields.io/badge/v-1.5.0-blue?style=flat) | ![p2_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p2_dl.json&style=flat) | ![p2_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p2_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
 | 🥉 | [Markdown Normalizer](https://openwebui.com/posts/markdown_normalizer_baaa8732) | ![v](https://img.shields.io/badge/v-1.2.7-blue?style=flat) | ![p3_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p3_dl.json&style=flat) | ![p3_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p3_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
 | 4️⃣ | [Export to Word Enhanced](https://openwebui.com/posts/export_to_word_enhanced_formatting_fca6a315) | ![v](https://img.shields.io/badge/v-0.4.4-blue?style=flat) | ![p4_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p4_dl.json&style=flat) | ![p4_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p4_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
-| 5️⃣ | [Async Context Compression](https://openwebui.com/posts/async_context_compression_b1655bc8) | ![v](https://img.shields.io/badge/v-1.3.0-blue?style=flat) | ![p5_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_dl.json&style=flat) | ![p5_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
+| 5️⃣ | [Async Context Compression](https://openwebui.com/posts/async_context_compression_b1655bc8) | ![v](https://img.shields.io/badge/v-1.4.0-blue?style=flat) | ![p5_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_dl.json&style=flat) | ![p5_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--09-gray?style=flat) |
 | 6️⃣ | [AI Task Instruction Generator](https://openwebui.com/posts/ai_task_instruction_generator_9bab8b37) | ![v](https://img.shields.io/badge/v-N/A-gray?style=flat) | ![p6_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p6_dl.json&style=flat) | ![p6_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p6_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
 
 ### 📈 Total Downloads Trend
 
@@ -24,7 +24,7 @@ OpenWebUI 增强功能集合。包含个人开发与收集的插件、提示词
 | 🥈 | [Smart Infographic](https://openwebui.com/posts/smart_infographic_ad6f0c7f) | ![v](https://img.shields.io/badge/v-1.5.0-blue?style=flat) | ![p2_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p2_dl.json&style=flat) | ![p2_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p2_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
 | 🥉 | [Markdown Normalizer](https://openwebui.com/posts/markdown_normalizer_baaa8732) | ![v](https://img.shields.io/badge/v-1.2.7-blue?style=flat) | ![p3_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p3_dl.json&style=flat) | ![p3_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p3_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
 | 4️⃣ | [Export to Word Enhanced](https://openwebui.com/posts/export_to_word_enhanced_formatting_fca6a315) | ![v](https://img.shields.io/badge/v-0.4.4-blue?style=flat) | ![p4_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p4_dl.json&style=flat) | ![p4_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p4_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
-| 5️⃣ | [Async Context Compression](https://openwebui.com/posts/async_context_compression_b1655bc8) | ![v](https://img.shields.io/badge/v-1.3.0-blue?style=flat) | ![p5_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_dl.json&style=flat) | ![p5_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
+| 5️⃣ | [Async Context Compression](https://openwebui.com/posts/async_context_compression_b1655bc8) | ![v](https://img.shields.io/badge/v-1.4.0-blue?style=flat) | ![p5_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_dl.json&style=flat) | ![p5_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p5_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--09-gray?style=flat) |
 | 6️⃣ | [AI Task Instruction Generator](https://openwebui.com/posts/ai_task_instruction_generator_9bab8b37) | ![v](https://img.shields.io/badge/v-N/A-gray?style=flat) | ![p6_dl](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p6_dl.json&style=flat) | ![p6_vw](https://img.shields.io/endpoint?url=https%3A%2F%2Fgist.githubusercontent.com%2FFu-Jie%2Fdb3d95687075a880af6f1fba76d679c6%2Fraw%2Fbadge_p6_vw.json&style=flat) | ![updated](https://img.shields.io/badge/2026--03--08-gray?style=flat) |
 
 ### 📈 总下载量累计趋势
 
@@ -0,0 +1,124 @@
+# Fix: OpenAI API Error "messages with role 'tool' must be a response to a preceding message with 'tool_calls'"
+
+## Problem Description
+In the `async-context-compression` filter, chat history can be trimmed or summarized when the conversation grows. If the retained tail starts in the middle of a native tool-calling sequence, the next request may begin with a `tool` message whose triggering `assistant` message is no longer present.
+
+That produces the OpenAI API error:
+`"messages with role 'tool' must be a response to a preceding message with 'tool_calls'"`
+
+## Root Cause
+History compression boundaries were not fully aware of atomic tool-call chains. A valid chain may include:
+
+1. An `assistant` message with `tool_calls`
+2. One or more `tool` messages
+3. An optional assistant follow-up that consumes the tool results
+
+If truncation happens inside that chain, the request sent to the model becomes invalid.
+
+## Solution: Atomic Boundary Alignment
+The fix groups tool-call sequences into atomic units and aligns trim boundaries to those groups.
+
+### 1. `_get_atomic_groups()`
+This helper groups message indices into units that must be kept or dropped together. It explicitly recognizes native tool-calling patterns such as:
+
+- `assistant(tool_calls)`
+- `tool`
+- assistant follow-up response
+
+Conceptually, it treats the whole sequence as one atomic block instead of independent messages.
+
+```python
+def _get_atomic_groups(self, messages: List[Dict]) -> List[List[int]]:
+    groups = []
+    current_group = []
+
+    for i, msg in enumerate(messages):
+        role = msg.get("role")
+        has_tool_calls = bool(msg.get("tool_calls"))
+
+        if role == "assistant" and has_tool_calls:
+            if current_group:
+                groups.append(current_group)
+            current_group = [i]
+        elif role == "tool":
+            if not current_group:
+                groups.append([i])
+            else:
+                current_group.append(i)
+        elif (
+            role == "assistant"
+            and current_group
+            and messages[current_group[-1]].get("role") == "tool"
+        ):
+            current_group.append(i)
+            groups.append(current_group)
+            current_group = []
+        else:
+            if current_group:
+                groups.append(current_group)
+                current_group = []
+            groups.append([i])
+
+    if current_group:
+        groups.append(current_group)
+
+    return groups
+```
+
+### 2. `_align_tail_start_to_atomic_boundary()`
+This helper checks whether a proposed trim point falls inside one of those atomic groups. If it does, the start index is moved backward to the beginning of that group.
+
+```python
+def _align_tail_start_to_atomic_boundary(
+    self, messages: List[Dict], raw_start_index: int, protected_prefix: int
+) -> int:
+    aligned_start = max(raw_start_index, protected_prefix)
+
+    if aligned_start <= protected_prefix or aligned_start >= len(messages):
+        return aligned_start
+
+    trimmable = messages[protected_prefix:]
+    local_start = aligned_start - protected_prefix
+
+    for group in self._get_atomic_groups(trimmable):
+        group_start = group[0]
+        group_end = group[-1] + 1
+
+        if local_start == group_start:
+            return aligned_start
+
+        if group_start < local_start < group_end:
+            return protected_prefix + group_start
+
+    return aligned_start
+```
+
+### 3. Applied to Tail Retention and Summary Progress
+The aligned boundary is now used when rebuilding the retained tail and when calculating how much history can be summarized safely.
+
+Example from the current implementation:
+
+```python
+raw_start_index = max(compressed_count, effective_keep_first)
+start_index = self._align_tail_start_to_atomic_boundary(
+    messages, raw_start_index, effective_keep_first
+)
+tail_messages = messages[start_index:]
+```
+
+And during summary progress calculation:
+
+```python
+raw_target_compressed_count = max(0, len(messages) - self.valves.keep_last)
+target_compressed_count = self._align_tail_start_to_atomic_boundary(
+    messages, raw_target_compressed_count, effective_keep_first
+)
+```
+
+## Verification Results
+- **First compression boundary**: When history first crosses the compression threshold, the retained tail no longer starts inside a tool-call block.
+- **Complex sessions**: Real-world testing with 30+ messages, multiple tool calls, and failed calls remained stable during background summarization.
+- **Regression behavior**: The filter now prefers a valid boundary even if that means retaining slightly more context than a naive raw slice would allow.
+
+## Conclusion
+The fix prevents orphaned `tool` messages by making history trimming and summary progress aware of atomic tool-call groups. This eliminates the 400 error during long conversations and background compression.
@@ -0,0 +1,126 @@
+# 修复：OpenAI API 错误 "messages with role 'tool' must be a response to a preceding message with 'tool_calls'"
+
+## 问题描述
+在 `async-context-compression` 过滤器中，当对话历史变长时，系统会对消息进行裁剪或摘要。如果保留下来的尾部历史恰好从一个原生工具调用序列的中间开始，那么下一次请求就可能以一条 `tool` 消息开头，而触发它的 `assistant` 消息已经被裁掉。
+
+这就会触发 OpenAI API 的错误：
+`"messages with role 'tool' must be a response to a preceding message with 'tool_calls'"`
+
+## 根本原因
+
+真正的缺陷在于历史压缩边界没有完整识别工具调用链的“原子性”。一个合法的工具调用链通常包括：
+
+1. 一条带有 `tool_calls` 的 `assistant` 消息
+2. 一条或多条 `tool` 消息
+3. 一条可选的 assistant 跟进回复，用于消费工具结果
+
+如果裁剪点落在这段链条内部，发给模型的消息序列就会变成非法格式。
+
+## 解决方案：对齐原子边界
+修复通过把工具调用序列分组为原子单元，并使裁剪边界对齐到这些单元。
+
+### 1. `_get_atomic_groups()`
+这个辅助函数会把消息索引分组为“必须一起保留或一起丢弃”的原子单元。它显式识别以下原生工具调用模式：
+
+- `assistant(tool_calls)`
+- `tool`
+- assistant 跟进回复
+
+也就是说，它不再把这些消息看成彼此独立的单条消息，而是把整段序列视为一个原子块。
+
+```python
+def _get_atomic_groups(self, messages: List[Dict]) -> List[List[int]]:
+    groups = []
+    current_group = []
+
+    for i, msg in enumerate(messages):
+        role = msg.get("role")
+        has_tool_calls = bool(msg.get("tool_calls"))
+
+        if role == "assistant" and has_tool_calls:
+            if current_group:
+                groups.append(current_group)
+            current_group = [i]
+        elif role == "tool":
+            if not current_group:
+                groups.append([i])
+            else:
+                current_group.append(i)
+        elif (
+            role == "assistant"
+            and current_group
+            and messages[current_group[-1]].get("role") == "tool"
+        ):
+            current_group.append(i)
+            groups.append(current_group)
+            current_group = []
+        else:
+            if current_group:
+                groups.append(current_group)
+                current_group = []
+            groups.append([i])
+
+    if current_group:
+        groups.append(current_group)
+
+    return groups
+```
+
+### 2. `_align_tail_start_to_atomic_boundary()`
+这个辅助函数会检查一个拟定的裁剪起点是否落在某个原子块内部。如果是，它会把起点向前回退到该原子块的开头位置。
+
+```python
+def _align_tail_start_to_atomic_boundary(
+    self, messages: List[Dict], raw_start_index: int, protected_prefix: int
+) -> int:
+    aligned_start = max(raw_start_index, protected_prefix)
+
+    if aligned_start <= protected_prefix or aligned_start >= len(messages):
+        return aligned_start
+
+    trimmable = messages[protected_prefix:]
+    local_start = aligned_start - protected_prefix
+
+    for group in self._get_atomic_groups(trimmable):
+        group_start = group[0]
+        group_end = group[-1] + 1
+
+        if local_start == group_start:
+            return aligned_start
+
+        if group_start < local_start < group_end:
+            return protected_prefix + group_start
+
+    return aligned_start
+```
+
+### 3. 应用于尾部保留和摘要进度计算
+这个对齐后的边界现在被用于重建保留尾部消息，以及计算可以安全摘要的历史范围。
+
+当前实现中的示例：
+
+```python
+raw_start_index = max(compressed_count, effective_keep_first)
+start_index = self._align_tail_start_to_atomic_boundary(
+    messages, raw_start_index, effective_keep_first
+)
+tail_messages = messages[start_index:]
+```
+
+在摘要进度计算中同样如此：
+
+```python
+raw_target_compressed_count = max(0, len(messages) - self.valves.keep_last)
+target_compressed_count = self._align_tail_start_to_atomic_boundary(
+    messages, raw_target_compressed_count, effective_keep_first
+)
+```
+
+## 验证结果
+
+- **首次压缩边界**：当历史第一次越过压缩阈值时，保留尾部不再从工具调用块中间开始。
+- **复杂会话验证**：在 30+ 条消息、多个工具调用和失败调用的真实场景下，后台摘要过程保持稳定。
+- **回归行为更安全**：过滤器现在会优先选择合法边界，即使这意味着比原始的朴素切片稍微多保留一点上下文。
+
+## 结论
+通过让历史裁剪与摘要进度计算具备"工具调用原子块感知"能力，避免孤立的 `tool` 消息出现，消除长对话与后台压缩期间的 400 错误。