feat(provider): add ZAI/GLM provider with full thinking support #5

IgorWarzocha · 2026-01-03T09:55:51Z

Summary

Add first-class support for ZAI/GLM models (GLM-4.7) as a new provider with Anthropic-compatible API.

Features

Add zai-coding-plan provider with GLM-4.7 model
Automatic Anthropic SDK selection (fixes models.dev cache using wrong SDK)
Thinking variants selectable via Ctrl+T:
- none: disabled
- low: 4k tokens
- medium: 12k tokens (default: 8k)
- high: 24k tokens
- max: 64k tokens
Add GLM-specific system prompt
Add ZAI to auth menu for easy API key configuration

Technical Changes

models.ts: Override npm to @ai-sdk/anthropic and inject thinking config for zai-coding-plan
provider.ts: Merge model variants from ModelsDev into provider variants
auth.ts: Add zai-coding-plan to auth menu with helpful prompts
system.ts: Add GLM prompt selection
glm.txt: Comprehensive system prompt for GLM models

Files Changed

packages/opencode/src/cli/cmd/auth.ts
packages/opencode/src/provider/models.ts
packages/opencode/src/provider/provider.ts
packages/opencode/src/session/index.ts
packages/opencode/src/session/prompt/glm.txt
packages/opencode/src/session/system.ts

- Add GLM system prompt (session/prompt/glm.txt) - Update session/system.ts to route GLM models to the new prompt - Update session/index.ts to extract token usage from metadata (fixes ZAI/Anthropic token counting)

- Add zai-coding-plan to provider priority list - Add GLM hint to auth selection - Add specific prompt instructions for ZAI API key

- Add zai-coding-plan to CUSTOM_LOADERS in provider.ts with correct baseURL - Inject default zai-coding-plan provider definition in models.ts for CLI visibility

- Implement rigorous system prompt for GLM models - Add 'Ambition vs Precision' heuristic from Codex - Add detailed workflow from Beast/Anthropic - Add strict XML constraints for steerability (no emojis, no logs) - Align with agent-architect skill best practices

- Override npm to @ai-sdk/anthropic for zai-coding-plan (fixes SDK selection from cache) - Add default thinking options (budgetTokens: 8000) for glm-4.7 - Add thinking variants: none, low (4k), medium (12k), high (24k), max (64k) - Merge model variants from ModelsDev into provider variants - Minor cleanup: remove redundant type casts in session

gemini-code-assist · 2026-01-03T09:56:08Z

Summary of Changes

Hello @IgorWarzocha, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly expands the platform's capabilities by integrating ZAI/GLM models as a new, fully supported provider. It ensures seamless interaction with these models by configuring the appropriate SDK, offering flexible "thinking" token budget options, and providing a specialized system prompt for optimal performance. Additionally, the changes streamline the user experience for setting up ZAI API keys within the application.

Highlights

New ZAI/GLM Provider: Introduced first-class support for ZAI/GLM models, specifically GLM-4.7, with an Anthropic-compatible API.
Automatic SDK Selection: Implemented automatic selection of the @ai-sdk/anthropic SDK for ZAI/GLM, resolving potential caching issues.
Configurable Thinking Variants: Added multiple "thinking" token budget options (none, low, medium, high, max) for GLM models, selectable via Ctrl+T.
Dedicated System Prompt: Integrated a specific system prompt (glm.txt) tailored for GLM models to guide their behavior.
Auth Menu Integration: Added ZAI to the authentication menu for straightforward API key configuration with helpful prompts.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds support for ZAI/GLM models as a new provider. The changes are comprehensive, covering authentication, provider configuration, token usage calculation, and system prompts. The implementation looks solid, particularly the fixes for token counting in session/index.ts and the variant merging logic in provider/provider.ts. I have one suggestion in packages/opencode/src/provider/models.ts to refactor the logic for configuring the new provider, which will improve maintainability and fix a potential logic gap.

gemini-code-assist · 2026-01-03T09:57:54Z

packages/opencode/src/provider/models.ts

+    if (parsed["zai-coding-plan"]) {
+      parsed["zai-coding-plan"].npm = "@ai-sdk/anthropic"
+      // Inject thinking options and variants for glm-4.7 if it exists in cache
+      if (parsed["zai-coding-plan"].models["glm-4.7"]) {
+        parsed["zai-coding-plan"].models["glm-4.7"].options = {
+          ...parsed["zai-coding-plan"].models["glm-4.7"].options,
+          ...glmDefaultOptions,
+        }
+        parsed["zai-coding-plan"].models["glm-4.7"].variants = glmVariants
+      }
+    } else {
+      parsed["zai-coding-plan"] = {
+        id: "zai-coding-plan",
+        name: "ZAI (GLM)",
+        env: [],
+        npm: "@ai-sdk/anthropic",
+        models: {
+          "glm-4.7": {
+            id: "glm-4.7",
+            name: "GLM 4.7",
+            release_date: "2025-11-24",
+            attachment: true,
+            reasoning: true,
+            temperature: true,
+            tool_call: true,
+            interleaved: true,
+            cost: {
+              input: 0,
+              output: 0,
+            },
+            limit: {
+              context: 200000,
+              output: 128000,
+            },
+            options: glmDefaultOptions,
+            variants: glmVariants,
+          },
+        },
+      }
+    }


The current logic for ensuring the zai-coding-plan provider is correctly configured has some room for improvement:

Code Duplication: The entire definition for the glm-4.7 model is duplicated in the else block. This makes the code harder to maintain, as any updates to the model definition would need to be made in two places.

Potential Logic Flaw: If models.dev returns a zai-coding-plan provider but without the glm-4.7 model, your current if block will not add it. The model configuration is only injected if parsed["zai-coding-plan"].models["glm-4.7"] already exists. The else block, which does create the model, is never reached.

A more robust approach is to ensure the provider and model exist, creating them if necessary, and then applying the overrides. This avoids duplication and fixes the logic gap.

const providerId = "zai-coding-plan"; const modelId = "glm-4.7"; const provider = (parsed[providerId] ??= { id: providerId, name: "ZAI (GLM)", env: [], models: {}, }); provider.npm = "@ai-sdk/anthropic"; const model = (provider.models[modelId] ??= { id: modelId, name: "GLM 4.7", release_date: "2025-11-24", attachment: true, reasoning: true, temperature: true, tool_call: true, interleaved: true, cost: { input: 0, output: 0 }, limit: { context: 200000, output: 128000 }, options: {}, variants: {}, }); model.options = { ...model.options, ...glmDefaultOptions }; model.variants = glmVariants;

Address code review feedback: - Use ??= to reduce duplication - Fix edge case where provider exists but model is missing - Cleaner, more maintainable code structure

IgorWarzocha · 2026-01-03T10:13:49Z

@codex triple check if pr looks sound after the changes

chatgpt-codex-connector · 2026-01-03T10:21:00Z

Codex Review: Didn't find any major issues. Can't wait for the next one!

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

IgorWarzocha · 2026-01-03T19:53:39Z

/opencode verify all of this is sound. check for DRY, no excessive console logs and comments. I know for a fact it does something to models.dev.json or whatever it is called, so the changes seem to persist even when switching to upstream release build. Can you confirm/deny. And will models.dev cache get refreshed when they update the remote database?

opencode-agent · 2026-01-03T19:55:14Z

Analysis Complete

✅ DRY Principle

Code is DRY - uses ??= operator to only create provider/model if missing. No unnecessary duplication.

✅ Console Logs

Minimal and appropriate - only in refresh() at models.ts:166-168 for debugging cache refresh.

✅ Comments

Minimal - only 2 lines (models.ts:92-93) explaining ZAI Anthropic compatibility.

🔍 models.dev.json Behavior

DENIED: Changes do NOT persist to cache file

How it works:

get() calls refresh() (models.ts:79)
refresh() ONLY writes to cache when successful fetch from remote API (models.ts:179: if (result && result.ok))
ZAI/GLM modifications happen in-memory on parsed object (models.ts:134-158) after reading cache
Modifications are NOT written back to cache file

Result:

✅ Switching to upstream release: cache refetched, modifications reapplied in memory
✅ No persistent cache contamination

✅ Cache Refresh

Yes, cache refreshes automatically

Line 183: setInterval(() => ModelsDev.refresh(), 60 * 1000 * 60).unref()

Refreshes every hour. When remote models.dev updates, next refresh cycle pulls new data.

Verdict: Code is sound. Runtime-only modifications, no file pollution, automatic refresh works correctly.

opencode session | github run

IgorWarzocha added 5 commits January 2, 2026 23:19

feat: support GLM models and ZAI token metadata

8cc93a0

- Add GLM system prompt (session/prompt/glm.txt) - Update session/system.ts to route GLM models to the new prompt - Update session/index.ts to extract token usage from metadata (fixes ZAI/Anthropic token counting)

feat(cli): add zai-coding-plan (GLM) to auth menu

c4c863c

- Add zai-coding-plan to provider priority list - Add GLM hint to auth selection - Add specific prompt instructions for ZAI API key

feat(provider): add ZAI/GLM provider definition and default endpoint

c8e143a

- Add zai-coding-plan to CUSTOM_LOADERS in provider.ts with correct baseURL - Inject default zai-coding-plan provider definition in models.ts for CLI visibility

gemini-code-assist bot reviewed Jan 3, 2026

View reviewed changes

refactor(models): use nullish coalescing for GLM provider injection

08fc6fd

Address code review feedback: - Use ??= to reduce duplication - Fix edge case where provider exists but model is missing - Cleaner, more maintainable code structure

IgorWarzocha changed the base branch from dev to howcode January 3, 2026 10:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(provider): add ZAI/GLM provider with full thinking support #5

feat(provider): add ZAI/GLM provider with full thinking support #5

Uh oh!

IgorWarzocha commented Jan 3, 2026

Uh oh!

gemini-code-assist bot commented Jan 3, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Jan 3, 2026

Uh oh!

IgorWarzocha commented Jan 3, 2026

Uh oh!

chatgpt-codex-connector bot commented Jan 3, 2026

Uh oh!

IgorWarzocha commented Jan 3, 2026

Uh oh!

opencode-agent bot commented Jan 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(provider): add ZAI/GLM provider with full thinking support #5

Are you sure you want to change the base?

feat(provider): add ZAI/GLM provider with full thinking support #5

Uh oh!

Conversation

IgorWarzocha commented Jan 3, 2026

Summary

Features

Technical Changes

Files Changed

Uh oh!

gemini-code-assist bot commented Jan 3, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jan 3, 2026

Choose a reason for hiding this comment

Uh oh!

IgorWarzocha commented Jan 3, 2026

Uh oh!

chatgpt-codex-connector bot commented Jan 3, 2026

Uh oh!

IgorWarzocha commented Jan 3, 2026

Uh oh!

opencode-agent bot commented Jan 3, 2026

Analysis Complete

✅ DRY Principle

✅ Console Logs

✅ Comments

🔍 models.dev.json Behavior

✅ Cache Refresh

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants