Skip to content

[codex] docs: add inference model task-fit guide#5527

Open
HOYALIM wants to merge 3 commits into
NVIDIA:mainfrom
HOYALIM:codex/issue-4755-task-fit-docs
Open

[codex] docs: add inference model task-fit guide#5527
HOYALIM wants to merge 3 commits into
NVIDIA:mainfrom
HOYALIM:codex/issue-4755-task-fit-docs

Conversation

@HOYALIM

@HOYALIM HOYALIM commented Jun 17, 2026

Copy link
Copy Markdown
Contributor

Summary

  • Adds a per-model task-fit guide to the Inference Options page so users can choose models by task type, latency, tool-use fit, context-window fit, and relative cost.
  • Keeps exact context-window limits and pricing delegated to the provider catalog because those values change outside NemoClaw releases.
  • Adds a focused docs regression test that ensures the guide exists and covers curated onboarding model IDs.

Validation

  • python3 scripts/generate-platform-docs.py --check
  • NODE_PATH=/Users/holim/code/NemoClaw/node_modules /Users/holim/code/NemoClaw/node_modules/.bin/vitest run test/docs-copyable-command-blocks.test.ts test/inference-options-docs.test.ts

Fixes #4755

Summary by CodeRabbit

  • Documentation

    • Added a “Model Task-Fit Guide” section to the inference options docs, with a starter table that links curated model IDs to recommended task types and includes qualitative comparisons for latency, tool-use quality, context-window fit, and relative cost.
  • Tests

    • Added automated validation to ensure the “Model Task-Fit Guide” subsection contains the expected table structure, covers all curated model IDs, and excludes placeholder text.

@copy-pr-bot

copy-pr-bot Bot commented Jun 17, 2026

Copy link
Copy Markdown

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@coderabbitai

coderabbitai Bot commented Jun 17, 2026

Copy link
Copy Markdown
Contributor

Review Change Stack

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: 2bb5cf34-439d-4784-b473-e276235b97da

📥 Commits

Reviewing files that changed from the base of the PR and between 829b4c3 and 8e706c3.

📒 Files selected for processing (1)
  • test/inference-options-docs.test.ts
🚧 Files skipped from review as they are similar to previous changes (1)
  • test/inference-options-docs.test.ts

📝 Walkthrough

Walkthrough

Adds a "Model Task-Fit Guide" section to docs/inference/inference-options.mdx containing a qualitative table mapping curated model IDs to task types, relative latency, tool-use quality, context-window fit, and relative cost. A new Vitest test validates the section's presence, completeness, and model coverage against CLOUD_MODEL_OPTIONS from the inference config.

Changes

Model Task-Fit Guide

Layer / File(s) Summary
Model Task-Fit Guide docs table
docs/inference/inference-options.mdx
Inserts a new "Model Task-Fit Guide" section with a Markdown table pairing curated model IDs to recommended task types and qualitative latency, tool-use, context-window, and cost attributes.
Documentation validation test
test/inference-options-docs.test.ts
Adds a Vitest test that extracts the new subsection from the MDX file, dynamically reads expected model IDs from CLOUD_MODEL_OPTIONS via TypeScript AST parsing, and asserts table structure, absence of placeholder text, the authoritative-provider statement, and full model ID coverage.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Poem

🐇 A table appeared on the inference page,
Each model assigned to its proper stage.
No TBD found, no TODO in sight,
The test checks each row to confirm it's right.
Latency, context, cost — all aligned,
A task-fit guide for the developer's mind! 🥕

🚥 Pre-merge checks | ✅ 5
✅ Passed checks (5 passed)
Check name Status Explanation
Description Check ✅ Passed Check skipped - CodeRabbit’s high-level summary is enabled.
Title check ✅ Passed The title accurately describes the main change: adding a model task-fit guide to the inference documentation.
Linked Issues check ✅ Passed The PR fully addresses issue #4755 by adding the per-model task-fit comparison table and a documentation regression test ensuring completeness.
Out of Scope Changes check ✅ Passed All changes directly support the linked issue: documentation additions for the task-fit guide and a validation test for coverage of curated models.
Docstring Coverage ✅ Passed Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
🧪 Generate unit tests (beta)
  • Create PR with unit tests

Comment @coderabbitai help to get the list of available commands and usage tips.

@HOYALIM HOYALIM force-pushed the codex/issue-4755-task-fit-docs branch from 2e4efbc to fa25f95 Compare June 17, 2026 06:41
@HOYALIM HOYALIM marked this pull request as ready for review June 17, 2026 06:42
Copilot AI review requested due to automatic review settings June 17, 2026 06:42

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Adds and validates a new “Model Task-Fit Guide” section in the inference options docs to keep curated onboarding model guidance up to date.

Changes:

  • Added a “Model Task-Fit Guide” comparison table to docs/inference/inference-options.mdx.
  • Added a Vitest that ensures the section exists, has no TBD/TODO markers, and includes rows for curated onboarding models plus select additional model IDs.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File Description
test/inference-options-docs.test.ts Adds a docs integrity test that locates the new section and asserts required table content/model rows.
docs/inference/inference-options.mdx Introduces the new model task-fit comparison table and supporting guidance text.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread test/inference-options-docs.test.ts Outdated
Comment on lines +8 to +10
const repoRoot = path.dirname(import.meta.dirname);
const inferenceOptionsPath = path.join(repoRoot, "docs", "inference", "inference-options.mdx");
const inferenceConfigPath = path.join(repoRoot, "src", "lib", "inference", "config.ts");
Comment thread test/inference-options-docs.test.ts Outdated
Comment on lines +20 to +25
function readCuratedNvidiaModelIds(): string[] {
const config = fs.readFileSync(inferenceConfigPath, "utf8");
const match = config.match(/export const CLOUD_MODEL_OPTIONS = \[(?<body>[\s\S]*?)\];/);
expect(match?.groups?.body).toBeTruthy();
return [...match!.groups!.body.matchAll(/id:\s*"([^"]+)"/g)].map((entry) => entry[1]);
}
Comment thread test/inference-options-docs.test.ts Outdated
return markdown.slice(start, end);
}

function readCuratedNvidiaModelIds(): string[] {
@HOYALIM HOYALIM force-pushed the codex/issue-4755-task-fit-docs branch from fa25f95 to 704865e Compare June 17, 2026 06:48

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🧹 Nitpick comments (1)
docs/inference/inference-options.mdx (1)

100-100: 💤 Low value

Consider replacing "Very large context" with a stronger descriptor.

The phrase "Very large context" appears on lines 100 and 103. Per the coding guidelines, avoid weak intensifiers like "very" in documentation. Since this table lacks specific numeric context-window values (by design, per PR objectives), consider alternatives such as:

  • Largest context
  • Extensive context
  • Maximum context

These provide the same meaningful distinction from the "Large context" tier without relying on the weak intensifier "very."

Also applies to: 103-103

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@docs/inference/inference-options.mdx` at line 100, The table contains the
weak intensifier "very" in the context window descriptions listed as "Very large
context" which appears multiple times in the inference options table. Replace
all instances of "Very large context" with a stronger descriptor such as
"Largest context", "Extensive context", or "Maximum context" to align with
documentation guidelines that avoid weak intensifiers while maintaining the same
meaningful distinction from the "Large context" tier.

Source: Coding guidelines

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Nitpick comments:
In `@docs/inference/inference-options.mdx`:
- Line 100: The table contains the weak intensifier "very" in the context window
descriptions listed as "Very large context" which appears multiple times in the
inference options table. Replace all instances of "Very large context" with a
stronger descriptor such as "Largest context", "Extensive context", or "Maximum
context" to align with documentation guidelines that avoid weak intensifiers
while maintaining the same meaningful distinction from the "Large context" tier.

ℹ️ Review info
⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Enterprise

Run ID: a8065e5f-755f-456b-add4-7748b09eb023

📥 Commits

Reviewing files that changed from the base of the PR and between fa25f95 and 704865e.

📒 Files selected for processing (2)
  • docs/inference/inference-options.mdx
  • test/inference-options-docs.test.ts

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[All Platforms][Docs] inference-options page lacks per-model task-fit comparison

2 participants