feat(inference): allow setting custom inference timeout by pentschev · Pull Request #672 · NVIDIA/OpenShell

pentschev · 2026-03-30T07:13:53Z

Summary

Makes the inference routing timeout configurable via openshell inference set --timeout <secs> and openshell inference update --timeout <secs>, replacing the hardcoded 60-second default. Timeout changes propagate dynamically to running sandboxes within the route refresh interval (~5 seconds) without requiring sandbox recreation.

The timeout was observed running OpenCode for a complex build task on a DGX Spark running nemotron-3-super:120b via Ollama, this feature allows longer running tasks to succeed.

Related Issue

Closes #641

Changes

Add timeout_secs field to ClusterInferenceConfig, SetClusterInferenceRequest, SetClusterInferenceResponse, GetClusterInferenceResponse, and ResolvedRoute proto messages
Add timeout field (Duration) to the router's ResolvedRoute struct with a DEFAULT_ROUTE_TIMEOUT of 60 seconds
Remove the global reqwest::Client timeout; apply per-request .timeout(route.timeout) in backend.rs
Thread timeout_secs through server persistence (upsert_cluster_inference_route, build_cluster_inference_config, bundle resolution)
Map proto timeout_secs to router ResolvedRoute.timeout in the sandbox's bundle_to_resolved_routes()
Include timeout_secs in the bundle revision hash so timeout changes trigger route cache refreshes in running sandboxes
Add --timeout CLI flag to inference set (default 0 = 60s) and inference update (optional)
Update docs/inference/configure.md with timeout usage and hot-reload behavior
Update architecture/inference-routing.md with per-request timeout semantics, proto field additions, and CLI surface

Testing

mise run pre-commit passes
Unit tests added/updated
E2E tests added/updated (if applicable)

Checklist

Follows Conventional Commits
Commits are signed off (DCO)
Architecture docs updated (if applicable)

github-actions · 2026-03-30T07:14:07Z

All contributors have signed the DCO ✍️ ✅
_{Posted by the DCO Assistant Lite bot.}

pentschev · 2026-03-30T07:14:39Z

I have read the DCO document and I hereby sign the DCO.

pentschev · 2026-03-30T07:16:21Z

recheck

johntmyers · 2026-03-30T14:43:42Z

Hi thank you. Please address the failing branch checks. AGENTS.md describes this:

## Pre-commit

- Run `mise run pre-commit` before committing.
- Install the git hook when working locally: `mise generate git-pre-commit --write --task=pre-commit`

pentschev · 2026-03-30T16:24:21Z

Sorry, that was my mistake, should be fixed now. Could you check again @johntmyers ?

pentschev added 3 commits March 29, 2026 03:26

feat(inference): add timeout

243d59f

feat(inference): fix dynamic timeout change

dcd1cb5

feat(inference): update docs

d1c90d7

pentschev requested a review from a team as a code owner March 30, 2026 07:13

johntmyers self-assigned this Mar 30, 2026

feat(inference): fix formatting

479d951

johntmyers added the test:e2e Requires end-to-end coverage label Mar 30, 2026

This was referenced Mar 30, 2026

fix(ci): skip docs preview deploy for fork PRs #679

Merged

fix(ci): enable e2e for fork PRs via pull_request_target #680

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(inference): allow setting custom inference timeout#672

feat(inference): allow setting custom inference timeout#672
pentschev wants to merge 4 commits intoNVIDIA:mainfrom
pentschev:inference-timeout

pentschev commented Mar 30, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 30, 2026 •

edited

Loading

Uh oh!

pentschev commented Mar 30, 2026

Uh oh!

pentschev commented Mar 30, 2026

Uh oh!

johntmyers commented Mar 30, 2026

Uh oh!

pentschev commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pentschev commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issue

Changes

Testing

Checklist

Uh oh!

github-actions bot commented Mar 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pentschev commented Mar 30, 2026

Uh oh!

pentschev commented Mar 30, 2026

Uh oh!

johntmyers commented Mar 30, 2026

Uh oh!

pentschev commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pentschev commented Mar 30, 2026 •

edited

Loading

github-actions bot commented Mar 30, 2026 •

edited

Loading