Block vLLM/SGLang serve on non-Linux with clear error by alfredoclarifai · Pull Request #966 · Clarifai/clarifai-python

alfredoclarifai · 2026-02-26T15:06:41Z

Summary

Add early platform check when serving models that use vLLM or SGLang toolkits
On macOS/Windows, these engines crash deep in C extensions with opaque AttributeError tracebacks
Now fails fast with a clear message: what's wrong and what to do instead (cloud deploy or Ollama)
Applied to both serve paths (API-connected and --grpc)

Test plan

On macOS, run clarifai model serve . in a vLLM model directory — should get clear error instead of C extension traceback
On macOS, run clarifai model serve --grpc in an SGLang model directory — same clear error
On Linux, verify serve still works normally (platform check passes)

🤖 Generated with Claude Code

vLLM and SGLang only support Linux with GPU access. On macOS/Windows, they crash deep in C extensions with opaque errors. This adds an early platform check in both serve paths (API-connected and --grpc) to fail fast with an actionable message suggesting cloud deploy or Ollama. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Copilot

Pull request overview

Adds an early OS/platform guard in the CLI model serving paths to fail fast (with a clearer UserError) when attempting to serve vLLM/SGLang-based models on non-Linux platforms.

Changes:

Add non-Linux detection for vLLM/SGLang during clarifai model serve (API-connected) validation.
Add similar detection during clarifai model serve --grpc validation.
Reuse toolkit.provider (when present) alongside requirements.txt inspection to detect the engine.

Comments suppressed due to low confidence (1)

clarifai/cli/model.py:1404

serve_cmd introduces toolkit_provider = config.get('toolkit', {}).get('provider'), but later in the same validation block the LM Studio branch still re-reads config.get('toolkit', {}).get('provider') instead of using toolkit_provider. Using the cached variable consistently would avoid duplicate lookups and keep the checks uniform (and reduces the chance of future drift between branches).

    toolkit_provider = config.get('toolkit', {}).get('provider')
    if _platform.system() != "Linux":
        for engine in ('vllm', 'sglang'):
            if engine in dependencies or toolkit_provider == engine:
                raise UserError(
                    f"{engine} is not supported on {_platform.system()}. It requires a Linux environment with GPU access.\n"
                    "  Use 'clarifai model deploy .' to run on cloud GPU, or switch to the Ollama or LM Studio toolkit for local serving."
                )

    if "ollama" in dependencies or toolkit_provider == 'ollama':

Copilot · 2026-02-27T21:14:39Z

clarifai/cli/model.py

+    if _platform.system() != "Linux":
+        for engine in ('vllm', 'sglang'):
+            if engine in dependencies or toolkit_provider == engine:
+                raise UserError(
+                    f"{engine} is not supported on {_platform.system()}. It requires a Linux environment with GPU access.\n"
+                    "  Use 'clarifai model deploy .' to run on cloud GPU, or switch to the Ollama or LM Studio toolkit for local serving."
+                )


This adds new platform-specific behavior (raising UserError on non-Linux when vLLM/SGLang is detected), but the existing CLI tests for clarifai model serve don’t appear to cover these branches. Adding unit tests that mock platform.system() (and set up requirements.txt/toolkit.provider for vllm/sglang) would prevent regressions and should cover both API-connected serve and the --grpc path.

Copilot · 2026-02-27T21:14:39Z

clarifai/cli/model.py

+        import platform as _platform
+
+        toolkit_provider = config.get('toolkit', {}).get('provider')
+        if _platform.system() != "Linux":
+            for engine in ('vllm', 'sglang'):
+                if engine in dependencies or toolkit_provider == engine:
+                    raise UserError(
+                        f"{engine} is not supported on {_platform.system()}. It requires a Linux environment with GPU access.\n"
+                        "  Use 'clarifai model deploy .' to run on cloud GPU, or switch to the Ollama or LM Studio toolkit for local serving."
+                    )


In _run_local_grpc, the non-Linux guard for vLLM/SGLang only runs inside if mode not in ("container", "env"). That means clarifai model serve --grpc --mode env|container on macOS/Windows will skip this early check and can still hit the same deep C-extension failures this PR is trying to avoid. Consider moving the platform/toolkit check outside the mode gate (using toolkit_provider and/or parsing requirements regardless of mode) so the behavior is consistent across all --grpc modes.

luv-bansal · 2026-03-01T08:40:54Z

@alfredoclarifai I have included these improvements in my branch, so closing this PR

alfredoclarifai and others added 2 commits February 26, 2026 10:06

Mention LM Studio as alternative in vLLM/SGLang platform error

aa3ca99

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

alfredoclarifai requested a review from Copilot February 27, 2026 21:11

Copilot started reviewing on behalf of alfredoclarifai February 27, 2026 21:11 View session

Copilot AI reviewed Feb 27, 2026

View reviewed changes

luv-bansal closed this Mar 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Block vLLM/SGLang serve on non-Linux with clear error#966

Block vLLM/SGLang serve on non-Linux with clear error#966
alfredoclarifai wants to merge 2 commits intocli-improvementfrom
cli-improvement-vllm-platform-check

alfredoclarifai commented Feb 26, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

Copilot AI Feb 27, 2026

Uh oh!

luv-bansal commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

alfredoclarifai commented Feb 26, 2026

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 27, 2026

Choose a reason for hiding this comment

Uh oh!

luv-bansal commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants