[FIX] Use actual model name for Azure OpenAI cost tracking by pk-zipstack · Pull Request #1805 · Zipstack/unstract

pk-zipstack · 2026-02-24T08:52:14Z

What

Preserve the user-provided model field (actual model name like gpt-4o) as cost_model in Azure OpenAI adapter validation output
Use cost_model for usage recording and embedding metadata instead of the deployment name

Why

Azure OpenAI uses deployment_name (e.g., my-gpt4-prod) as the model parameter for LiteLLM routing (azure/my-gpt4-prod), which is correct for API calls
However, cost tracking breaks because the platform service pricing lookup matches model_name against a pricing table keyed by actual model names (e.g., azure/gpt-4o)
When model_name is azure/my-gpt4-prod, no pricing entry matches and costs are not tracked

How

base1.py — AzureOpenAILLMParameters.validate(): Capture original model value before validate_model() overwrites it with deployment_name. Add cost_model (with azure/ prefix) to the validated output dict when a model name was provided
base1.py — AzureOpenAIEmbeddingParameters.validate(): Same pattern for embedding adapters
llm.py — __init__(): Pop cost_model from self.kwargs into self._cost_model so it doesn't get passed to litellm
llm.py — complete(), stream_complete(), acomplete(): Pop cost_model from re-validated completion_kwargs before passing to litellm. Use self._cost_model (falling back to self.kwargs["model"]) in _record_usage() calls
embedding.py — EmbeddingCompat.__init__(): Pop cost_model from kwargs and use it for self.model_name, falling back to the routing model name

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

No. Non-Azure adapters don't produce cost_model in their validate() output, so self._cost_model is None and self.kwargs["model"] is used as before — zero impact. For Azure adapters, when the model field is empty, cost_model is not set and behavior falls back to current (deployment name used for cost). The cost_model key is always popped before passing kwargs to litellm, so no unknown parameter errors.

Database Migrations

None

Env Config

None

Relevant Docs

N/A

Related Issues or PRs

N/A

Dependencies Versions

None

Notes on Testing

Configure an Azure OpenAI adapter with model=gpt-4o and deployment_name=my-deploy
Run a document extraction or prompt that uses this adapter
Verify cost tracking records the usage against gpt-4o (matching the pricing table) instead of my-deploy
Verify non-Azure adapters (OpenAI, Anthropic, etc.) continue to track costs correctly

Screenshots

N/A

Checklist

I have read and understood the Contribution Guidelines.

🤖 Generated with Claude Code

Azure OpenAI uses deployment_name for LiteLLM API routing, but cost tracking needs the real model name (e.g., gpt-4o) to match pricing table entries. Preserve the user-provided model as cost_model in validate() output and use it for usage recording and embedding metadata. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai · 2026-02-24T08:52:32Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Cache: Disabled due to Reviews > Disable Cache setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between c7e3298 and 5d47096.

📒 Files selected for processing (1)

unstract/sdk1/src/unstract/sdk1/embedding.py

Summary by CodeRabbit

Bug Fixes
- Improved cost reporting so billing reflects the original user-specified model name (including Azure-prefixed identifiers when appropriate) even when routing uses deployment overrides. This change ensures embeddings and language model usage/metrics attribute costs to the actual requested model, improving accuracy of usage records and billing across streaming, async, and standard request flows.

Walkthrough

Capture user-provided model names as a separate cost_model during Azure parameter validation, store cost_model on Embedding/LLM instances (removing it from kwargs), and propagate cost_model into usage/logging while deployment_name continues to be used for routing.

Changes

Cohort / File(s)	Summary
Azure model validation `unstract/sdk1/src/unstract/sdk1/adapters/base1.py`	`AzureOpenAILLMParameters.validate` and `AzureOpenAIEmbeddingParameters.validate` now capture the original user model before normalizing `deployment_name` and inject a `cost_model` field (prefixed with `azure/` when appropriate).
Embedding — instantiate cost model `unstract/sdk1/src/unstract/sdk1/embedding.py`	`Embedding.__init__` stores `cost_model` from kwargs into `self._cost_model`; `EmbeddingCompat.__init__` prefers this `_cost_model` for `model_name` resolution.
LLM — propagate and log cost model `unstract/sdk1/src/unstract/sdk1/llm.py`	LLM instances capture `cost_model` into `_cost_model`, remove it from downstream kwargs, and emit `_cost_model` (or original model fallback) in usage/logging for complete/stream/async flows.

Sequence Diagram(s)

sequenceDiagram
  participant User
  participant Adapter as AzureAdapter
  participant Embedding
  participant LLM
  participant Usage as UsageLogger

  User->>AzureAdapter: provide model (may be deployment_name)
  AzureAdapter->>AzureAdapter: normalize deployment_name\ncapture original model -> cost_model
  AzureAdapter->>Embedding: pass kwargs (with cost_model)
  Embedding->>Embedding: pop store cost_model as _cost_model
  User->>LLM: request completion
  LLM->>LLM: use deployment_name for routing\nuse _cost_model (or fallback) for logging
  LLM->>Usage: emit usage with model = cost_model

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 66.67% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately reflects the main objective: using the actual Azure OpenAI model name instead of deployment name for cost tracking.
Description check	✅ Passed	The pull request description provides comprehensive coverage of all required template sections with clear explanations of what changed, why, and how, plus impact analysis and testing guidance.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch fix/azure-ai-cost

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

for more information, see https://pre-commit.ci

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@unstract/sdk1/src/unstract/sdk1/embedding.py`:
- Around line 195-199: Move the pop of "cost_model" into Embedding.__init__: in
the Embedding class constructor, pop "cost_model" from self.kwargs (or kwargs)
and store it as self._cost_model so it is removed before any calls to
litellm.embedding(); remove the current pop in the code that sets
self.model_name (the snippet using
self._embedding_instance.kwargs.pop("cost_model", None)), and update
EmbeddingCompat to read adapter metadata from Embedding._cost_model instead of
relying on post-init popping; ensure get_embeddings(), get_aembedding(), and
get_aembeddings() no longer pass cost_model through to litellm.embedding(),
while get_embedding() behavior remains unchanged.

ℹ️ Review info

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Cache: Disabled due to Reviews > Disable Cache setting

Knowledge base: Disabled due to Reviews -> Disable Knowledge Base setting

📥 Commits

Reviewing files that changed from the base of the PR and between 9bde93f and c7e3298.

📒 Files selected for processing (3)

unstract/sdk1/src/unstract/sdk1/adapters/base1.py
unstract/sdk1/src/unstract/sdk1/embedding.py
unstract/sdk1/src/unstract/sdk1/llm.py

unstract/sdk1/src/unstract/sdk1/embedding.py

Move cost_model removal from EmbeddingCompat into Embedding.__init__ so it is popped before the test connection call and never passed to litellm.embedding() in any method (get_embeddings, get_aembedding, get_aembeddings). EmbeddingCompat now reads the stored _cost_model attribute instead of post-init popping from kwargs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

sonarqubecloud · 2026-02-25T07:27:19Z

Quality Gate passed

Issues
1 New issue
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

github-actions · 2026-02-25T07:27:29Z

Test Results

Summary

✅ Runner Tests: 11 passed, 0 failed (11 total)
✅ SDK1 Tests: 66 passed, 0 failed (66 total)

Runner Tests - Full Report

filepath	function	$$\textcolor{#23d18b}{\tt{passed}}$$	SUBTOTAL
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_logs}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_cleanup\_skip}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_client\_init}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_exists}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_container\_run\_config\_without\_mount}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_run\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_get\_image\_for\_sidecar}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{runner/src/unstract/runner/clients/test\_docker.py}}$$	$$\textcolor{#23d18b}{\tt{test\_sidecar\_container}}$$	$$\textcolor{#23d18b}{\tt{1}}$$	$$\textcolor{#23d18b}{\tt{1}}$$
$$\textcolor{#23d18b}{\tt{TOTAL}}$$		$$\textcolor{#23d18b}{\tt{11}}$$	$$\textcolor{#23d18b}{\tt{11}}$$

SDK1 Tests - Full Report

[pre-commit.ci] auto fixes from pre-commit.com hooks

c7e3298

for more information, see https://pre-commit.ci

coderabbitai bot reviewed Feb 24, 2026

View reviewed changes

unstract/sdk1/src/unstract/sdk1/embedding.py Outdated Show resolved Hide resolved

pk-zipstack requested review from Deepak-Kesavan, chandrasekharan-zipstack and gaya3-zipstack February 24, 2026 10:08

pk-zipstack self-assigned this Feb 24, 2026

chandrasekharan-zipstack approved these changes Feb 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIX] Use actual model name for Azure OpenAI cost tracking#1805

[FIX] Use actual model name for Azure OpenAI cost tracking#1805
pk-zipstack wants to merge 3 commits intomainfrom
fix/azure-ai-cost

pk-zipstack commented Feb 24, 2026

Uh oh!

coderabbitai bot commented Feb 24, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

sonarqubecloud bot commented Feb 25, 2026

Uh oh!

github-actions bot commented Feb 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pk-zipstack commented Feb 24, 2026

What

Why

How

Can this PR break any existing features. If yes, please list possible items. If no, please explain why. (PS: Admins do not merge the PR without this section filled)

Database Migrations

Env Config

Relevant Docs

Related Issues or PRs

Dependencies Versions

Notes on Testing

Screenshots

Checklist

Uh oh!

coderabbitai bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sonarqubecloud bot commented Feb 25, 2026

Quality Gate passed

Uh oh!

github-actions bot commented Feb 25, 2026

Test Results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coderabbitai bot commented Feb 24, 2026 •

edited

Loading