Quotas
Quotas let administrators set cost limits on LLM usage. A quota definition sets the limit and period, and a quota mapping applies it to a specific context — at the ticket level, workspace level, or across the entire platform. When a limit is reached, the agent stops processing and reports the overage. All limits reset at UTC midnight.
Quota Definitions
Go to AI Admin → Access Control → Quota → Quota Definition. This page lists all quota policies, showing their name, period type, and USD limit.

Add a Quota Definition
Click + Add. Fill in the fields:
Name — a unique identifier for the quota policy
Period Type — how frequently the limit resets (e.g. Daily)
Limit (USD) — the maximum spend allowed within the period
Click Create.

Quota Mappings
Click the Quota Mapping tab. Mappings bind a quota definition to a context and an Apply to value, determining which activity the limit applies to.

Add a Quota Mapping
Click + Add. Fill in the fields:
Name — a label for this mapping
Quota — the quota definition to apply
Context — whether this applies at the Workspace or Platform level
Target Workspace — the specific workspace (when Context is Workspace)
Apply to — what the limit tracks: Ticket, Workspace, or All
Ticket Quota
When Apply to is set to Ticket, the quota limit applies per individual ticket. Each ticket has its own spend counter — one ticket reaching the limit does not affect other tickets in the same workspace.

To verify, create a ticket in the target workspace and run a request.

Once the ticket's cumulative cost hits the defined limit, the agent stops and returns an error showing the ticket ID, the amount used, and when the limit resets.

Because the quota is scoped to the individual ticket, a new ticket created in the same workspace starts with a fresh counter and continues working normally.

Workspace Quota
When Apply to is set to Workspace, the quota limit applies to all activity within a specific workspace. The cumulative spend across all tickets in that workspace counts toward a single shared limit.

Create a ticket and run a request in the workspace.

Once the workspace's cumulative spend reaches the limit, the agent blocks all new requests within that workspace, reporting the workspace name, total used, and reset time.

Switching to a different workspace bypasses the limit entirely — the quota is scoped to the specific workspace it was mapped to, so other workspaces are unaffected.

A ticket in the new workspace runs successfully with its own independent cost counter.

Platform Quota
When Context is set to Platform, the quota applies across all workspaces on the platform. Set Apply to to All to enforce it globally regardless of workspace.

Once the platform-wide limit is reached, all agent requests across every workspace are blocked, regardless of which workspace or ticket they originate from.

Last updated
Was this helpful?

