Cascading policy
Policies flow through a hierarchy:| Level | Example rule |
|---|---|
| Org | budget · max $50k / mo |
| Dept | model · only gpt-5 |
| App | rate · 60 RPM |
| Key | scope · read-only |
Policy types
- Rate limits — requests per minute (RPM) and tokens per minute (TPM), per key, model, or endpoint
- Budgets — daily, weekly, and monthly spend caps with automatic enforcement
- Model allowlisting — restrict which models a key can reach
- Endpoint restrictions — control which API endpoints a key can call
- Guardrails — content policies for intent blocking, PII redaction, and toxicity thresholds
- Custom rules — regex and keyword patterns with block, flag, or reclassify actions
- Intent rules — per-intent actions such as allow, block, or force optimization
KYT (Know Your Tokens)
KYT is DeepMyst’s inline security engine. Every request is classified in under 15ms before it reaches a model, so threats are caught in the request path — not after the fact.Prompt injection
Detects attempts to override system prompts or exfiltrate instructions.
PII detection & redaction
Identifies personal information and redacts it before forwarding — replace,
mask, tokenize, or hash.
Toxicity scoring
Rates content against your thresholds and flags or blocks accordingly.
Intent classification
Categorizes every request so policy can act on what it’s actually trying to do.
Two passes
- Pass 1 (inline, <15ms) runs synchronously in the request path to classify and enforce guardrails before the model is called.
- Pass 2 (async) runs after the response for deeper enrichment, behavioral analytics, and anomaly detection across requests.
Guardrail enforcement
Based on classification, the gateway can:- Block the request and return an error
- Flag it for manual review
- Redact PII before forwarding
- Allow it with full logging
Identity and access
- SSO — SAML 2.0 and OIDC for enterprise sign-in
- SCIM — automatic user provisioning and de-provisioning
- RBAC — role-based access control across owner, admin, operator, team lead, auditor, member, and viewer roles, plus a read-only flag
- Access scopes — row-level grants that give a team lead read access to specific projects, departments, or users
Audit and replay
Every request and every agent run is written to an immutable audit log:- Audit-logged forever — inputs, outputs, model, tokens, cost, and policy decisions
- Replayable — replay any run on demand to reproduce what happened
- Evals on every output — each result is scored against your rubric
Credits and metering
Usage is metered per request and billed in credits (1 credit = $0.01). Every response carries DeepMyst headers — request ID, token counts, compression ratio, cost, credits, rate-limit and budget remaining, classification category, and whether the request was flagged or redacted — so cost and policy are observable on every call.Next steps
The OS Stack
See how governance sits beneath every agent and request
Agent Builder
Build agents that inherit this governance automatically

