Guardrails Reference

Guardrails are expressed with two gateway-agnostic Custom Resource Definitions:

GuardrailProvider — a backend that implements a specific guardrail API contract.
Guard — a policy that binds a provider to a concrete inspection point (pre-call, post-call, during-call) with provider-specific configuration.

Gateway resources (for example AiGateway) reference one or more Guards in their spec.guardrails list.

How Guardrails Work

Two-tier model

GuardrailProvider and Guard separate transport from policy:

GuardrailProvider declares where and how to reach a backend service that implements a guardrail contract (Presidio, OpenAI Moderation, AWS Bedrock). One provider can back many guards.
Guard binds a provider to a concrete policy: when it runs (mode), and what configuration to apply (entity actions, score thresholds, language, …). Each guard targets exactly one provider.

This split lets a platform team operate one set of provider deployments while application teams own the per-gateway policies that reference them.

Runtime model

Gateway CRDs expose a uniform guardrails field — an ordered list of references to Guard resources. The owning gateway operator turns each reference into runtime configuration in two steps:

Resolution — the operator reads the referenced Guard and its providerRef, then validates that the provider type is one it supports for that gateway. Missing references, unsupported provider types, and invalid configuration surface as Ready=False conditions on the Guard and/or the gateway. Operators reconcile gateways when referenced Guard and GuardrailProvider resources change, so policy updates roll out automatically.
Translation — the operator translates the resolved Guard plus provider into the gateway implementation’s native guardrail mechanism. The translation is implementation-specific:
- The AI Gateway LiteLLM Operator writes a guardrails: block into the LiteLLM ConfigMap, with each entry carrying the resolved provider URL, score thresholds, and entity actions.
- The Tool Gateway Agentgateway Operator deploys a dedicated guardrail-adapter workload per Guard and wires the gateway’s AgentgatewayPolicy extProc.backendRef at it.

At request time, the gateway invokes the configured guardrail at the points selected by Guard.spec.mode (pre_call, post_call, during_call — see Guard modes). The provider returns the inspected payload with sensitive entities replaced (action MASK) or signals that the call should be rejected (BLOCK); the gateway applies the decision before forwarding traffic.

Cross-namespace references

Guard.spec.providerRef and the ObjectReference entries in <Gateway>.spec.guardrails both accept explicit namespace fields, defaulting to the referencing resource’s own namespace. This supports a layered layout:

Shared GuardrailProvider resources in a guardrail-providers namespace.
Reusable Guard resources in a guards namespace.
Application gateways in their own namespaces, referencing both.

Gateway operators are granted RBAC to read Guard and GuardrailProvider resources cluster-wide.

Operator-specific behaviour

Translation, status conditions, and per-provider support vary by gateway operator. The Gateway Support Matrix below summarises what each operator implements today; for the full per-gateway behaviour see the operator’s own reference page.

Full API Reference

https://pkg.go.dev/github.com/agentic-layer/agent-runtime-operator/api/v1alpha1

CRDs

GuardrailProvider

A GuardrailProvider describes where and how to reach a guardrail backend. Per-policy tuning (which entities to catch, at which confidence, what to do with them) lives on the Guard, not on the provider.

Spec

Field Type Description

Field	Type	Description
`type`	`string` (required)	API contract implemented by this provider. One of `presidio-api`, `openai-moderation-api`, `bedrock-api`.
`openaiModeration`	object	Configuration for `type: openai-moderation-api`.
`bedrock`	object	Configuration for `type: bedrock-api`.
`presidio`	object	Configuration for `type: presidio-api`.

type

string (required)

API contract implemented by this provider. One of presidio-api, openai-moderation-api, bedrock-api.

openaiModeration

object

Configuration for type: openai-moderation-api.

bedrock

object

Configuration for type: bedrock-api.

presidio

object

Configuration for type: presidio-api.

`presidio` config

Field Type Description

Field	Type	Description
`baseUrl`	URI (required)	HTTP endpoint of the Presidio Analyzer service. Must be reachable from the gateway’s namespace.
`apiKeySecretRef`	`SecretKeySelector`	Optional API key for authenticated Presidio deployments.

baseUrl

URI (required)

HTTP endpoint of the Presidio Analyzer service. Must be reachable from the gateway’s namespace.

apiKeySecretRef

SecretKeySelector

Optional API key for authenticated Presidio deployments.

`openaiModeration` config

Field Type Description

Field	Type	Description
`baseUrl`	URI	Override the default `https://api.openai.com/…;` endpoint. Use this to point at a custom service that implements the OpenAI Moderation API contract. When omitted, the official OpenAI endpoint is used.
`apiKeySecretRef`	`SecretKeySelector`	Reference to the Secret key that contains the API key.

baseUrl

URI

Override the default https://api.openai.com/…; endpoint. Use this to point at a custom service that implements the OpenAI Moderation API contract. When omitted, the official OpenAI endpoint is used.

apiKeySecretRef

SecretKeySelector

Reference to the Secret key that contains the API key.

`bedrock` config

Field Type Description

Field	Type	Description
`region`	`string` (required)	AWS region hosting the Bedrock Guardrails service.
`credentialsSecretRef`	`SecretReference`	Secret containing `aws-access-key-id` and `aws-secret-access-key`. When omitted, the pod’s IRSA or pod-identity credentials are used.

region

string (required)

AWS region hosting the Bedrock Guardrails service.

credentialsSecretRef

SecretReference

Secret containing aws-access-key-id and aws-secret-access-key. When omitted, the pod’s IRSA or pod-identity credentials are used.

Guard

A Guard binds a provider to a concrete policy and defines when the policy runs relative to the LLM or tool call.

Spec

Field Type Description

Field	Type	Description
`mode`	`[]string` (required, min 1)	When the guard is applied. Each entry is one of `pre_call`, `post_call`, `during_call`. Multiple modes can be combined.
`description`	`string`	Human-readable description. Documentation only; has no runtime effect.
`providerRef`	`ObjectReference` (required)	Reference to the `GuardrailProvider`. `namespace` defaults to the Guard’s own namespace when omitted.
`openaiModeration`	object	Guard-level configuration for OpenAI Moderation.
`bedrock`	object	Guard-level configuration for AWS Bedrock Guardrails.
`presidio`	object	Guard-level configuration for Presidio.

mode

[]string (required, min 1)

When the guard is applied. Each entry is one of pre_call, post_call, during_call. Multiple modes can be combined.

description

string

Human-readable description. Documentation only; has no runtime effect.

providerRef

ObjectReference (required)

Reference to the GuardrailProvider. namespace defaults to the Guard’s own namespace when omitted.

openaiModeration

object

Guard-level configuration for OpenAI Moderation.

bedrock

object

Guard-level configuration for AWS Bedrock Guardrails.

presidio

object

Guard-level configuration for Presidio.

Guard modes

Mode What it inspects

Mode	What it inspects
`pre_call`	The outgoing payload before it reaches the LLM or tool server (prompt, tool call arguments).
`post_call`	The payload returned from the LLM or tool server (response text, tool results) before it reaches the caller.
`during_call`	Both the request and response. For protocols without a streaming body (for example MCP), this is equivalent to `pre_call + post_call`.

pre_call

The outgoing payload before it reaches the LLM or tool server (prompt, tool call arguments).

post_call

The payload returned from the LLM or tool server (response text, tool results) before it reaches the caller.

during_call

Both the request and response. For protocols without a streaming body (for example MCP), this is equivalent to pre_call + post_call.

`presidio` guard config

Field Type Description

Field	Type	Description
`language`	`string`	ISO language code for the content under inspection (for example `en`, `de`). Defaults to `en` when omitted.
`scoreThresholds`	`map[string]string`	Per-entity confidence cutoffs in the range `0.0`--`1.0`. Use `ALL` as the catch-all; per-entity keys override it. Example: `{"ALL": "0.5", "EMAIL_ADDRESS": "0.8"}`.
`entityActions`	`map[string]string`	Maps entity types to actions. `MASK` replaces the value with a placeholder (for example `<PERSON>`); `BLOCK` rejects the whole request. The keys also determine which entity types are detected. When omitted entirely, every supported entity is detected and blocked.

language

string

ISO language code for the content under inspection (for example en, de). Defaults to en when omitted.

scoreThresholds

map[string]string

Per-entity confidence cutoffs in the range 0.0--1.0. Use ALL as the catch-all; per-entity keys override it. Example: {"ALL": "0.5", "EMAIL_ADDRESS": "0.8"}.

entityActions

map[string]string

Maps entity types to actions. MASK replaces the value with a placeholder (for example <PERSON>); BLOCK rejects the whole request. The keys also determine which entity types are detected. When omitted entirely, every supported entity is detected and blocked.

Presidio entity types are documented at https://microsoft.github.io/presidio/supported_entities/. Common examples: PERSON, EMAIL_ADDRESS, PHONE_NUMBER, CREDIT_CARD, IP_ADDRESS, IBAN_CODE, US_SSN.

`openaiModeration` guard config

Field Type Description

Field	Type	Description
`model`	`string`	Moderation model to use (for example `omni-moderation-latest`). Defaults to the provider’s default when omitted.

model

string

Moderation model to use (for example omni-moderation-latest). Defaults to the provider’s default when omitted.

`bedrock` guard config

Field Type Description

Field	Type	Description
`guardrailId`	`string` (required)	Identifier of the Bedrock Guardrail.
`guardrailVersion`	`string`	Version of the Bedrock Guardrail. Defaults to the latest available version when omitted.

guardrailId

string (required)

Identifier of the Bedrock Guardrail.

guardrailVersion

string

Version of the Bedrock Guardrail. Defaults to the latest available version when omitted.

Attaching Guards to Gateways

Gateway CRDs expose a uniform guardrails field — an ordered list of ObjectReference entries pointing at Guard resources. The gateway’s operator resolves each reference, loads the referenced GuardrailProvider, and configures the underlying gateway accordingly.

spec:
  guardrails:
    - name: pii-guard
      namespace: guards
    - name: toxic-language-guard
      namespace: guards

The field is present on:

AiGateway.spec.guardrails
AgentGateway.spec.guardrails
ToolGateway.spec.guardrails

Gateway Support Matrix

Gateway Implementing operator Supported provider types Notes

Gateway	Implementing operator	Supported provider types	Notes
`AiGateway`	AI Gateway LiteLLM Operator	`presidio-api`	Generates LiteLLM `guardrails:` config from resolved Guards. `mode` maps directly to the LiteLLM `mode` field; `entityActions` map to LiteLLM’s `pii_entities_config`; `scoreThresholds` map to `presidio_score_thresholds`. Output PII tokens are parsed back to the original values on responses.
`AgentGateway`	agent-gateway-krakend-operator	—	Not yet implemented. The `guardrails` field is accepted by the CRD but ignored by the operator.
`ToolGateway`	Tool Gateway Agentgateway Operator	`presidio-api`	For each referenced `Guard`, the operator deploys a dedicated `guardrail-adapter` instance (`Deployment` + `ConfigMap` + `Service`) pre-configured from the `Guard` and its `GuardrailProvider`, and wires the gateway’s `AgentgatewayPolicy` `extProc.backendRef` at it. Single Guard per `ToolGateway` is supported today.

AiGateway

AI Gateway LiteLLM Operator

presidio-api

Generates LiteLLM guardrails: config from resolved Guards. mode maps directly to the LiteLLM mode field; entityActions map to LiteLLM’s pii_entities_config; scoreThresholds map to presidio_score_thresholds. Output PII tokens are parsed back to the original values on responses.

AgentGateway

agent-gateway-krakend-operator

—

Not yet implemented. The guardrails field is accepted by the CRD but ignored by the operator.

ToolGateway

Tool Gateway Agentgateway Operator

presidio-api

For each referenced Guard, the operator deploys a dedicated guardrail-adapter instance (Deployment + ConfigMap + Service) pre-configured from the Guard and its GuardrailProvider, and wires the gateway’s AgentgatewayPolicy extProc.backendRef at it. Single Guard per ToolGateway is supported today.

Samples

The agent-runtime-operator repository ships ready-to-apply samples that cover every supported provider type:

GuardrailProvider samples — https://github.com/agentic-layer/agent-runtime-operator/blob/main/config/samples/runtime_v1alpha1_guardrailprovider.yaml
Guard samples — https://github.com/agentic-layer/agent-runtime-operator/blob/main/config/samples/runtime_v1alpha1_guard.yaml

An end-to-end example that applies a Presidio guard to an AiGateway across three namespaces lives in the LiteLLM operator repository:

https://github.com/agentic-layer/ai-gateway-litellm-operator/blob/main/config/samples/v1alpha1_guardrail_presidio.yaml

Guardrails Reference

How Guardrails Work

Two-tier model

Runtime model

Cross-namespace references

Operator-specific behaviour

Full API Reference

CRDs

GuardrailProvider

Spec

presidio config

openaiModeration config

bedrock config

Guard

Spec

Guard modes

presidio guard config

openaiModeration guard config

bedrock guard config

Attaching Guards to Gateways

Gateway Support Matrix

Samples

`presidio` config

`openaiModeration` config

`bedrock` config

`presidio` guard config

`openaiModeration` guard config

`bedrock` guard config