Guardrails Overview

Guardrails are safety constraints that protect your customers and brand by defining what your agent cannot do and when it should escalate to humans.

What are Guardrails?

Guardrails are the safety net for your AI agents. They define boundaries and trigger actions when those boundaries are approached.

Two Types

Type	Purpose	When Triggered
Escalation Rules	Hand off to humans	Customer is upset, legal question, VIP, complexity
Restrictions	Block specific actions/topics	Forbidden topics, unsafe promises, confidential info

Escalation Rules

Define when the agent should stop and hand off to a human: Examples:

Rule	Description
Angry Customer	Customer expresses frustration or mentions canceling
Legal Question	Questions about contracts, liability, or legal terms
VIP Customer	Conversations from enterprise or high-value accounts
Complex Issue	Agent cannot confidently resolve after attempts
Human Request	Customer explicitly asks to speak with a person

When an escalation rule triggers:

Agent stops processing
Sends an appropriate message to the customer
Creates an internal note with context
Routes to human queue

Restrictions

Define absolute limits on what the agent cannot do: Examples:

Restriction	Response When Triggered
No refund promises	”I’d love to help! Let me connect you with our billing team who can review refund requests.”
No competitor discussion	”I’m focused on how we can help you. Let me tell you about what makes us great.”
No internal pricing	”For custom pricing, please contact our sales team at sales@example.com”
No medical/legal advice	”I can’t provide medical/legal advice, but I can help you find appropriate resources.”

Restrictions are the highest priority — they’re checked before anything else.

Priority Order

Guardrails are evaluated in this order:

1. Restrictions (highest priority)
        ↓
   If triggered → Block/Redirect
        ↓
2. Escalation Rules
        ↓
   If triggered → Escalate
        ↓
3. Normal Processing
        ↓
   Generate response

Detection Methods

Guardrails can detect triggers using:

Method	Best For	Example
AI-based	Complex context, sentiment, intent	”Customer is frustrated about repeated issues”
Keyword	Specific words or phrases	”cancel”, “lawsuit”, “speak to manager”
Regex	Structured patterns	Email addresses, order numbers

Testing Guardrails

Each guardrail includes a built-in playground:

Open the guardrail configuration
Enter test messages
See if the guardrail triggers
Refine detection criteria

Guardrails vs Guidelines

Guardrails	Guidelines
Safety constraints	Communication style
Hard limits	Soft guidance
Trigger actions (escalate, block)	Shape responses
Evaluated per-message	Always applied

Guardrail: “Never discuss competitor pricing” → Blocks the topic Guideline: “Be professional and helpful” → Shapes all responses

Common Guardrail Patterns

Safety Net

Catch situations that need human judgment:

Angry or threatening language
Legal or compliance topics
Account security concerns
Complex edge cases

Brand Protection

Prevent brand damage:

Competitor discussions
Unauthorized promises
Confidential information
Off-brand responses

Compliance

Meet regulatory requirements:

Privacy requests (GDPR, CCPA)
Financial advice disclaimers
Medical/legal limitations
Age-restricted content

Next Steps

Escalation Rules

Define when to hand off

Restrictions

Set hard limits

Getting Started

Building Agents

Training Your Agent

Tagging & Classification

Deploying

Testing

Analytics

Settings

What are Guardrails?

Two Types

Escalation Rules

Restrictions

Priority Order

Detection Methods

Testing Guardrails

Guardrails vs Guidelines

Common Guardrail Patterns

Safety Net

Brand Protection

Compliance

Next Steps

Escalation Rules

Restrictions

Getting Started

Building Agents

Training Your Agent

Tagging & Classification

Deploying

Testing

Analytics

Settings

​What are Guardrails?

​Two Types

​Escalation Rules

​Restrictions

​Priority Order

​Detection Methods

​Testing Guardrails

​Guardrails vs Guidelines

​Common Guardrail Patterns

​Safety Net

​Brand Protection

​Compliance

​Next Steps

Escalation Rules

Restrictions

What are Guardrails?

Two Types

Escalation Rules

Restrictions

Priority Order

Detection Methods

Testing Guardrails

Guardrails vs Guidelines

Common Guardrail Patterns

Safety Net

Brand Protection

Compliance

Next Steps