← Glossary
Agentic AI concepts

Guardrails

Rules, checks, permissions, and recovery paths that constrain an AI system so it stays within acceptable behavior and operational boundaries.

Guardrails are not a single feature. They include prompt instructions, input validation, output validation, tool permissions, policy classifiers, approval gates, rate limits, sandboxing, and monitoring.

The strongest guardrails are enforced outside the model. A model can be asked not to delete production data, but the safer design is to avoid giving it that permission unless an explicit approval gate has been satisfied.

Related terms
rbacprivilegehuman-in-the-loopaudit-trailsandboxing