Agentic AI concepts
Guardrails
Rules, checks, permissions, and recovery paths that constrain an AI system so it stays within acceptable behavior and operational boundaries.
Guardrails are not a single feature. They include prompt instructions, input validation, output validation, tool permissions, policy classifiers, approval gates, rate limits, sandboxing, and monitoring.
The strongest guardrails are enforced outside the model. A model can be asked not to delete production data, but the safer design is to avoid giving it that permission unless an explicit approval gate has been satisfied.
Related terms