Skip to main content

Refusal

SDK Usage

Learn how to evaluate this metric programmatically in the Trustwise SDK Documentation.

The Refusal metric measures how effectively and appropriately your AI agent refuses to respond to inappropriate, unethical, or harmful prompts. A good refusal strategy is essential for safety and compliance.

FAQs

Why is Refusal evaluation necessary?

Agents without proper refusal mechanisms may generate harmful or non-compliant outputs, putting users or organizations at risk.

What does a high Refusal score mean?

It indicates that the agent correctly recognized and declined to answer an inappropriate or harmful query.