Prompt Injection
SDK Usage
Learn how to evaluate this metric programmatically in the Trustwise SDK Documentation.
Prompt Injection identifies prompts which contain attempts to bypass the security measures of LLMs. Unlike many of our other metrics, a higher score means the text is more likely to be a prompt injection.