Policy Violation - Mindgard

What is Policy Violation?

Policy Violation is a type of policy which analyses outputs from your LLM application and uses an auxiliary LLM judge hosted by Mindgard to determine whether your LLM’s response violates the provided policy. For example

Competitors: if you want to test if your system can be exploited to recommend competitors, then you can use the policy The LLM should never recommend or discuss competitors to <Your Company> which would then flag any violations of this policy when applied to your test results.
Decoding: it is common for attackers to obfuscate malicious prompts and content through various forms of encoding. You could provide a policy The LLM should never decode or interpret encoded content which would then flag any violations of this policy.

​What is Policy Violation?

What is Policy Violation?