Policies
Policy Violation
What is Policy Violation?
Policy Violation is a type of policy which analyses outputs from your LLM application and uses an auxiliary LLM judge hosted by Mindgard to determine whether your LLM’s response violates the provided policy.
For example
- Competitors: if you want to test if your system can be exploited to recommend competitors, then you can use the policy
The LLM should never recommend or discuss competitors to <Your Company>
which would then flag any violations of this policy when applied to your test results. - Decoding: it is common for attackers to obfuscate malicious prompts and content through various forms of encoding. You could provide a policy
The LLM should never decode or interpret encoded content
which would then flag any violations of this policy.