// "tabs": [
  //   {
  //     "name": "API Reference",
  //     "url": "api-reference"
  //   }
  // ],
  // "anchors": [
  //   {
  //     "name": "Documentation",
  //     "icon": "book-open-cover",
  //     "url": "https://mintlify.com/docs"
  //   },
  //   {
  //     "name": "Community",
  //     "icon": "slack",
  //     "url": "https://mintlify.com/community"
  //   },
  //   {
  //     "name": "Blog",
  //     "icon": "newspaper",
  //     "url": "https://mintlify.com/blog"
  //   }
  // ],

Goal

Impact

How do these attacks work?

Example Threat Scenario

Remediation

Overview

Mindgard

Introduction

Sign up for a trial

Explore Mindgard using a demo model

Trial Limitations & PoC

Plan AI Risk Management

Prerequisites

Testing via Web

Testing via CLI

Testing via Burp

Python SDK

Testing Chatbot Applications

Test Results

Exporting Test Results

Workflow Integrations

Reporting

Custom Prompts

Generate your own dataset using the CLI to red-team your system with more context

Generating Custom Datasets

Using Custom Datasets

Running a subset of attacks

Input In Output

Empty Output

LLM Error

Decode

Regex Refusal

Policy Violation

AI Risk Remediation

Enterprise Setup

Support & Troubleshooting

Ever growing list of LLM/ML attack techniques.

Attack Library Overview

Introduction to Remediation

ML Output Obfuscation

Apply Context Windows

Structure prompts with clear delineation between system instructions and user input.

Separate System Instructions

Break the system prompt into distinct components to enhance security.

Dynamically Compile System Prompt

Limit the total number of queries a user can perform.

Query Restrictions

Use multiple models in consensus with each other.

Ensemble Methods

Make AI models more robust to adversarial inputs.

Model Hardening

Nullify or reverse potential adversarial perturbations.

Input Restoration

Encrypt data into a form the model can process

Homomorphic Encryption

Ensure privacy of individuals within datasets.

Differential Privacy

Overfitting Detection

Preprocess Input Text

Anonymise sensitive data during training.

Anonymization of Data

Change your System Prompt to reject malicious content.

Refine System Prompt

Filter potentially malicious content from the output of the LLM.

Filter Outputs

Deploy guardrails to block malicious prompts.

Welcome

User Guide

Attack Library

Remediation Library

Overview

Goal

Impact

How do these attacks work?

Example Threat Scenario

Remediation

Preprocess Input Text

Implement Guardrails

Refine System Prompt

Model Hardening

Further Reading

Welcome

User Guide

Attack Library

Remediation Library

​Goal

​Impact

​How do these attacks work?

​Example Threat Scenario

​Remediation

Preprocess Input Text

Implement Guardrails

Refine System Prompt

Model Hardening

​Further Reading

Goal

Impact

How do these attacks work?

Example Threat Scenario

Remediation

Further Reading