Mindgard

Microsoft Azure Prompt Shield Guardrail Evasion

Affected Vendor(s)

Microsoft

Affected Product(s)

Azure Prompt Shield

Summary

Azure Prompt Shield is a jailbreak and prompt injection classifier created by Microsoft. Prompt Shield can be enabled by developers using Azure OpenAI to protect their AI applications from these threats. We successfully demonstrated how an attacker can fully evade, or greatly degrade, classification accuracy of the classifier, enabling prompt injections and jailbreaks to pass through filters and subsequently to the protected AI application.

Timeline

Discovered on

Disclosed to Vendor on

June 6, 2024

Published on

June 24, 2024

Credit

William Hackett

Lewis Birch

Blog Post

Bypassing Azure AI Content Safety Guardrails

References