prompt extraction

prompt extraction

An attack that tries to divulge the system prompt or other information in the context of a large language model that would normally be hidden from a user.

📚 Reference: NIST AI 100-2e2025
🏷️ Category: Cybersecurity
📊 Commonality: common