Loading…
Loading…
Written by Max Zeshut
Founder at Agentmelt
A defense mechanism where the AI model is trained to prioritize instructions from different sources in a fixed order: system prompt > developer instructions > user messages > retrieved content. Instruction hierarchy prevents prompt injection attacks where malicious content in emails, documents, or web pages tries to override the agent's behavior. It is one of the most effective defenses for agents that process untrusted external data.
Back to glossary