Improving instruction hierarchy in frontier LLMs


Source: OpenAI Blog Published: March 10, 2026

Summary

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.


Read full article on OpenAI Blog