shipfeedAI news, curated daily

00:36:32 CET
21 MAY00:36:32shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Improving instruction hierarchy in frontier LLMs

IH-Challenge trains frontier LLMs to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.

Mar 10 · · primary fetch1 sourceupdated Mar 10 ·

IH-Challenge trains models to prioritize trusted instructions, improving instruction hierarchy, safety steerability, and resistance to prompt injection attacks.

read full article on openai.com
§ sources1 publication · timeline below
  1. openai.comImproving instruction hierarchy in frontier LLMsprimary