shipfeedAI news, curated daily

23:04:58 CET
20 MAY23:04:58shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

gpt-oss-safeguard technical report

OpenAI releases gpt-oss-safeguard-120b and gpt-oss-safeguard-20b, two open-weight reasoning models trained to classify content against a provided policy.

Oct 29 · · primary fetch1 sourceupdated Oct 29 ·

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are two open-weight reasoning models post-trained from the gpt-oss models and trained to reason from a provided policy in order to label content under that policy. In this report, we describe gpt-oss-safeguard’s capabilities and provide our baseline safety evaluations on the gpt-oss-safeguard models, using the underlying gpt-oss models as a baseline.

For more information about the development and architecture of the underlying gpt-oss models, see the original gpt-oss model model card⁠.

read full article on openai.com
§ sources1 publication · timeline below
  1. openai.comgpt-oss-safeguard technical reportprimary