shipfeedAI news, curated daily

01:27:18 CET
21 MAY01:27:18shipfeed
pull to refreshlast sync
Just in — 30 new
§ feed · storyline

Finding GPT-4’s mistakes with GPT-4

CriticGPT, a GPT-4-based model, writes critiques of ChatGPT outputs to help human trainers identify errors during reinforcement learning from human feedback.

Jun 27 · · primary fetch1 sourceupdated Jun 27 ·

CriticGPT, a model based on GPT-4, writes critiques of ChatGPT responses to help human trainers spot mistakes during RLHF

read full article on openai.com
§ sources1 publication · timeline below
  1. openai.comFinding GPT-4’s mistakes with GPT-4primary