§ feed · storyline
Weak-to-strong generalization
Weak-to-strong generalization
We present a new research direction for superalignment, together with promising initial results: can we leverage the generalization properties of deep learning to control strong models with weak supervisors?
§ sources1 publication · timeline below
- openai.comWeak-to-strong generalizationprimary