§ gpt-4o · storyline
AI model scaling and cognitive benchmarking research
Research finds model capabilities interact significantly above a 3.5B parameter threshold and identifies cognitive blind spots in GPT-4o and Claude 3.5 on attention tests.
New research studies demonstrate that model capabilities interact and cooperate significantly above a 3.5B parameter threshold, while also identifying fundamental cognitive blind spots in state-of-the-art models like GPT-4o and Claude 3.5 regarding attention tests.
§ sources1 publication · timeline below