Thumbnail 1666112
thumbnail
Large (256x256)

Articles

OpenAI gets caught vibe graphing
During its big GPT-5 livestream on Thursday, OpenAI showed off a few charts that made the model seem quite impressive - but if you look closely, some graphs were a little bit off. In one, ironically showing how well GPT-5 does in deception evals across models," the scale is all over the place. For coding [...]
1