Thread Easy
Explorar
Criar thread
Instalar
Trocar modo
Trocar idioma
Explorar
Newest first — browse tweet threads
Author handle
From date
To date
Apply filters
Reset
Blur thumbnails
Keep on to blur preview images; turn off to show them clearly
basically the only two evals I care about at this stage of the game are: -how reliably can it do 30min+ tasks without messing up -how much do the cot’s make me feel the AGI (this post isn’t meant as a slant against gemini 3 in any way, looks like a strong model and congrats to the team on the launch!)
researcher @OpenAI | prev CMU
James Campbell
Wed Nov 19 02:27:15
this halloween, I’m going as the chain of thought
researcher @OpenAI | prev CMU
James Campbell
Sat Nov 01 06:33:33
Mira, Ilya, Elon, Sam, and Dario are now all competing with each other for AGI despite all having worked together at OpenAI just a few years ago
researcher @OpenAI | prev CMU
James Campbell
Thu Oct 17 01:39:34
Mira, Ilya, Elon, Sam, and Dario are now all competing with each other for AGI despite all having worked together at OpenAI just a few years ago
researcher @OpenAI | prev CMU
James Campbell
Thu Oct 17 01:39:34
Mira, Ilya, Elon, Sam, and Dario are now all competing with each other for AGI despite all having worked together at OpenAI just a few years ago
researcher @OpenAI | prev CMU
James Campbell
Thu Oct 17 01:39:34
Previous
1
Next