Thread Easy
探索
撰写 Thread
安装
切换模式
切换语言
探索
最新在前,按卡片方式浏览线程
作者账号
起始日期
结束日期
应用筛选
清除条件
模糊预览图
开启时会模糊预览图,关闭后正常显示
basically the only two evals I care about at this stage of the game are: -how reliably can it do 30min+ tasks without messing up -how much do the cot’s make me feel the AGI (this post isn’t meant as a slant against gemini 3 in any way, looks like a strong model and congrats to the team on the launch!)
researcher @OpenAI | prev CMU
James Campbell
Wed Nov 19 02:27:15
this halloween, I’m going as the chain of thought
researcher @OpenAI | prev CMU
James Campbell
Sat Nov 01 06:33:33
Mira, Ilya, Elon, Sam, and Dario are now all competing with each other for AGI despite all having worked together at OpenAI just a few years ago
researcher @OpenAI | prev CMU
James Campbell
Thu Oct 17 01:39:34
Mira, Ilya, Elon, Sam, and Dario are now all competing with each other for AGI despite all having worked together at OpenAI just a few years ago
researcher @OpenAI | prev CMU
James Campbell
Thu Oct 17 01:39:34
Mira, Ilya, Elon, Sam, and Dario are now all competing with each other for AGI despite all having worked together at OpenAI just a few years ago
researcher @OpenAI | prev CMU
James Campbell
Thu Oct 17 01:39:34
Previous
1
Next