LogoThread Easy
  • 探索
  • 撰写 Thread
LogoThread Easy

您的一体化 Twitter 线程助手

© 2025 Thread Easy All Rights Reserved.

探索

最新在前,按卡片方式浏览线程

开启时会模糊预览图,关闭后正常显示

DPO was the most effective decelerationist paper ever written, but by accident; tons of academic time was spent on slightly different variants of it instead of building infrastructure for policy gradients at scale. PauseAI people could never

DPO was the most effective decelerationist paper ever written, but by accident; tons of academic time was spent on slightly different variants of it instead of building infrastructure for policy gradients at scale. PauseAI people could never

ML researcher (@primeintellect), speculator • extremely silly jester

avatar for kalomaze
kalomaze
Sun Nov 09 02:14:56
DPO was the most effective decelerationist paper ever written, but by accident; tons of academic time was spent on slightly different variants of it instead of building infrastructure for policy gradients at scale. PauseAI people could never

DPO was the most effective decelerationist paper ever written, but by accident; tons of academic time was spent on slightly different variants of it instead of building infrastructure for policy gradients at scale. PauseAI people could never

ML researcher (@primeintellect), speculator • extremely silly jester

avatar for kalomaze
kalomaze
Sun Nov 09 02:14:56
  • Previous
  • 1
  • Next