LogoThread Easy
  • 探索
  • 撰写 Thread
LogoThread Easy

您的一体化 Twitter 线程助手

© 2025 Thread Easy All Rights Reserved.

探索

最新在前,按卡片方式浏览线程

开启时会模糊预览图,关闭后正常显示

For instance, you don't need RL on something you have verifiably true data for that doesn't use any trajectory to get to the outcome for. 

aka single turn, no reasoning math problems for instance

For instance, you don't need RL on something you have verifiably true data for that doesn't use any trajectory to get to the outcome for. aka single turn, no reasoning math problems for instance

Cofounder and Head of Post Training @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE

avatar for Teknium (e/λ)
Teknium (e/λ)
Mon Nov 03 06:00:35
A brain is a machine for dealing with combinatorial explosions.

A brain is a machine for dealing with combinatorial explosions.

Professor of computer science at UW and author of '2040' and 'The Master Algorithm'. Into machine learning, AI, and anything that makes me curious.

avatar for Pedro Domingos
Pedro Domingos
Mon Nov 03 06:00:34
I kind of want to write a blog post on some of the constraints, limitations, and things of that nature on RL with LLMs

I kind of want to write a blog post on some of the constraints, limitations, and things of that nature on RL with LLMs

For instance, you don't need RL on something you have verifiably true data for that doesn't use any trajectory to get to the outcome for. aka single turn, no reasoning math problems for instance

avatar for Teknium (e/λ)
Teknium (e/λ)
Mon Nov 03 05:59:55
RT @mddanishyusuf: 📤 Maillayer v2.0 is live on @ProductHunt 

A self-hosted Mailchimp alternative that you buy once, host yourself, and use…

RT @mddanishyusuf: 📤 Maillayer v2.0 is live on @ProductHunt A self-hosted Mailchimp alternative that you buy once, host yourself, and use…

"The Micro Startups Guy" ❯ https://t.co/hwZ0eO0l5D ❯ https://t.co/RkKck3vdIO ❯ https://t.co/PyEJHvxCRn ❯ https://t.co/5hDIulx6OL Sold @nocodeapi for 6 figures

avatar for Mohd Danish
Mohd Danish
Mon Nov 03 05:56:39
RT @BoredElonMusk: We WILL cancel daylight savings time someday.

RT @BoredElonMusk: We WILL cancel daylight savings time someday.

FOLLOWS YOU. Artificial Intelligence, Cognitive Architectures, Computation. The goal is integrity, not conformity. https://t.co/rFUNzdYXuK

avatar for Joscha Bach
Joscha Bach
Mon Nov 03 05:56:32
Socialists don’t understand compounding.

Socialists don’t understand compounding.

Former Quant Investor, now building @lumera (formerly called Pastel Network) | My Open Source Projects: https://t.co/9qbOCDlaqM

avatar for Jeffrey Emanuel
Jeffrey Emanuel
Mon Nov 03 05:56:20
  • Previous
  • 1
  • More pages
  • 1238
  • 1239
  • 1240
  • More pages
  • 2111
  • Next