LogoThread Easy
  • Explorar
  • Componer hilo
LogoThread Easy

Tu compañero integral para hilos de Twitter

© 2025 Thread Easy All Rights Reserved.

Explorar

Newest first — browse tweet threads

Keep on to blur preview images; turn off to show them clearly

2021到2025年,代码大型语言模型(Code-LLMs)及相关生态系统的演变概览。

2021到2025年,代码大型语言模型(Code-LLMs)及相关生态系统的演变概览。

AI驱动的代码生成中,编程开发和研究领域的演变。

avatar for Yangyi
Yangyi
Wed Dec 03 06:16:07
Incidentally, I only now noticed that DeepSeek's «Generative Reward Model (GRM) empowered by Self-Principled Critique Tuning (SPCT)» checkpoints were released at some point.
This still used Gemmas and V2-lite + V2.5 as teacher. Imagine how good their GRMs are when based on V3.2.

Incidentally, I only now noticed that DeepSeek's «Generative Reward Model (GRM) empowered by Self-Principled Critique Tuning (SPCT)» checkpoints were released at some point. This still used Gemmas and V2-lite + V2.5 as teacher. Imagine how good their GRMs are when based on V3.2.

We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization. @deepseek_ai stan #1, 2023–Deep Time «C’est la guerre.» ®1

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Wed Dec 03 06:10:37
RT @VoidAsuka: Claude buying Bun. It hit me today: the AI era is compressing the entire developer-tooling stack into the hands of a few pla…

RT @VoidAsuka: Claude buying Bun. It hit me today: the AI era is compressing the entire developer-tooling stack into the hands of a few pla…

ai agents @hud_evals | owned @AIHubCentral (1 million users,acq.) ex climate protester🦦 dont do the deferred life plan

avatar for Minh Nguyen✈️NeurIPS
Minh Nguyen✈️NeurIPS
Wed Dec 03 06:05:56
GRM was a big deal. Few paid attention, even though it was close to DeepSeek's post-R1 hype wave. Math-V2, V3.2 use its ideas. This will be slowly acknowledged. Innovations are remarkably slow to diffuse.

GRM was a big deal. Few paid attention, even though it was close to DeepSeek's post-R1 hype wave. Math-V2, V3.2 use its ideas. This will be slowly acknowledged. Innovations are remarkably slow to diffuse.

We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization. @deepseek_ai stan #1, 2023–Deep Time «C’est la guerre.» ®1

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Wed Dec 03 06:05:53
Give a man his dream job and you will sleep well with dreams. Running YC was @garrytan’s dream. Happy for my brother who is clearly killing it.

Give a man his dream job and you will sleep well with dreams. Running YC was @garrytan’s dream. Happy for my brother who is clearly killing it.

Founder: @mixpanel Pizzatarian, programmer, music maker

avatar for Suhail
Suhail
Wed Dec 03 06:05:09
This is basically a list of labs without the Mandate, fast-followers.
> RLVR gives small open-source models an edge
Always overviews of the last year's alpha. GRM not cited.
Math-V2 doesn't seem "very novel". I think it'll matter as much as Math-V1, in retrospect.

This is basically a list of labs without the Mandate, fast-followers. > RLVR gives small open-source models an edge Always overviews of the last year's alpha. GRM not cited. Math-V2 doesn't seem "very novel". I think it'll matter as much as Math-V1, in retrospect.

GRM was a big deal. Few paid attention, even though it was close to DeepSeek's post-R1 hype wave. Math-V2, V3.2 use its ideas. This will be slowly acknowledged. Innovations are remarkably slow to diffuse.

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Wed Dec 03 06:02:42
  • Previous
  • 1
  • More pages
  • 1729
  • 1730
  • 1731
  • More pages
  • 5634
  • Next