LogoThread Easy
  • Explorar
  • Criar thread
LogoThread Easy

Seu parceiro completo para threads do Twitter

© 2025 Thread Easy All Rights Reserved.

Explorar

Newest first — browse tweet threads

Keep on to blur preview images; turn off to show them clearly

It's crazy how conservative I was, for all my fanboyism (then again I expected it to come sooner, around V3.2). They got most of the way to this target just with better post-training. It's now reasonable to expect that V4 will be stronger than Gemini 3/GPT 5.1 on most stuff

It's crazy how conservative I was, for all my fanboyism (then again I expected it to come sooner, around V3.2). They got most of the way to this target just with better post-training. It's now reasonable to expect that V4 will be stronger than Gemini 3/GPT 5.1 on most stuff

We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization. @deepseek_ai stan #1, 2023–Deep Time «C’est la guerre.» ®1

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Thu Dec 04 19:47:49
It’s actually funny to mentally adjust to different training size/shapes after spending so much time optimizing the sub-350m space. Have to find new ways to let the data breathe.

It’s actually funny to mentally adjust to different training size/shapes after spending so much time optimizing the sub-350m space. Have to find new ways to let the data breathe.

Artisanal baker of reasoning models @pleiasfr

avatar for Alexander Doria
Alexander Doria
Thu Dec 04 19:46:03
RT @CarinaLHong: There is beauty in growing your student into your collaborator. 

Sir Hardy to Ramanujan. 
Professor Ono to me. 
And now,…

RT @CarinaLHong: There is beauty in growing your student into your collaborator. Sir Hardy to Ramanujan. Professor Ono to me. And now,…

Market Design/Entrepreneurship Professor @HarvardHBS & Faculty Affiliate @Harvard Economics; Research @a16zcrypto; Editor @restatjournal; Econ @Quora; … | #QED

avatar for Scott Kominers
Scott Kominers
Thu Dec 04 19:45:50
RT @EERandomness: @elonmusk The thermodynamics of the nozzle cooling are so perfectly balanced that it doesn't melt but also doesn't form i…

RT @EERandomness: @elonmusk The thermodynamics of the nozzle cooling are so perfectly balanced that it doesn't melt but also doesn't form i…

All Things Engineering. Electrical, Mechanical, Software, Firmware, AI, Security and everything in between. Specialize in custom HW/FW/SW for motor control

avatar for Engineering Randomness
Engineering Randomness
Thu Dec 04 19:44:42
I’m seeing less and less the point with distillation for small models. If you go full synthetic anyway, selectively target for task and memorization >>>> fuzzy compression.

I’m seeing less and less the point with distillation for small models. If you go full synthetic anyway, selectively target for task and memorization >>>> fuzzy compression.

Artisanal baker of reasoning models @pleiasfr

avatar for Alexander Doria
Alexander Doria
Thu Dec 04 19:41:58
RT @Jimmy_JingLv: Bun 被 Anthropic 收购第二天,
我的 bun install 就卡住了,完全不动。

🤣 这是什么巧合呐......

RT @Jimmy_JingLv: Bun 被 Anthropic 收购第二天, 我的 bun install 就卡住了,完全不动。 🤣 这是什么巧合呐......

🚧 building https://t.co/AJfZ3LMlgq https://t.co/606cFUoda3 https://t.co/s0m0tpQMDH https://t.co/UQ5vrrYdAG 🐣learning/earning while helping others ❤️making software, storytelling videos 🔙alibaba @thoughtworks

avatar for 吕立青_JimmyLv (闭关ing) 2𐃏25
吕立青_JimmyLv (闭关ing) 2𐃏25
Thu Dec 04 19:40:49
  • Previous
  • 1
  • More pages
  • 1581
  • 1582
  • 1583
  • More pages
  • 5634
  • Next