LogoThread Easy
  • Explorar
  • Componer hilo
LogoThread Easy

Tu compañero integral para hilos de Twitter

© 2025 Thread Easy All Rights Reserved.

Explorar

Newest first — browse tweet threads

Keep on to blur preview images; turn off to show them clearly

Google 玩明白了

Google 玩明白了

行道途中。非求速成,惟求通达。 2023 年扎进AI ,打通Know-How,不少赚钱项目,踩过坑,也见过光。 围城里待得够久了,出来聊聊世界,聊聊技术、聊聊赚钱。

avatar for 凡人小北
凡人小北
Fri Nov 07 11:21:36
RT @bradenjhancock: Today we announced Slingshots // ONE, the first batch of Laude Slingshots grant recipients. And dang — it’s an impressi…

RT @bradenjhancock: Today we announced Slingshots // ONE, the first batch of Laude Slingshots grant recipients. And dang — it’s an impressi…

Asst professor @MIT EECS & CSAIL (@nlp_mit). Author of https://t.co/VgyLxl0oa1 and https://t.co/ZZaSzaRaZ7 (@DSPyOSS). Prev: CS PhD @StanfordNLP. Research @Databricks.

avatar for Omar Khattab
Omar Khattab
Fri Nov 07 11:20:54
RT @getpy: DSPyWeekly Issue #10 is live! 

This issue covers:

- Articles: Bay Area DSPy Meetup, new papers (E-CARE, Agent-Omni), using ReA…

RT @getpy: DSPyWeekly Issue #10 is live! This issue covers: - Articles: Bay Area DSPy Meetup, new papers (E-CARE, Agent-Omni), using ReA…

Asst professor @MIT EECS & CSAIL (@nlp_mit). Author of https://t.co/VgyLxl0oa1 and https://t.co/ZZaSzaRaZ7 (@DSPyOSS). Prev: CS PhD @StanfordNLP. Research @Databricks.

avatar for Omar Khattab
Omar Khattab
Fri Nov 07 11:20:42
The new design is more cluttered, but I think it is more useful. 

Which do you think is better?

The new design is more cluttered, but I think it is more useful. Which do you think is better?

Productivity meets minimalism. Building, designing, and sharing my solutions.

avatar for Easlo
Easlo
Fri Nov 07 11:20:10
> the curvature for memorized training points is much sharper than non-memorized, meaning ordering weight components from high to low curvature can reveal a distinction without explicit labels
I've been talking of «spiky singularities» forever, they treat this formally

> the curvature for memorized training points is much sharper than non-memorized, meaning ordering weight components from high to low curvature can reveal a distinction without explicit labels I've been talking of «spiky singularities» forever, they treat this formally

We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization. @deepseek_ai stan #1, 2023–Deep Time «C’est la guerre.» ®1

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Fri Nov 07 11:20:07
Enough geopolitics, this is an extremely interesting advance in mech interpretability. Goodfire uses loss curvature analysis to decompose memorized and generalized structures on the level of weight-space, and suppress memorization via weight-editing.

Enough geopolitics, this is an extremely interesting advance in mech interpretability. Goodfire uses loss curvature analysis to decompose memorized and generalized structures on the level of weight-space, and suppress memorization via weight-editing.

> the curvature for memorized training points is much sharper than non-memorized, meaning ordering weight components from high to low curvature can reveal a distinction without explicit labels I've been talking of «spiky singularities» forever, they treat this formally

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Fri Nov 07 11:17:46
  • Previous
  • 1
  • More pages
  • 573
  • 574
  • 575
  • More pages
  • 2111
  • Next