LogoThread Easy
  • Explore
  • Thread Compose
LogoThread Easy

Your All-in-One Twitter Thread Companion

© 2026 Thread Easy All Rights Reserved.

Explore

Newest first — browse tweet threads

Keep on to blur preview images; turn off to show them clearly

To my surprise, Opus 4.5 one-shot my hardest λ-calculus problem (tying with Gemini 3), and it did solve the stack underflow bug that an old checkpoint of Gemini 3 (NOT the deployed version) solved. So, in terms of first hour impression, that couldn't be more promising I guess...

To my surprise, Opus 4.5 one-shot my hardest λ-calculus problem (tying with Gemini 3), and it did solve the stack underflow bug that an old checkpoint of Gemini 3 (NOT the deployed version) solved. So, in terms of first hour impression, that couldn't be more promising I guess...

Keep in mind these prompts were all over X last week. Could Opus be accessing the internet? So take this with a grain of salt. I'll design new variations and harder prompts when I find some time, possibly this weekend

avatar for Taelin
Taelin
Mon Nov 24 19:40:40
RT @joshdbirdwell: Our team got to go to the @aiDotEngineer code conference this past weekend. We learned a lot. Solid talks and conversati…

RT @joshdbirdwell: Our team got to go to the @aiDotEngineer code conference this past weekend. We learned a lot. Solid talks and conversati…

achieve ambition with intentionality, intensity, & integrity - @dxtipshq - @sveltesociety - @aidotengineer - @latentspacepod - @cognition + @smol_ai

avatar for swyx
swyx
Mon Nov 24 19:40:08
This isn't very good
Gemini 3.5 will obliterate it
Dario's b2b fortress is strong but not impregnable
Inflection, though incomparably wretched, gave up in a similar situation - almost reached the frontier, outmatched on compute
Nevertheless, "Ant is dead" postponed by 1 more year

This isn't very good Gemini 3.5 will obliterate it Dario's b2b fortress is strong but not impregnable Inflection, though incomparably wretched, gave up in a similar situation - almost reached the frontier, outmatched on compute Nevertheless, "Ant is dead" postponed by 1 more year

We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization. @deepseek_ai stan #1, 2023–Deep Time «C’est la guerre.» ®1

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Mon Nov 24 19:38:44
This isn't very good
Gemini 3.5 will obliterate it
Dario's b2b fortress is strong but not impregnable
Inflection, though incomparably wretched, gave up in a similar situation - almost reached the frontier, outmatched on compute
Nevertheless, "Ant is dead" postponed by 1 more year

This isn't very good Gemini 3.5 will obliterate it Dario's b2b fortress is strong but not impregnable Inflection, though incomparably wretched, gave up in a similar situation - almost reached the frontier, outmatched on compute Nevertheless, "Ant is dead" postponed by 1 more year

We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization. @deepseek_ai stan #1, 2023–Deep Time «C’est la guerre.» ®1

avatar for Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞)
Mon Nov 24 19:38:44
wish this had made it in time for AIE CODE but it’s out now!

have been testing kevlar internally for a few weeks and people are VERY excited - this thing destroys @cognition’s internal held out benchmarks and is a notable step up in SOTA. Devin only gets an upgrade with step ups so am excited to see this roll out to everyone today.

wish this had made it in time for AIE CODE but it’s out now! have been testing kevlar internally for a few weeks and people are VERY excited - this thing destroys @cognition’s internal held out benchmarks and is a notable step up in SOTA. Devin only gets an upgrade with step ups so am excited to see this roll out to everyone today.

achieve ambition with intentionality, intensity, & integrity - @dxtipshq - @sveltesociety - @aidotengineer - @latentspacepod - @cognition + @smol_ai

avatar for swyx
swyx
Mon Nov 24 19:38:01
Another cool work from our team! Karpathy’s original quest of controlling computer with pixels in, keyboard+mouse out now can be done with just 7B model!

Another cool work from our team! Karpathy’s original quest of controlling computer with pixels in, keyboard+mouse out now can be done with just 7B model!

Mostly research and code. If universe is an optimizer, what is its loss function? All opinions are my own.

avatar for Shital Shah
Shital Shah
Mon Nov 24 19:32:47
  • Previous
  • 1
  • More pages
  • 2475
  • 2476
  • 2477
  • More pages
  • 5635
  • Next