탐색
스레드 작성

Thread Easy

트위터 스레드의 올인원 파트너

© 2026 Thread Easy All Rights Reserved.

탐색

Newest first — browse tweet threads

Author handle

From date

To date

Blur thumbnails

Keep on to blur preview images; turn off to show them clearly

Every problem we imagine to test intelligence, can also be solved without intelligence. So we must inspect not just whether a problem is solved, but how quickly, how cheaply, with what prior knowledge, and whether the way it was solved transfers well to other problems.

Every problem we imagine to test intelligence, can also be solved without intelligence. So we must inspect not just whether a problem is solved, but how quickly, how cheaply, with what prior knowledge, and whether the way it was solved transfers well to other problems.

Sculpting, AI, Philosophy, Coding

Fri Dec 19 16:44:27

Hmm. Or maybe I'm overthinking it and we just use what works to construct something more robust from lots of LLMs.

Hmm. Or maybe I'm overthinking it and we just use what works to construct something more robust from lots of LLMs.

Sculpting, AI, Philosophy, Coding

Fri Nov 28 16:44:18

NanBanPro image of what I was saying in January. Do AI labs investigate bottom-up training for LLMs much? I don't think embodiment is necessary for intelligence, but it sure would be handy for bottom-up learning... But that's way harder to scale, even if it's a virtual body.

NanBanPro image of what I was saying in January. Do AI labs investigate bottom-up training for LLMs much? I don't think embodiment is necessary for intelligence, but it sure would be handy for bottom-up learning... But that's way harder to scale, even if it's a virtual body.

Sculpting, AI, Philosophy, Coding

Fri Nov 28 10:49:48

Still one of the more interesting plots from Artificial Analysis. Makes me pretty intrigued by Grok 4.1 Fast. Here's to more efficiency gains over the next year, so we can get more dots in the top of that green zone!

Still one of the more interesting plots from Artificial Analysis. Makes me pretty intrigued by Grok 4.1 Fast. Here's to more efficiency gains over the next year, so we can get more dots in the top of that green zone!

Sculpting, AI, Philosophy, Coding

Tue Nov 25 08:48:30

Nano Banana Pro still can't count particularly well...
(with the classic @ESYudkowsky prompt)

Nano Banana Pro still can't count particularly well... (with the classic @ESYudkowsky prompt)

Sculpting, AI, Philosophy, Coding

Fri Nov 21 10:52:08

This surprised me, given that the translation result without the custom instructions didn't even have em-dashes.

And by 'borderline wrong', I mean that the translation itself was technically correct, but didn't fit the mood of the original message at all, sounded very depressed.

This surprised me, given that the translation result without the custom instructions didn't even have em-dashes. And by 'borderline wrong', I mean that the translation itself was technically correct, but didn't fit the mood of the original message at all, sounded very depressed.

Sculpting, AI, Philosophy, Coding

Wed Nov 19 09:17:59

Previous
1
2
Next