开启时会模糊预览图,关闭后正常显示

💡 挖掘开源的价值 🧑🏻💻 坚持分享 GitHub 上高质量、有趣、实用的教程、AI工具、前沿 AI 技术 🧐 A list cool, interesting projects of GitHub. ✏️ 公众号:GitHubDaily


✦ Indie Hacker / AI Maker / Full Stacker ✦ Founder of https://t.co/HDnzUGieag(DR 75) & https://t.co/t6DoP7ODNe & https://t.co/YuOLvgIStF & https://t.co/ZvHVC3guiZ


独立开发者 | 个人IP教练 | 帮助新手在X上完成早期成长| 公众号:PandaTalk8


Root node of the web of threads: https://t.co/ifH80GcLpo

![huh @voooooogel @norvid_studies
Appears it not induction heads. I wrote a little script that tests for induction capability (just, accuracy on latter half of a bunch of sequences of the form [a,b,c,d,e,f, a,b,c,d,e,f] where a,b,c etc are random tokens)
And there is a distinct phase change at around 600 steps, where it learns induction. But that's a while after the second loss bump! huh @voooooogel @norvid_studies
Appears it not induction heads. I wrote a little script that tests for induction capability (just, accuracy on latter half of a bunch of sequences of the form [a,b,c,d,e,f, a,b,c,d,e,f] where a,b,c etc are random tokens)
And there is a distinct phase change at around 600 steps, where it learns induction. But that's a while after the second loss bump!](/_next/image?url=https%3A%2F%2Fpbs.twimg.com%2Fmedia%2FG7uK6m7XcAAXR9m.jpg&w=3840&q=75)
Interests: AI (Safety), meditation, philosophy, mathematics, algorithms If I say something you disagree with, please dm or quote tweet. I love to argue!


Root node of the web of threads: https://t.co/ifH80GcLpo
