Keep on to blur preview images; turn off to show them clearly

Reasoning models coming (very) soon. Co-founder @pleiasfr

🧑💻 https://t.co/Y30jsaHwz9 $20K/m ⚡️ https://t.co/vatLDmi9UG $17K/m 📈 https://t.co/3EDxln5mdi $16K/m ⭐️ https://t.co/MZc8tG9xWi $8K/m 🧬 https://t.co/SfrVXVtmdA $.5K/m 🍜 https://t.co/r07EpGSYJ2 $0K/m 🧾 https://t.co/7olaOzV8Xd $0/m +18 https://t.co/4zCWHGJp1S


This means MSL is trying ideas and executing with high entropy. Unexpected and interesting.


Now that I finally have controlled synthetic environments, seeing similar trade-off on the pretrain side. Like stacking layers is even more beneficial to some tasks/domains (math) than others.


模型数据


Building @SakanaAILabs 🧠
