Keep on to blur preview images; turn off to show them clearly

Data Science Director at Agoda | Ex-TOP-20 on Kaggle | Playing with robots for fun

If you want the full deep dive (with visuals + demo + and fine-tuning instructions), watch my new video: https://t.co/TDdhedJiDn


What changed vs Pi0? - FAST tokenization - there was an optional FAST version on Pi0, but for Pi0.5, it is an essential part of the training - System 2 - following the Hi Robot paper, Pi 0.5 is using its VLM part as a reasoning high-level system 2 to reason and plan complex tasks - Better training recipe and a couple of smaller tweaks.


We open-sourced everything in our solution: the code, model weights, and a detailed tech report. Code: https://t.co/LLSd6VtbaE Weights: https://t.co/f3ZUF175rV Tech report: https://t.co/TeFiiTha0d I will also record a video walkthrough with more details later. Stay tuned 🎥


Data Science Director at Agoda | Ex-TOP-20 on Kaggle | Playing with robots for fun

What is the BEHAVIOR Challenge? In this competition, we had to train a policy that could complete 50 robotic household tasks in a high-quality simulation. The policy controls a bimanual humanoid robot with a mobile base, and the tasks range from 1 to 14 minutes. Read more details in @drfeifei post: https://t.co/jDviv5d6pB
