Well, «beat» is a strong word, it's definitely more efficient and the base model is at parity, but it'll need quite some work to match Qwen's mature post-training, which they leave for later. But there's plenty of post-training knowledge now.
Loading thread detail
Fetching the original tweets from X for a clean reading view.
Hang tight—this usually only takes a few seconds.
