> The model couples the Mistral-3 24B parameter vision-language model [Mistral Small 3.2 apparently] with a rectified flow transformer
Loading thread detail
Fetching the original tweets from X for a clean reading view.
Hang tight—this usually only takes a few seconds.