RT @MustafaShukor1: VL-JEPA is out! A non-generative vision-language model, based on JEPA. Different from typical data-space autoregressive…
Loading thread detail
Fetching the original tweets from X for a clean reading view.
Hang tight—this usually only takes a few seconds.