RT @MatharyCharles: Check out our spotlight work at NeurIPS! DiLoCo and related methods are critical for scaling LLM training across the wo…
Loading thread detail
Fetching the original tweets from X for a clean reading view.
Hang tight—this usually only takes a few seconds.