insane. Competing with Qwen 3 (latest) while using ~6T tokens, and all in the open. They need to refine OLMoE recipe now, and they may well become a mainstream choice for research.
Cargando el detalle del hilo
Obteniendo los tweets originales de X para ofrecer una lectura limpia.
Esto suele tardar solo unos segundos.