Keep on to blur preview images; turn off to show them clearly

Have questions, or building something cool with Cloudflare's Developer products? We're here to help. For help with your account please try @CloudflareHelp


AI researcher & teacher @SCAI_ASU. Former President of @RealAAAI; Chair of @AAAS Sec T. Here to tweach #AI. YouTube Ch: https://t.co/4beUPOmf6y Bsky: rao2z


4/ The new version can be found at https://t.co/4LGWfiCZ5e These results will also be presented by the lead authors @karthikv792 @kayastechly & @PalodVardh12428 at #NeurIPS2025 workshops on LAW, ForLM and Efficient Reasoning next week. Please stop by and chat!


AI researcher & teacher @SCAI_ASU. Former President of @RealAAAI; Chair of @AAAS Sec T. Here to tweach #AI. YouTube Ch: https://t.co/4beUPOmf6y Bsky: rao2z


2/ We also look at the effect of DeepSeek R1-style RL on the trace validity--to see if RL improves trace validity of the base model. The results show that RL is basically neutral on trace validity. It improves solution accuracy even in the case of model trained on 100% swapped traces, without increasing trace validity.

If you're interested in being a future sponsor, reach out here: https://t.co/5Bf3JvyPIx
