Keep on to blur preview images; turn off to show them clearly

Co-founder @ndea. Co-founder @arcprize. Creator of Keras and ARC-AGI. Author of 'Deep Learning with Python'.


AI is cool i guess


Performance is strong across the board: 55.6% on SWE-Bench Pro, 52.9% or ARC-AGI-2, 40.3% on Frontier Math.


AI is cool i guess


For example, GDPval measures how often industry experts prefer the model's output to the output of other industry experts. GPT-5.2 gets a 70% (beat or tie); GPT-5 got a 38%. Try it to makes slides, spreadsheets, code, and much more.


making models learn • eXperiments lab
