OpenAI’s o3 system scored 85% on the ARC-AGI benchmark, well above the previous AI best score of 55% and on par with the average human score.
Large Language Models, like Google's PaLM and OpenAI's GPT-4, are at the forefront of AI conversation, capable of generating ...