Visual Reasoning and Tool Use Double GPT-5's Arc-AGI-2 Success Rategithub.com/zoecarver2 pointszoecarver10 months ago