A 164-parameter architecture beats a 6.5M transformer on SCAN by 94 pointsgithub.com/Elgoghel2 pointsElgoghel2 months ago