Quantitative AI progress needs accurate and transparent evaluationmathstodon.xyz209 pointsbertmana year ago