Open weight LLMs exhibit inconsistent performance across providerssimonwillison.net6 pointsneehao10 months ago