Performance of a large language model on the reasoning tasks of a physicianscience.org10 pointskakoni2 months ago