Analyzing OpenAI's Reinforcement Fine-Tuning: Less Data, Better Resultsopenpipe.ai4 pointskcorbitta year ago