Post-Training Generative Recommenders with Advantage-Weighted Supervised Tuningnetflixtechblog.com1 pointCharlesW8 months ago