Training LLMs to generate text with citations via fine-grained rewardsarxiv.org170 pointsPaulHoule2 years ago