Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning

Heykuki News

239 points

2 years ago

28 comments

Threaded

Loading comments...

Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning | Heykuki News