Exploration Hacking: Can LLMs Learn to Resist RL Training?alignmentforum.org2 pointsProf_Sigmunda month ago