Towards Truly Open-Ended Reinforcement Learning