Model-based meta reinforcement learning for alchemy