On solving cooperation with multi-agent reinforcement learning