Multi-Agent RL — CTDE Framework

3
60%
CTDE: Centralized Training, Decentralized Execution — agents share info during training only