Multi-Agent RL — CTDE Framework
Agents:
3
Comm bandwidth:
60%
Start
Stop
Reset
CTDE: Centralized Training, Decentralized Execution — agents share info during training only