Quantal Response Equilibrium

In QRE, agents choose actions probabilistically proportional to exp(λ·payoff) rather than deterministically choosing the best response. At λ→0 agents choose randomly; at λ→∞ QRE approaches Nash. This captures bounded rationality: mistakes happen, but more costly mistakes happen less often.

Rationality λ2.0
Payoff a (Coord)4.0
Payoff b (Coord)2.0
Punishment c1.0
QRE p(A):
Nash p(A):
QRE q(A):
Nash q(A):