Horner, J. M.; Staddon, John
(Elsevier, 1987)
When subjects must choose repeatedly between two or more alternatives, each of
which dispenses reward on a probabilistic basis (two-armed bandit), their behavior
is guided by the two possible outcomes, reward and nonreward. ...