Participants learn the latent rules defining action-reward associations. During the tutorial, participants are instructed on stochastic rewards, that a single action can garnish rewards for multiple states, and that a single state can be rewarded for multiple actions. On a given trial, participants encounter an 'alien artefact' activated with one of four actions. The main task is composed of two blocks. During block 1, they learn the latent states with an initial set of examples. The transition to the second block occurs without notice to the participant. During block 2, new examples are introduced that differ in a previously non-discriminative feature (left).