Quasi-Continuous Decision States in the Leaky Competing Accumulator Model

Quasi-Continuous Decision Statesin the Leaky Competing Accumulator Model Jay McClellandStanford University With Joel Lachter, Greg Corrado, and Jim Johnstonas well as Juan Gao and Marius Usher

Is the rectangle longer toward the northwest or longer toward the northeast?

Longer toward the Northeast! 1.99” 2.00”

A Classical Model of Decision Making:The Drift Diffusion Model of Choice Between Two Alternative Decisions • At each time step a small sample of noisy information is obtained; each sample adds to a cumulative relative evidence variable. • Mean of the noisy samples is +m for one alternative, –m for the other, with standard deviation s. • When a bound is reached, the corresponding choice is made. • Alternatively, in ‘time controlled’ or ‘interrogation’ tasks, respond when signal is given, based on value of the relative evidence variable.

The DDM is an optimal model, and it is consistent with neurophysiology • It achieves the fastest possible decision on average for a given level of accuracy • It can be tuned to optimize performance under different kinds of task conditions • Different prior probabilities • Different costs and payoffs • Variation in the time between trials… • The activity of neurons in a brain area associated with decision making seems to reflect the DD process

Neural Basis of Decision Making in Monkeys (Shadlen & Newsome; Roitman & Shadlen, 2002) RT task paradigm of R&T. Motion coherence anddirection is varied fromtrial to trial.

Neural Basis of Decision Making in Monkeys: Results Data are averaged over many different neurons that areassociated with intended eye movements to the locationof target.

Easy Prob. Correct Hard A Problem with the DDM • Accuracy should gradually improve toward ceiling levels as more time is allowed, even for very hard discriminations, but this is not what is observed in human data. • Two possible fixes: • Trial-to-trial variance in the direction of drift • Evidence accumulation may reach a bound and stop, even if more time is available

Usher and McClelland (2001)Leaky Competing Accumulator Model y2 y1 I1 I2 • Addresses the process of decidingbetween two alternatives basedon external input, with leakage, mutual inhibition, and noise: dy1/dt = I1-gy1–bf(y2)+x1 dy2/dt = I2-gy2–bf(y1)+x2 f(y) = [y]+ • Participant chooses the most active accumulator when the go cue occurs • This is equivalent to choosing response 1 iff y1-y2 > 0 • Let y = (y1-y2). While y1 and y2 are positive, the model reduces to: dy/dt = I-ly+x [I=I1-I2; l = g-b; x=x1-x2]

Wong & Wang (2006) ~Usher & McClelland (2001)

Time-accuracy curves for different |k-b| or |l| |k-b| = 0 |k-b| = .2 |k-b| = .4

Prob. Correct

Kiani, Hanks and Shadlen 2008 Random motion stimuli of different coherences. Stimulus duration follows an exponential distribution. ‘go’ cue can occur at stimulus offset; response must occur within 500 msec to earn reward.

The earlier the pulse, the more it matters(Kiani et al, 2008)

These results rule out leak dominance Still viable X

The Full Non-Linear LCAi Model y1 y2 Although the value of the differencevariable is not well-captured by thelinear approximation, the sign of thedifference is approximated very closely.

Three Studies Related to these Issues • Integration of reward and payoff information under time controlled conditions • Gao, Tortell & McClelland • Investigations of decision making with non-stationary stimulus information • Usher, Tsetsos & McClelland • Does the confidence of a final decision state vary continuously with the strength of the evidence? • Lachter, Corrado, Johnston & McClelland

Timeline of the Experiment

Proportion of Choices toward Higher Reward

Sensitivity varies with time

Three Hypotheses • Reward acts as an input from reward cue onset til the end of the integration period • Reward influences the state of the accumulators before the onset of the stimulus • Reward introduces an offset into the decision

Fits Based on Linear Model

Fits based on full LCAi

Relationship between response speed and choice accuracy

Different levels of activation of correct and incorrect responses in Inhibition-dominant LCA Final time slice errors correct

High-Threshold LCAi

Preliminary Simulation Results

Three Studies Related to these Issues • Integration of reward and payoff information under time controlled conditions • Gao, Tortell & McClelland • Investigations of decision making with non-stationary stimulus information • Usher, Tsetsos & McClelland • Does the confidence of a final decision state vary continuously with the strength of the evidence? • Lachter, Corrado, Johnston & McClelland

Continuous Report of ConfidenceLachter, Corrado, Johnston & McClelland (in progress) Expt 1: Probability bias included; trial could end at any time Expt 2: No probability bias; observers terminate trial when they have determined their best answer

Trial Duration Protocol and performance as a functionof time for participants groupedby performance

Results and Descriptive Model of Data from 1 Participant

Quasi-Continuous Decision States in the Leaky Competing Accumulator Model

Quasi-Continuous Decision States in the Leaky Competing Accumulator Model

Presentation Transcript

Competing Values Model

Quasi-Competitive Model

Accumulator

The continuous Quality Improvement model

Competing Values Model

Introducing The Decision Model

Decision Dynamics and Decision States in the Leaky Competing Accumulator Model

Decision Dynamics and Decision States: the Leaky Competing Accumulator Model

Continuous Model Synthesis

Competing Nation-States?

Constructing the Decision Model

The Leaky Pipeline

Constructing the Decision Model

Decision Dynamics and Decision States in the Leaky Competing Accumulator Model

Resonant states in the shell model

The Unified Model of Quasi-Stellar Objects

Using the Competing Behavior Model in Teacher Education

Electron Cooling in the Accumulator

The Leaky Pipeline

Competing Values Model

Quasi-Competitive Model

Decision Dynamics and Decision States in the Leaky Competing Accumulator Model