The Concrete Substrates of Abstract Rule Use

The Concrete Substrates of Abstract Rule Use Bradley Love www.ccc.utexas.edu

Three Domains • Tracking basic statistics in our environment. • Learning seemingly abstract rules. • Learning and reasoning about future rewards in dynamic environments.

Basic Story • Learning is constrained by • non-rational mechanisms • the nature of domain representations • trial-by-trial updates with regards to current representations in memory. • Seemingly abstract or rational explanations prove not to be upon closer inspection.

Tracking Basic Statistics • Variance • Mean

Tracking Variability

Rational Models • Fried and Holyoak (1984) • Maximum likelihood • unbiased estimator/consistent • Tenenbaum and Griffiths (2001) • Bayesian updating • Globally Bayesian

Tracking Variability

Obvious Mechanistic Approach • Error driven learning of cluster position and variance. • updates are trial by trial • are with regard to a memory representation of each category

Formally..

Mechanistic Model

Rational or Mechanistic? • Trial by trial error-driven learning makes local updates. • all statistics are local in error-driven learning • We can use this to disentangle the competing explanations.

Tracking Variability (locally)

What about the mean?

Tracking Statistics • People appear to update with regards to representations in memory in a trial-by-trial fashion. • Learning is error-driven and sensitive to the error term (i.e., task goal) • we see this in many domains (e.g., inference vs. classification learning)

Verbal Rules? • Certainly, we can implement and report strategies, but that doesn’t imply categories are represented as rules. • Are rules more concrete than they appear?

Are people really using rules? • Rules aren’t always rules (e.g., Allen & Brooks, 1991; Ramscar, 2002). • One alternative is clusters with selective attention. • more schema-like

Value 2 2 1 1 1 2 A 2 2 1 2 2 B 2 1 2 1 1 B 2 2 2 1 1 B 2 1 1 2 2 B Value 1 1 1 1 2 1 B 1 2 1 2 2 A 1 1 2 1 1 A 1 2 2 1 1 A 1 1 1 2 2 A 1 2 1 1 1 A 1 1 2 2 2 A 1 1 2 1 2 A 1 2 1 2 1 A Sakamoto and Love (2004)

Value 2 2 1 1 1 2 A 2 2 1 2 2 B 2 1 2 1 1 B 2 2 2 1 1 B 2 1 1 2 2 B Value 1 1 1 1 2 1 B 1 2 1 2 2 A 1 1 2 1 1 A 1 2 2 1 1 A 1 1 1 2 2 A 1 2 1 1 1 A 1 1 2 2 2 A 1 1 2 1 2 A 1 2 1 2 1 A Sakamoto and Love (2004) Rule Route If small, then A. If large, then B. Exception Route memorize item

Following-A Violating-B Following-A

“Abstract” Rules • Abstract rules are not specified by fixed-values. • e.g., learning an abstract notion of same and different • Could abstract responding be grounded in concrete episodes and trial-by-trial learning?

Exemplar Model Kruschke’s (1992) ALCOVE

Category association Learns association and attention weights Luminance Size

Attention Learns association and attention weights Luminance Size

Adding Structure: Chase Chase Chased Chased Chaser Chaser

one-to-one correspondence Chase Chase Chased Chased Chaser Chaser One or the other, not both

Parallel Connectivity Chase Chase Perfect relational match, but feature mismatches. Parallel connectivity satisfied.

Parallel Connectivity Chase Chase Two relational mismatches, but high feature match. Parallel Connectivity violated.

Disambiguation: Attention Chase Chase Attention determines the trade-off.

BRIDGES

Do, Do, Re (pause) Me, Me, Fa (pause) Ti, Ti, Sa (pause) ... Learning Abstract Rules • Marcus et al. showed infants can learn to distinguish simple grammars. • AAB pattern vs. ABB pattern • No diagnostic features • Infants habituated to one grammar, then tested with novel sounds from both grammars

Representation • Attention on position constrained • Only type-token relation used a a b Type_of( , ) Type_of( , ) Type_of( , ) a1 A a2 A b1 B

Same grammar • Relations: perfect similarity, parallel connectivity preserved a a b c c d Type_of( , ) Type_of( , ) Type_of( 1, ) Type_of( , ) Type_of( , ) Type_of( , ) a1 A c1 C a2 A c2 C b B d1 D

Opposite grammar • Relations: 1 mismatch, parallel connectivity not preserved a a b e f f Type_of( , ) Type_of( , ) Type_of( , ) Type_of( , ) Type_of( , ) Type_of( , ) a1 A e E a2 A f1 F b1 B f2 F

Learning • Attention shifts from inconsistent features to predictive relations • Concrete exemplar-based memory and attention shifting to relations explains performance

Lurking Concreteness in Abstract Concepts • Pigeons appear to learn the concept of same and different. Same Different Young, Wasserman and others - 1997, 2001, 2004

Experimental Details • 16 “Same” grids, 16 “Different” grids -> 4 x 4 • Pigeons trained to peck green for same, red for different • Then tested with novel icons and old icons • accuracy was 83% with old icons, 71% with new BRIDGES captures this pattern...

Concrete basis revealed in graded structure Shades of grey Same Different

Determining relational similarity Shades of grey Match Match Same Different Mismatch Mismatch 13 Mismatches 2 Mismatches

Relational categories are graded Shades of grey Shades of grey Same Different

Relational categories are graded Shades of grey Same Different

The Concrete Substrates of Abstract Rule Use