Curious Reconfigurable Robots

Curious Reconfigurable Robots Kathryn Merrick ITEE Seminar – Friday 29th August Collaborators: Dayne Schmidt, Elanor Huntington

Overview • Why build curious robots? • Background • Curious robots for creative play • Strengths and limitations • Current and future work

Why Build Curious Robots? • Explore the scalability of existing curiosity algorithms • Build adaptive robots: • Fault tolerant • Able to use tools • Reconfigurable • Creative

Adaptable Robots Big Dog (Boston Dynamics, 2008) (Bongard et al, 2006)

Self-Reconfigurable Robots • Research focuses on: • Hardware • Communication • Navigation (University of Pennsylvania, 2008)

Curious, Developmental Robots • Research focus is on behaviour • Static robot configuration Curious Aibo (Sony CSL, 2003-2008)

Curious, Reconfigurable Robots • Assume a changing robot configuration • Focus on behavioural algorithms

Perceiving the World • World is represented as a variable length string from a grammar, rather than a fixed length vector • World comprises states and actions S <device_data> <device_data>  <Sidata><device_data> | ε <Sidata>  <s><Sidata> | ε <s>  (<label>:<value>) <label>  A unique identifier <value>  Real number A <actions> <actions>  <AjActions><actions> | ε <AjActions>  A unique identifier

Modelling Curiosity • Curiosity is modelled as a function of novelty and interest

Curious Reinforcement Learning • Agent learns a mapping of states to actions and utility values • Agent selects best action most of the time and a random action some of the time Reward = Curiosity value

Architecture for a Curious, Reconfigurable Robot Curious Agent Memory Perception Curiosity Learning Activation Agent Layer Abstract Sensor Abstract Actuator Resource Manager Device Layer Device Manager

Curious Robots for Creative Play Explores relationship between structure and behaviour Creative thinking spiral (Resnick, 2007) Imagination, creativity, play, sharing and reflection

Strengths • Robot is reconfigurable – sensors and effectors can be added or removed • New behaviour emerges for new structure • Relationship between structure and behaviour revealed

S2 A2 T2 A1 T1 S3 A3 T3 S1 (a) (b) (c) Limitations • Not exhibiting the same cyclic behaviour seen in simulated agents • Cyclic behaviour harder to measure • Learning takes a long time

Curious Social Force Models for Reconfigurable Robots CDF Project: Dayne Schmidt

Measuring the Performance of Curious Robots • Characterising attention focus using bifurcation diagrams • A(move fwd port 4) • A(move fwd port 6) • A(move bkd port 4) • A(move bkd port 6) • A(stop port 4) • A(stop port 6) With Elanor Huntington

Measuring a Curious, Reconfigurable Robot • Focus of attention shifts as robot is reconfigured • A(move fwd port 4) • A(move bkd port 4) • A(stop port 4) • A(move fwd port 6) • A(move bkd port 6) • A(stop port 6)

s1 w11 w21 s2 A1 w31 s3 A2 w41 s4 A3 w51 s5 Learning Approaches for Curious Robots • Representing the state-action table as a neural network reduces memory requirements • Attention focus limited to simple tasks

Alternatives to Curiosity? • Modelling behaviour cycles as intrinsic reward for learning • In natural systems behaviour cycles occur at biological, cognitive and social levels • Sleep-wake cycle, seasonal, tidal or lunar cycles • Learning cycle, habituation and recovery • Fashion cycles, sociological cycles (Ahlgren and Halberg, 1990)

Behaviour Cycles as Intrinsic Reward • Advantages in natural systems include: • Anticipation, efficiency, competition, navigation • Creativity through exploration • Social self-advancement • Also potential advantages for artificial systems

Conclusions • We have created a curious, reconfigurable robot: • Sensors and effectors can be added or removed • New behaviour emerges for new structure • Ongoing work for: • Understanding curiosity in complex environments • Learning speed and representation • Measurement of emergent behaviour • Alternatives to curiosity as motivation for reconfigurable robots

Curious Reconfigurable Robots

Curious Reconfigurable Robots

Presentation Transcript

Self-Reconfigurable Robots

Concurrent and Real-Time Task Management for Self-Reconfigurable Robots

Multiagent Control of Modular Self-Reconfigurable Robots

Curious

Curious

CURIOUS

CURIOUS!?!?!?

The Curious Incident

Robots!!

Curious George

Curious Math Calculation

Robots

Curious

Curious

Robots

Curious Gir

“Curious Places”

Self-reconfigurable robots

Curious George

CURIOUS

Curious George WebQuest

Curious George