210 likes | 407 Views
Curious Reconfigurable Robots. Kathryn Merrick ITEE Seminar – Friday 29 th August Collaborators: Dayne Schmidt, Elanor Huntington. Overview. Why build curious robots? Background Curious robots for creative play Strengths and limitations Current and future work. Why Build Curious Robots?.
E N D
Curious Reconfigurable Robots Kathryn Merrick ITEE Seminar – Friday 29th August Collaborators: Dayne Schmidt, Elanor Huntington
Overview • Why build curious robots? • Background • Curious robots for creative play • Strengths and limitations • Current and future work
Why Build Curious Robots? • Explore the scalability of existing curiosity algorithms • Build adaptive robots: • Fault tolerant • Able to use tools • Reconfigurable • Creative
Adaptable Robots Big Dog (Boston Dynamics, 2008) (Bongard et al, 2006)
Self-Reconfigurable Robots • Research focuses on: • Hardware • Communication • Navigation (University of Pennsylvania, 2008)
Curious, Developmental Robots • Research focus is on behaviour • Static robot configuration Curious Aibo (Sony CSL, 2003-2008)
Curious, Reconfigurable Robots • Assume a changing robot configuration • Focus on behavioural algorithms
Perceiving the World • World is represented as a variable length string from a grammar, rather than a fixed length vector • World comprises states and actions S <device_data> <device_data> <Sidata><device_data> | ε <Sidata> <s><Sidata> | ε <s> (<label>:<value>) <label> A unique identifier <value> Real number A <actions> <actions> <AjActions><actions> | ε <AjActions> A unique identifier
Modelling Curiosity • Curiosity is modelled as a function of novelty and interest
Curious Reinforcement Learning • Agent learns a mapping of states to actions and utility values • Agent selects best action most of the time and a random action some of the time Reward = Curiosity value
Architecture for a Curious, Reconfigurable Robot Curious Agent Memory Perception Curiosity Learning Activation Agent Layer Abstract Sensor Abstract Actuator Resource Manager Device Layer Device Manager
Curious Robots for Creative Play Explores relationship between structure and behaviour Creative thinking spiral (Resnick, 2007) Imagination, creativity, play, sharing and reflection
Strengths • Robot is reconfigurable – sensors and effectors can be added or removed • New behaviour emerges for new structure • Relationship between structure and behaviour revealed
S2 A2 T2 A1 T1 S3 A3 T3 S1 (a) (b) (c) Limitations • Not exhibiting the same cyclic behaviour seen in simulated agents • Cyclic behaviour harder to measure • Learning takes a long time
Curious Social Force Models for Reconfigurable Robots CDF Project: Dayne Schmidt
Measuring the Performance of Curious Robots • Characterising attention focus using bifurcation diagrams • A(move fwd port 4) • A(move fwd port 6) • A(move bkd port 4) • A(move bkd port 6) • A(stop port 4) • A(stop port 6) With Elanor Huntington
Measuring a Curious, Reconfigurable Robot • Focus of attention shifts as robot is reconfigured • A(move fwd port 4) • A(move bkd port 4) • A(stop port 4) • A(move fwd port 6) • A(move bkd port 6) • A(stop port 6)
s1 w11 w21 s2 A1 w31 s3 A2 w41 s4 A3 w51 s5 Learning Approaches for Curious Robots • Representing the state-action table as a neural network reduces memory requirements • Attention focus limited to simple tasks
Alternatives to Curiosity? • Modelling behaviour cycles as intrinsic reward for learning • In natural systems behaviour cycles occur at biological, cognitive and social levels • Sleep-wake cycle, seasonal, tidal or lunar cycles • Learning cycle, habituation and recovery • Fashion cycles, sociological cycles (Ahlgren and Halberg, 1990)
Behaviour Cycles as Intrinsic Reward • Advantages in natural systems include: • Anticipation, efficiency, competition, navigation • Creativity through exploration • Social self-advancement • Also potential advantages for artificial systems
Conclusions • We have created a curious, reconfigurable robot: • Sensors and effectors can be added or removed • New behaviour emerges for new structure • Ongoing work for: • Understanding curiosity in complex environments • Learning speed and representation • Measurement of emergent behaviour • Alternatives to curiosity as motivation for reconfigurable robots