1 / 17

Collaboration Development through Interactive Learning between Human and Robot

Collaboration Development through Interactive Learning between Human and Robot. Tetsuya OGATA, Noritaka MASAGO, Shigeki SUGANO, Jun TANI. Introduction. “Recent” studies about welfare robots or robots as pets attracted lots of attention They must work flexibly and cooperatively with humans

roman
Download Presentation

Collaboration Development through Interactive Learning between Human and Robot

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Collaboration Development through Interactive Learning between Human and Robot • Tetsuya OGATA, Noritaka MASAGO, Shigeki SUGANO, Jun TANI.

  2. Introduction • “Recent” studies about welfare robots or robots as pets attracted lots of attention • They must work flexibly and cooperatively with humans • They would also have a establish relations with people in daily life

  3. The Aims • To demonstrate interactive learning between a human operator and a robot system • Both human and robot are in the role of the learner But... • These sorts of systems are usually difficult to stabilize over long operation times

  4. Previous Work • Most similar studies focus on short operations • Exploring collapse and modification of relationships between people

  5. The Robot • Robovie • 2 arms • 4 DOF • 'Human-like' head • Audiovisual sensors • Many tactile sensors attached to its body

  6. The Environment • A 4x4m course • The outside walls marked alternately red and blue

  7. The Experiment • The human and robot join arms and attempt to travel clockwise through the maze without hitting obstacles • Try to do it in the shortest time • The movement is a combination of the human's influence and the robot's neural network

  8. Limited Senses • Both the human and robot have very limited sensory information • Robot has poor vision, and only local information such as ultrasonic sensors. • It has no global position information • The human has a blindfold on • But can see the space before the experiment begins • Both sides are anticipating future sensory input and generating the next motor commands

  9. The Model • A Recurrent Neural Network (RNN) • The input consists of: • Current sensory input • Current motor values • The output is predictions of: • Next sensory input • Next motor values

  10. Their model can run in one of two modes • It can work in Open Loop Mode which directly maps inputs to outputs • Closed Loop Mode takes the output and puts it straight into the input • Can generate predictions of arbitrary length • Similar to mental rehearsal

  11. Consolidation Learning • When a RNN tries to learn something new, it severely damages everything it already knows. • One way to avoid this: • Save all past teaching data in a database • Add new data • Use all of the data to retrain • Learning time increases with data

  12. Consolidation Learning • Analogous to biology • Temporary memory stored in hippocampus • Consolidated into long-term memory during sleep • New data is stored in a database • The RNN corresponds to the long-term memory • The RNN is trained using both the rehearsed patterns and the sequence of the new experience • This enables the incremental learning without damaging the structure of the RNN

  13. Navigation • In initial stages, performs very badly • Has a collision avoidance system to help with the training • Simplified reinforcement learning for initial training • Robot and human go around workspace • Time measured • If performance is better, train RNN to incorporate new trial

  14. Experiments • A feed forward neural network (FFNN) • A RNN • A RNN with consolidation learning • Trials interlaced with questionnaires meant to judge workloads • Effort, workload, complexity, performance, concentration...

  15. Results • FFNN ultimately deteriorates • RNN ultimately stagnates • RNN with consolidation learning continued to improve

  16. Robustness • The analyze the effect of consolidation learning they compared the conventional RNN to the consolidation learning RNN when subject to noise • Used the closed loop mode and introduced different amounts of noise to the inputs • Consolidation learning proved far more robust • Linked to 'operability' – Robots which don't cope well with noise can seem unwieldy

  17. Collaboration • Miwa showed that human collaboration was developed through repeated phases (Miwa et. al, 2001) • The consolidation-learning method arguably demonstrates these phases • The RNN with consolidation learning might have similarity with human learning

More Related