1.16k likes | 1.18k Views
This talk explores the future of interactive robot theatre, focusing on the integration of machine learning techniques, quantum networks, and software-hardware methodology in humanoid robots. It discusses the design of inexpensive natural size humanoid caricature and realistic robot heads, as well as the importance of robots having a personality expressed through their behavior, facial gestures, emotions, and learned speech patterns.
E N D
Interactive Robot Theatre as a future toy Integration of Machine Learning, Quantum Networks and software-hardware methodology in humanoid robots Marek Perkowski, Dept. Electrical Engineering PSU, and Department of Electronics and Computer Science, Korea Advanced Institute of Science and Technology Talk presented at Department of Electronics, Technical University of Warsaw, December 2004
Talking Robots • Many talking toys exist, but they are still very primitive • Actors for robot theatre, agents for advertisement, education and entertainment. • Designing inexpensive natural size humanoid caricature and realistic robot heads Dog.com from Japan We concentrate on Machine Learning techniques used to teach robots behaviors, natural language dialogs and facial gestures. Work in progress
Robot with a Personality? • Future robots will interact closely with non-sophisticated users, children and elderly, so the question arises, how they should look like? • If human face for a robot, then what kind of a face? • Handsome or average, realistic or simplified, normal size or enlarged? • The famous example of a robot head • is Kismet from MIT. • Why is Kismet so successful? • We believe that a robot that will interact with humans should have some kind of “personality” and Kismet so far is the only robot with “personality”.
Robot face should be friendly and funny The Muppets of Jim Henson are hard to match examples of puppet artistry and animation perfection. We are interested in robot’s personality as expressed by its: • behavior, • facial gestures, • emotions, • learned speech patterns.
Behavior, Dialog and Learning • Robot activity as a mapping of the sensed environment and internal states to behaviors and new internal states (emotions, energy levels, etc). • Our goal is to uniformly integrate verbal and non-verbal robot behaviors. • Words communicate only about 35 % of the information transmitted from a sender to a receiver in a human-to-human communication. • The remaining information is included in para-language. • Emotions, thoughts, decision and intentions of a speaker can be recognized earlier than they are verbalized. NASA
Robot Head Construction, 1999 Furby head with new control Jonas We animate various kinds of humanoid heads with from 4 to 20 DOF, looking for comical and entertaining values.
Mister Butcher Latex skin from Hollywood 4 degree of freedom neck
Robot Head Construction, 2000 Skeleton Alien We use inexpensive servos from Hitec and Futaba, plastic, playwood and aluminum. The robots are either PC-interfaced, use simple micro-controllers such as Basic Stamp, or are radio controlled from a PC or by the user.
Technical Construction, 2001 Details Marvin the Crazy Robot Adam
2001 Virginia Woolf heads equipped with microphones, USB cameras, sonars and CDS light sensors
2002 BUG (Big Ugly Robot) Max Image processing and pattern recognition uses software developed at PSU, CMU and Intel (public domain software available on WWW). Software is in Visual C++, Visual Basic, Lisp and Prolog.
Visual Feedback and Learning based on Constructive Induction 2002
2002, Japan Professor Perky Professor Perky with automated speech recognition (ASR) and text-to-speech (TTS) capabilities • We compared several commercial speech systems from Microsoft, Sensory and Fonix. • Based on experiences in highly noisy environments and with a variety of speakers, we selected Fonix for both ASR and TTS for Professor Perky and Maria robots. • We use microphone array from Andrea Electronics. 1 dollar latex skin from China
Maria, 2002/2003 20 DOF
location of head servos Construction details of Maria skull • location of controlling rods • location of remote servos Custom designed skin
Software/Hardware Architecture • Network- 10 processors, ultimately 100 processors. • Robotics Processors. ACS 16 • Speech cards on Intel grant • More cameras • Tracking in all robots. • Robotic languages – Alice and Cyc-like technologies.
Currently the hands are not moveable. We have a separate hand design project.
HAHOE KAIST ROBOT THEATRE, KOREA, SUMMER 2004 Sonbi, the Confucian Scholar Paekchong, the bad butcher
Yangban the Aristocrat and Pune his concubine The Narrator
We base all our robots on inexpensive radio-controlled servo technology.
We are familiar with latex and polyester technologies for faces
Probabilistic State Machines to describe emotions “you are beautiful” / ”Thanks for a compliment” P=1 Happy state “you are blonde!” / ”I am not an idiot” P=0.3 “you are blonde!” / Do you suggest I am an idiot?” Unhappy state P=0.7 Ironic state
Facial Behaviors of Maria Do I look like younger than twenty three? Maria asks: Response: • “no” • “no” • “yes” 0.7 0.3 Maria smiles Maria frowns
Probabilistic Grammars for performances Speak ”Professor Perky”, blinks eyes twice P=0.1 Speak ”Professor Perky” Where? P=0.3 Who? P=0.5 P=0.5 P=0.5 Speak “in some location”, smiles broadly Speak “In the classroom”, shakes head Speak ”Doctor Lee” What? P=0.1 P=0.1 P=0.1 Speak “Was singing and dancing” P=0.1 Speak “Was drinking wine” ….
Human-controlled modes of dialog/interaction Human teaches “Thanks, I have a lesson” “Hello Maria” “Lesson finished” Robot performs Robot asks “Question” “Stop performance” “Questioning finished” “Command finished” “Thanks, I have a question” “Thanks, I have a command” Human asks Human commands
Robot-Receptionist Initiated Conversation Human Robot What can I do for you? Robot asks This represents operation mode
Robot-Receptionist Initiated Conversation Human Robot What can I do for you? I would like to order a table for two Robot asks
Robot-Receptionist Initiated Conversation Human Robot Smoking or non-smoking? Robot asks
Robot-Receptionist Initiated Conversation Human Robot Smoking or non-smoking? I do not understand Robot asks
Robot-Receptionist Initiated Conversation Human Robot Do you want a table in a smoking or non-smoking section of the restaurant? Non-smoking section is near the terrace. Robot asks
Robot-Receptionist Initiated Conversation Human Robot Do you want a table in a smoking or non-smoking section of the restaurant? Non-smoking section is near the terrace. A table near the terrace, please Robot asks
Human-Initiated Conversation Human Robot Hello Maria initialization Robot asks
Human-Initiated Conversation Human Robot Hello Maria What can I do for you? Robot asks
Human-Asking Human Robot Question Question Robot asks Human asks
Human-Asking Human Robot Yes, you ask a question. Question Human asks
Human-Asking Human Robot Yes, you ask a question. What book wrote Lee? Human asks
Human-Asking Human Robot I have no sure information. What book wrote Lee? Human asks
Human-Asking Human Robot I have no sure information. Try to guess. Human asks
Human-Asking Human Robot Lee wrote book “Flowers”. Try to guess. Human asks
Human-Asking Human Robot Lee wrote book “Flowers”. This is not true. Human asks
Human ends questioning Human-Teaching Human Robot Questioning finished Human asks Human teaches “Questioning finished” Robot asks Thanks, I have a lesson