470 likes | 640 Views
Vahid Mokhtari. RoboCup: A Case Study in Multiagent System. Trends. 1. What is an Agent?. 2. Multiagent System. 3. 3. Case study in RoboCup. 4. 4. Contents. Trends in History of Computing. From Programming Perspective. What is an Agent?. Agent and Environment. Environment.
E N D
Vahid Mokhtari RoboCup: A Case Study in Multiagent System
Trends 1 What is an Agent? 2 Multiagent System 3 3 Case study in RoboCup 4 4 Contents
Environment • Accessible vs. Inaccessible - can the agent “see” everything? • Deterministic vs. Non-deterministic - do actions have guaranteed effect? • Static vs. Dynamic - does the environment change on its own? • Discrete vs. Continuous - is the number of actions and percepts finite?
Examples of Intelligent Agents • Assistant agent in MS Office • Trading agents • Web spiders • Computer viruses • Characters in computer games
Why MAS? • Some domains require it • Parallelism • Robustness • Scalability • Simpler programming • To study intelligence • Geographic distribution • Cost effectiveness
MAS Research Area • Distributed Computing: Processors share data, but not control. Focus on low-levelparallelization, synchronization. • Distributed AI: Control as well as data is distributed. Focus on problemsolving, communication, and coordination. • Distributed Problem Solving (DPS): Task decomposition and/or solution synthesis. • Multiagent Systems (MAS): Behavior coordination or behavior management.
Homogeneous Non-Communicating Multiagent Systems • Several different agents with identicalstructure (sensors, effectors, domain knowledge, and decision functions). • Different sensor input and effectors output. • Situated differently in the environment and they make their own decisions regarding which actions to take.
Heterogeneous Non-Communicating Multiagent Systems • Agents are situated differently in the environment • Different sensory inputs and different actions
Homogeneous Communicating Multiagent Systems • Agents are identical that they are situated differently in the environment • Agents can communicate together directly
Heterogeneous Communicating Multiagent Systems • Different sensory data, goals, actions, and domain knowledge
Importance of MAS • Research in “Distributed AI” started over 30 years ago, but only in the mid of 1990s has it become a major research trend in AI. • Now the main conference (AAMAS) attracts around 800 submissions (of which 20-25% get accepted) each year. • In addition, there are dozens of smaller workshops and conferences. it’s a large, young and dynamic research community
RoboCup Case study in Multiagent System
RoboCup Soccer • Distributed • Multiagent • Teammates and adversaries domain • Partial world view • Noisy sensors and actuators • Real-time
The Agent-Environment Interface • Agent and environment interact at discrete time steps: t=0, 1, 2, … • Agent observes state at step t: st S • Produces action at step t: at A(st) • Gets resulting rewards: rt+1 R • And resulting next step: St+1
SARSA (State-Action-Reward-State-Action) • SARSA is a learning algorithm in the reinforcement learning area of machine learning. • On-policy learning method, • It learns state-action values (Q values).
Result keepers hold the ball for about 8.2 seconds on average keepers hold the ball for about 12 seconds on average