Multi-agent Systems & Reinforcement Learning

Multi-agent Systems &Reinforcement Learning A Presentation

What • Artificial Intelligence -> Distributed Artificial Intelligence • Concerned with information management issues and distributed/parallel problem solving • Distributed Artificial Intelligence -> Multi-agent Systems • Different problem solving agents with their own interests and goals

Why • Some of the trends in computing • Ubiquity, interconnection, intelligence, delegation. • The Internet of Things,self-steering cars, home automation devices. • What advantages does it offer over the alternatives? • In what circumstances is it useful?

Answers • Parallelism • Robustness • Fault-tolerance • Scalability • Simpler programming • Not for situations where parallel action is not possible and there is no action uncertainty.

Multi-Agent Systems • Two main dimensions: • Agent Heterogeneity • Amount of communication among agents • Multi-agent scenarios • Homogeneous non-communicating agents • Heterogeneous non-communicating agents • Homogeneous communicating agents • Heterogeneous communicating agents

The Predator/Prey (“Pursuit”) Domain

Homogeneous Non-Communicating • Issues • Reactive vs. Deliberative agents • Local vs. Global perspective • Modeling other agents’ states • How to affect others • Techniques • Reactive behaviors for formation maintenance • Local knowledge sometimes better • Recursive Modeling Method • Don’t model others – Just pay attention to reward • Stigmergy

Heterogeneous Non-Communicating • Issues • Benevolence vs. Competitiveness • Stable vs. evolving (arms race, credit/blame) • Modeling of others’ goals, actions, and knowledge • Social conventions • Roles • Techniques • Game theory, iterative play • Minimax-Q • Competitive co-evolution • Deduce intentions through observation • Autoepistemic reasoning (ignorance) • Model as a team (individuals follow roles) • Focal points/Emergent conventions • Design agents play different roles

Heterogeneous Communicating • Issues • Understanding each other • Planning communicative acts • Benevolence vs. competitiveness • Commitment/decommitment • Truth in communication • Techniques • Language Protocols: CL, ACL, KQML • Speech acts • Learning social behaviors • Multi-agent Q-learning • Training other agents’ Q-functions • Contract nets for electronic commerce/Market-based systems • Belief/Desire/Intention (BDI) models • Coalitions • Reasoning about truthfulness

The End • Thanks for listening • I skipped a lot of material • Multiagent Systems: A Survey from a Machine Learning Perspective, Peter Stone and Manuela Veloso, December 4, 1997 • No programming segment • Questions time

Multi-agent Systems & Reinforcement Learning