220 likes | 632 Views
Multi-agent Systems & Reinforcement Learning. A Presentation. What. Artificial Intelligence -> Distributed Artificial Intelligence Concerned with information management issues and distributed/parallel problem solving Distributed Artificial Intelligence -> Multi-agent Systems
E N D
Multi-agent Systems &Reinforcement Learning A Presentation
What • Artificial Intelligence -> Distributed Artificial Intelligence • Concerned with information management issues and distributed/parallel problem solving • Distributed Artificial Intelligence -> Multi-agent Systems • Different problem solving agents with their own interests and goals
Why • Some of the trends in computing • Ubiquity, interconnection, intelligence, delegation. • The Internet of Things,self-steering cars, home automation devices. • What advantages does it offer over the alternatives? • In what circumstances is it useful?
Answers • Parallelism • Robustness • Fault-tolerance • Scalability • Simpler programming • Not for situations where parallel action is not possible and there is no action uncertainty.
Multi-Agent Systems • Two main dimensions: • Agent Heterogeneity • Amount of communication among agents • Multi-agent scenarios • Homogeneous non-communicating agents • Heterogeneous non-communicating agents • Homogeneous communicating agents • Heterogeneous communicating agents
Homogeneous Non-Communicating • Issues • Reactive vs. Deliberative agents • Local vs. Global perspective • Modeling other agents’ states • How to affect others • Techniques • Reactive behaviors for formation maintenance • Local knowledge sometimes better • Recursive Modeling Method • Don’t model others – Just pay attention to reward • Stigmergy
Heterogeneous Non-Communicating • Issues • Benevolence vs. Competitiveness • Stable vs. evolving (arms race, credit/blame) • Modeling of others’ goals, actions, and knowledge • Social conventions • Roles • Techniques • Game theory, iterative play • Minimax-Q • Competitive co-evolution • Deduce intentions through observation • Autoepistemic reasoning (ignorance) • Model as a team (individuals follow roles) • Focal points/Emergent conventions • Design agents play different roles
Heterogeneous Communicating • Issues • Understanding each other • Planning communicative acts • Benevolence vs. competitiveness • Commitment/decommitment • Truth in communication • Techniques • Language Protocols: CL, ACL, KQML • Speech acts • Learning social behaviors • Multi-agent Q-learning • Training other agents’ Q-functions • Contract nets for electronic commerce/Market-based systems • Belief/Desire/Intention (BDI) models • Coalitions • Reasoning about truthfulness
The End • Thanks for listening • I skipped a lot of material • Multiagent Systems: A Survey from a Machine Learning Perspective, Peter Stone and Manuela Veloso, December 4, 1997 • No programming segment • Questions time