Monte-Carlo Methods in AI: Overview

Monte-Carlo Methods in AI: Overview Prasad Tadepalli

What is a Monte-Carlo Method? • Any method that relies on repeated random simulations to estimate something • Simplest case: Polling – who wins the election? • True probability of a person voting for Obama is • Ask N = 1000 random registered voters how they vote. • Calculate = #(Obama voters)/1000 • Apply Chernoff’s bound • Key idea: Although people are complex and varied, they can be treated as independent samples of an identical distribution for estimation

Applications • First modern use in simulating nuclear reactions in 1940’s by Stanislaw Ulam • Predicting the behavior of complex systems – weather, finance, fluid dynamics, markets, … • Planning and optimization - • Computer games: Bridge, Go, Solitaire, StarCraft • Optimal path planning in time-sensitive networks • True model either does not exist or is too complicated to reason about

Two Fundamental Problems • Prediction/Inference Problem • Given a probabilistic model of how the world operates (a “Bayesian Network”) and some observed evidence, what can we infer about a particular query variable? • Draw samples of the model where the observed evidence is true • Estimate the number of times the query variable is true • Planning/Optimization Problem • Given a faithful simulator of an environment, how can we use it to choose an optimal action? • Run lots and lots of trials • Combine the evidence in a “smart” way • Output the action that yields best results

Organization • Monday, Tuesday, Wednesday are divided into 2 parts • Mornings • Inference/Prediction Problem (Experiments with Genie) • Application Talk • Afternoons • Planning/Optimization Problem (Experiments with MCP) • Project/Lab (Galcon) • Wednesday evening dinner @5:30, McMenamins, Monroe • Thursday 2 talks plus tournament project work • Tournament code is due: Friday 9 AM. • Friday – Advanced topics, tournaments, student presentations

Monte-Carlo Methods in AI: Overview

Monte-Carlo Methods in AI: Overview

Presentation Transcript

BUFN 722

Introducing Psychology and its Methods

Chapter 6: Birth Control

Iterative Methods for System of Equations

Algorithms for Dynamical Fermions

DFT, Monte-Carlo and classical simulation studies of crystals, surfaces and zeolites

Python Programming: An Introduction to Computer Science

Reverse Stress Testing

Clustering Methods

Particle Filter/Monte Carlo Localization

Mesh Parameterizations

Carlo Colantuoni carlo@illuminatobiotech

Simulation Algorithms for Lattice QCD

Algorithms for Dynamical Fermions

Monte Carlo Detector Simulation

Monte Carlo Detector Simulation

Inference in Bayesian Networks

Monte Carlo Methods in Finance IIM Ahmedabad, Nov 6, 2005 Sandeep Juneja

From Selling Technology to Selling Value (2008)