Lectures prepared by: Elchanan Mossel Yelena Shvets

Lectures prepared by:Elchanan MosselYelena Shvets

Binomial Distribution:Mean • Recall: the Mean(m) • of a binomial(n,p) distribution • is given by: m = #Trials £ P(success) = n p • The Mean = Mode (most likely value). • The Mean = “Center of gravity” of the distribution.

Standard Deviation • The Standard Deviation (SD) of a distribution, denoted s measures the spread of the distribution. • Def:The Standard Deviation (SD)  of the binomial(n,p) distribution is given by:

Example 1: • Let’s compare Bin(100,1/4) to Bin(100,1/2). • 1 = 25, 2 = 50 • 1 = 4.33, 2 = 5 • We expect Bin(100,1/4) to be less spread than Bin(100,1/2). • Indeed, you are more likely to guess the value of Bin(100,1/4) distribution than of Bin(100,1/2) since: Binomial Distribution For p=1/2 : P(50) ¼ 0.079589237; For p=1/4 : P(25) ¼ 0.092132;

Histograms for binomial(100, 1/4) & binomial(100, 1/2);

Example 2: • Let’s compare Bin(50,1/2) to Bin(100,1/2). • 1 = 25, 2 = 50 • 1 = 3.54, 2 = 5 • We expect Bin(50,1/2) to be less spread than Bin(100,1/2). • Indeed, you are more likely to guess the value of Bin(50,1/2) distribution than of Bin(100,1/2) since: Binomial Distribution For n=100 : P(50) ¼ 0.079589237; For n=50 : P(25) ¼ 0.112556;

Histograms for binomial(50, 1/2) & binomial(100,1/2)

,  and the normal curve • We will see today that the • Mean (m) and the • SD (s) give a very good summary of the binom(n,p) distribution via • The Normal Curve with parameters  and .

binomial(n,½); n=50,100,250,500 m=25, s=3.54 m=50, s=5 m=125, s=7.91 m=250, s=11.2

Normal Curve y m - s m m + s x

The Normal Distribution a b x

Standard Normal Standard Normal Density Function: Corresponds to  = 0 and =1

Standard Normal Standard Normal Cumulative Distribution Function: • For the normal (,) distribution: • P(a,b) = F((b-m)/s) - F((a-m)/s);

Standard Normal

Standard Normal -z z

Standard Normal Standard Normal Cumulative Distribution Function: • For the normal (,) distribution: • P(a,b) = F((b-m)/s) - F((a-m)/s); • In order to prove this, suffices to show: • P(-1,a) = F((a-m)/s);

Change of variables

Properties of F: F(0) = 1/2; F(-z) = 1 - F(z); F(-1) = 0; F(1) = 1. does not have a closed form formula!

Normal Approximation of a binomial For n independent trials with success probability p: where:

Normal(m,s); m=25, s=3.54 m=50, s=5 m=125, s=7.91 m=250, s=11.2

Normal Approximation of a binomial For n independent trials with success probability p: • The 0.5 correction is called the “continuity correction” • It is important when  is small or when a and b are close.

Normal Approximation Question: Find P(H>40) in 100 tosses.

Normal Approximation to bin(100,1/2). 1-F((40-50)/5) =1-F(-2) = F(2) ¼ 0.9772 What do we get With continuity correction? Exact = 0.971556

When does the Normal Approximation fail? bin(1,1/2) m=0.5,s=0.5 N(0.5,0.5)

When does the NA fail? bin(100,1/100) m =1, s¼ 1 N(1,1)

Rule of Thumb • Normal works better: • The larger  is. • The closer p is to ½.

Fluctuation in the number of successes. • From the normal approximation it follows that: • P(m-s to m+s successes in n trials) ¼ 68% • P(m-2s to m+2s successes in n trials) ¼ 95% • P(m-3s to m+3s successes in n trials) ¼ 99.7% • P(m-4s to m+4s successes in n trials) ¼ 99.99%

Fluctuation in the proportion of successes. • Typical size of fluctuation in the number of successes is: Typical size of fluctuation in the proportion of successes is:

Square Root Law Let n be a large number of independent trials with probability of success p on each. The number of successes will, with high probability, lie in an interval, centered on the mean np, with a width a moderate multiple of . The proportion of successes, will lie in a small interval centered on p, with the width a moderate multiple of 1/ .

Law of large numbers Let n be a number of independent trials, with probability p of success on each. For each e > 0; P(|#successes/n – p|< e) ! 1, as n !1

Confidence intervals

Conf Intervals

Conf Intervals is called a 99.99% confidence interval.

Normal(m,s); m=25, s=3.54 m=50, s=5 m=125, s=7.91 m=250, s=11.2

bin(50,1/2) + 25 m=50, s =3.535534 bin(50,1/2) m = 25, s = 3.535534 Normal Approximation m=50, s=5 (bin(50,1/2) + 25)* 5/3.535534 m=50, s = 5 bin(100,1/2) m=50, s=5

Notre Dame de Rheims

Quote Bientôt je pus montrer quelques esquisses. Personnen'y comprit rien. Même ceux qui furent favorables à ma perception des vérités que je voulais ensuite graverdans le temple, me félicitèrent de les avoirdécouvertes au « microscope », quand je m'étais aucontraire servi d'un télescope pour apercevoir deschoses, très petites en effet, mais parce qu'elles étaient situées à une grande distance […]. Là où jecherchais les grandes lois, on m'appelait fouilleur dedétails. (TR, p.346)

Lectures prepared by: Elchanan Mossel Yelena Shvets

Lectures prepared by: Elchanan Mossel Yelena Shvets

Presentation Transcript

Yelena Yesha Olga Streltchenko

Language and art: From paleolithic art to writing Four lectures

Top Ten (or 11) EKG Killers

Supply Side--Lectures

Lectures for 3rd Edition

Group behavior, teams and conflicts

Employees Motivation

Lectures for 3rd Edition

CSE544 Transactions: Concurrency Control

Resident Physics Lectures

OGSA-DAI Lectures Part 2

Diseases of the Pituitary Gland

Organization of lectures

Prepared by Dave Smith, D.Min. Spring, TX

Lionel Robbins Lectures, London School Economics, Feb.23-25 Understanding Institutions

Chemical Kinetics Lecture notes edited by John Reif from PPT lectures by:

Lectures Series on English Learning 英语学习系列讲座

Lectures for 3rd Edition

Organization of lectures

Resident Physics Lectures

Diseases of the Pituitary Gland