160 likes | 310 Views
Biostatistics. Dr. Chenqi Lu Telephone: 021-55665269 E-mail: luchenqi @fudan.edu.cn Office: 2309 GuangHua East Main Building. Population Sample Parameter Statistic. Random Sampling. Sampling Distribution.
E N D
Biostatistics Dr. Chenqi Lu Telephone: 021-55665269 E-mail: luchenqi@fudan.edu.cn Office: 2309 GuangHua East Main Building
Population Sample Parameter Statistic Random Sampling Sampling Distribution Estimate parameter Statistical inference
An example of wing shape inheritance in Drosophila Vv x Vv ¼ VV ½ Vv ¼ vv (vestigial wing) p = 0.25 of vv genotype randomly sampling 20 flies Estimated value of p: Only those samples containing 5 vv individuals give , the probability is Chapter 4. Sampling Distributions § 4.1. Concept of random samples A sample from any given population is recognized as a random sample if every individual in the population is equally likely to be present in that sample. i.e. a random sample is a “fair” presentation of the population. probability frequency
x 0 1 2 3 4 5 6 0.2023 0.1444 0.0727 0.1025
Larger sample size tend to improve the probability that is close to p but decreases as n increases
§ 4.2. Distribution of the sample mean Let X = {X1, X2, …, Xn} be a random sample of size n from a population with mean m and variance s2, and {Xi} are i.i.d. It is clear that Independent identical distribution Mean of a random sample equals mean of the population from which the sample is collected is a r.v. Its mean and variance are given by Variance of a random sample is proportional to its population variance to a factor 1/n
Revisit of the example of two rolls of a die (sampling with replacement )
the example of two from six balls (sampling without replacement ) 0 1 1 2 2 3 2 2 1 1 0 15
If a random sample is from a population exhibiting normal distribution mean m and variance s2, then the sample mean as a r.v. follows normal distribution with Standard deviation of mean is also referred as to standard error In fact, if the sample size (n) is large enough, the sample mean approaches to a normal distribution no matter what population distribution the sample is from! (Central Limit Theorem)
0.95 0.05 -1.96 1.96 § 4.2. Confidence Intervals for population mean In many cases where population size is very large or even infinity, it is not practical to calculate the exact value of population mean. In many practices, it is useful to have ideas about the range of value of the population mean with certain confidence. Sampling distribution of a sample mean allows this to be done. In fact For a given probability confidence 1-a
When a = 0.05 is called the 1- a confidence interval of population mean m
Thus, estimated C.I. of m from a sample is In practice, calculation of C.I. for population mean needs s, the population standard deviation which is unknown but can be estimated from sample s.d., s. However is no longer a normal distributed r.v. Its distribution follows a t-distribution with d.f. = n-1.
Let s2 be variance of a random sample from a population with mean § 4.3. Confidence Intervals for population variance normal standard deviation
In a similar way but need to consider asymmetry in c2 distribution, the C.I. for population variance c 2 can be constructed for a given probability confidence that is, Calculation of using Minitab MiniTab => Calculate => Probability Distributions => Chi-square => Inverse CP (DF, input constant)