1 / 18

Population and Processes

Population and Processes. Population , the entire set of entities about which we make statements in an analysis Incomes of all families living in Milwaukee Votes of all voters in an election Process , an ongoing mechanism from which data are generated

israeln
Download Presentation

Population and Processes

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Population and Processes • Population, the entire set of entities about which we make statements in an analysis • Incomes of all families living in Milwaukee • Votes of all voters in an election • Process, an ongoing mechanism from which data are generated • An attribute of products of a production line • Daily return of a stock

  2. Parameter • A quantity that describes a characteristic of the distribution f(y)of a variable Y of a process variable or a population • distribution mean m • distribution standard deviation s • distribution median, etc. • The value of a parameter is usually unknown • Subject of statistical inference (estimation, testing)

  3. Sample • Data y1, . . ., yn • A subset of a population • A snapshot of a process • Simple Random Sample • Every set of n elements has an equal chance of being selected • Statistical replicas of a draw from f(y) • yi’sare Independently and Identically Distributed (IID)

  4. Statistic • A quantity computed from data • Sample mean • Sample variance • Sample standard deviation

  5. An Example • Given: f(y) normal, sy = 15 • Parameter of interest m, unknown • Data y1=97.65, y2 =101.76, y3 =54.27, y4=99.37 • Use the statistic as an estimate of the unknown parameter m • Compute the statistic

  6. Statistical Inference Population f(y) Unknown Parameter Statistic Sample Estimate

  7. Relative Frequency Inference Statistic All other possible samples f(y) Statistic Unknown Parameter Statistic Actual sample

  8. Variation of Sample mean • The sample mean varies from one sample to another sample • The distribution describes variation of all possible sample mean and is called the sampling distribution of • Parameters of are related to the parameters of the data distribution f(y)

  9. Averages of n rolls of a die Simulation Results Variable NMean StDev y(n = 1 ) 1000 3.5020 1.7008 (n = 4 ) 10003.4800 0.8475 (n = 9 ) 10003.5101 0.5525 (n =16) 10003.4922 0.4235

  10. Mean of the sampling distribution of the sample mean is the same as population distribution meanmy Mean and Standard Deviation of • Standard deviation of the sampling distribution of the sample mean is smaller than the population distribution standard deviationsy

  11. Sampling Distribution of • If f(y) is normal, then is normal P(85< y <115 )=68.3% sy=15 f(y)

  12. Central limit Theorem for • For any f(y) , as n grows, becomes approximately normal f(y)

  13. 1000 Incomes my=2, sy=2 P(0 < y < 4) = ?

  14. 1,000 Average Incomes, n = 4

  15. 1,000 Average Incomes, n = 25

  16. Bayesian Inference Actual sample observed Unknown Parameter Statistic Prior distribution g(m) g(m | data) Posterior distribution

  17. Bayesian Statistics • Degrees of belief about the values of the unknown parameter m are described by a probability distribution • Prior distribution g(m) • For a given sample, a statistic is a known value and is non-random • A statistic is used for updating the prior knowledge about m • Posterior distribution, g(m|data)

  18. Uniform Prior & Normal Posterior • No prior knowledge, g(m) = Uniform • Posterior distribution g(m|data) g(m)

More Related