1 / 16

Chapter 2 Organizing Data

Chapter 2 Organizing Data. Nutan s. Mishra. Raw Data. A data recorded in the form as it was collected without ranking or processing is called raw data. Example: consider the status of following 20 students. The four status are SO, F, J, SE

lbaltazar
Download Presentation

Chapter 2 Organizing Data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Chapter 2 Organizing Data Nutan s. Mishra University of South Alabama

  2. Raw Data A data recorded in the form as it was collected without ranking or processing is called raw data. Example: consider the status of following 20 students. The four status are SO, F, J, SE This also called ungrouped data. University of South Alabama

  3. Grouped Data This is categorical (qualitative) data x= status f= frequency The data set consist of 20 members. The sum of frequencies of all the categories is equal to size of the data set . i.e. Σf = f1 + f2 + f3 + f4 = 20 University of South Alabama

  4. Relative frequency Relative frequency of a category = frequency of that category/ sum of all frequencies University of South Alabama

  5. Graphical presentation University of South Alabama

  6. Organizing Data (quantitative) Consider the following table This is the organized data for the quantitative variable GPA. The table shows that there are 10 students whose GPA falls between 0 and 1 and … The values of x are divided into four distinct classes. Each class is an interval of values. University of South Alabama

  7. Organizing Data (quantitative) This is called Frequency Distribution Table or just frequency distribution . Class width = upper boundary-lower boundary Class midpoint = (upper boundary+lower boundary)/2 University of South Alabama

  8. Construction of frequency distribution That is how to divide data into classes? How many classes? Depends on size of the data set. Vary between 5 and 20. What should be the class width? Approximately (largest value-smallest value)/number of classes University of South Alabama

  9. Example of classification Consider the following raw data on GPA of 30 students. We know that the variable GPA =x varies between 0 and 4 Number of classes can be 3 or 4 or 5. let it be 3 Then class width = (3.99-1.54)/3 = .81 1 University of South Alabama

  10. Example of classification And the frequency table is as follows Relative frequency of a class = frequency of that class/ sum of all frequencies = f/Σf University of South Alabama

  11. Important note about classification Most of the statistical software accept only raw data as input and they classify data for us. Thus if we are using a software to analyze our data, we do not have to worry about the classification part. Software gives us ability to change the number and width of the classes according to the need of the problem. University of South Alabama

  12. Cumulative frequency distribution A cumulative frequency distribution gives the total number of values that fall below the upper boundary of each class. Example: Application 2.34 from the textbook frequency cumulative frequency University of South Alabama

  13. Frequency Curve University of South Alabama

  14. Ogive (Cumulative frequency curve) University of South Alabama

  15. Stem and Leaf display A way of organizing and display quantitative data. Each value is divided into two parts – a stem and a leaf. \ To draw a stem-n-leaf plot its helpful to know the range of the data that is max value and min value If can not find out exact values of max and min, the approximate value can given us some idea about the range of the data. Consider the following data set of scores of 30 students in Statistics exam University of South Alabama

  16. Stem and leaf display In this data set the values range between 50’s and 90’s. Thus we would like to count the number of values in 50’s, in 60’s in 70’s and so on That we declare the tenth place as stem and unit place as a leaf. Thus the resulting stem and leaf plot is as follows: University of South Alabama

More Related