230 likes | 920 Views
Quantitative Analysis. Managing the Data. Coding of Questionnaires and Production of Code book. Quantitative Measurement: Nominal, ordinal & interval variables. Simple statistics: Frequencies (percentages) Descriptives (mean, median, mode). Data Analysis .
E N D
Quantitative Analysis • Managing the Data. • Coding of Questionnaires and Production of Code book. • Quantitative Measurement: • Nominal, ordinal & interval variables. • Simple statistics: • Frequencies (percentages) • Descriptives • (mean, median, mode)
Data Analysis • Having spent time designing your survey and collecting data, it is important to make a good job of your data analysis. • Your analysis is guided by the hypotheses that you are exploring and relies on your ingenuity to relate the data to the theory. • Data analysis is helped by efficient management of data, particularly coding.
Managing Data: • Keep all questionnaires safely and where you cannot violate your promise of anonymity and confidentiality (cabinet and key!). • Check your input of data regularly on the computer – for example, SPSS, check frequencies and cross-tabulations of responses to variables to ensure no strange input! • Also check frequencies of case numbers (numbers representing your respondents) to prevent entering a case (respondent) twice! • When finished processing data, add frequencies of variables to the codebook.
Coding the data • You need to extract the data from the questionnaires and put them into a form that is easier to refer to and to manipulate: • You code data and create a data file. • Draw up a coding frame. This lists all the alternative values for a given variable and allocates a number for each possible answer. • Coding frame for closed questions can be taken from the questionnaire. Additional coding for open questions needs to be created when imputing data. Coding frame is usually drawn up before you attempt to code data and input data into the matrix. • The job of analysis is made easier if you use a computer to create data files – e.g. SPSS, STATA, R, SAS or even Excel.
Coding Questions • What sex are you? • Male q • Female q • How old are you? ________ • What is your marital status? • Never married q • Married q • Cohabiting q • Separated q • Divorced q • Widowed q
Data File • This involves producing a grid on which to record the appropriate codes for each respondent. • Normally each row represents the respondent and each column a variable.
Data File • In some computer programmes you can click on a data label and reveal what each code means. • 99 or 999 are usually the codes we use when there is missing data – ‘non item’ responses.
Notes on the Codebook: • Start creating the codebook with the questionnaire. For each question: • Write down the actual question. • The name(s) of the variable(s) given to represent the question [eg:Q1walk] • The label of the variable. [eg: whether or not respondent walks to work]. • Values (codes) and value labels for each attribute. [egg: 1 (value) = walks to work (label); 0 (value) = does not walk to work (label) and 99 = missing data] • Variable type – nominal, ordinal or continuous. • Code open questions (string variables) and ‘other’ categories afterwards and manually.
Example of Tricky Question to Code: Q1. Which mode of transport do you use most often to work? (please tick all those that apply) • Walk q • Bicycle q • Rail q • Bus q • Car as driver q • Car as passenger q • Motorbike as driver q • Motorbike as passenger q
Coding of ‘Tricky’ Question: • Spilt the question into different variables, i.e. create multi-dichotomies. • Walk variable (Q1walk): 1 = walks to work 0 = does not walk to work 99 = missing data. • Bicycle variable (Q1bicycle): 1 = walks to work 0 = does not walk to work 99 = missing data. ETC.
Quantitative Measurement • Types of Variables: • Binary • Nominal • Ordinal • Interval • Discrete Count
Counting Responses • After compiling your data in your data matrix, the first step in data analysis is to summarise your data. • You can do this by: • Tabulating the data (‘Frequencies’) • Calculating the ‘Descriptives’ i.e. summaries and variability of the data • Graphs • This initial step is sometimes called univariate analysis – i.e. description of one variable
Frequencies • From a frequency table, you can tell how often (frequently) people gave each response. • It tells you how many people selected each response to a question. • Frequencies can also be used to check codes. If a code appears in the frequency table that wasn’t used in the coding scheme, you know that an error has happened in imputing the data.
IMPJOB – Importance to respondent of having a fulfilling job.
Frequencies • A frequency count alone is not a very good summary of the data so use both counts and percentages. • Percentages are easier to visualise – unlike counts you can compare percentages across surveys with different cases. • Use valid percentages – i.e. exclude the missing data.
Simple Statistics Descriptives: • Mode (most frequently recurring response) • Mean (average) • Median (middle value if all responses were laid out in a row from smallest to largest). • Used instead of mean for ordinal variables, i.e. is the mid-rank. • Also is not affected by ‘outliers’ so a better measure than mean for continuous variables like age and income.
Graphs • You can use a number of graphs to illustrate your findings such as: • Pie chart • Bar chart • Histogram • It depends on the type of variable!
Simple Statistics • Bivariate Analysis – this involves the relationship between two variables. • The simplest form is a cross-tabulation. • You need categorical variables for this. • For continuous variables, use a scatter plot to graph the relationship.