160 likes | 304 Views
Bi- Variate Data. PPDAC. We are looking for a set of data that is affected by the other data sets in our spreadsheet. This variable is called dependent because its values are affected by the other data sets Sometimes it is called the response variable (because it responds)
E N D
Bi-Variate Data PPDAC
We are looking for a set of data that is affected by the other data sets in our spreadsheet. • This variable is called dependent because its values are affected by the other data sets • Sometimes it is called the response variable (because it responds) • This variable must be “variable 2” on iNZight i.e. on the y-axis. Types of data
The x-axis is called the explanatory variable or the independent variable Dependant or Response Variable Explanatory or Independent Variable Types of data
We can only plot scatter diagrams of numerical data, non numerical data like colours, countries etc cannot be plotted Choosing your variables
You may like to look at ‘advanced’ – ‘Scatter Plot Matrix’ to get an overview of all the combinations of graphs. • Look for areas where the fit isn’t good. • Clusters • Fanning out or in (data points are further away from the trendline) • Gaps in data Choosing your variables
Import the correct CSV file into iNZight and click and drag variables into variable 1 and 2 positions. • For each scatter diagram you must add a linear trend line and note the equation and ‘r’ value. Drawing Graphs
For Achieved. • Problem: Write a question that clearly investigates the relationship between two variables. 2. Plan: I will use iNZight to produce a scatter plot and equation. I will observe the graph to decide if the equation is valid. 3. Data: Describe the data including the correct units and show some understanding of the context. 4. Analyse: Use iNZight to draw a scatter graph and produce the trend curve. For Achieved
5. Analyse: Describe what you see in the scatter graph (use T.A.R.S.O.G. for this). 6. Analyse: Describe the relationship between the two variables in terms of "as xxx increases, yyy ...“ 7. Predicition: Make a prediction (interpolation) using the iNZight equation, is it valid? Reliable? 8. Conclusion: Answer your problem question, is there a relationship? For Achieved
Problem:This report considers the relationship between the stride length and the time to complete a marathonin minutes for the purpose of predicting the time to run a marathon. • Plan: The independent variable is the stride length which is measured in centimeters. The dependent variable is the marathon minutes, measured in minutes. • Data: The data is a sample taken from marathons in NZ. Purpose statement (Basic)
T is for trend, is it linear or not?A is for association, is it positive or negative?R is for relationship, is it strong or weak?S is for scatter, is it constant or not? Fan?O is for outliers, can you spot any?G is for groups, are there any? 5. Analysis
As the carrot increases the price of the diamond increases. For every increase in carrot the price increases by approximately $7800. 6. Analysis
Must include • A interpolation and extrapolation • A comment about the strength of the prediction (critique) 7. Predictions
Answer your purpose statement by highlighting the key points of the analysis. 8. Conclusion
The correlation coefficient is a number value between -1 and 1 • The sign shows if it is positive or negative correlation • These are only a guide (different books give different values.) Correlation Coefficient.
It is only designed to measure linear relationships! (Not appropriate for curved relationships) r = 1 r = -1 r = 0 Correlation Coefficient