300 likes | 442 Views
Developing General Education Course Assessment Measures. Anthony R. Napoli, PhD Lanette A. Raymond, MA Office of Institutional Research & Assessment Suffolk County Community College http://sccaix1.sunysuffolk.edu/Web/Central/IT/InstResearch/. Why Validity & Reliability ?.
E N D
Developing General Education Course Assessment Measures Anthony R. Napoli, PhD Lanette A. Raymond, MA Office of Institutional Research & Assessment Suffolk County Community College http://sccaix1.sunysuffolk.edu/Web/Central/IT/InstResearch/
Why Validity & Reliability ? • Assessment results must represent student achievement of course learning objectives • Evaluation of the validity and reliability of the assessment measure provides the evidence that it does so
Types of Measures • ‘Performance’ Measures • ‘Objective’ Measures
Validity for Performance Measures • Identified learning outcomes represent the course (domain sampling) • The measure addresses the learning outcomes (content validity) • There is a match between the measure and the rubric (criteria for evaluating performance) • Rubric scores can be linked to the learning outcomes, and indicate the degree of student achievement within the course
Validity for Objective Measures • Identified learning outcomes represent the course (domain sampling) • The items on the measure address specific learning outcomes (content validity) • Scores on the measure can be applied to the learning outcomes, and indicate the degree of student achievement within the course
Objective I II III Description Identify the basic methods of data collection Demonstrate an understanding of basic sociological concepts and social processes that shape human behavior Apply sociological theories to current social issues Content Validity (SO11) A 30-item test measured students’ mastery of the objectives
Reliability • Can it be done consistently? • Can the rubric be applied consistently across raters -- Inter-rater reliability • Can each of the items act consistently as a measure of the construct -- Inter-item reliability
Face Validity and Reliability… Is this enough? • Measures with face validity & adequate levels of reliability can produce misleading/inaccurate results. • Even content valid measures cannot guarantee accurate estimates of student achievement MA23 – earlier pilot
Motivational Comparison (PC11) • 2 Groups Graded Embedded Questions Non-Graded Form & Motivational Speech • Mundane Realism
Motivational Comparison (PC11) • Graded condition produces higher scores (t(78) = 5.62, p < .001). • Large effect size (d = 1.27).
Motivational Comparison (PC11) • Minimum competency 70% or better • Graded condition produces greater competency (Z = 5.69, p < .001).
Motivational Comparison (PC11) • In the non-graded condition this measure is neither reliable nor valid KR-20N-g = 0.29
Developing General Education Course Assessment Measures Anthony R. Napoli, PhD Lanette A. Raymond, MA Office of Institutional Research & Assessment Suffolk County Community College http://sccaix1.sunysuffolk.edu/Web/Central/IT/InstResearch/