CADIgen: A New Approach to Skills Diagnosis Data Simulation

CADIgen: A New Approach to Skills Diagnosis Data Simulation Final Presentation Becky Norman•Abdullah Ferdous•Louis Roussos

Background:What is Skills Diagnosis? • Predefined list of skills (attributes) on assessment • Each item requires the mastery of one or more skills • Q matrix specifies whether skill required for item • Outcome is a diagnostic report • List of each skill, whether student is a master or non-master • Report can be used to aid instruction

Background:Fusion Model Characteristics • Skill space includes Q matrix and additional relevant attributes • Conjunctive • Assumes Attribute Homogeneity • Fusion Model Attribute Response Function (ARF) is step function • Attribute location or jump-point (kk)

Background:Attribute Heterogeneity • Pks : The proportion of masters for a given attribute • Piks: The proportion of masters for a given item associated with an attribute • Attribute heterogeneity occurs when we allow the individual item associated with an attribute to vary. Homogeneity is the case which each item associated with the skill has the same proportion of masters. • DADIgen allows one to specify the degree of heterogeneity to allow

DADIgen • Dichotomous attributes, Dichotomous items generation • pk estimates based on dichotomous piks. • Generates data based on user specifications • Input files • qmatrix.in, xpar.in, iparms.in, xpc.in, xrstar.in, corrparm.in, corr.txt, pikparms.in, xpik.in, xpk.in • Output files • data.txt, alfcthet.out, alphad.out Roussos, Xu, & Stout (2003)

Research Questions: • How does increasing heterogeneity affect the shape of the score distribution of simulated data using the DADIgen program? • Does a modified program, simulating item correct probabilities based on a continuous ability (CADIgen), produce more realistic score distributions? • How does CADIgen simulated score distributions compare to distributions estimated by applying the homogeneous fusion model to the CADIgen simulated data?

Methods: DADIgen • qmatrices: • Low complexity: average of 1.5 items per attribute • High complexity: average 2.5 items per attribute • Examinees: 10,000 • Range of π*: .75 - .95 • Attribute difficulty parameter • Range of r*: .40 - .85 • Attribute discrimination parameter • pks: .45, .49, .53, .57, .61, .65, .69 • Range of correlation between skills: .55 - .89

DADIgen Score Distributions

Score Distributions: Low Complexity Heterogeneity = 0.00 Heterogeneity = 0.50

Score Distributions: High Complexity Heterogeneity = 0.00 Heterogeneity = 0.50

Methods: CADIgen • Continuous attributes, dichotomous items, generation • Probabilities based on continuous attribute ability parameters and continuous attribute application functions

CADIgen Score Distributions

Score Distributions: Low Complexity Heterogeneity = 0.50 Heterogeneity = 0.00

Score Distributions: High Complexity

BackgroundResearch Question 3 • Jang (2005) examined the relationship between observed cumulative score frequencies and estimated distributions on 39 item test. • Score<10: Observed slightly less than estimated • Middle range: Observed slightly more • Score>30: Observed slightly less

Methods: Research Question 3 • Use CADIgen score matrices • Low complexity, heterogeneity = .50 • High complexity, heterogeneity = .25 • Run existing skills diagnosis estimation program • Obtain r* and π* estimates and specify in DADIgen • Compare simulated score distributions to score distributions based on fitting the fusion model to CADIgen simulated data

Simulated vs. Generated Score Distribution: Low Complexity

Simulated vs. Generated Score Distribution: High Complexity

Summary and Conclusions • DADIgen tends to produce bimodal distributions • Most pronounced in high condition, and with less heterogeneity • Using a continuous attribute ability, CADIgen eliminated the bimodality • CADIgen produced similar score distributions to those obtained in real data situations for the low condition, somewhat for the high

Next Step • Calculate and output expected homogeneous r*’s and π*’s in CADIgen using heterogeneous r*’s and π*’s • Use for comparison with existing skills diagnosis program estimates

Any Questions?

CADIgen: A New Approach to Skills Diagnosis Data Simulation

CADIgen: A New Approach to Skills Diagnosis Data Simulation

Presentation Transcript

A New Approach to Supervision

A New Approach to Banking

Air Quality Data: A New Conceptual Approach

A New Approach to Preservation Metadata for Scientific Data

A New Approach

A new approach to NPC behavior simulation: The Mask Model

A NEW APPROACH

A New Approach

Diagnosis: EBM Approach

A site specific approach to radiologic diagnosis

New Pathways to Diagnosis

A new Approach to  -Decay

A Behavioral Approach to Communication Skills

The eyeHeme : A Novel Approach to Anemia Diagnosis

Skills Approach

Skills Approach

Vitals Ethnicity Data: A New Approach to Disparities Research

A New Approach

Data Operation-A New Approach To Dodge Test Data Problems

A New Approach to Banking

Diagnosis: EBM Approach

A comprehensive approach to the diagnosis of IFI