210 likes | 222 Views
This study investigates the relationship between petrochemical exposure and childhood brain and leukemia cancers using the Cumulative Geographic Residual Test. The study population consists of matched case-control participants from Kaohsiung County, Taiwan. The results show marginally significant clustering of childhood leukemia in certain areas without adjusting for smoking history.
E N D
Cumulative Geographic Residual Test Example: Taiwan Petrochemical Study Andrea Cook
Outline 1. Motivation • Petrochemical exposure in relation to childhood brain and leukemia cancers 2. Cumulative Geographic Residuals • Unconditional • Conditional 3. Application • Childhood Leukemia • Childhood Brain Cancer
Taiwan Petrochemical Study Matched Case-Control Study • 3 controls per case • Matched on Age and Gender • Resided in one of 26 of the overall 38 administrative districts of Kaohsiung County, Taiwan • Controls selected using national identity numbers (not dependent on location).
Study Population Due to dropout approximately 50% 3 to 1 matching, 40% 2 to 1 matching, and 10% 1 to 1 matching.
Cumulative Residuals • Unconditional (Independence) • Model definition using logistic regression • Extension to Cluster Detection • Conditional (Matched Design) • Model definition using conditional logistic regression • Extension to Cluster Detection
Logistic Model Assume the logistic model where, and the link function,
Residual Formulation Then define a residual as, Assuming the model is correctly specified would imply there is no pattern in residuals. => Use Residuals to test for misspecification. Cumulative Residuals for Model Checking; Lin, Wei, Ying 2002
Hypothesis Test Hypothesis of interest, Geographic Location, (ri, ti ) Independent of Outcome, Yi|Xi Cumulative Geographic Residual Moving Block Process is Patternless
Unconditional Cluster Detection Define the Cumulative Geographic Residual Moving Block Process as,
Asymptotic Distribution However, the distribution of, is hard to define analytically, but we have found another distribution that is asymptotically equivalent, which consists of a fixed component of data and random variables
Significance Test Testing the NULL • Simulate N realizations of by repeatedly simulating , while fixing the data at their observed values. • Calculate P-value
Conditional Logistic Model Type of Matching: 1 case to Ms controls Data Structure: Assume that conditional on , an unobserved stratum-specific intercept, and given the logit link, implies, The conditional likelihood, conditioning on is,
Conditional Residual Then define a residual as, => Use these correlated Residuals to test for patterns based on location.
Conditional Cumulative Residual However, the distribution of, is hard to define analytically, but we have found another distribution that is asymptotically equivalent, which consists of a fixed component of data and random variables
Testing the NULL Simulate N realizations of by repeatedly simulating , while fixing the data at their observed values. Calculate P-value Significance Test
Application • Study: Kaohsiung, Taiwan Matched Case-Control Study • Method: Conditional Cumulative Geographic Residual Test (Normal and Mixed Discrete)
Results Odds Ratio (p-values) Marginally Significant Clustering for both outcomes without adjusting for smoking history.
Discussion Cumulative Geographic Residuals • Unconditional and Conditional Methods for Binary Outcomes • Can find multiple significant hotspots holding type I error at appropriate levels. • Not computer intensive compared to other cluster detection methods Taiwan Study • Found a possible relationship between Childhood Leukemia and Petrochemical Exposure, but not with the outcome Childhood Brain Cancer.