850 likes | 1.1k Views
The Population Attributable Fraction (PAF) for Public Health Assessment: Epidemiologic Issues, Multivariable Approaches, and Relevance for Decision-Making. Deborah Rosenberg Kristin Rankin Craig A. Mason Juan Acu ña. Workshop Outline.
E N D
The Population Attributable Fraction (PAF) for Public Health Assessment: Epidemiologic Issues, Multivariable Approaches, and Relevance for Decision-Making Deborah Rosenberg Kristin Rankin Craig A. Mason Juan Acuña
Workshop Outline • Overview of the Population Attributable Fraction (PAF) • Methodological issues for the PAF in a multivariable context • A simple example with 2 variables • A modeling approach for a an example with 3 variables • Direct and indirect effects—the special case when variables are in a causal pathway • Issues for using the PAF for priority-setting, program planning, and to inform policy • What we won’t discuss: • Standard error and confidence interval estimation • Statistical testing
The Population Attributable Fraction (PAF) for Public Health Assessment: Epidemiologic Issues, Part I Deborah Rosenberg, PhD Research Assistant Professor Epidemiology and Biostatistics UIC School of Public Health
Overview of Attributable Risk Measures • Measures based on Risk Differences • Attributable Risk = • Attributable Fraction = • Pop. Attributable Risk = • Pop. Attributable Fraction =
Overview of Attributable Risk Measures • The PAF can also be computed as a function of the relative risk and the prevalence and distribution of exposure in the population: • directly in cohort and cross-sectional studies • substituting the odds ratio as an estimate when appropriate—in case control studies when “disease” is rare
Methodological Issues for the PAF in a Multivariable Context • In a multivariable context, the goal is to generate a PAF for each of multiple factors, taking into account relationships with other factors • The sum of this set of PAFs should equal the aggregate PAF calculated for all of the factors combined
Methodological Issues for the PAF in a Multivariable Context • Generating mutually exclusive and mutually adjusted PAFs is not straightforward because of the overlapping distributions of exposure in the population • Methods that go beyond the usual adjustment procedures, therefore, have to be used to address correlation between variables
Methodological Issues for the PAF in a Multivariable Context • In addition to different computational approaches, decisions about how variables will be considered may be different when focusing on the PAF than when focusing on the ratio measures of association • Differentiating the handling of modifiable and non-modifiable risk factors • Confounding and effect modification • Handling factors in a causal pathway
Methodological Issues for the PAF in a Multivariable Context • Having an explicit conceptual framework / logic model is always important for multivariable analysis, and is particularly critical when focusing on the PAF because decisions about how to handle variables will not only influence the substantive interpretation of results, but will change computational steps as well.
Methodological Issues for the PAF in a Multivariable Context • Approaches to Generating PAFs • Aggregate PAF: The total PAF for many factors considered in a single risk system • Component PAF: The separate PAF for each combination of exposure levels in a risk system • Sequential PAF: The PAF considering one possible order for eliminating risk factors • Average PAF: The PAF summarizing all possible sequences for eliminating a risk factor
The simple case of 2 variables • Smoking and Cocaine • Crude RR = 1.60 Crude RR = 4.77
Smoking and Cocaine Organized into a Risk System • Aggregate RR = 1.88
Components of the Smoking-Cocaine Risk System • Components • RR=5.89, both smoking and cocaine use • RR=4.30, cocaine use only • RR=1.36, smoking only • There is a component for each combination of exposure levels in the risk system.
Components of the Smoking-Cocaine Risk System • The aggregate PAF (PAFAGG) of • variables in a risk system equals • the sum of the component PAFs. • + • = • +
Components of the Smoking-Cocaine Risk System • While the component PAFs of a risk system sum to the aggregate PAF for the system as a whole, they do not provide mutually exclusive measures of the PAF for each risk factor • Here, the aggregate PAF = 0.16, • but the two factors are related: • some women are both smokers • and cocaine users
The “Adjusted” PAF:The Stratified Approach • The PAF for eliminating an exposure • controlling for other risk factors • PAF considering potential effect modification (This assumption-free approach always applies) • PAF assuming no effect modification
The “Adjusted” PAF:The PAF for Smoking, Controlling for Cocaine Use • RR=1.37 • + • RR=1.36 =
The “Adjusted” PAF:The PAF for Cocaine Use, Controlling for Smoking • RR=4.33 • + • RR=4.30 =
The “Adjusted” PAF: • While the usual adjustment methods control for other risk factors, the resulting adjusted PAFs still do not meet the criterion of summing to the aggregate PAF for all factors combined • ≠ • 0.042+0.062+0.056=0.16 0.076 + 0.099 = 0.175
The “Adjusted” PAF: • The typical adjustment procedures result in a PAF that, by itself, represents an estimate—perhaps unrealistic—of the impact of eliminating one exposure in a risk system, controlling for the effects of other risk factors in the system • The “adjusted” PAF becomes more useful when viewed as one element of a sequence for eliminating all risk factors in a system
Sequential PAFs (PAFSEQ) for theSmoking-Cocaine Risk System • For the smoking-cocaine risk system, there are 2 possible sequences: • Eliminate smoking first, controlling for cocaine use, then eliminate cocaine use • Eliminate cocaine use first, controlling for smoking, then eliminate smoking • And within each sequence, there are two sequential PAFs
Sequential PAFs (PAFSEQ) for theSmoking-Cocaine Risk System • The PAFSEQ for eliminating smoking, controlling for cocaine use: • PAFSEQ (S|C) = 0.076 • The PAFSEQ for eliminating cocaine use after smoking has already been eliminated is the remainder of the Aggregate PAF • PAFAGG – PAFSEQ (S|C) = 0.16 – 0.076 = 0.084
Sequential PAFs (PAFSEQ) for theSmoking-Cocaine Risk System • The PAFSEQ for eliminating cocaine use, controlling for smoking: • PAFSEQ (C|S) = 0.099 • The PAFSEQ for eliminating smoking after cocaine use has already been eliminated is the remainder of the Aggregate PAF • PAFAGG – PAFSEQ (C|S) = 0.16 – 0.099 = 0.061
Sequential PAFs (PAFSEQ) for theSmoking-Cocaine Risk System • By definition, the sequential PAFs within the two possible sequences sum to the Aggregate PAF • Smoking First Cocaine Use First • 0.076 + 0.084 = 0.16 0.099 + 0.061 = 0.16
Average PAF (PAFAVG) for theSmoking-Cocaine Risk System • While the sequential PAFs for each sequence sum to the aggregate PAF, they still do not provide a summary comparison of the impact of smoking and cocaine use regardless of the order in which they are eliminated • That is, regardless of the order of elimination, what would be the impact of eliminating smoking on average?
Average PAF (PAFAVG) for theSmoking-Cocaine Risk System • To calculate an average, the sequential PAFs are rearranged, grouping the two for smoking together and the two for cocaine together: • Eliminating smoking first, averaged with eliminating smoking second • Eliminating cocaine use first, averaged with eliminating cocaine use second
Average PAF (PAFAVG) for theSmoking-Cocaine Risk System • Averaging Sequential PAFs • Average PAF for Smoking: • = • Average PAF for Cocaine Use: • =
Average PAFs for theSmoking-Cocaine Risk System • The Average PAFs for each factor in the risk system are mutually exclusive and their sum equals the Aggregate PAF: • 0.0685 + 0.0915 = 0.16
Average PAFs for theSmoking-Cocaine Risk System • The average PAF is perhaps most realistic since typically there are multiple interventions operating simultaneously—risk reduction activities are unordered and often intersect • In addition, averages can be customized—instead of a simple average, the sequential PAFs can be differentially weighted to reflect other unmeasured issues such as funding streams or political will
In a Truly Multivariable Context • The number of average PAFs equals the number of variables in a risk system. The number of sequences is a function of the number of variables and becomes large quickly as the number of variables increases. • 2 variables 2 sequences • 3 variables 6 sequences • 5 variables 30 sequences • Computation of the sequential PAFs becomes cumbersome and an automated modeling approach is needed
The Population Attributable Fraction (PAF) for Public Health Assessment: Epidemiologic Issues, Part II Kristin Rankin, MSPH Assistant Director of Research CADE Research Data Management Group UIC School of Public Health
PAF from Modeling • Why isn’t the multivariable PAF used more commonly in the analysis of public health data? • No known standard statistical packages to complete all steps • What is the advantage of using modeling techniques over stratified analysis?
Advantages of Obtaining Estimates from Modeling • Modeling is not as sensitive to sparse data in individual cells when there are many strata • If you choose to consider confounding and effect modification in the same model, estimates are generated more easily • Note: Using an assumption-free approach, all variables are treated as effect modifiers
Using SAS PROC GENMOD • With cross-sectional data, such as birth certificate data, you can use PROC GENMOD in SAS with log link and binomial or Poisson distribution to model the relative risks (RR) of factors • As number of factors of interest increases, still only need one model to obtain relative risks for several different stratified relationships (using the Estimate statement in SAS)
Modifiable and Unmodifiable Risk Factors • In addition, within one model, we can differentiate between those factors considered to be modifiable and those factors considered to be unmodifiable • This differentiation has an impact on the resulting aggregate, sequential, and average PAFs.
Case Study • Scenario: You are asked to prioritize spending for interventions that target the high rate of lo birth weight (LBW) in your jurisdiction. • Data: You have a data set with relatively reliable data on smoking during pregnancy, cocaine use during pregnancy and poverty level. • Method: You would like to use one of the methods you just learned for calculating the PAFs for each of these factors.
Choose Your Own Adventure • Would you consider each of the following variables unmodifiable or modifiable for preventing LBW? • Smoking (1=Smoking during pregnancy, 0=No smoking) • Cocaine (1=Cocaine use during pregnancy, 0=No cocaine) • Poverty (1=Below Federal Poverty Level, 0=Above FPL) • What type of PAF is most appropriate? • Adjusted (only focused on one factor, controlling for others) • Sequential (specifying one ordering for targeting factors) • Average (account for all possible sequences of eliminating each factor)
Considering Poverty as Unmodifiable:Calculating Sequential and/or Average PAFs for Smoking and Cocaine Use
SAS Code: Obtaining Prevalence for Any Modifiable Exposure vs LBW, Stratified by Poverty • /*Must first sort data set to use “by” variable below*/ • proc sort ; • by poverty; • run; • /*Then, produce frequency tables for low birth weight (lbw) and any modifiable exposure (mod_exp), stratified by poverty*/ • proc freq order=formatted; • tables lbw*mod_exp/list nopercent; • /*mod_exp=1 if smoke=1 or cocaine=1*/ • by poverty; /*Stratified by poverty*/ • run;
SAS Code: Modeling to Obtain Stratified RRs for Any Modifiable Exposure vs LBW • /*Binomial regression run below to obtain RRs*/ • proc genmod; title2 "Smoke and Cocaine, Stratified by Poverty"; • model lbw = mod_exp poverty mod_exp * poverty • /*mod_exp=1 if woman has at least one of the modifiable • exposures*/ • / dist=bin link=log; /*Binomial distribution*/ • estimate “Smoke and/or Cocaine, where Poverty=Yes” • mod_exp 1 Poverty 0 mod_exp*Poverty 1/exp;/*Stratified RR*/ • estimate “Smoke and/or Cocaine, where Poverty=No” • mod_exp 1 /exp; /*Stratified RR*/ • run;
SAS Results: Elements of the PAFAGGfor Risk System, Stratified by Poverty Poverty=Yes Poverty=No
PAFAGG for Smoking and Cocaine Risk System, Considering Poverty Unmodifiable Poverty=Yes Poverty=No
SAS Code: Obtaining Prevalences for Smoke and Cocaine vs LBW, Stratified by Poverty • /*Must first sort data set to use “by” variable below*/ • proc sort ; • by poverty; • run; • /*Create a listing of the frequencies for each possible combination of smoke, poverty and lbw to calculate proportions*/ • proc freq order=formatted; • tables lbw*smoke*cocaine/list nopercent; • by poverty; /*Stratified by poverty*/ • run;
SAS Code: Modeling to Obtain RRs for Smoke and Cocaine vs LBW, Stratified by Poverty • /*Binomial regression run below to obtain RRs*/ • proc genmod; • title2 “RRs for Smoke and Coke with LBW, controlling for Poverty"; • model lbw = smoke cocaine poverty • smoke*cocaine smoke*poverty cocaine*poverty • smoke*cocaine*poverty • /*Every possible multiplicative term must be in model • if using assumption-free, stratified approach*/ • /dist=bin link=log obstats; /*Binomial distribution*/ • /*ESTIMATE Statements in future slides should be inserted here*/ • run;
SAS Code: Estimate Statements to Obtain Stratified RRs for Smoking • /*defining all possible parameter values for stratified model*/ • estimate “smoke, where cocaine=Yes and poverty=Yes” • smoke 1 cocaine 0 poverty 0 smoke*cocaine 1 smoke*poverty 1 • cocaine*poverty 0 smoke*cocaine*poverty 1 • / exp; /* “exp” option gives relative risks from betas */ • estimate “smoke, where cocaine=Yes and poverty=No” • smoke 1 cocaine 0 poverty 0 smoke*cocaine 1 smoke*poverty 0 • cocaine*poverty 0 smoke*cocaine*poverty 0 • / exp; • estimate “smoke, where cocaine=No and poverty=Yes” • smoke 1 cocaine 0 poverty 0 smoke*cocaine 0 smoke*poverty 1 • cocaine*poverty 0 smoke*cocaine*poverty 0 • /exp;estimate “smoke, where cocaine=No and poverty=No” • smoke 1 cocaine 0 poverty 0 smoke*cocaine 0 smoke*poverty 0 • cocaine*poverty 0 smoke*cocaine*poverty 0 • / exp;
SAS Results: Elements of PAFSEQ for Smoking Removed First Poverty=Yes Poverty=No
Elements of PAFSEQ for Smoking Removed First, Considering Poverty Unmodifiable Poverty=Yes Coke=Yes Coke=No Poverty=No Coke=Yes Coke=No
SAS Code: Estimate Statements to Obtain Stratified RRs for Cocaine • estimate “Cocaine, where smoke=Yes and poverty=Yes” • cocaine 1 smoke 0 poverty 0 cocaine*smoke 1 cocaine*poverty 1 • smoke*poverty 0 cocaine*smoke*poverty 1 • / exp e; • estimate “Cocaine, where smoke=Yes and poverty=No” • cocaine 1 smoke 0 poverty 0 cocaine*smoke 1 cocaine*poverty 0 • smoke*poverty 0 cocaine*smoke*poverty 0 • / exp e; • estimate “Cocaine, where smoke=No and poverty=Yes” • cocaine 1 smoke 0 poverty 0 cocaine*smoke 0 cocaine*poverty 1 • smoke*poverty 0 cocaine*smoke*poverty 0 • / exp e; • estimate “Cocaine, where smoke=No and poverty=No” • cocaine 1 smoke 0 poverty 0 cocaine*smoke 0 cocaine*poverty 0 • smoke*poverty 0 cocaine*smoke*poverty 0 • / exp e;