500 likes | 590 Views
What’s EPB? What’s so good about RCTs? (And what are we reading?). Philosophy of Social Science Phil 152 Winter 2011 Week 8. A theory of evidence for use. Foundation for a guide by C&H for the use of evidence in evaluating policy effectiveness. The (C&H) guide should be –
E N D
What’s EPB? What’s so good about RCTs?(And what are we reading?) Philosophy of Social Science Phil 152 Winter 2011 Week 8
A theory of evidence for use • Foundation for a guide by C&H for the use of evidence in evaluating policy effectiveness. • The (C&H) guide should be – • Simultaneously well-groundedand practicable. • More comprehensive than those currently available • For policy-makers not expert in natural and social science with limited amounts of time and resources.
The question of effectiveness Will the proposed policy produce the targeted outcomes were it to be implemented in the targeted setting in the way it would in fact be implemented there? So, what’s evidence – good evidence – for answers?
The rise of evidence-based policy In the UK, USA and increasingly in Europe we see a huge drive • to useevidence to inform policy and practice • mandated by executive and legislative branches • at international, national and local levels • pushed by national and international organisations like the Campbell and Cochrane Collaborations • with accompanying institutions and regulations to ensure evidence is appropriately considered.
Some guides for use of evidence to judge policy effectiveness • IARC: International Agency for Research on Cancer • SIGN: Scottish Intercollegiate Guidelines Network • What Works Clearinghouse • USEPA: US Environmental Protection Agency • CEPA: Canadian Environmental Protection Act • Cochrane Collaboration • Oxford Centre for Evidence-Based Medicine • Daubert decision!! (US Supreme Court)
For an evidence-based policy… We want • Evidence of high quality • That speaks for the policy.
High Quality Evidence We do not want to build an argument for a policy on shaky premises.
High Relevance Evidence No matter how sturdy this foundation: It won’t support these houses:
____________________________________________ The Weight of Argument When do symmetry arguments provide the best evidence? When are detailed calculations found more convincing? When does modeling appear conclusive? For how much does societal acceptance account? _____________________________________________ High Relevance v High Quality Evidence (From Martin Harwit, lecture, 12 Nov 2007, IAS Durham. Ital added.) • not QUALITY: Thelikely truthof evidence. • but RELEVANCE: Evidence that supports the conclusion.
High Quality Evidence • We want to admit as evidence only claims that are true, or highly probable: P(e) is high. • Ranking schemes rank evidence according to the method by which it is produced. Top-ranked evidence is produced by methods that make it likely that the result is true: P(e) is high.
Evidence-ranking Schemes: SIGNS (Scottish Intercollegiate Guidelines Network)
The RCT RCTs establish causal conclusions: • In Tennessee reducing class size caused better reading scores. • In two Zambian hospitals introducing tri-co… increased survival rates in HIV-positive children. If properly done, they guarantee • That there wasthe difference indicated. • That it was causedin the way indicated (and not, e.g., by accident or by some other factor introduced at the same time). The method itself makes it very probable that the resulting conclusion is true.
What’s an RCT for ‘T causes O’? • A Mill’s method-of-difference study: • 2 groups where all causal factors for O are distributed the same except for T and it’s downstream effects. • T is universally present in the treatment group, universally absent in the control. • If ProbT(O) > ProbC(O), T must be responsible for the difference. • SO: T was a contributing cause towards O in at least some members of study population.
What’s special about RCTs? • They clinchcausal conclusions: If the assumptions for the study are met and ProbT(O) > ProbC(O), it follows deductively that T causes O in some study members. • They are self-validating: • Blinding (quadruple maybe) • Random assignment • Placebo control aim to make the distribution of other factors the same in treatment and control groups.
Other methods… • Can be clinchers but none are self validating: they require substance-specific assumptions. The assumptions of the RCTs are supposedly all based on method. • Econometric modelling • Process tracing • Derivation from theory • Some methods merely suggest a conclusion or vouch for it. • Causal relations from analogue systems • ‘Observational’ studies’. • Standard advice guides tell you to ignore other clinchers and vouchers.
Daft advice HARD WON KNOWLEDGE
The RCT • The RCT can produce high quality claims – claims we have good reason to judge true – that we may adduce as evidence. • But what are they evidence for? • That’s our second requirement. We want high quality claims that speak for the policy. Showing that a claim is very likely true goes no wayto showing that it is relevant to the truth of our policy hypotheses.
Turn now to Relevance How do we decide which evidence supports which conclusions?
US Dept of Education Website • Strong evidence for your policy = two or more high quality RCTs in ‘settings similar to that of your schools/classrooms’. • Later elaboration adds 4 lines – trials on white suburban populations do not constitute strong evidence for large inner city schools serving primarily minority students.
A At least one meta analysis, systematic review, or RCT rated as 1++, and directly applicable to the target population; orA systematic review of RCTs or a body of evidence consisting principally of studies rated as 1+, directly applicable to the target population, and demonstrating overall consistency of results B A body of evidence including studies rated as 2++, directly applicable to the target population, and demonstrating overall consistency of results; orExtrapolated evidence from studies rated as 1++ or 1+ C A body of evidence including studies rated as 2+, directly applicable to the target population and demonstrating overall consistency of results; orExtrapolated evidence from studies rated as 2++ D Evidence level 3 or 4; orExtrapolated evidence from studies rated as 2+ SIGNS Grades of recommendation
Practical advice about relevance??? So—look for high quality evidence (1++) that is ‘directly applicable to the target population’!!!! For better advice read Cartwright & Hardie
RCTs cannot hand evidentiary support directly to effectiveness claims
The Theory and the Cakes it Calls For Philosophy 152 Philosophy of Social Science Week 9 2011
Causes are INUS conditions INUScondition: anInsufficientbut Necessary part of an Unnecessary but Sufficient condition.
Unnecessary • There’s more than one way to skin a cat. • Many different causes can produce contributions to the same effect. No single one of them is necessary to get a contribution. • Some will be negative. • California class-size reduction programme • Important side lesson: Beware what you do in implementation.
Sufficient • Sufficient = enough. • A bundle of factors is sufficient for a contribution means that you don’t need to add any more to get a contribution. • Notice I say ‘bundle’. This brings us to PARTS of unnecessary but sufficient conditions. And to causal cakes.
Cakes and their ingredients It’s like making pancakes. Given the 3 base ingredients – flour, milk and eggs – baking powder can turn the whole mix into pancakes. +=
But you can’t make pancakes without flour, without eggs or without milk no matter how much baking powder you pour in the bowl. + ≠
Necessary but Insufficient parts • Each causal factor in a bundle that makes up a sufficient condition is necessary – if one casual factor is missing you get no contribution at all. • But each is insufficient by itself. You need all of them together to get any contribution at all.
Smoking causes lung cancer, but not all smokers develop lung cancer. Genetic and environmental factors contribute. Sufficient Cause A is a constellation of factors, including smoking, that together cause lung cancer; smoking is C3. But people develop lung cancer without smoking. Sufficient Cause B is the constellation of factors, not including smoking (no C3), that together cause lung cancer. Working in a coal mine is C8. Sufficient Cause A. Sufficient Cause B.
Homework is one INUS condition contributing to higher test scores. Other conditions are necessary to ensure that homework is maximally effective. These include student motivation and student ability, having access to a proper study space, a supportive family, getting a consistent message from teachers and parents, and receiving teacher feedback on assignments Homework & test scores
Other practices might achieve the same outcome as assigning homework These would be different pies altogether. For example, directed in-class tutorials, while resource intensive, could achieve the aim of higher test scores We could depict more pies for the other factors that we expect to affect test scores (like smaller class sizes).
Why fuss about INUS conditions? They remind us that -- • There are usually a number of distinct causal bundles (cakes) that contribute separately to the effect. • Other factors are necessary along with the policy variable if the policy is to have any effect at all.
Our theory of evidence for EBP • Focuses on relevance: What kinds of facts matter to the truth of an effectiveness prediction? • Has two simple ideas.
Three presuppositions • Law-governedness: Outcomes are produced under the governance of causal principles. • Analyticity: • Causal principles allow that different distinguishable sources can contribute to the same outcome, some positively, some negatively. • So a causal principle for O in S dictates • what factors can contribute – positively or negatively, to O in S • what the form of the contribution is • what it takes for that contribution to occur. 3. Causes are INUS conditions.
So causal principles can be represented like this: O c= C11C12…C1n + … + Cm1Cm2…Cmr − P11P12…P1s − … − Pt1Pt2…Ptu. where the meaning of ‘+’ can vary. • When the focus is on a specific treatment/ programme T, this is often shortened to O c= βT + W. • Caution: β is not a constant. [Causes are INUS conditions.] β represents the support team without which T contributes nothing.
You need all those factors without which the policy variable cannot act. The support team
Theory claim 1 T as implemented will contribute positively to O in S iff • There is a causal principle that holds in S from implementation till time of outcome in which T figures as a cause of O. • All the factors obtain that are required in that principle for T to contribute to O obtain in S at the required times.
Theory claim 2 The facts relevant for predicting ‘T will contribute positively to the production of O in S’ are those that must obtain if this claim is to be true. So • Is there a causal principle that holds in S from implementation till time of outcome in which T figures as a cause of O? • Do all the factors obtain in S that are required in that principle for T to contribute to O and at the right times? These are the two kinds of facts that are directly relevant to predicting effectiveness. Other facts are relevant only if they support these.
Two cautions • ‘T will make a positive contribution’ does not tell us what will actually happen. What happens depends on the totality of causal factors present. • Both positive and negative. • The magnet may contribute but the final effect may not be what one hopes for. • Though better than what would happen otherwise. • Watch out for what happens when you implement. Your efforts might make things worse. • California class-size reduction : Implementing a positive cause introduced a strong negative cause as well.
So what can RCTs do for you? Recall Theory claim 2: The facts relevant for predicting ‘T will contribute positively to the production of O in S’ are those that must obtain if this claim is to be true. So • Is there a causal principle that holds in S from implementation till time of outcome in which T figures as a cause of O? • Do all the factors obtain in S that are required in that principle for T to contribute to O and at the right times? These are the two kinds of facts that are directly relevant to predicting effectiveness. Other facts are relevant only if they support these.
So what can RCTs do for you? They are at best conditionallyindirectly relevant. • They support [very well] the claim that there is a causal principle L for the study situation X in which T figures as a cause of O. • They are relevant to ‘T figures as a cause of O in S’ only conditional on separate support that L is shared with S. • They provide no support for the occurrence of the requisite support team in S. • They are relevant to S only conditional on separate support that the requisite support team occurs in S (and does so postimplementation).
Three kinds of causal claim • It-works-somewhere claims: T contributes positively to O somewhere under some conditions (e.g. in study population X, administered by method M). • ‘General’ claims: T contributes positively to O ‘widely’. • Effectiveness claims: T would contribute positively to O in S administered as it would be administered in S given policy P.
Hierarchy of evidential support Effectiveness claim about S Presence of support team in S ‘General’ claim It-works-somewhere claim ?? ?? ?? ?? R C T R C T ? ? ? ? ? ? ?
Finding ingredients for the cake • Tell a causal roll-forward story from T to O. • How is T to produce O step-by-step? • This will help identify the factors needed at each step to get the next. • Taken all together these constitute the requisite support team. • Imagine the policy has failed and try to explain why. • Simple decision tree: Try to think of the most salient features without which the policy won’t work. If you don’t have/can’t obtain those, you don’t need to think further.
Finding ingredients for the cake • Tell a causal roll-forward story from T to O. • How is T to produce O step-by-step? • This will help identify the factors needed at each step to get the next step. • Taken all together these constitute the requisite support team. • Imagine the policy has failed and try to explain why. • Simple decision tree: Try to think of the most salient features without which the policy won’t work. If you don’t have/can’t obtain those, you don’t need to think further.