410 likes | 515 Views
Giving GRADE D recommendations – challenges and solutions. The New York Academy of Medicine Teaching Evidence Assimilation for Collaborative Healthcare New York, August 8 , 2013. Yngve Falck-Ytter , MD, AGAF for the GRADE team
E N D
Giving GRADED recommendations – challenges and solutions The New York Academy of Medicine Teaching Evidence Assimilation for Collaborative Healthcare New York, August 8, 2013 Yngve Falck-Ytter, MD, AGAF for the GRADE team Associate Professor, Case Western Reserve University, Case & VA Medical Center Chief, Gastroenterology & Hepatology, VA Medical Center, Cleveland
How did we make clinical decision? • If the basis was not evidence – what was it? • Expert recommendations
Institute of Medicine • March 2011 report: “Clinical Practice Guidelines We Can Trust” • Establishing transparency • Management of conflict of interest • Guideline development group composition • Evidence based on systematic reviews • Method for rating strength of recommendations • Articulation of recommendations • External review • Updating
Quality of CPG based on IOM criteria • 169 oncology CPGs evaluated (2005-2010) • 60% published after 2007 • Not a single CPG met all 8 IOM criteria Reams et al. Journal of Clinical Oncology 2013
Quality of CPG based on IOM criteria 6. Wording 7. Ext. review 1. Transparency 8. Updating 4. Based on SR 5. Rating recs 2. COI 3. Group composition
Before GRADE Source of evidence Grades of recomend. Level of evidence I SR, RCTs A II Cohort studies B III Case-control studies IV Case series C V Expert opinion D
Before GRADE Source of evidence Grades of recomend. Level of evidence Ia Ib Meta-analysis RCTs A II Cohort studies B III Case-control studies IV Case series C V Expert opinion D
So what is quality of evidence? • Confidence in evidence • Confidence in the evidence of benefits • Confidence in the evidence of downsides • Confidence in the evidence in the balance • Recognizing that not all outcomes are equal
Importance of outcomes Final health outcomes Mortality Liver cancer Liver cirrhosis Chronic hepatitis B infection Acute symptom. infection Question (PICO) Should health care worker receive booster vaccination vs. not? Intermediate outcomes Positive hepatitis B core antibody Amnestic response to re-challenge Loss of protective surface antibody
A grading system needs to be outcome-centric Outcome #1 Quality Outcome #2 Quality Outcome #3 Quality I B II V III Old system GRADE
Grades of Recommendations Assessment, Development and Evaluation
70+ Organizations 2008 2010 2006 2005 2007 2009 2011
Where GRADE fits in Prioritize problems, establish panel Find/appraise or prepare: Systematic review Searches, selection of studies, data collection and analysis (Re-) Assess the relative importance of outcomes Prepare evidence profile: Quality of evidence for each outcome and summary of findings GRADE Guidelines: Assess overall quality of evidence Decide direction and strength of recommendation Draft guideline Consult with stakeholders and / or external peer reviewer Disseminate guideline Implement the guideline and evaluate
GRADE expands quality of evidence determinants Inconsistency of results Risk of bias Failure of blinding Methodological limitations Incomplete reporting Indirectness of evidence Losses to follow-up Allocation concealment Imprecision of results Publication bias
GRADE: Quality of evidence For guidelines: The extent to which our confidence in anestimate of the treatment effect is adequate to support a particular recommendation. Although quality of evidence is a continuum, we suggest using 4 categories: • High • Moderate • Low • Very low
Determinants of quality • RCTs start high • Observational studies start low
Quality of evidence: beyond risk of bias Definition: The extent to which our confidence in an estimate of the treatment effect is adequate to support a particular recommendation Methodological limitations Inconsistency of results Indirectness of evidence Imprecision of results Publication bias Sources of indirectness: Indirect comparisons Patients Interventions Comparators Outcomes Risk of bias: Allocation concealment Blinding Intention-to-treat Follow-up Stopped early
Quality assessment criteria Lower if… Higher if… Quality of evidence Study design Study limitations (design and execution) High RCTs Observational studies Moderate Inconsistency What can raise the quality of evidence? Low Indirectness Very low Imprecision Publication bias
Question to the audience You review all colonoscopies for average risk colon cancer screening in your health system and document a percentage of patient who developed a perforation after the procedure (evidence of free air on imaging). No comparison group without colonoscopy available. Rate the quality of evidencefor the outcome perforation: • High • Moderate • Low • Very low
Quality assessment criteria Lower if… Higher if… Quality of evidence Study design Study limitations (design and execution) High RCTs Observational studies Large effect (e.g., RR 0.5) Very large effect (e.g., RR 0.2) Moderate Inconsistency Evidence of dose-response gradient Low Indirectness All plausible confounding… …would reduce a demonstrated effect …would suggest a spurious effect when results show no effect Very low Imprecision Publication bias
GRADE evidence profile: HCC associated with HCV eradication Morgan R, Baack B, Smith B, Yartel A, Pitasi M, Falck-Ytter Y. Ann Intern Med. 2013;158:329-337.
From evidence to recommendations RCT Obser-vational study Balance between benefits, harms & burdens Quality of evidence Patients’ values & preferences High level recommen-dation Lower level recommen-dation Old system GRADE
Values and preferences Implicit value judgments in recommendations Trade-offs: example prevention of VTE in surgery Thrombotic events Deep vein thrombosis, pulmonary embolism Bleeding events Gastrointestinal bleeds, operative site bleeds Inconvenience of injections Variability in values and preferences
Case • 77 y/o patient with atrial fibrillation, mild CHF, HTN, DM and history of stroke (fully recovered) • Meds: warfarin, antihypertensives, statin, glyburide • Admitted with nausea/vomiting, then hematemesis; INR 2.5; 1 U blood transfused; EGD: no active bleed, possible Mallory Weiss • This is his second major bleed since he started warfarin one year ago
Acceptable additional bleeds? • Study: Patients at high risk for atrial fibrillation and high risk of stroke (h/o CHF/MI); internists and cardiologists • Warfarin decreases risk at cost of increased GI bleeds • Without treatment 100 patients will suffer: • 12 strokes (six major, six minor), 3 serious GI bleeds in 2 years • Warfarin would decrease strokes in 100 patients to 4 per 2 years (8 fewer strokes, 4 major, minor) • How many additional bleeds would you accept in 100 patients over a year, and still be willing to administer/take warfarin? Slide courtesy of: G. Guyatt; Study: Devereaux et. al., 2001
Strength of recommendation “The strength of a recommendation reflects the extent to which we can, across the range of patients for whom the recommendations are intended, be confident that desirable effects of a management strategy outweigh undesirable effects.” • Understanding values & preferences necessary to trade-off benefits and downsides • Values and preferences should ideally be informed by systematic reviews, but evidence is often sparse
Example recommendation ACCP AT9 recommendation: In patients undergoing major orthopedic surgery (e.g., total hip replacement), we suggest the use of LMWH in preference to the other agents. Patients who place a high value on avoiding bleeding complications and a low value on its inconvenience are likely to choose a compression device (IPCD) over the drug options.
4 determinants of the strength of recommendation Factors that can weaken the strength of a recommendation Explanation • Lower quality evidence The higher the quality of evidence, the more likely is a strong recommendation. • Uncertainty about the balance of benefits versus harms and burdens The larger the difference between the desirable and undesirable consequences, the more likely a strong recommendation warranted. The smaller the net benefit and the lower certainty for that benefit, the more likely is a weak recommendation warranted. • Uncertainty or differences in patients’ values The greater the variability in values and preferences, or uncertainty in values and preferences, the more likely weak recommendation warranted. • Uncertainty about whether the net benefits are worth the costs The higher the costs of an intervention – that is, the more resources consumed – the less likely is a strong recommendation warranted.
Monthly cost of cancer drugs 11 of 12 cancer drugs approved by the FDA in 2012 cost more than $100,000 / year
Implications of a strong recommendation • Population: Most people in this situation would want the recommended course of action and only a small proportion would not • Health care workers: Most people should receive the recommended course of action • Policy makers: The recommendation can be adapted as a policy in most situations
Implications of a conditional recommendation • Population: The majority of people in this situation would want the recommended course of action, but many would not • Health care workers: Be prepared to help people to make a decision that is consistent with their own values/decision aids and shared decision making • Policy makers: There is a need for substantial debate and involvement of stakeholders
Create evidence profile with GRADEpro Summary of findings & estimate of effect for each outcome Guideline development Rate overall quality of evidence across outcomes based on lowest quality of critical outcomes Rate quality of evidence for each outcome Outcomes across studies Formulate question Rate importance Select outcomes RCT start high, obs. data start low Risk of bias Inconsistency Indirectness Imprecision Publication bias P I C O Outcome Critical High Outcome Critical Moderate Grade down Low Outcome Important Very low Outcome Less important Large effect Dose response Confounders Grade up Panel • Formulate recommendations: • For or against (direction) • Strong or weak (strength) • By considering: • Quality of evidence • Balance benefits/harms • Values and preferences • Revise if necessary by considering: • Resource use (cost) Systematic review • “We recommend using…” • “We suggest using…” • “We recommend against using…” • “We suggest against using…”
What GRADE isn’t • Not another “risk of bias” tool • Not a quantitative system (no scoring required) • Not eliminate COI, but able to minimize • Not “expensive” • Builds on well established principles of EBM • Some degree of training is needed for any system • Proportionally adds minimal amount of extra time to a systematic review
Summary • Using GRADE enables organizations to produce methodologically rigorous recommendations • It’s sensible, transparent, and systematic and fulfills requirements for use in performance measure production (e.g., NQF, PCPI) • International standardization facilitates direct comparisons across organizations and has the potential to reduce redundancy in efforts