1 / 37

How to Publish a Paper: Statistics and Writing Tips

How to Publish a Paper: Statistics and Writing Tips. Kristin Sainani, PhD, Stanford University. Outline. 1. Making a Data Analysis Plan 2. Analyzing Your Data 3. Creating Tables and Figures 4. Writing Up Your Findings. 1. Make a data analysis plan before analyzing the data.

Download Presentation

How to Publish a Paper: Statistics and Writing Tips

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. How to Publish a Paper: Statistics and Writing Tips Kristin Sainani, PhD, Stanford University

  2. Outline • 1. Making a Data Analysis Plan • 2. Analyzing Your Data • 3. Creating Tables and Figures • 4. Writing Up Your Findings

  3. 1. Make a data analysis plan before analyzing the data. • Why make a plan? • Makes your analysis easier and more efficient. • Ensures completeness. • Keeps your analysis focused on the main hypotheses of interest. • Helps you avoid “p-value shopping” (type I errors!) • Makes it easier to write a clear statistical methods section.

  4. The data analysis plan—explanatory aims • What goes in your plan? (explanatory goal) • How will I clean and check my data? • How will I deal with missing data? • How will I visualize my data? • How will I define my primary exposures and outcomes? • What descriptive analyses will best inform the primary hypothesis? • How will I test my primary hypothesis? • How will I visualize my primary hypothesis? • How will I approach multivariate model building? • What interactions/subgroup analyses will I be testing? • Will I be performing any sensitivity analyses? • Will I be performing any secondary/exploratory analyses?

  5. The data analysis plan—predictive aims • What goes in your plan? (predictive goal) • How will I clean and check my data? • How will I deal with missing data? • How will I visualize my data? • How will I reduce my set of candidate variables? • How will I select variables for the final prediction model? • How will I assess model performance? • How will I validate my final model? • How will I address clinical utility?

  6. 2. Tips for analyzing your data • Follow your statistical analysis plan. • Document all your steps. • Aim for reproducible research! • Use common sense. • Draw lots of pictures. • Focus on effect sizes, not p-values. • If it’s a controlled study, focus on between-group comparisons. • Calculate absolute risks, not just relative risks. • Ask for help from a statistician.

  7. First rules of statistics… Use common sense! Draw lots of pictures!

  8. What’s wrong with this? • Study with sample size of 10 (N=10) • Results: “Objective scoring by blinded investigators indicated that the treatment resulted in improvement in all (100%) of the subjects. Of patients showing overall improvement, 78% were graded as having either excellent or moderate improvement.”

  9. Take-home message? Do the three groups differ meaningfully in weight change over time? Reproduced with permission from: JAMA. 2010;303(12):1173-1179. doi:10.1001/jama.2010.312

  10. Omit extraneous p-values Table 2. Outcome volume for the experimental and standard groups; mean (SD). * p< .05 greater than the contralateral side What is the only comparison that matters here?

  11. For controlled studies, focus on between-group comparisons From the abstract: • Design: A double-blinded, randomized, controlled trial. • Results: The treatment group showed significant increases in walking speed and stride length after the intervention but showed no significant changes in peak hip extension or anterior pelvic tilt during comfortable and fast-paced walking. The treatment group also showed significantly increased passive hip extension range of motion. But the comparison that matters in an RCT is treatment vs. control! This isn’t reported in the paper.

  12. Table 1. Comfortable walking primary parameters (mean ± SD) pre- and postintervention

  13. Situations where you may need a statistician: • Correlated data • Repeated-measures • Two hands/two sides of the face/two knees/two legs from the same person • 1:1 matched cases and controls • Sparse data • Multiple hypotheses • Multivariate models

  14. 3. Tips for Tables and Figures Data presentation matters!

  15. Recommended order for writing an original manuscript • 1. Tables and Figures • 2. Results • 3. Methods • 4. Introduction • 5. Discussion • 6. Abstract

  16. Tables and Figures are the foundation of your story! Figures and tables should stand alone and tell a complete story. Each figure and table should have a take-home point. The reader should not need to refer back to the main text.

  17. Tables and Figures are the story! “An article about computational science in a scientific publication isn’t the scholarship itself, it’s merely advertising of the scholarship. The actual scholarship is the complete software development environment and the complete set of instructions which generated the figures.”—Jon Claerbout, Stanford

  18. Tips on Tables and Figures • Use the fewest figures and tables needed to tell the story. • Do not present the same data in both a figure and a table.

  19. Tables vs. Figures • Figures • Visual impact • Show trends and patterns • Tell a quick story • Tell the whole story • Highlight a particular result • Tables • Give precise values • Display many values/variables

  20. Three horizontal lines Example table: Table 1. Descriptive characteristics of the study groups, means ± SD or N (%). *p<.05, ttest or Fisher’s exact test, as appropriate.

  21. Example table: Table 1. Descriptive characteristics of the study groups, means ± SD or N (%). *p<.05, ttest or Fisher’s exact test, as appropriate.

  22. What not to do! Table 1. Descriptive characteristics of the study groups, means ± SD or N (%). Remove grid lines! *p<.05, ttest or Fisher’s exact test, as appropriate.

  23. What not to do! Table 1. Descriptive characteristics of the study groups, means ± SD or N (%). Make sure everything lines up and looks professional! *p<.05, ttest or Fisher’s exact test, as appropriate.

  24. Use a reasonable number of significant figures. What not to do! Table 1. Descriptive characteristics of the study groups, means ± SD or N (%). *p<.05, ttest or Fisher’s exact test, as appropriate.

  25. What not to do! Give units! Table 1. Descriptive characteristics of the study groups, means ± SD or N (%). *p<.05, ttest or Fisher’s exact test, as appropriate.

  26. Omit unnecessary columns! What not to do! Table 1. Descriptive characteristics overall and by group, means ± SD or N (%), and p-values for the comparison between the groups.

  27. Types of Figures • Primary evidence • electron micrographs, gels, photographs, pathology slides, X-rays, etc. • indicates data quality • “Seeing is believing” • Graphs • line graphs, bar graphs, scatter plots, histograms, boxplots, etc. • Drawings and diagrams • illustrate an experimental set-up or work-flow • indicate flow of participants • illustrate cause and effect relationships or cycles • give a hypothetical model • represent microscopic particles or microorganisms as cartoons

  28. Types of graphs: • line graphs • scatter plots • bar graphs • individual-value bar graphs • histograms • box plots • survival curves

  29. 4. Writing up your findings • Tips for writing the Results section • Tips for writing in general!

  30. Results ≠ Raw Data • The results section should: • Summarize what the data show • Point out simple relationships • Describe big-picture trends • Cite figures or tables that present supporting data • Avoid simply repeating the numbers that are already available in tables and figures.

  31. The characteristics of the bad witches and the good witches are shown in Table 1. There was a significant difference in age between the groups. The mean age of the bad witches was 45 ± 5; and the mean age of the good witches was 36 ± 6. There was no significant difference in gender between the groups, with the bad witches having 85% females and the good witches having 83% females. BMI was not significantly different between the groups, which both had normal BMIs. Systolic blood pressure and exercise were significantly different. The bad witches had a mean blood pressure of 140 ± 10, whereas the good witches had a mean blood pressure of 120 ± 9. Estimated daily exercise was higher in the good witches (60 ± 30) than the bad witches (30 ± 20). Employment was not significantly different between the two groups… Hypothetical Example

  32. Original: The characteristics of the bad witches and the good witches are shown in Table 1. There was a significant difference in age between the groups. The mean age of the bad witches was 45 ± 5; and the mean age of the good witches was 36 ± 6. There was no significant difference in gender between the groups, with the bad witches having 85% females and the good witches having 83% females. BMI was not significantly different between the groups, which both had normal BMIs. Systolic blood pressure and exercise were significantly different. The bad witches had a mean blood pressure of 140 ± 10, whereas the good witches had a mean blood pressure of 120 ± 9. Estimated daily exercise was higher in the good witches (60 ± 30) than the bad witches (30 ± 20). Employment was not significantly different between the two groups… Revised: The witches were, on average, lean and predominantly female (Table 1). Bad witches were significantly older, had higher blood pressures, exercised less, and were more likely to smoke than good witches. More bad witches were unemployed, but this difference did not reach statistical significance. Edited version…

  33. What verb tense do I use? *Use past tense for completed actions: We found that… Women weremore likely to… Men smoked more cigarettes than… The average reaction time was… *Use the present tense for assertions that continue to be true, such as what the tables show, what you believe, and what the data suggest: Figure 1 shows… The findings confirm… The data suggest… We believe that this shows…

  34. Say what you mean clearly and succinctly! Example: “This paper provides a review of the basic tenets of cancer biology study design, using as examples studies that illustrate the methodologic challenges or that demonstrate successful solutions to the difficulties inherent in biological research.” s and “This paper reviews cancer biology study design, using examples that illustrate specific challenges and solutions.”

  35. Say what you mean clearly and succinctly! Example: “As it is well known, increased athletic activity has been related to a profile of lower cardiovascular risk, lower blood pressure levels, and improved muscular and cardio-respiratory performance.”  is associated with I fitness. “Increased athletic activity is associated with lower cardiovascular risk, lower blood pressure, and improved fitness.” “Increased athletic activity lowers cardiovascular risk and blood pressure, and improves fitness.” (stronger level of evidence)

  36. Good references on manuscript writing • Clinical Chemistry Guide to Scientific Writing: http://www.aacc.org/publications/clin_chem/ccgsw/Pages/default.aspx# • Mimi Zeiger. Essentials of Writing Biomedical Research Papers, McGraw Hill Professional

  37. For more training on writing or statistics, see my MOOCs: My Massive Open Online Courses (MOOCs): • Writing in the Sciences: https://class.stanford.edu/courses/Medicine/Sci-Write/Fall2014/about • Statistics in Medicine: https://class.stanford.edu/courses/Medicine/MedStats/Summer2014/about

More Related