470 likes | 698 Views
National Naval Medical Center Directorate for Professional Education Clinical Investigation Department. RESEARCH COURSE Statistical Data Analysis and Scientific Research Dr. Francois O. Tuamokumo, Mathematical Statistician , Department of Research Programs,
E N D
National Naval Medical CenterDirectorate for Professional EducationClinical Investigation Department RESEARCH COURSE Statistical Data Analysis and Scientific Research Dr. Francois O. Tuamokumo, Mathematical Statistician , Department of Research Programs, WRNMMC, Bethesda, MD “The National Naval Medical Center is an approved provider of continuing nursing education by the Navy Medicine Manpower, Personnel, Training and Education Command, an accredited approver by the American Nurses Credentialing Center’s Commission on Accreditation.”
Disclosure Statement This CE/CME activitydoes not have commercial support,and has no conflicts of interest. Research Course
DATA MANAGEMENT: INTRODUCTIONBy design of a study we mean planning the study in such a way that appropriate data can be collected and analyzed. Research Course
DATA are measurements collected on some characteristics called variables. • VARIABLES are the characteristics on which measurements are made Research Course
TYPES OF DATA • Qualitative data • Quantitative data • Qualitative data are categories. example: Gender (male, female) Stage of cancer (I, II, III, IV) • Quantitative data are numbers. example: Age, height, weight # of first trimester visits Research Course
Conclusion Methods of analysis depend on the data. • Data Management & Quality Assurance Access Excel Minitab BMDP SPSS STATA SAS Research Course
Identifying & resolving outliers • Identifying & and resolving missing data • Identifying duplicate records • Data Dictionary VariableAbbreviation Identification Code ID Low Birth Weight LoBrtWt Research Course
(0 = Birth Weight ≥ 2500g, 1 = Birth Weight < 2500g) Age of mother in years Age Weight in pounds at last wtLMst menstrual period Race (1=White, 2=black, Race 3=Hispanic, 4=other) Smoking status during Smoke Research Course
pregnancy (1=yes, 0=no) History of premature labor Prmtrlbo (0=none,1=one, 2=two, etc) History of hypertension Hptnsion (1= yes, 0 = no) Research Course
Presence of uterine Utrnirrt irritability (1=yes, 0= no) Number of physician visits visits during first trimester (0 = none, 1=one, 2=two, etc) Birth weight in grams brtwt Research Course
Some Considerations in Research • What are the variables of interest on which data will be collected? • What are the testable research questions of interest? • Are these questions clearly, concisely, and completely stated? Research Course
Objectives and Methods of Analysis • Objective: The effect of dose (two levels 250, 500mg) on pain status(relief/no relief); 2 controlling, for previous exposure (yes/no), gender (male/female); 3 age, bmi • Analysis Methods (vary) • Pearson’s chi-square (1) • Mantel-Haenszel Chi square (1,2) • Logistic regression (1, 2, 3)
Objective: The effect of dose (250, 500, 750mg) on pain status(quantified). • Analysis Method • independent variable - categorical • dependent variable - continuous • DO ANOVA • The effect of the type of medication (A, B, C, D) on cholesterol level
Objective: The effect of the number of miles run per day on weight loss Analysis method • independent variable - numerical • dependent variable - continuous • DO REGRESSION ANALYSIS
What is the purpose of the study? a. Descriptive b. Hypothesis testing, or c. Modeling • Descriptive study: To estimate a population parameter. • Ex: meanarterial blood pressure. proportion (percent) with improved respiratory outcome Research Course
Provide a 95% confidence interval for the estimates. • Confidence Interval: An interval over which the true value is expected to lie. Confidence Interval for: 1. population mean 2. population proportion Research Course
How large a sample do I need? Answer: Depends on type of study A. Estimation B. Testing Hypothesis A. Estimation of population mean, μ and population proportion, p Research Course
Example • A hospital administrator wishes to estimate the mean weight of babies born in her hospital. How large a sample of birth records should be taken if she wants to be 95% confident that the sample mean weight will be within 0.50 pound of the true mean weight of all babies born in her hospital? Research Course
Assume that a reasonable estimate of σ is 1 pound. Using the formula, Research Course
HYPOTHESES TESTING AND SAMPLE SIZE AN HYPOTHESIS IS AN ASSERTION ABOUT A POPULATION PARAMETER, SUCH AS THE POPULATION MEAN OR THE POPULATION PROPORTION. Research Course
TWO TYPES OF HYPOTHESES: NULL ALTERNATIVE THE RESEARCHER WISHES TO DISCREDIT THE NULL STATEMENT. Research Course
ERRORS IN HYPOTHESIS TESTING TYPE I ERROR TYPE II ERROR • TYPE I ERROR: REJECTING THE NULL HYPOTHESIS WHEN IT IS TRUE • TYPE II ERROR: ACCEPTING THE NULL HYPOTHESIS WHEN IT IS FALSE Research Course
PROBABILITY OF COMMITTING THESE ERRORS: ALPHA AND BETA Research Course
P-value • It is the smallest significance level for which the null hypothesis is rejected. • Compare it to level of significance, α (normally, .05) Research Course
II. SAMPLE SIZE FOR COMPARISON OF TWO GROUPS: • DATA TYPE: A. NUMERICAL DEPENDENT VARIABLE Research Course
PROBLEM:THE RESEARCH QUESTION IS WHETHER THERE IS A DIFFERENCE IN THE EFFICACY OF SALBUTAMOL AND IPRATROPIUM BROMIDE FOR THE TREATMENT OF ASTHMA. • DESIGN: RANDOMIZED TRIAL TO DETERMINE THE EFFECT OF THESE DRUGS ON FEV1 (FORCED EXPIRATORY VOLUME IN 1 SECOND) AFTER 1 WEEK OF TREATMENT. Research Course
ANALYSIS: DIFFERENCES IN MEANS • TEST: t-TEST • SPECIFICATIONS: 1. NULL AND ALTERNATIVE HYPOTHESES: NULL: MEAN FEV1 AFTER ONE WEEK OF TREATMENT IS THE SAME IN ASTHMATIC PATIENTS TREATED WITH SALBUTAMOL AS IN THOSE TREATED WITH IPRATROPIUM BROMIDE. ALTERNATIVE (2-SIDED): Research Course
2. MEAN FEV1 = 2 LITERS STD. DEVIATION = 1 LITER - IPRATROPIUM(LITERATURE) 3. EFFECT SIZE: = 0.2 LITERS (10% * 2) STANDARDIZED EFFECT SIZE = (EFFECT SIZE / STD.DEV.) = 0.2 LITERS Research Course
4. LEVEL OF SIGNIFICANCE = .05 POWER = .80 THUS SAMPLE SIZE PER GROUP = 393 Research Course
EXISTENCE OF HIGH IN-BETWEEN VARIABILITY AMONGST OBSERVATIONS • DESIGN: RANDOMIZED TRIAL • ANALYSIS: PRE-POST CHANGES • TEST: t-TEST Research Course
1. HYPOTHESES H0: CHANGE IN MEAN FEV1s ARE THE SAME HA: CHANGE IN MEAN FEV1s ARE DIFFERENT 2. STANDARD DEVIATION OF THE CHANGE = 0.25 (FROM PILOT) 3.EFFECT SIZE = 0.2 LITERS STANDARDIZED EFFECT SIZE = .80 4. LEVEL OF SIGNIFICANCE = .05, POWER = .80 FROM FORMULA, n = 25 PER GROUP Research Course
DATA TYPE - • B. CATEGORICAL (BINARY) DEPENDENT VARIABLE • EXAMPLE: PROPORTION OF MEN WHO DEVELOP CORONARY HEART DISEASE (CHD) WHILE TREATED WITH ASPIRIN COMPARED WITH THE PROPORTION WHO DEVELOP CHD WHILE TAKING A PLACEBO Research Course
SPECIFICATION REQUIREMENTS: • EFFECT SIZE IS SPECIFIED BY SPECIFYING P1 AND P2 TYPE OF STUDIES: A. COHORT STUDIES: P1AND P2 ARE PROPORTIONS OF SUBJECTS EXPECTED TO HAVE THE OUTCOME IN THE TWO GROUPS. 2. STATE THE NULL AND ALTERNATIVE HYPOTHESES. 3. SET ALPHA AND BETA. Research Course
PROBLEM: THE RESEARCH QUESTION IS WHETHER ELDERLY SMOKERSHAVE GREATER INCIDENCE OF SKIN CANCER THAN ELDERLY NONSMOKERS - COHORT Research Course
EXAMPLE: HOW MANY SMOKERS AND NONSMOKERS WILL NEED TO BE STUDIED TO DETERMINE WHETHER THE 5-YEAR SKIN CANCER INCIDENCE IS AT LEAST 30% IN SMOKERS? 1.H0: THE INCIDENCE IS THE SAME HA: THE INCIDENCE IS DIFFERENT 2. 5-YEAR INCIDENCE OF SKIN CANCER IS ABOUT 20% IN NONSMOKERS – LITERATURE REVIEW. 3. ALPHA = 0.05 AND POWER = 0.80 n = 313, FOR A TWO-SIDED HA n = 250, FOR A ONE-SIDED HA ** ABOVE PROBLEM MAY BE STATED IN FORM OF RELATIVE RISK. Research Course
EXAMPLE: AN INVESTIGATOR IS INTERESTED IN WHETHER WOMEN WHO USE ORAL CONTRACEPTIVES ARE AT A MUCH HIGHER RISK OF HAVING A MYOCARDIAL INFARCTION WHEN COMPARED TO NON-USERS (PROSPECTIVE) Research Course
B. CASE-CONTROL STUDY: • SPECIFICATION REQUIREMENTS: • 1. THE ODDS RATIO TO BE DETECTED IN THE CASE GROUP • 2. P2: THE PROPORTION OF CONTROLS EXPOSED TO THE PREDICTOR VARIABLE Research Course
WHERE,P1IS THE PROPORTION OF CASES EXPOSED TO THE PREDICTOR VARIABLE Research Course
EXAMPLE: 1. EXPECTS THAT 10% OF CONTROLS WILL BE EXPOSED TO ORAL CONTRACEPTIVES (P2) 2. WISHES TO DETECT AN ODDS RATIO OF 3 ASSOCIATED WITH THE EXPOSURE FROM FORMULA, P1 = 0.25 HENCE, FOR A TWO-SIDED HYPOTHESIS, n = 112 PER GROUP Research Course
Questions ??? Research Course
Thank You! My contact Information: Dr. Francois O. Tuamokumo Phone: (301) 319 8788 Email: francois.tuamokumo@med.navy.mil Research Course