1 / 33

Microarray Design and Analysis

Microarray Design and Analysis. Jeremy D. Glasner. Genetics 875 November 20, 2007. What is a Microarray?. A collection of DNA sequences arrayed on a solid substrate usually with thousands of individual DNA spots. Gene expression analysis. Massively parallel biochemistry aimed

terrel
Download Presentation

Microarray Design and Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Microarray Design and Analysis Jeremy D. Glasner Genetics 875 November 20, 2007

  2. What is a Microarray? A collection of DNA sequences arrayed on a solid substrate usually with thousands of individual DNA spots

  3. Gene expression analysis Massively parallel biochemistry aimed at measuring RNA levels

  4. Why do gene expression analysis? • Predict new gene functions by expression patterns • Determine the effect of a drug on gene expression • Compare a mutant strain to wild-type • Identify an expression signature for a cancer type • Find the targets of a transcriptional regulator • Measure the half-lives of RNAs in the cell . . .

  5. Typical gene expression experiment

  6. TraSH: transposon site hybridization Genome wide localization of insertion mutations Sassetti CM, Boyd DH, Rubin EJ. Proc Natl Acad Sci U S A. 2001 98(22):12712-7

  7. CGH: Comparative genome hybridization Rajashekara G, Glasner JD, Glover DA, Splitter GA. J Bacteriol. 2004 186(15):5040-51.

  8. ChIP-Chip: Chromatin immunoprecipitation, chip hybridization a.k.a. genome-wide occupancy profiling Identify the chromosomal locations of a DNA binding protein

  9. Flow of Information in Array Analyses Experimental Design Array Production Sample Preparation Scanning Image Analysis Data Processing Data Analysis Information Integration

  10. Experimental Design Issues Number of Replicates What samples should be compared? Directly on same chip/across arrays? Calibrators and common references What controls are necessary?

  11. Considerations when designing the sequences for a chip Sequence Annotation ORF, UTR, functional RNA prediction Oligo Selection Array Design, Replication

  12. Sensitivity vs. specificity as a function of oligo length They are inversely related Hughes TR, et al., Nat Biotechnol 2001. Apr;19(4):342-7.

  13. Two methods for array production photolithography spotting

  14. Affymetrix Chips 105-106 “Probes” Perfect Match and Mismatch Average Difference Values

  15. Affymetrix “Units” PM MM A “probe set” A “probe pair”

  16. DMD “The Digital Light Switch” DMD Close-Up • Mirrors spacing 17 um • Mirror transit time <20 us • Tilt angle +10 degrees • Five mirrors = diameter human hair • Analog pictures from digital switches?

  17. NimbleChip Technology

  18. Flexibility of NimbleGen Arrays

  19. Sample Preparation RNA samples are extracted for the experiment and fluorescent dyes are incorporated RNA stabilization Direct vs. indirect labeling

  20. Hybridization & Scanning PMT settings Lasers Focus Data Tracking

  21. Image Analysis Spot Finding Background subtraction Intensity Calculation

  22. Automatic Grid Finding Sum signal intensities in X and Y directions

  23. Estimating Foreground and Background with the “Fixed Circle” Method

  24. Estimating Foreground and Background with the “Histogram” Method # pixels Intensity

  25. Quality Filtering of Data From Tseng et al., 2001. NAR 29(12):2549-2557

  26. Data Normalization Data Normalization is necessary if the overall signal differs between experiments and can be complicated if the relationship is nonlinear. Internal controls can also be used for normalization. Data from Schadt et al 2001. Journal of Cellular Biochemistry Supplement 37:120-125.

  27. M-A Plots

  28. Print Tip Group Normalization

  29. Intensity Dependent Normalization

  30. Normalization Methods Assume linear relationship Apply non-linear normalization Normalize to “house-keeping genes” Normalize to internal Standards

  31. Ratio Calculation Methods

  32. Detecting differential expression Determine which changes are significant: Fixed cutoff (fold-change>4) Replication allows assessment of variability Common statistics such as the t-test are often used for gene expression data. Significance of the value is then determined by referring to the t distribution. This assumes that the data is normally distributed, which may not be true. Gene expression experiments may require thousands of statistical tests and significance should be adjusted to reflect this. A standard Bonferroni correction is the p-value multiplied by the number of tests but is likely too conservative.

  33. Different methods, different results Millenaar et al., BMC Bioinformatics2006, 7:137

More Related