260 likes | 540 Views
Lecture 7. Functional Genomics: Gene Expression Profiling using DNA microarrays. Functional Genomics: Development and Application of Genome-Wide Experimental Approaches to Assess Gene Function by making use of the information and reagents provided by Structural Genomics.
E N D
Lecture 7. Functional Genomics: Gene Expression Profiling using DNA microarrays
Functional Genomics: Development and Application of Genome-Wide Experimental Approaches to Assess Gene Function by making use of the information and reagents provided by Structural Genomics
Goals of Functional Genomics: 1)DNA 2)RNA 3) Protein 4) Whole organism 5) Society Lander, E. 1996. The New Genomics: Global Views of Biology. Science 274: 536-539.
2. RNA Simultaneous monitoring of the expression of all genes EG: What do gene expression patterns look like in tumor vs. normal cells? What about following chemotherapy? Will reveal Regulatory Networks
Gene Expression Profiling Experimental Strategies for Analyzing Gene Expression Patterns: 1) Sequencing of cDNA Either complete cDNAs or partial cDNAs (Serial Analysis of Gene Expression {SAGE} technique) 2) DNA Microarrays-"Chip" technologies a) oligonucleotides synthesized in situ on glass slides using light directed combinatorial chemistry b) "printing" of cDNA onto glass slides Two things to consider: -the chip -how to label the RNA for hybridization to the chip
A. Oligonucleotide chip Multiple Oligonucleotides synthesized in situ on glass slides using light directed combinatorial chemistry
Oligonucleotide chip: 10-20 25mer oligos represent each mRNA Control: same oligos with a one bp mismatch @ center
Example: Simultaneous monitoring of gene expression in yeast grown in rich media or after starvation. Wodicka, Dong, Mittmann, Ho, and Lockhart. 1997 Genome-wide expression monitoring in S. cerevisiae. Nature Biotech. 15: 1359-1367. 1) 1.28 X 1.28 cm chip contains 65,000 independent 25-mer oligonucleotides. ~70,000 copies of each individual oligo per slot on the chip 2) 6200 yeast gene each represented by 20 different oligonucleotides 6200 X 20= 128,000 oligos 3) Control- For each oligo, also synthesize a second oligo having one base mismatch (center base in the 25 mer) Total of 260,000 oligos on 4 chips: total is about 1 sq. in.
RNA from yeast cells: starved In rich media
B. cDNA microarrays a) PCR amplify individual cDNA clones (500-1000 bp) displayed in 96 well dishes; b) stamp each well on a glass slide (robot); ~1 ul of DNA per spot. Denature and bake the DNA onto the glass slide c) label test and control RNA with different fluorescent markers d) hybridize to the chip; detect hybridization by color of fluorescence
Density of microarray is not as great; however cDNA is longer and hybridization more efficient than oligonucleotide hybridization (remember: hybridization is COOPERATIVE). EM of cDNA Microarray
Alternative to PCR Amplified sequences: Long (70-80) Oligonucleotides Spotted in the Same Manner
Detection Limit can be INCREASED depending on how RNA is labeled
DATA ANALYSIS In a typical experiment, several hundred-or even several thousand- genes might change expression pattern when two conditions are compared! How do you make sense of this massive amount of data? Cluster Analysis Tamayko et al. 1999. Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. PNAS 96:2907-2912 SELF ORGANIZING MAPS (SOMS) Mathematical technique for identifying underlying patterns in complex data arrays. Essentially clusters data points in multidimensional space. SOMS impose structure on a data set, clustering like data in “nodes”. GENECLUSTER: program developed to produce SOMS from microarray data:and available from these authors
Example: An oligonucleotide microarray containing 6416 genes (5223 known, 1193 unknown ESTs) was used to monitor gene in HL60 cell line induced to differentiate into macrophage-like cells by phorbol ester (PMA) treatment: Cells were treated for 0, 0.5, 6 or 24 hrs with PMA, RNA extracted for each treatment, and used to make cRNA. The analysis indicated that the expression of 567 genes varied by more than 4-fold.
4X3 SOM of the HL60 data Node Related nodes are closer together in the SOM
More complicated example: differentiation of four cell lines was studied by microarray analysis: two that differentiate into macrophages (HL60 &U937); one into neutrophils (NB4), and one into activated T-cells (Jurkat) 1,036 genes varied in the analysis 6X4 SOM representing the data
HIERARCHICAL CLUSTERING Relationships among objects (genes) are represented by a tree whose branch lengths reflect the degree of similarity between the objects, as assessed by a pairwise similarity function. In sequence comparison, these methods are used to infer the evolutionary history of sequences being compared.
For analysis of gene profiling data such methods are useful in their ability to represent varying degrees of similarity and more distant relationships among groups of closely related genes, as well as in requiring few assumptions about the nature of the data. The computed trees can be used to order genes in the original data table, so that genes or groups of genes with similar expression patterns are adjacent. The ordered table can then be displayed graphically, with a representation of the tree to indicate the relationships among genes.
After growth factor stimulation Example: Growth factor stimulation Of Fibroblast Proliferation Eisen MB, Spellman PT, Brown PO, Botstein D. 1998. PNAS 95:14863-14868. Growth Factor http://rana.lbl.gov/EisenSoftware.htm Green=increased expression Fibroblast RNA cDNA (Fluorescent label) vs. cDNA from unstimulated cells Red=decreased expression
Hierarcical Cluster Analysis of 3800 Yeast Genes whose expression changed one or more times under 365 different experimental treatments (including genetic manipulation, drugs, growth conditions) Known genes all involved in yeast mating Hypothesis: Unknown Genes are involved in yeast mating