460 likes | 508 Views
Explore the world of gene expression profiling and discover how it can help us understand cellular profiles and uncover patterns in gene expression. Learn about the different applications and techniques used in gene expression profiling.
E N D
Gene Expression Profiling Brad Windle, Ph.D. 628-1956 bwindle@hsc.vcu.edu http://www.people.vcu.edu/~bwindle/Courses/BIOC605/WindleGEPlecture.ppt
Profile A set of data or characteristics pertaining to an item Profiles are sometimes referred to as Signatures or Fingerprints
Gene/Protein Sequence Protein Structure Drug Structure Disease Cellular Profiles Gene Expression Protein SNPs Expression Protein States Structural Genomic Cell State Drug Response Methylation Misc Metabolitics Data
Gene/Protein Sequence Protein Structure Drug Structure Cellular Profiles Gene Expression Protein SNPs Expression Protein States Structural Genomic Drug Response Methylation Misc Metabolitics Data
Factors in Gene Expression 1. Presence or absence of the genes, and the number of genes Differences within the human population and big differences that occur during oncogenesis 2. Epigenetics, chromatin state Cell to cell and host to host variability unknown 3. Homeostasis and environmental factors Cell to cell and host to host variability unknown but environmental factors is variable of interest
The bigger picture Are cells or tissues related based on the genes they express? For an experimental cell model, are there conditions that are similar based on changes in gene expression? For certain experimental conditions, are there genes that show similar patterns of change (co-regulated)? What do we want to know? The smaller picture What genes went up or down under an experimental condition? What are the differences in gene expression between two cell types?
Profiles Have Two Sides A gene profile across samples and a sample profile across genes
Microarrays and Gene Expression Profiling
How did we use to do this? Probe for 1 gene Analyze ~10 samples
Now with Microarrays Analyzing 1 gene for 10 samples Analyzing thousands of genes for 1 sample
Gene Expression Profiling cell or condition of interest control or reference cell hybridize to microarray
Issues of Multiplicity 10,000 genes analyzed p=0.001 Therefore, there should be ~10 genes found even when there is no significant difference
Applications of Gene Expression Profiling Tissue or Tumor Classification Gene Classification Drug Classification Drug Target Identification Drug Response Prediction
Gene Expression Array Genomic Content Array Methylation Array (Chromatin Array) SNP Array
Structural Genomic Profiling Comparative Genomic Hybridization (CGH) cell with losses or gains normal cell hybridize to metaphase chromosomes
Genome Representation Profiling Using Arrays Normal Cancer Label DNA Label DNA BAC or Oligo Array
CGH Detection mainly for cancer and inherited deletions Tumor suppressor genes are deleted Oncogenes are amplified
me me PCR linkers CCGG CCGG CpG Island Array GGCC GGCC me me Hpa II / PCR Amplify/ Label PCR Amplify / Label hybridize to array Methylation Profiling
Profile cells based on methylation state cell-type profiles Differences in the methylated state of cancers Compare methylation profiles to gene expression profiles
Profiling Transcription Factor-Interactive DNA Immuno-precipitate w/ Ab to protein Chromatin IP or ChIP total genomic DNA
Analyzing Gene Expression Data and Profiles Cluster Analysis
Cluster Analysis We start with no hypothesis or a very general hypothesis We want the data to reveal what is relatively significant This approach is a hypothesis generator
We can’t observe the patterns unaided? The patterns are too complex or abstract. There’s too much data. There’s too much noise.
Gene A Gene X
Clustering Methods Divisive Agglomerative (Aggregative)
Cluster Linkage Methods Nearest Neighbor or Single Linkage Furthest Neighbor or Complete Linkage Average Neighbors or Average Linkage
The Color Map Cells 1 2 3 4 5 a b c d e 1 2 3 4 a d c d Genes
The Profile Data Sources Gene expression DNA microarrays, oligo or PCR, 20-30,000 genes Structural genomics DNA microarrays, BACs, ~one per 1Mb Methylation DNA microarrays, upstream sequences, CpG islands SNPs DNA microarray, oligos, millions of SNP sites Protein expression Ab microarray, 2D gels, Mass Spectrometry Protein states 2D gels, <1000 proteins resolved Drug response brute force, 70,000 compounds screened Metabolitics GC-MS DNA/protein sequence Sequencing, <20 people sequenced, brute force Drug structure in silico Protein structure 3D crystallography, NMR, brute force