1 / 6

Gene Set Enrichment Analysis

Gene Set Enrichment Analysis. GSEA: Key Features. Ranks all genes on array based on their differential expression Identifies gene sets whose member genes are clustered either towards top or bottom of the ranked list (i.e. up- or down regulated) Enrichment score calculated for each category

colin
Download Presentation

Gene Set Enrichment Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Gene Set Enrichment Analysis

  2. GSEA: Key Features Ranks all genes on array based on their differential expression Identifies gene sets whose member genes are clustered either towards top or bottom of the ranked list (i.e. up- or down regulated) Enrichment score calculated for each category Permutation test to identify significantly enriched categories Extensive gene sets provided via MolSig DB – GO, chromosome location, KEGG pathways, transcription factor or microRNA target genes

  3. GSEA Disease Control Most significantly up-regulated genes Unchanged genes Most significantly down-regulated genes Each gene category tested by traversing ranked list Enrichment score starts at 0, weighted increment when a member gene encountered, weighted decrement otherwise Enrichment score – point where most different from zero

  4. Enrichment Score 4 Enrichment Score (ES) is calculated by evaluating the fractions of genes in S (‘‘hits’’) weighted by their correlation and the fractions of genes not in S (‘‘misses’’) present up to a given position i in the ranked gene list, L, where N genes are ordered according to the correlation,

  5. GSEA algorithm

  6. GSEA: Permutation Test Randomise data (groups), rank genes again and repeat test 1000 times Null distribution of 1000 ES for geneset FDR q-value computed – corrected for gene set size and testing multiple gene sets Null distribution of enrichment scores Actual ES

More Related