1 / 13

Mining publicly available microarray data

Mining publicly available microarray data. Frances Turner Fsturner@ic.ac.uk. Introduction. Publicly available data Method for data mining Application to Tuberculosis and Campylobacter. Capsule synthesis in C.jejuni. In which dataset(s) do these genes show changed expression?

shawn
Download Presentation

Mining publicly available microarray data

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Mining publicly available microarray data Frances Turner Fsturner@ic.ac.uk

  2. Introduction • Publicly available data • Method for data mining • Application to Tuberculosis and Campylobacter

  3. Capsule synthesis in C.jejuni • In which dataset(s) do these genes show changed expression? • Identify useful data • Improve biological understanding

  4. Publicly available data • Increasing volume of data • Different depositories • Different standards • Difficult to compare experiments

  5. Publicly available data Campylobacter 18 experiments 126 conditions M.bovis/M.tuberculosis 34 experiments 539 conditions

  6. Identification of sets of differentially expressed genes • GSEA commonly used (Subramanian et al 2005) • Threshold independent • Small but biologically significant changes

  7. GSEA applied to multiple expression datasets Cj1099 Cj0812 Cj1494c Cj1457c Cj0434 Cj1307 Cj0028 Cj1294 Cj1393 Cj1303 Cj1368 Cj0597 Cj1309c Cj0505c

  8. GSEA applied to multiple expression datasets Cj1099 Cj0812 Cj1494c Cj1457c Cj0434 Cj1307 Cj0028 Cj1294 Cj1393 Cj1303 Cj1368 Cj0597 Cj1309c Cj0505c Cj0172 Cj1099 Cj0028 Cj0812 Cj1494c Cj0741 Cj1457c Cj1303 Cj0434 Cj1393 Cj1307 Cj1294 Cj1393 Cj1309c Cj0812 Cj1494c Cj1307 Cj0434 Cj1393 Cj0028 Cj1294 Cj0597 Cj0145c Cj1368 Cj0432 Cj1309c Cj0505c

  9. GSEA applied to multiple expression datasets • Allows correction for multiple datasets • Not confounded by correlations between datasets

  10. Capsule synthesis in C.jejuni

  11. Nitrogen metabolism in M.bovis

  12. Summary Collect available microarray data GSEA based analysis Put different datasets in to comparable formats Identification of experimental conditions of interest

  13. Work in progress • Collaboration with Chris Tomlison to create user interface • Host of CISBIC server • Allow users to test their own gene sets or expression datasets.

More Related