420 likes | 583 Views
Web Service Deployment & Programmatic Access. BioMart. Syed Haider (Rice Group) Nov’ 2008. BioMart. A Collaboration: European Bioinformatics Institute (EBI) Ontario Institute for Cancer Research (OICR) . BioMart.
E N D
Web Service Deployment & Programmatic Access BioMart Syed Haider (Rice Group) Nov’ 2008
BioMart A Collaboration: • European Bioinformatics Institute (EBI) • Ontario Institute for Cancer Research (OICR)
BioMart A generic data management system with a particular focus on supporting biological research featuring: - Built-in query optimisation for fast data retrieval - Data Federation • Easy to use interfaces and APIs • Web Services and DAS
In a nutshell Source data (MySQL, Oracle, Postgres) DB Mart ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG
Deploying BioMart STEP 1 - Transformation STEP 2 - Configuration
1. Transformation Source data (MySQL, Oracle, Postgres) DB Mart ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG
1. Transformation MartBuilder
2. Configuration Mart Mart Mart
2. Configuration MartEditor
Basic Abstractions Dataset Filter Attribute
Examples • of all rat genes located on chromosome 1, expressed in lungs name, chromosome, description of all mouse genes ENSMUSG00000042351 exon sequences in FASTA format
MartService wget --post-data 'query= <Query> <Dataset name="hsapiens_gene_ensembl" > <Filter name="chromosome_name" value="1"/> <Attribute name="ensembl_gene_id"/> <Attribute name="ensembl_transcript_id"/> <Attribute name="biotype"/> </Dataset> </Query> ‘http://www.biomart.org/biomart/martservice
MartView – XML Access wget --post-data 'query= <Query> <Dataset name="hsapiens_gene_ensembl" > <Filter name="chromosome_name" value="1"/> <Attribute name="ensembl_gene_id"/> <Attribute name="ensembl_transcript_id"/> <Attribute name="biotype"/> </Dataset> </Query> ‘http://www.biomart.org/biomart/martview
MartView – URL Access VIRTUALSCHEMANAME=default &ATTRIBUTES=hsapiens_gene_ensembl.default.feature_page.ensembl_gene_id &FILTERS=hsapiens_gene_ensembl.default.filters.chromosome_name."1" http://biomart.org/biomart/martview?
BioMart DAS Access http://www.YourBioMart.org/biomart/das/DATASET/features? segment=FILTERS http://www.biomart.org/biomart/das/default__hsapiens_gene_ensembl__ensembl_das_chr/features? segment=1:1,100000 http://www.biomart.org/biomart/das/default__hsapiens_gene_ensembl__ensembl_das_gene/features? segment=ENSG00000197194
Learn as you go.... Show URL Request Show XML Query Show Perl Script
MartService – Get Metadata 1. Get Databases/Marts www.biomart.org/biomart/martservice?type=registry Live Demo!
MartService – Get Metadata 2. Get Datasets www.biomart.org/biomart/martservice? type=datasets&mart=<MART_NAME> Live Demo!
MartService – Get Metadata 3. Get Attributes www.biomart.org/biomart/martservice? type=attributes&dataset=<DATASET_NAME> Live Demo!
MartService – Get Metadata 4. Get Filters www.biomart.org/biomart/martservice? type=filters&dataset=<DATASET_NAME> Live Demo!
Credits • BioMart Team Arek Kasprzyk (OICR) Junjun Zhang (OICR) Syed Haider (Rice Group-EBI) • Acknowledgements Benoit Ballester (Ensembl) Richard Holland (Ensembl) Andreas Kahari (Ensembl) Craig Melsopp (Ensembl) Bert Overduin (Ensembl) Damian Smedley (Ensembl) Giulietta Spudich (Ensembl) Arne Stabenau (Ensembl) Phil Jones (EBI) Asif Kibria (EBI) Gulam Patel (EBI) Stephen Robinson (EBI) Katerina Tzouvara (EBI) Will Spooner (CSHL) Gudmundur Thorisson (CSHL) Darin London (Duke University) Don Gilbert (Indiana University) Steffen Durinck (NCI NIH) Eric Just (Northwestern University) Paul Donlon (Unilever) Christina Yung (OICR) Igor Antoshechkin (Caltech) References Galaxy WebLab