410 likes | 588 Views
EBI patent related services. 3 rd Annual Forum for SMEs. September 3-4 th 2009. Jennifer McDowall Senior Scientist, EMBL-EBI. Overview. Databases available Sequence archives Searching the database. EBI patent related services. Databases available…. EBI patent related services.
E N D
EBI patent related services 3rd Annual Forum for SMEs September 3-4th 2009 Jennifer McDowall Senior Scientist, EMBL-EBI
Overview • Databases available • Sequence archives • Searching the database EBI patent related services
Databases available… EBI patent related services
USPTO JPO GenBank DDBJ September 2009nucl >9.4m sequencesprot > 2.5m sequences EMBL EPO policy: data released topublic (and to EMBL) 18 months after the patent application date, independent of whether patent has been granted. . EPO Sequence data from patent literature EBI patent related services
Know the Data…Nucleotides EMBL • Release and updates EBI patent related services
Know the Data…Nucleotides EMBL Release and updates • Divided into classes and divisions... ANN – Annotated Constructed Seq PAT – Patent CON – Constructed Sequence STS – Sequence Tagged Site EST – Expressed Sequence Tag STD – Standard GSS – Genome Survey Sequence TPA – Third Party Annotation HTC – High Throughput cDNA TSA – Transcriptome Shotgun Assembly HTG – High Throughput Genome WGS – Whole Genome Shotgun EBI patent related services
Know the Data…Nucleotides EMBL Release and updates • Divided into classes and divisions... HUM – Human MUS – Mouse ROD – Rodent (excluding mouse) MAM – Mammal (excluding human, mouse, rodent) VRT – Vertebrate (excluding human, mouse, rodent, mammal) FUN – Fungi PRO – Prokaryote ENV – Environment INV – Invertebrate PHG – Phage SYN – Synthetic PLN – Plant VIR – Viral TGN – Transgenic UNC – Unclassified EBI patent related services
Know the Data…Nucleotides EMBL Release and updates Divided into classes and divisions... • Supplementary sets: • EMBL-CDS, EMBL-MGA • Specialist databases: • Immunoglobulins (IMGT/HLA, IMGT/LIGM) • Alternative splicing (ASDT) • Completed proteomes (Ensembl, Integr8) • Variation (HGVBase, dbSNP) EBI patent related services
EMBL Patent Sequence Entry Version, dates, archive Patent number, title, link to patent EBI patent related services
EBI patent related services Know the Data…Proteins UniProt • Release and updates
Know the Data…Proteins UniProt Release and updates Divided into 3 sections: UniProtKB UniParc UniRef • Taxonomic info • Annotated sequence • Combines sequences by % ID • UniRef100, 90, 50 • Protein archive • Covers ALL proteins (including UniMess) SwissProt TrEMBL Automatic annotation Manual annotation EBI patent related services
Know the Data…Proteins UniProt Release and updates Divided into 3 sections • Specialist databases linked to UniProt: • Structure (PDBe, SGT) • Immunoglobulins (IMGT/HLA) • Alternative splicing (ASDT) • Completed proteomes (Ensembl, Integr8) • Protein interactions (IntAct) • Protein signatures (InterPro) • Patent proteins (EPO, USTPO, JPO, KIPO) EBI patent related services
Bulk download http://www.ebi.ac.uk/patentdata/ Nucleotide sequences Protein sequences EBI patent related services
Bulk download ftp.ebi.ac.uk/pub/databases/embl/patent/ EBI patent related services
Sequence archives… EBI patent related services
Sequence archives • EMBL nucleotide sequence version archive (SVA)www.ebi.ac.uk/embl/sva • UniSave – UniProt sequence/annotation version archivewww.ebi.ac.uk/uniprot/unisave EBI patent related services
Enter accession # View old entries EMBL sequence version archive (SVA) EBI patent related services
Sequence record from EMBL SVA EBI patent related services
Select and compare versions Comparing versions in EMBL SVA EBI patent related services
UniProtKB sequence annotation version server - UniSave Enter accession # EBI patent related services
Select and compare versions View old entries UniSave results EBI patent related services
Searching the databases… EBI patent related services
EB-eye search by patent number Search for patent WO0146262 EBI patent related services
EB-eye search by patent number EBI patent related services
EB-eye nucleotide sequences from WO0146262 EBI patent related services
Sequence Similarity Search Tools Toolbox PSI search BLAST FASTA Smith-Waterman PSI-BLAST Wu-BLAST FASTA suite MPsrch PSI-SEARCH NCBI-BLAST ScanPS SSEARCH EBI patent related services
Fasta v. patent protein sequences EBI patent related services
Tools: Genomes & Proteomes FASTA EBI patent related services
When to use which search? NCBI BLAST Query length WU-BLAST PSI-SEARCH FASTA Database size EBI patent related services
When to use which search? NCBI BLAST time to search WU-BLAST PSI-SEARCH FASTA PDB Swiss-Prot UniRef50 UniRef 90 UniRef100 UniProtKB UniParc EBI patent related services
InterProScan protein signature search www.ebi.ac.uk/interpro/ EBI patent related services
InterPro signature database EBI patent related services
Some search guidelines… EBI patent related services
Search Guidelines #1 Use the most appropriate tool for your search - Don’t assume one tool will cater to all your search needs NCBI BLAST Query length WU-BLAST PSI-SEARCH FASTA Database size EBI patent related services
Search Guidelines #1 Use the most appropriate tool for your search #2 Best search option protein seq v. protein DB 2nd translated DNA seq v. protein DB 3rd DNA seq v. DNA DB Worst protein seq v. transl DNA BD EBI patent related services
Search Guidelines #1 Use the most appropriate tool for your search #2 Best search option protein seq v. protein DB #3 Search the smallest DB likely to have your sequence #4 Check statistics – histograms... #6 Don’t assume homologues have the same function #5 Change parameters when necessary (gap penalties, scoring matrices...) • Orthologs have similar functions • Paralogs acquire different functions EBI patent related services
Search Guidelines #1 Use the most appropriate tool for your search #2 Best search option protein seq v. protein DB #3 Search the smallest DB likely to have your sequence #4 Check statistics – histograms... #6 Don’t assume homologues have the same function #5 Change parameters when necessary (gap penalties, scoring matrices...) #7 Use multiple sequence alignments to validate relatedness #8 Consider filtering low complexity regions EBI patent related services
Typical workflow search Check stats review function evolution compare EBI patent related services