1 / 10

First release of HOGENOM, a database of homologous genes from complete genome

POSTER JO 60. First release of HOGENOM, a database of homologous genes from complete genome. Simon Penel, Laurent Duret, Pascal Calvat, Jean-Fran ç ois Dufayard, Guy Perri è re, Manolo Gouy. Equipe Bioinformatique et Génomique Evolutive Laboratoire de Biométrie et Biologie Evolutive

limei
Download Presentation

First release of HOGENOM, a database of homologous genes from complete genome

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. POSTER JO 60 First release of HOGENOM, a database of homologous genes from complete genome Simon Penel, Laurent Duret, Pascal Calvat, Jean-François Dufayard, Guy Perrière, Manolo Gouy. Equipe Bioinformatique et Génomique Evolutive Laboratoire de Biométrie et Biologie Evolutive Université Claude Bernard - Lyon 1

  2. Homologous Genes Databases Research fields: • Proteome/genome comparative analysis • Phylogenetic studies • Orthology/Paralogy relationship assignments • Development of generic databases, specialised databases • HOVERGEN: families of homologous vertebrate genes • HOBACGEN: families of homologous bacterial genes • NureBase, RTKdb, Hoppsigen, Mitalib, Polymorphix..

  3. The HoGenom database: Homologous Genes Families from fully Sequenced Organisms European project TEMBLOR Contents: • Nucleic and protein sequences • Sequence annotations • Taxonomic data • Protein multiple alignments • Phylogenetic trees

  4. 1 sequence  1 species Protein sequences Mouse Human etc. Rat Proteome sets SwissProt TrEMBL TrEMBL-new The HoGenom database: Building of Database Data selection European Bioinformatic Institute 1 sequence  many species

  5. BLASTP  BLOSUM62 E ≤ 10-4 Parralelised calculations at IN2P3 The HoGenom database: Building of Database Similarity search Filtering (SEG) Local pairwise alignments

  6. A A B C HSP ≥ 80% length Similarity ≥ 50% A B Protein Family C Cluster A, B, C The HoGenom database: Building of Database Clustering into families 1 : Clustering of complete sequences into families 2 : Including partial sequences to the families defined previously

  7. A B C D E F G CLUSTAL W Default parameters Protein family A B C BIONJ D Neighbor joining, Observed divergence Partial sequences: distance matrix with missing values E F G Phylogenetic tree The HoGenom database: Building of Database Alignments and trees A B C D E F G Multiple alignment Rooting: mid-point

  8. 16 10 91 9% 31% 423 577 proteins, 527 925 cds 41 907 families 60% The HoGenom database: Contents Arabidopsis thaliana (plant) Caenorhabditis elegans (nematod) Drosophila melanogaster (fly) Encephalitozoon cuniculi (microsporidia)  Guillardia theta (alguae)  Homo sapiens (man) Mus musculus (mouse) Rattus norvegicus (rat) Saccharomyces cerevisiae (yeast) Schizosaccharomyces pombe (fungus) 117 organisms

  9. Querying the databases WWW Query Query on sequences and families according to multiple criteria Cross Taxa Query on families according to complex taxonomic criteria

  10. POSTER JO-60 à suivre…

More Related