200 likes | 358 Views
Genome Viewer Sequence Alignments Sharing Data Automation Map Viewer Genetic Marker Correspondences FPC Map FPC I-Map Future Goals EnsEMBL Pipeline. Genome Viewer. Aligned Data Sets:. Rice TIGR GIs Rice BGI EST Clusters Rice dbEST ESTs Rice BGI ESTs Rice Complete CDSs.
E N D
Genome Viewer • Sequence Alignments • Sharing Data • Automation • Map Viewer • Genetic Marker Correspondences • FPC Map • FPC I-Map • Future Goals • EnsEMBL Pipeline
Aligned Data Sets: Rice TIGR GIs Rice BGI EST Clusters Rice dbEST ESTs Rice BGI ESTs Rice Complete CDSs Maize Unigene Clusters Maize TIGR Gis Maize dbEST ESTs Barley dbEST ESTs Wheat dbEST ESTs Sorghum dbEST ESTs Rice CUGI BACends Rice JRGP/Cornell RFLP Markers Rice Cornell SSRs
Alignment Methods: • Rice ESTs: • BLAT • pslReps • Discard based on percent of EST length matched • Non-Rice ESTs: • BLAT • pslReps • Discard based on hit length and hit frequency • Rice BACends: • BLAT • Discard based on gap length, percent of BACend length matched, percent identity, and hit frequency.
Total BACs/PACs: 1,847 Total bp: 250,879,896 (250MB ) Phase 1: 78 Phase 2: 1,238 Phase 3: 531 Transcripts: 8,034 (~80,000) Phase 3 with genes: 330 (~%18)
Alignment Methods: • Rice Markers: • BLAT • Discard based on percent of marker length matched and the gap length in case of genomic markers. • Utilize loci information, outputting hits only for those whose chromosome matches that of the BAC/PAC. • Rice SSRs: • e-PCR
Sharing Results • All processed alignments available under “Downloads”. • Send to GrainGenes a list of BAC/PAC and EST Accessions for Barley and Wheat. • Send to maizdb locations of each mapped Unigene cluster and if available, closest RFLP marker.
Automating Alignments: • For each group of data sets, there is a script to automatically: • Run pslReps • Load results into the database • Discard low-quality matches • Update documentation
Map Correspondences Same marker on multiple mapping studies • Name-identity • Curated evidence • Sequence-based correspondences for JRGP and Cornell markers: • BLAT • Utilize loci information, discarding matches from different chromosomes or more than 30cM apart.
curator same name sequence-based
same name curator
Cornell/JRGP markers mapped to sequenced clones were assigned positions on the FPC contigs.
Total: 2,272 4,417
EnsEMBL Pipeline • Install the EnsEBML annotation pipeline on the Clemson cluster • Use the pipeline to update alignments • Integrate genomic data to produce uniform annotation of rice