630 likes | 1.09k Views
Virus Bioinformatics Resource Centers – ViPR & IRD. Richard H. Scheuermann, Ph.D. Department of Pathology, UT Southwestern March 30, 2011. Take home messages. NIAID Bioinformatics Resource Centers Virus Pathogen Resource ( ViPR ; www.viprbrc.org )
E N D
Virus Bioinformatics Resource Centers – ViPR & IRD Richard H. Scheuermann, Ph.D. Department of Pathology, UT Southwestern March 30, 2011
Take home messages • NIAID Bioinformatics Resource Centers • Virus Pathogen Resource (ViPR; www.viprbrc.org ) • Influenza Research Database (IRD; www.fludb.org ) • Free and open access virus database and analysis resources • Comprehensive data collection • Sequence, structure, function, phenotype, surveillance, clinical, immune response • Integrated • Surveillance metadata with sequence • Sequence features and sequence conservation with protein structures • Development and adoption of data standards • Data sharing infrastructure and policies • Novel approach to identification of genetic determinants of important virus characteristics • Influenza NS1 nuclear export signal and host range
Pathogen Portal pathogenportal.net
Home page www.viprbrc.org
IRD www.fludb.org
Other Data Standards • Non-human mammalian surveillance • Virus isolate characteristics • Immuno-assays • Virus passage • Virus nomenclature • Clinical data
Policies • Sequences – 30 days • Initial Surveillance Record – 90 days • Completed Virus Isolated Record – 1 year
Do variations in NS1 sequence featureS influence influenza virus host range?
SFVT approach Influenza A_NS1_nuclear-export-signal_137(10) Influenza A_NS1_alpha-helix_171(17) VT-1 I F D R L E T L I L VT-2 I F N R L E T L I L VT-3 I F D R L E T IV L VT-4 L F D Q L E T L VS VT-5 I F D R L E N L T L VT-6 I F N R L E A L I L VT-7 I Y D R L E T L I L VT-8 I F D R L E T L V L VT-9 I F D R L E NIVL VT-10 I F E R L E T L I L VT-11 L F D QM E T L VS • Identify regions of protein/gene with known structural or functional properties – Sequence Features (SF) • alpha-helical region, host protein binding site, enzyme active site, immune epitope • Determine the extent of sequence variation for each SF by defining each unique sequence as a Variant Type (VT) • High-level, comprehensive grouping of all virus strains by VT membership for each SF independently • Genotype-phenotype association statistical analysis (virulence, pathogenesis, host range, immune evasion, drug resistance) • While phylogenetics reflect evolutionary history of the virus, it is the focused regions in the viral genes/proteins that effect protein expression, structure and/or protein function that are responsible for important virus characteristics
ViPRSFVTs • Starting with 3 species • DENV = 192 SF’s • HCV = 335 SF’s • Pox = 14 genes, 295 total SF’s • HSV coming soon w/ DBP • Beginning automatic definition for other virus families • Most SF’s pulled from • UniProtKB protein annotations • IEDB immune epitopes (future)