20 likes | 212 Views
Protein Information Resource. Protein Information Resource, 3300 Whitehaven St., Georgetown University, Washington, DC 20007. PIR: Integrated Protein Informatics Resource For Genomic & Proteomic Research.
E N D
Protein Information Resource Protein Information Resource, 3300 Whitehaven St., Georgetown University, Washington, DC 20007 PIR: Integrated Protein Informatics Resource For Genomic & Proteomic Research For four decades the Protein Information Resource (PIR) has provided databases and protein sequence analysis tools to the scientific community including the Protein Sequence Database (PIR-PSD), which grew from the Atlas of Protein Sequence and Structure, edited by Margaret Dayhoff [1965-1978]. PIR’s major activities include i) UniProt - Universal Protein Resource developmentii) iProClass - protein data integration, iii) PIRSF - protein family classification, and iv) iProLINK – source for text mining and ontology development. UniProt Universal Protein Resource iProClass Integrated Protein Knowledgebase PIRSF Protein Family Classification System iProLINK Integrated Protein Literature, Information and Knowledge UniProt is a comprehensive catalog of information on proteins. UniProt builds upon the solid foundations laid by the consortium members, Protein Information Resource, European Bioinformatics Institute, and Swiss Institute of Bioinformatics, over many years. When to use UniProt Use UniProt to retrieve curated, reliable, comprehensive information on proteins. iProClass provides summary descriptions of protein family, function and structure for UniProt protein sequences, with links to over 80 biological databases. When to use iProClass Use iProClass to retrieve thorough information about a protein. By integrating data from multiple sources iProClass presents comprehensive up-to-date protein information and data mapping. The PIRSF (superfamily) protein classification system reflects the evolutionary relationship of full-length proteins and domains. The primary PIRSF classification unit is the homeomorphic family, whose members are both homologous and homeomorphic (sharing full-length sequence similarity). When to use PIRSF Use PIRSF to retrieve curated information on membership, domain architecture, and function for protein families. iProLINK provides annotated literature, protein name dictionary and other information to facilitate Natural Language Processing technology development and protein name tagging and ontology. When to use iProLINK Use iProLINK for bibliography mapping, for literature mining of protein phosphorylation (RLIMS-P), for mapping protein/gene names to UniProtKB entries, and for text mining algorithm development. PIR Team Additional Projects NIAID Proteomics Resource: PIR is an administrative resource center providing the scientific community with proteomic information about NIAID's Category A-C biodefense organisms. caBIG™: PIR participates in the voluntary network connecting individuals and institutions, creating a World Wide Web of information on cancer research. iProXpress: Platform for analysis of proteomic and genomic data. Protein Ontology (PRO): A project geared towards formally defining protein classes and their relations. PRO will be developed within the framework of the Open Biomedical Ontologies (OBO) Foundry. CUPID: Provides strain-, species- and genus-specific proteins. Such proteins can provide insight into the criteria that define an organism and can also serve as taxon-specific diagnostic targets. Acknowledgements: PIR activities are supported by National Institutes of Health (NIH) grant 2U01 HG02712-04 for UniProt, the NIH grants for NIAID proteomic resource (HHSN266200400061C) and grid enablement (NCI-caBIG-ICR), and National Science Foundation grants for protein ontology (ITR-0205470) and BioTagger (IIS-0430743). Contact pirmail@georgetown.edu