410 likes | 549 Views
PSI Structural Genomics Knowledgebase. Helen M. Berman, Rutgers University EMBO Practical Course Section: Searching Structure Databases September 26, 2008. Knowledgebase. Knowledgebase Vision.
E N D
PSI Structural Genomics Knowledgebase Helen M. Berman, Rutgers University EMBO Practical Course Section: Searching Structure Databases September 26, 2008
Knowledgebase Vision The PSI Structural Genomics Knowledgebase(PSI SGKB) will turn the products of the PSI effort into major advances in knowledge that can be used to understand living systems and human disease. It will be a key resource for the advancement of biology, biochemistry, functional genomics, pharmacology, bioinformatics, chemistry, education and clinical medicine.
Knowledgebase Goals To provide a “marketplace of ideas” that • connects protein sequence information to 3D structures and homology models • enhances functional annotations • provides access to new experimental protocols and materials To kick start and enable advancements in structural genomics • by communicating and providing visibility and accessibility of information and technology advances of the PSI • through presentation and discussion of the most provocative challenges with the general community • by fostering community collaborations
PSI SGKB features • Database searchable by sequence, text, and PDB ID • Search results include aggregate reports and inventories • Links to PSI projects, external resources, and publications • SG Gateway with Nature delivers featured articles, PSI news and events, featured molecules and technologies, molecules of unknown function and broader SG content • Notification to public about recently solved PSI structures or new editorial content
Scope Experimental Tracking Materials Target Selection Isolation, Expression, Purification, Crystallization Genomic Based Target Selection Data Collection Structure Determination PDB Deposition & Release Models Annotations Publications Technology Metrics • To capture, make accessible, and highlight elements of the high-throughput pipelines for use by various scientific communities • To leverage such information through the generation of molecular models and functional annotation
Knowledgebase Users • Biologists • Biochemists • Functional Genomicists • Pharmacologists • Bioinformatics • Chemists • Clinical Researchers and Physicians • Teachers and Students
PSI SGKB Homepage Receive e-mail alerts Explore structures of unknown function View latest structures & statistics Teasers for this month’s editorial content 1
Structural Genomics Update Search Box available • Editorial content: • Research Advances • Featured Molecule • Research Library • News • Events Calendar 1
About this site • Additional help content (getting started), site map, contact information, and terms of use About PSI • Information about the Protein Structure Initiative and the PSI SGKB PSI centers • Links to the PSI Large-Scale and Specialized Centers PSI Resources • Links to a list of our Biomedical Protein Target themes, Target Selection documentation, and the Modeling, Technology, Experimental Data Tracking, Materials, and Publications Resources NPG Resources • Links to the other Nature gateways, journals and other resources provided by the Nature Publishing Group 1
E-alerts: Receive news of PSI SGKB updates by email or RSS feed • Updates to editorial content (monthly) • Newly released structures (weekly) Functional Sleuth: explore protein structures solved by the PSI whose functions are unknown Latest PSI statistics Provides current tallies of structures solved • View detailed reports of which structures have solved by the PSI (“Metrics”) • View the latest structures solved by the PSI 1
Metrics PSI-2 Summary Statistics Updated Sept 5, 2008 • novel structures - structures with less than 30% sequence identity to an existing structure at the time of PDB deposition • distinct proteins - structures with non-redundant sequences less than 98% sequence identity 1
Searching the PSI SGKB All PSI SGKB data and resources are accessible from one central Search Box • Begin your search here: • By protein sequence • By keyword (plain text) • By structure (PDB ID) 1
Sequence/PDBid search Availablestructures of proteins with similar/identical amino acid sequences Any structural and functional properties (annotation) determined from these protein structures Available theoretical/homology models created with amino acid sequences similar to your query Any information about similar protein sequences (targets) studied by the PSI structural genomics efforts The protocols used during those PSI research efforts Ordering information to obtain DNA clone materials, if available.
Structures In the Structures tab, experiment and reference information about the structure is displayed: • View matching sequence alignment and sequence identity • Link to RCSB PDB’s Structure Explorer to learn more about the structure • View information about chemical substrates in the experiment (bound ligands and substrates) • Download the 3D atomic coordinates for the molecule • If published, connect to its citation and abstract at PubMed.
Annotations Genomic features: gene identifier, name and synonyms, operon/regulon mappings from databases Protein sequence features: amino acid sequence, taxonomy & phylogeny, isoforms, single nucleotide polymorphisms, post-translational modifications, and sequence families. Structure features: secondary structure, oligomeric state, structure and functional domains, DNA binding motifs, sites of interaction Ligands: information about bound ligands Functional/Biochemical classifications: enzyme class, substrate specificity and catalysis, epitope mapping, cellular location, organ location Protein Networks and Biological Systems: enzymatic pathways and networks information Literature: synonyms for protein names, links to PubMed by database identifier and related text and authors Information from more than 50 external annotation resources
Annotations • every annotation provided is a link to more content
Future Annotations Layout • annotations will be organized by scientific category Quick Annotations Summary will indicate available information
Models In the Models tab, a list of the homology models available from the integrated Protein Models Portal are displayed • view the structural model, and interact with it in a Java window (AstexViewer) • download the model’s atomic coordinates • view predicted domain annotations from databases such as InterPro • view sequence/domain annotations related to the template structure, such as SCOP and CATH
Models AstexViewer lets you view the model
Experimental Data Tracking TargetDB contains worldwide structural genomics protein target information. Search by sequence, Target ID, project site, status, update date, protein name, and source organism Links to other sequence databases, domain databases, other structural genomics centers, and the RCSB PDB Download target data Target statistics summary PepcDB contains all the functionality of TargetDB plus Experimental protocols Detailed status history of experimental trials Informationon failed experiments
Experimental Tracking PepcDB search form
Materials Repository Directly order targets of interest
Text Search With a plain text search, find information from: • PSI Center web pages • Publications resource • Technology resource • Annotation database
Text Search Site Search access web sites and files from 10 PSI centers and the Technology Portal
Text Search • Structure Publications • records displays the PDB ID and the link to the RCSB PDB Structure Explorer page • their doi and Pubmed identifier • a link to the abstract
Text Search Annotations Text search may find annotations from the database if the text query is biological term
Text Search • Methodology Publications • their doi and Pubmed identifier • a link to the abstract
Technology Module PSI Centers are actively developing technologies and methodologies for all aspects of the structure determination pipeline Isolation, Expression, Purification, Crystallization Genomic Based Target Selection Data Collection Structure Determination PDB Deposition & Release Publications Functional Annotation
Acknowledgements KB Group PSI Resources Wendy Tao Andrei Kouranov (Exp. Data Tracking) Raship Shah Torsten Schwede (Models) James Chun Paul Adams (Technology) Margaret Gabanyi Josh La Baer (Materials) Tom Oldfield Wladek Minor (Publications) John Westbrook Access Information http://kb.psi-structuralgenomics.org Nature Matthew Day BoyanaKonforti KB Steering Committee Chair, Eaton Lattman