1 / 16

Secondary structure prediction

Secondary structure prediction. There have been many strategies since the 1970’s to predict secondary structure: –  stereochemical principles (periodicity of helix, strands) – statistics (amino acid preference for helix/strand/coil) PSIPRED is one of the latest prediction methods

temple
Download Presentation

Secondary structure prediction

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Secondary structure prediction There have been many strategies since the 1970’s to predict secondary structure: – stereochemical principles (periodicity of helix, strands) – statistics (amino acid preference for helix/strand/coil) PSIPRED is one of the latest prediction methods – uses multiple alignments of sequences and structural information – not as computer intensive as its predecessors that also used databases The accuracy is > 70% against a known structure test dataset. This may be the practical limit of secondary structure prediction owing to the natural variation of secondary structures in homologous proteins that are used to train the prediction method.

  2. This PSIPRED prediction of a DNA binding domain is very close to reality. There are four helices in this domain. Only the boundaries vary from the actual structure

  3. Secondary structure is useful to find folds A real life example ! I was interested in a protein called Coilin which serves as a scaffold or glue that helps proteins involved in RNA maturation to come together (splicing factors, telomerase, etc). I knew the C-terminal portion of Coilin had a folded domain in it by NMR spectroscopy. mystery Domain

  4. Secondary structure is useful to find folds From PSIPRED, the mystery domain was predicted to have four strands and one small 3-10 helix as well as two large undefined regions Mystery domain

  5. Secondary structure is useful to find folds I determined the structure of the Coilinmystery domain by NMR methods and observed a similar fold with two very large disordered loops Because the loops took up half of the protein domain’s sequence, primary sequence search methods such as BLAST could not find a homolog in the PDB (too many gaps, probably). However, a secondary search using SSEA did find many homologous folds that aided my structure determination. For example, I could build a 3D molecular model and then compare my NMR experimental observations to it.

  6. Secondary structure is useful to find folds Some hits discovered by SSEA that had the same number of strands in the same order as the Coilin mystery domain I made molecular models of Coilin from these structures and then challenged my NMR experimental observations against them

  7. Actual structure of the Coilin mystery domain

  8. Secondary structure is useful to find folds Tudor domains are known to binding methylated lysines and arginines in histones. Other related domains bind peptides and even DNA/RNA. They are very versatile. I performed biochemical experiments and ruled out every binding mode except for protein-protein interactions for the Coilin mystery domain.

  9. Finding folds that are similar in the PDB This is a comparison of 3D coordinates versus all of the entries in the Protein Data Bank A program called SSM is very popular and fast - pairwise comparison and 3D alignment of protein structures - multiple comparison and 3D alignment of protein structures - examination of a protein structure for similarity with the whole archive

  10. Finding folds that are similar in the PDB This protein was used as an example in the previous unit when I presented TOPS diagrams. It’s a viral protein from bacteriophage lambda that serves as the “collar” between the “head” and the “tail” of the virus gpU

  11. Finding folds that are similar in the PDB Like the Coilin mystery domain, PSIPRED correctly predicted all of the beta strands and alpha helices in the protein. Here is the TOPS diagram of that prediction:

  12. Finding folds that are similar in the PDB Despite having a very similar fold, a BLAST primary sequence search failed to detect any homologs in the Protein Data Bank, presumably to the extremely evolutionary distance between viruses and other representatives in the database. Once I performed a 3D SSM search for similar folds, I found these three, among others. My solved structure of gpU has the PDB accession number 1ZIZ.

More Related