290 likes | 425 Views
RNA Bioinformatics Genes and Secondary Structure. Anne Haake Rhys Price Jones & Tex Thompson. Types of RNA. Coding mRNA (messenger) Non-Coding rRNA (ribosomal) tRNA (transfer) snRNA (small nuclear;splicing) RNAseP (ribozyme) siRNA (small inhibitory) Others…. RNA genes?. Recall:
E N D
RNA BioinformaticsGenes and Secondary Structure Anne Haake Rhys Price Jones & Tex Thompson
Types of RNA • Coding • mRNA (messenger) • Non-Coding • rRNA (ribosomal) • tRNA (transfer) • snRNA (small nuclear;splicing) • RNAseP (ribozyme) • siRNA (small inhibitory) • Others…
RNA genes? Recall: • Protein-coding genes; we have relatively good methods • Ab initio • Homology-based • RNA genes • Poor sequence homology • Secondary structure useful
Functional Roles: RNA Secondary Structure • mRNA • Regulation of transcription termination • Regulation of translation initiation • rRNA • ribosomal structure • tRNA • adaptor in translation • RNA interference • Regulation of gene expression • Anti-viral activity
mRNAs have functional secondary structures • e.g. Transcription Termination Signal
Ribozymes: Enzymes made of RNA • RNA molecules in Tetrahymena were shown to splice out introns without the aid of proteins • Ribozymes have been discovered in higher organisms, and may play a role in processing mRNA
Ribonuclease P • Enzyme found in many organisms, cleaves the 5’ end of tRNA molecules • Heterodimer consisting of a protein molecule and an RNA molecule • Without RNA molecule, Ribonuclease P loses all activity • Without protein, Ribonuclease P shows only reduced activity
Pictures from the Web • http://www.mbio.ncsu.edu/RNaseP/rna/threed/threed.html • http://proteinexplorer.org (molecule 1d6t)
Discovery of Ribonuclease P http://www.mun.ca/biochem/courses/3107/Lectures/Topics/Splicing.html
Can identify RNA genes that belong to a known family • Infer secondary structure by comparing sequences (multiple alignments) • e.g. Look for covariance; positions that covary to maintain Watson-Crick base-pairing;implies role in secondary structure • Rfam: a collection of multiple alignments and covariance models for ncRNAs • Rfam
Prediction of RNA Secondary Structure • Find the configuration that maximizes the number of base pairs • Scoring all possibilities would be computationally expensive • Use dynamic programming • Thermodynamics approach • Mfold: uses an energy minimization method of Zuker • http://www.bioinfo.rpi.edu/applications/mfold/ • http://bioweb.pasteur.fr/seqanal/interfaces/mfold-simple.html
RNA Interference • Breakthrough of the year in 2002 • Discovered in C. elegans • dsRNA involved in sequence-specific gene silencing • Post-transcriptional gene silencing • 21-25 nucleotide dsRNAs (siRNAs) facilitate the degradation of homologous RNAs
RNAi • Useful for gene targeting to study function • Other techniques for gene targeting • “knock-out” by homologous recombination • Antisense • siRNA-direct “knock-down” has potential to allow systematic study of each gene in a pathway • siRNA might allow silencing of pathogenic genes or pathogens (e.g. viruses)
Mechanism • siRNAs: 21-23 nt dsRNA with 2-3 nt 3’ overhangs • Produced from cleavage of long dsRNAs by “Dicer” enzyme • Form a siRNA-protein complex “RISC” • Cleaves homologous mRNA target • Also can start with a hairpin precursor rather than dsRNA
siRNA demo • RNA Interference links and refs
Introduction to Proteomics Techniques & Computational Issues
Experimental Techniques • As with transcriptome analysis, proteome analysis is limited by the techniques currently available • But, proteome analysis even more difficult and less precise due to the nature of proteins
Two-dimensional Gel Electrophoresis • 2D gels • First dimension: isoelectric focusing • Separates proteins on basis of charge • Second dimension: SDS-PAGE • Able to resolve thousands of proteins in a single gel • Proteins are usually radioactively labeled
Challenges of 2D gel Analysis • Reproducibility • Software is available to assist in aligning the spots between gels and integrating the intensities of the spots • Identification of the proteins of interest • Some underrepresented e.g. membrane proteins • Some below levels of detection • Which protein is represented by each spot? • Mass spectrometry has greatly enhanced ability to identify individual proteins
2-Dimensional Gel Electrophoresis • http://us.expasy.org/ • For other examples and tools
Mass Spectrometry • Able to uniquely identify the proteins associated with individual spots in 2D gels • Spots are excised from gels • Proteins are digested into peptide fragments using proteases such as trypsin • Trypsin cleaves peptide bonds next to the amino acids lysine and arginine. • Peptides are ionized for Mass Spec analysis • For a quick explanation of Mass Spec see: http://www.jeol.com/ms/docs/whatisms.html
Mass Spectrometry • Generates a peptide mass fingerprint • Computational challenge: the fingerprint must be matched up with the theoretical mass spectrum of the proteins derived from genomics databases • Analysis software ProteinProspector
Protein Microarrays • High-throughput techniques similar to gene chips • Probes (e.g. antibodies) attached to chips • Fluorescently-labeled proteins washed over chips • Fluorescence intensity indicative of relative levels • Variations include protein-compound (drug) interactions, protein-DNA etc.
Protein Microarrays • Major problems with analysis of proteins in this way • Protein-protein binding not determined by strict rules as it is in nucleic acids (base-pairing) • One protein may bind several others on the chip • Protein interactions very sensitive to chemistry • Application of protein arrays often used as a follow-up to gene chip studies