280 likes | 424 Views
Intermolecular Interactions and Biological Pathways Chapter 10 Bioinformatics A Practical Guide to the Analysis of Genes and Proteins Third Edition By Andreas Baxevanis B. F. Francis Ouellette. - Central Dogma of molecular biology: “DNA makes RNA makes Protein”
E N D
Intermolecular Interactions and Biological Pathways Chapter 10 Bioinformatics A Practical Guide to the Analysis of Genes and Proteins Third Edition By Andreas Baxevanis B. F. Francis Ouellette
- Central Dogma of molecular biology: • “DNA makes RNA makes Protein” • - But proteins interact to make metabolism • In other words, we must understand protein interactions in order to understand gene function. • Since proteins do not interact just in pairs or in linear pathways, the interactions are described as networks • together with microarray studies (to quantify gene expression) this work is the foundation of systems biology
Interaction Networks are composed of pathways. • These pathways can be further divided into: • metabolic pathways • cell signaling pathways • gene regulation pathways • These pathway categories are not mutually exclusive
- There are several different interaction databases available. • - These databases are not as simple to construct as are the sequence databases • There is a tradeoff between complexity (completeness) and simplicity (user friendliness) • They may curated or automated (e.g., derived from text- mining of the literature). • Data updating is a big challenge, especially for the curated databases.
Algorithms for identifying protein-protein interactions take the following data into account • Direct experimental evidence • Indirect evidence such as: • - co-purification • - yeast two-hybrid assays • - molecular cross-linking
Methods for biological pathway reconstruction also depend on direct and indirect methods. • direct experimental evidence • “gene neighborhood” • gene fusion • phylogenetic profile • gene sequence similarity • similar patterns of gene expression in microarray expts • “interlogs” (orthologous interactions)
The KEGG database focuses primarily on biochemical pathways.
The KEGG database builds pathways based on consensus data from many organisms (although the data are mainly from a relatively few “model” organisms). It includes enzymes, substrates and products. It also provides links to the genes encoding the enzymes.
The STRING database contains pre-computed results for protein interactions and gene interactions. These results are based on: - gene fusion data - phylogenetic profiles - co-expression patterns - co-mentioned in PubMed abstract (text mining)
STRING database – useful when you start from a single gene *