230 likes | 499 Views
Combining SELEX with quantitative assays to rapidly obtain accurate models of protein–DNA interactions. Jiajian Liu and Gary D. Stormo Presented by Aliya Sadeque. Protein-DNA interactions. Methods for measuring: Yeast 1 hybrid ChIP on chip, DNA microarray
E N D
Combining SELEX with quantitative assays to rapidly obtain accurate models of protein–DNA interactions Jiajian Liu and Gary D. Stormo Presented by Aliya Sadeque
Protein-DNA interactions • Methods for measuring: • Yeast 1 hybrid • ChIP on chip, DNA microarray • Important distinction in terms of specificity • Enzymes vs. transcription factors Bioinformatics. 2000 Jan;16(1):16-23.
TFBSTranscription Factor Binding Sites • Goal: Knowing the specificity of a TF in order to locate its binding sites within the genome • Sites represented as consensus sequences or weight matrices • Databases: TRANSFAC, JASPAR
SELEXSystematic Evolution of Ligands By Exponential Enrichment http://rulai.cshl.edu/tools/ESE2/ESEmatrix.html
QuMFRA Nucleic Acids Res. 2001 Jun 15;29(12):2471-8. PMID: 11410653
Selling points • Provides a general method that can be used for any DNA-binding protein even if nothing is known about its specificity. • Can isolate a small set of specific binding sites from a very large pool of random sequences Nucleic Acids Res. 2001 Jun 15;29(12):2471-8.
SELEX Binding Model • Assuming an additive model • Frequency: • Weight
QuMFRA Procedure • 15 sequences • Cover the space of possible sequence • Competitive binding assay http://www.answers.com/topic/gel-abpp-eg-png
Uh oh, math. • Intensities of each DNA in a separated band • Obtained from emission matrix and output vector • Relative binding constant of a test site with respect to a reference site • Reference was GGGT bound unbound
QuMFRA Binding Model Weight matrix Consensus Sequence Matrix values are proportional to binding affinity according to the Berg and von Hippel theory.
Comparing notesBetween SELEX and QuMFRA • Experimental Kafor 15 sequences • Renormalizedon consensussequence
Comparing notesBetween predictions and empirical binding affinities • Found affinity measures for 8 variants of the consensus sequence with one or two changed positions
A Probabilitistic Recognition CodeI promise those are all real words
AlternativesSAGE-SELEX • Improves on the number of binding sites found by SELEX alone • Large sample size reqired for statistical significance • Biases in SELEX
Alternatives dsDNA chips • Chips contain binding sites for a TF of interest • High throughput quantitative data • Almost all possible binding sites would have to be on the chip…that’s a lot.
AlternativesSimilar Sequences • optimized selection of DNA variants to be tested experimentally • quantitative protein-DNA binding assay • prediction of binding affinity for all variants using a statistical model • Can be done in high throughput with high accuracy, provided the consensus sequence is known
Praises General: • Can be used for a TF of unknown specificity and size • Efficient • Parallel (different colours of fluorophore) • First step narrows down sample size
Heckles • Sequenced a sample of ~20 - too small. • Could skew data. Secondary preference? • Why no A at position 1 in QuMFRA? • Weakness/inherent bias in SELEX • Empirical data - is an average appropriate?
Heckles • Correlation values (0.54, 0.6, 0.77, 0.94) – are these significant? No T-test or anything?