580 likes | 664 Views
Welcome to Introduction to Bioinformatics Friday, 19 September 2014. Scenario 2: Simulation Finding biologically important sites in DNA How to avoid being fooled by imposters?. Scenario. Gene regulation. Scenario 2. Finding biologically important sites in DNA.
E N D
Welcome toIntroduction to BioinformaticsFriday, 19 September 2014 Scenario 2: Simulation Finding biologically important sites in DNA How to avoid being fooled by imposters? • Scenario • Gene regulation
Scenario 2 Finding biologically important sites in DNA
Critical position in food web CO2 sugarN2 ammoniaH2O electrons Your object of study: Cyanobacteria How do they do it?
heterocysts sucrose N2 fixation in cyanobacteria N2 CO2 O2 Matveyev and Elhai (unpublished)
heterocysts sucrose NH3 N2 fixation in cyanobacteria NH3 N2 O2 CO2 Matveyev and Elhai (unpublished)
Differentiation in cyanobacteria -NH3 ? ? ? ? ? Heterocysts
Response to environment How do bacteria respond to the environment? From gene to protein DNA RNA protein
How do bacteria respond to the environment? From gene to protein RNAPol DNA P RNA protein
NH3 glutamine α-ketoglutarate How do cyanobacteria respond to NH3? From gene to protein High N Low N DNA binding protein, NtcA RNAPol DNA Binding site P No RNA
NH3 glutamine α-ketoglutarate How do cyanobacteria respond to NH3? From gene to protein High N Low N DNA binding protein, NtcA RNAPol DNA Binding site P No RNA
RNA protein How do cyanobacteria respond to NH3? From gene to protein Low N RNAPol NtcA DNA Binding site P α-ketoglutarate
Differentiation in cyanobacteria -NH3 ? ? ? ? ? Heterocysts
Differentiation in cyanobacteria -NH3 Activates NtcA (Nitrogen Control) ? ? ? Heterocysts
Differentiation in cyanobacteriaWhat DNA site does NtcA bind to? RNAPol NtcA Binding site P
Differentiation in cyanobacteriaWhat DNA site does NtcA bind to?
mRNA GTA…(8)…TAC …(20-24)…TAnnnT Differentiation in cyanobacteriaWhat DNA site does NtcA bind to? RNAPol NtcA Binding site P Herrero et al (2001) J Bacteriol 183:411-425
HetQ -N NtcA ??? Position in cell cycle Level of PatS Level of HetN Differentiation in cyanobacteriaIntegration of signals through HetR Genes needed for differentiation HetR Master regulator StrategyPCR out hetQRandom mutagenesisLook for effects on HetR expression/activity
Differentiation in cyanobacteriaFind primers to PCR out hetQ cctatctccgccctatggcgatttgggcaatatatttgatgattggttag ...hypothetical ttgtcagttgtcagacgtagtagcgcgtctagtctaatgtgttgttatat protein tatttgctactagaaatgaggagagggttatttttctcactgcttcccaa ttctatgagaatataaaattttccttaagtttctcatggcaataatggaa aaaaccgaccattctgatgaataagtccggttttttccaaaaaatatttt tgctttttcgctttatttatctatatttccaagttttagtacatcggtga ggggtgacaactatcttgccaatattgtcgttattgttaggttgctatcg gaaaaaatctgtaacatgagatacacaatagcatttatatttgctttagt atctctctcttgggtgggattctgcctgcaatttaaaaaccagtgttaac aattttcggctttattttccgggagttaaatcaaccaagggaaaatgtaa ctaatgtttaaatatcttcggatacacacaaagtaaaaccaatttttaca gatgtcgatgttgctcacattttttagaaatattactaaattaaaaatgt tattaaatttatgttcatagagaaccttttccaaataaaaaaataatttt cctgatgttttaagaaaattactgttgttataaattaaaggtgattcaac aaaatatagatagttctttcaataactatctacttttaccattaagtgaa cttactcatgaataatcaacaggaattaaaaataaagttcatgaatactg gttaaagattcagtaaagtttgaggaaataccggaataaatttccaccca aatatgattttttaaaagatacattggcagtacattaaaatgccgatgtt agataaatttgccttcatagctgttatctatttgctcagaactaagccaa gagtttacacaccaaacagaaattaaactatgaatccctcttcgtcgtta hetQ...
Differentiation in cyanobacteriaFind primers to PCR out hetQ cctatctccgccctatggcgatttgggcaatatatttgatgattggttag ...hypothetical ttgtcagttgtcagacgtagtagcgcgtctagtctaatgtgttgttatatprotein tatttgctactagaaatgaggagagggttatttttctcactgcttcccaa ttctatgagaatataaaattttccttaagtttctcatggcaataatggaa aaaaccgaccattctgatgaataagtccggttttttccaaaaaatatttt tgctttttcgctttatttatctatatttccaagttttagtacatcggtga ggggtgacaactatcttgccaatattgtcgttattgttaggttgctatcg gaaaaaatctgtaacatgagatacacaatagcatttatatttgctttagt atctctctcttgggtgggattctgcctgcaatttaaaaaccagtgttaac aattttcggctttattttccgggagttaaatcaaccaagggaaaatgtaa ctaatgtttaaatatcttcggatacacacaaagtaaaaccaatttttaca gatgtcgatgttgctcacattttttagaaatattactaaattaaaaatgt tattaaatttatgttcatagagaaccttttccaaataaaaaaataatttt cctgatgttttaagaaaattactgttgttataaattaaaggtgattcaac aaaatatagatagttctttcaataactatctacttttaccattaagtgaa cttactcatgaataatcaacaggaattaaaaataaagttcatgaatactg gttaaagattcagtaaagtttgaggaaataccggaataaatttccaccca aatatgattttttaaaagatacattggcagtacattaaaatgccgatgtt agataaatttgccttcatagctgttatctatttgctcagaactaagccaa gagtttacacaccaaacagaaattaaactatgaatccctcttcgtcgtta hetQ...
Differentiation in cyanobacteriaFind primers to PCR out hetC ttgtcagttgtcagacgtagtagcgcgtctagtctaatgtgttgttatat tatttgctactagaaatgaggagagggttatttttctcactgcttcccaa ttctatgagaatataaaattttccttaagtttctcatggcaataatggaa aaaaccgaccattctgatgaataagtccggttttttccaaaaaatatttt tgctttttcgctttatttatctatatttccaagttttagtacatcggtga ggggtgacaactatcttgccaatattgtcgttattgttaggttgctatcg gaaaaaatcTGTAacatgagaTACAcaatagcatttatatttgctttagt atctctctcttgggtgggattctgcctgcaatttaaaaaccagtgttaac aattttcggctttattttccgggagttaaatcaaccaagggaaaatgtaa ctaatgtttaaatatcttcggatacacacaaagtaaaaccaatttttaca gatgtcgatgttgctcacattttttagaaatattactaaattaaaaatgt tattaaatttatgttcatagagaaccttttccaaataaaaaaataatttt cctgatgttttaagaaaattactgttgttataaattaaaggtgattcaac aaaatatagatagttctttcaataactatctacttttaccattaagtgaa cttactcatgaataatcaacaggaattaaaaataaagttcatgaatactg gttaaagattcagtaaagtttgaggaaataccggaataaatttccaccca aatatgattttttaaaagatacattggcagtacattaaaatgccgatgtt agataaatttgccttcatagctgttatctatttgctcagaactaagccaa gagtttacacaccaaacagaaattaaactatgaatccctcttcgtcgtta hetC... GTA…(8)…TAC
Differentiation in cyanobacteria ttctatgagaatataaaattttccttaagtttct aaaaccgaccattctgatgaataagtccggtttt tgctttttcgctttatttatctatatttccaagt ggggtgacaactatcttgccaatattgtcgttat gaaaaaatctGTAacatgagaTACacaatagcatttatatttgcttTAgtaTctctctcttgggtggg …(20-24)…TAnnnT GTA…(8)…TACNtcA binding site Promoter
Differentiation in cyanobacteriaIntegration of signals through HetR ??? HetQ -N ??? NtcA ??? Genes needed for differentiation Position in cell cycle HetR Level of PatS Level of HetN Master regulator Stockholm
How to proceed? • Choice #1 • Publish • Grant proposals • Build a career • Likely result • Reviewers trash MS: too speculative
How to proceed? • Choice #2 • Forget about it • Back to PCR • Likely result • Sometimes miss spectacular finding
How to proceed? • Choice #3 • Forget about PCR • Do backbreaking NtcA binding studies I'd knock out NtcA, reintroduce it in plasmid to nostoc, and do RT-PCR to check gene expression. • Likely result • Might demonstrate binding of NtcA • Risky, may lose many months
How to proceed? • Choice #4 • Determine whether site is likely to be real How? N! . . .a! (N-a)! • High school math approach
How to proceed? • Choice #4 • Determine whether site is likely to be real How? BIOINFORMATICS • Simulation • Exhaustive pattern search
Regulatory Protein and their Binding Sites What do we talk about? • Significance of palindromes (SQ7 and topic H) • Nature of regulation (through gene fusions (SQ8) • Gene fusions: e.g. ntcA / lacZ(SQ8) • How many promoters? CRP-binding sites? (SQ5) • Simulations? • Why do them? (SQ10) • Pitfalls? (SQ9)
Backwards = forwards GCTATCG • DNA is double stranded ROTATOR TTAATGTGAGTTAGCTCACTCATTAATTACACTCAATCGAGTGAGTAA Regulatory Protein and their Binding Sites Palindromic sequences What is it? What about with DNA?
Backwards = forwards GCTATCG ROTATOR Regulatory Protein and their Binding Sites Palindromic sequences What is it? What about with DNA? • DNA is double stranded • DNA is redundant TTAATGTGAGTTAGCTCACTCATTAATTACACTCAATCGAGTGAGTAA
Backwards = forwards GCTATCG ROTATOR TTAATGTGAGTTAGCTCACTCATT AATGAGTGAGCTAACTCACATTAA Regulatory Protein and their Binding Sites Palindromic sequences What is it? What about with DNA? • DNA is double stranded • DNA is redundant • DNA has direction (read 5’->3’) 5’- -3’ 3’- -5’ TTAATGTGAGTTAGCTCACTCATTAATTACACTCAATCGAGTGAGTAA
TAT GGCATGCTAGCTTAAT TCATTAATTA AGTAACGTACGATCGG TAT DNA: cruciform RNA: stem/loop Regulatory Protein and their Binding Sites Palindromic sequences 5’- -3’ 3’- -5’ TTAATGTGAGTTAGCTCACTCATTAATTACACTCAATCGAGTGAGTAA
UAU GGCAUGCUAGCUUAAU UCAUU tRNA DNA: cruciform RNA: stem/loop Regulatory Protein and their Binding Sites Palindromic sequences 5’- -3’ 3’- -5’ TTAATGTGAGTTAGCTCACTCATTAATTACACTCAATCGAGTGAGTAA
TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA recognizes GTGAGTT Regulatory Protein and their Binding Sites Palindromic sequences
TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA Regulatory Protein and their Binding Sites Palindromic sequences
Regulatory Protein and their Binding Sites Palindromic sequences TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA
Regulatory Protein and their Binding Sites Palindromic sequences TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA
Regulatory Protein and their Binding Sites Palindromic sequences TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA
Regulatory Protein and their Binding Sites Palindromic sequences TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA recognizes GTGAGTT
Regulatory Protein and their Binding Sites Palindromic sequences TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA
Regulatory Protein and their Binding Sites Palindromic sequences TTAATGTGAGTTAGCTCACTCATT NNNNNNNNNNNNNNNNNNNNNNNNNNNNNN NNNNNNNNNNNNNNNNNNNNNNNNNN AATGAGTGAGCTAACTCACATTAA