20 likes | 85 Views
Mature protein. Signal peptide. Ta-18 ATGAAGACCTTCCTCATCTTTGCTCTCCTCGCCACTGCAGCGACAAGTGCCATTGCACAAATGGAGACTAGCCGCGTCCCTGGTTTGGAGAAACCATGGCAG 102 Ta-36 ----G---------TG------------------T------------------------------------------------------------------- 102
E N D
Mature protein Signal peptide Ta-18 ATGAAGACCTTCCTCATCTTTGCTCTCCTCGCCACTGCAGCGACAAGTGCCATTGCACAAATGGAGACTAGCCGCGTCCCTGGTTTGGAGAAACCATGGCAG 102 Ta-36 ----G---------TG------------------T------------------------------------------------------------------- 102 M K/R T F L I/V F A L L A T/I A A T S A I A Q M E T S R V P G L E K P W Q 34 CAGCAACCATTATCACCACAACAACAACCACCATGTTCACAGCAACAACAACCACTTCCGCAGCAACAACAACCAATTATTATACTGCAGCAACCACCATTT 204 ------------------------------------------------------------------------------------------------------ 204 Q Q P L S P Q Q Q P P C S Q Q Q Q P L P Q Q Q Q P I I I L Q Q P P F 68 TCGCAGCAACAACAACCAGTTCTACCGCAGCAACAACAACCAGTTATTATACTACAACAACCACCATTTTTGGAGCAACAACAACCAGTTCTACCACAACAA 306 ------------------------------------------------------------------------------------------------------ 306 S Q Q Q Q P V L P Q Q Q Q P V I I L Q Q P P F L E Q Q Q P V L P Q Q 102 CCATCATTTTCACAACAACAACAACAACAACAACAACAACCACCATTTTTGGAGCAACAACAACCAGTTCTACCACAACAACCATCATTTTCACAACAACAA 408 ------------------------------------------------------------------------------------------------------ 408 P S F S Q Q Q Q Q Q Q Q Q P P F L E Q Q Q P V L P Q Q P S F S Q Q Q 136 CAACAACAACAACAACCATTTCCGCAGCAGCAACAACCATCTTCACAACAACAACCTTTTCCACAACAACACCAACATCTTCTGCAACAACAAATCCCTGTT 510 ------------------------------------------------------------------------------------------------------ 510 Q Q Q Q Q P F P Q Q Q Q P S S Q Q Q P F P Q Q H Q H L L Q Q Q I P V 170 GTTCAACCATCCGTTTTGCAGCAGCTACACCCATGCAAGGTATTCCTCCAGCAGCAGTGCAGCCATGTGGCAATGTCGCAACGTCTTGCTAGGTCGCAAATG 612 ------------------------------------------------------------------------------------------------------ 612 V Q P S V L Q Q L H P C K V F L Q Q Q C S H V A M S Q R L A R S Q M 204 TGGCAGCAGAGCAGTTGCCATGTGATGCAGCAACAATGTTGCCAGCAGCTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTACCATCGTCTAC 714 ------------------------------------------------------------------------------------------------------ 714 W Q Q S S C H V M Q Q Q C C Q Q L P Q I P E Q S R Y E A I R T I V Y 238 TCCATCATCCTGCAAGAACAACAACAGGGGTTTGTCCAACCTCAGCAGCAACAACCCCAACAGTTGGGCCAAGGTGTCTCCCAACCCCAACAGCAGTCGCAG 816 ------------------------------------------------------------------------------------------------------ 816 S I I L Q E Q Q Q G F V Q P Q Q Q Q P Q Q L G Q G V S Q P Q Q Q S Q 272 CAACAGCAGCTCGGACAGTGTTCTTTCCAACAACCTCAACAACAACAACTGGGTCAGCAGCCTCAACAACAACAGATACCACAGGGTACATTCTTGCAGCCA 918 ------------------------------------------------------------------------------------------------------ 918 Q Q Q L G Q C S F Q Q P Q Q Q Q L G Q Q P Q Q Q Q I P Q G T F L Q P 306 CACCAGATATCTCAACTTGAGGTGATGACTTCCATTGCACTCCGTACCCTGCCAACGATATGCGGTGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTG 1020 ------------------------------------------------------------------------------------------------------ 1020 H Q I S Q L E V M T S I A L R T L P T I C G V N V P L Y S S T T S V 340 CCATTCGGCATTGGAACCGGAGTTGGTGGCTACTGATAA 1059 --------------------------------------- 1059 P F G I G T G V G G Y * * 351 a Mature protein Signal peptide Ta-24 ATGAAGACATTCCTCGTCTTTGCCCTCCTCGCCATTGTGGCGACAAGTGTCATTGCGCAGATGGAGACTAGCTGCATCCCTGGTTTGGAGAGACCATGGCAG 102 Ta-32 ----G---C-----T--------------------------------------------------------------------------------------- 102 M K/R T F L V F A L L A I V A T S V I A Q M E T S C I P G L E R P W Q 34 CAGCAACCATTACCACCACAACAGACATTATTTCCACAACAACAACCATTTCCACAACAACAACAACCACCATTTTCACAACAACAACCATCATTTTCGCAG 204 ------------------------------------------------------------------------------------------------------ 204 Q Q P L P P Q Q T L F P Q Q Q P F P Q Q Q Q P P F S Q Q Q P S F S Q 68 CAACAACCACCATTTTCGCAGCAACAACCAATTCTACCGCAGCAACCACCATTTTCACAGCAACAACAACCAGCTCTACCGCAACAATCACCATTTTTGCAG 306 ------------------------------------------------------------------------------------------------------ 306 Q Q P P F S Q Q Q P I L P Q Q P P F S Q Q Q Q P A L P Q Q S P F L Q 102 CAACAACAACTAGTTTTACCTCCACAACAACAACACCAACAGCTTCTGCAACAACAAATCCCTATTGTTCAACCATCCGTTTTGCAGCAGCTAAACCCATGC 408 ------------------------------------------------------------------------------------------------------ 408 Q Q Q L V L P P Q Q Q H Q Q L L Q Q Q I P I V Q P S V L Q Q L N P C 136 AAGGTATTCCTCCAGCAGAAGTGCAGCCCTGTAGCAATGCCACAACGTCTTGCTAGGTCGCAAATGTGGCAGCAGAGCAGTTGCCATGTGATGCAACAACAA 510 ------------------------------------------------------------------------------------------------------ 510 K V F L Q Q K C S P V A M P Q R L A R S Q M W Q Q S S C H V M Q Q Q 170 TGTTGCCAGCAGTTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTGCCATCACCTACTCCATCATCCTGCAAGAACAACAACAGGGTTTTGTC 612 ------------------------------------------------------------------------------------------------------ 612 C C Q Q L P Q I P E Q S R Y E A I R A I T Y S I I L Q E Q Q Q G F V 204 CAACCTCAGCAGCAACAGCCCCAACAGTCGGGTCAAGGTGTCTCCCAATCCCAACAGCAGTCGCAGCAGCAGCTCGGACAATGTTCTTTCCAACAACCTCAA 714 -----------------A------------------------------------------------------------------------------------ 714 Q P Q Q Q Q P Q Q S G Q G V S Q S Q Q Q S Q Q Q L G Q C S F Q Q P Q 238 CAGCAACTGGGTCAACAGCCTCAACAACAACAAGTACTACAGGGTACCTTTTTGCAACCACACCAGATAGCTCACCTTGAGGTGATGACTTCCATTGCACTC 816 ------------------------------------------------------------------------------------------------------ 816 Q Q L G Q Q P Q Q Q Q V L Q G T F L Q P H Q I A H L E V M T S I A L 272 CGTACCCTGCCAACGATGTGCAGCGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTGCCATTCAGCGTTGGCACCGGAGTTGGTGTCTACTGATAA 915 --------------------------------------------------------------------------------------------------- 915 R T L P T M C S V N V P L Y S S T T S V P F S V G T G V G V Y * * 303 b
Mature protein Signal peptide Ta-17 ATGAAGACCTTCCTCATCTTTGCCCTCCTCGCCATTGTGGCGACAAGTGTCATTGCGCAGATGGAGACTAGCTGCATCCCTGGTTTGGAGAGACCATGGCAG 102 Ta-41 -------------C-G------------C------------------------------------------------------------------------- 102 M K T F L/PI/V F A L L/P A I V A T S V I A Q M E T S C I P G L E R P W Q 34 CAGCAACCATTACCACCACAACAGACATTATTTCCACAACAACAACCATTTCCACAACAACAACAACCACCATTTTCACAACAACAACCATCATTTTCGCAG 204 ------------------------------------------------------------------------------------------------------ 204 Q Q P L P P Q Q T L F P Q Q Q P F P Q Q Q Q P P F S Q Q Q P S F S Q 68 CAACAACCACCATTTTCGCAGCAACAACCAATTCTACCGCAGCAACCACCATTTTCACAGCAACAACAACCAGCTCTACCGCAACAATCACCATTTTTGCAG 306 ------------------------------------------------------------------------------------------------------ 306 Q Q P P F S Q Q Q P I L P Q Q P P F S Q Q Q Q P A L P Q Q S P F L Q 102 CAACAACAACTAGTTTTACCTCCACAACAACAACACCAACAGCTTCTGCAGCAACAAATCCCTATTGTTCAACCATCCGTTTTGCAGCAGCTAAACCCATGC 408 --------------------------------------------------A--------------------------------------------------- 408 Q Q Q L V L P P Q Q Q H Q Q L L Q Q Q I P I V Q P S V L Q Q L N P C 136 AAGGTATTCCTCCAGCAGAAGTGCAGCCCTGTAGCAATGCCACAACGTCTTGCTAGGTCGCAAATGTGGCAGCAGAGCAGTTGCCATGTGATGCAACAACAA 510 ------------------------------------------------------------------------------------------------------ 510 K V F L Q Q K C S P V A M P Q R L A R S Q M W Q Q S S C H V M Q Q Q 170 TGTTGCCAGCAGTTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTGCCATCACCTACTCCATCATCCTGCAAGAACAACAACAGGGTTTTGTC 612 ------------------------------------------------------------------------------------------------------ 612 C C Q Q L P Q I P E Q S R Y E A I R A I T Y S I I L Q E Q Q Q G F V 204 CAACCTCAGCAGCAACAACCCCAACAGTCGGGTCAAGGTGTCTCCCAATCCCAACAGCAGTCGCAGCAGCAGCTCGGACAATGTTCTTTCCAACAACCTCAA 714 ------------------------------------------------------------------------------------------------------ 714 Q P Q Q Q Q P Q Q S G Q G V S Q S Q Q Q S Q Q Q L G Q C S F Q Q P Q 238 CAGCAACTGGGTCAACAGCCTCAACAACAACAAGTACTACAGGGTACCTTTTTGCAACCACACCAGATAGCTCACCTTGAGGTGATGACTTCCATTGCACTC 816 ------------------------------------------------------------------------------------------------------ 816 Q Q L G Q Q P Q Q Q Q V L Q G T F L Q P H Q I A H L E V M T S I A L 272 CGTACCCTGCCAACGATGTGCAGCGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTGCCATTCAGCGTAGGCACCGGAGTTGGTGCCTAA 909 -----------------------------------------------------------------------T--------------------- 909 R T L P T M C S V N V P L Y S S T T S V P F S V G T G V G A * 303 c Mature protein Signal peptide Ta-4 ATGAAGACCTTCCCCGTCTTTGCCCTCCTCGCCATTGTGGCGACAAGTGTCATTGCGCAGATGGAGACTAGCTGCATCCCTGGTTTGGAGAGACCATGGCAG 102 Ta-21 -------------T-A-------------------------------------------------------------------------------------- 102 Ta-28 --------A----TT--------------------------------------------------------------------------------------- 102 M K T F L/PI/V F A L L A I V A T S V I A Q M E T S C I P G L E R P W Q 34 CAGCAACCATTACCACCACAACAGACATTATTTCCACAACAACAACCATTTCCACAACAACAACAACCACCATTTTCACAACAACAACCATCATTTTCGCAG 204 ------------------------------------------------------------------------------------------------------ 204 ------------------------------------------------------------------------------------------------------ 204 Q Q P L P P Q Q T L F P Q Q Q P F P Q Q Q Q P P F S Q Q Q P S F S Q 68 CAACAACCACCATTTTCGCAGCAACAACCAATTCTACCGCAGCAACCACCATTTTCACAGCAACAACAACCAGCTCTACCGCAACAATCACCATTTTTGCAG 306 ------------------------------------------------------------------------------------------------------ 306 ------------------------------------------------------------------------------------------------------ 306 Q Q P P F S Q Q Q P I L P Q Q P P F S Q Q Q Q P A L P Q Q S P F L Q 102 CAACAACAACTAGTTTTACCTCCACAACAACAACACCAACAGCTTCTGCAACAACAAATCCCTATTGTTCAACCATCCGTTCTGCAGCAGCTAAACCCATGC 408 ---------------------------------------------------------------------------------T-------------------- 408 ---------------------------------------------------------------------------------T-------------------- 408 Q Q Q L V L P P Q Q Q H Q Q L L Q Q Q I P I V Q P S V L Q Q L N P C 136 AAGGTATTCCTCCAGCAGAAGTGCAGCCCTGTAGCAATGCCACAACGTCTTGCTAGGTCGCAAATGTGGCAGCAGAGCAGTTGCCATGTGATGCAACAACAA 510 ------------------------------------------------------------------------------------------------------ 510 ------------------------------------------------------------------------------------------------------ 510 K V F L Q Q K C S P V A M P Q R L A R S Q M W Q Q S S C H V M Q Q Q 170 TGTTGCCAGCAGTTGCCGCAAATCCCCGAACAATCCCGCTATGAGGCAATCCGTGCCATCACCTACTCCATCATCCTGCAAGAACAACAACAGGGTTTTGTC 612 ------------------------------------------------------------------------------------------------------ 612 ------------------------------------------------------------------------------------------------------ 612 C C Q Q L P Q I P E Q S R Y E A I R A I T Y S I I L Q E Q Q Q G F V 204 CAACCTCAGCAGCAACAACCCCAACAGTCGGGTCAAGGTGTCTCCCAATCCCAACAGCAGTCGCAGCAGCAGCTCGGACAATGTTCTTTCCAACAACCTCAA 714 ------------------------------------------------------------------------------------------------------ 714 ------------------------------------------------------------------------------------------------------ 714 Q P Q Q Q Q P Q Q S G Q G V S Q S Q Q Q S Q Q Q L G Q C S F Q Q P Q 238 CAGCAACTGGGTCAACAGCCTCAACAACAACAAGTACTACAGGGTACCTTTTTGCAACCACACCAGATAGCTCACCTTGAGGTGATGACTTCCATTGCACTC 816 ------------------------------------------------------------------------------------------------------ 816 ------------------------------------------------------------------------------------------------------ 816 Q Q L G Q Q P Q Q Q Q V L Q G T F L Q P H Q I A H L E V M T S I A L 272 CGTACCCTGCCAACGATGTGCAGCGTCAATGTGCCGTTGTACAGCTCCACCACTAGTGTGCCATTCAGCGTTGGCACCGGAGTTGCTGCCTACTGATAA 915 --------------------------------------------------------------------------------------------------- 915 --------------------------------------------------------------------------------------------------- 915 R T L P T M C S V N V P L Y S S T T S V P F S V G T G V A A Y * * 305 d Supplementary Fig. 2The deduced amino acid sequence alignments of four groups of genes with same mature proteins. The three, two and two sequences with the same N-terminal METSCIP- are shown in a, b and c, respectively. The two sequences with the same N-terminal METSRVP- are shown in d. Short bars indicate the nucleotide sequences were the same among (between) the sequences compared, * indicates stop codons