1 / 19

Data formats in Bioinformatics

Data formats in Bioinformatics. >CRAB_ANAPL ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN). MDITIHNPLI RRPLFSWLAP SRIFDQIFGE HLQESELLPA SPSLSPFLMR SPIFRMPSWL ETGLSEMRLE KDKFSVNLDV KHFSPEELKV KVLGDMVEIH GKHEERQDEH GFIAREFNRK YRIPADVDPL TITSSLSLDG VLTVSAPRKQ SDVPERSIPI TREEKPAIAG AQRK

harsha
Download Presentation

Data formats in Bioinformatics

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data formats in Bioinformatics

  2. >CRAB_ANAPL ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN). MDITIHNPLI RRPLFSWLAP SRIFDQIFGE HLQESELLPA SPSLSPFLMR SPIFRMPSWL ETGLSEMRLE KDKFSVNLDV KHFSPEELKV KVLGDMVEIH GKHEERQDEH GFIAREFNRK YRIPADVDPL TITSSLSLDG VLTVSAPRKQ SDVPERSIPI TREEKPAIAG AQRK >CRAB_BOVIN MDIAIHHPWI RRPFFPFHSP SRLFDQFFGE HLLESDLFPA STSLSPFYLR PPSFLRAPSW IDTGLSEMRL EKDRFSVNLD VKHFSPEELK VKVLGDVIEV HGKHEERQDE HGFISREFHR KYRIPADVDP LAITSSLSSD GVLTVNGPRK QASGPERTIP ITREEKPAVT AAPKK >CRAB_CHICK ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN). MDITIHNPLV RRPLFSWLTP SRIFDQIFGE HLQESELLPT SPSLSPFLMR SPFFRMPSWL ETGLSEMRLE KDKFSVNLDV KHFSPEELKV KVLGDMIEIH GKHEERQDEH GFIAREFSRK YRIPADVDPL TITSSLSLDG VLTVSAPRKQ SDVPERSIPI TREEKPAIAG SQRK >CRAB_HUMAN ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN) (ROSENTHAL FIBER). MDIAIHHPWI RRPFFPFHSP SRLFDQFFGE HLLESDLFPT STSLSPFYLR PPSFLRAPSW FDTGLSEMRL EKDRFSVNLD VKHFSPEELK VKVLGDVIEV HGKHEERQDE HGFISREFHR KYRIPADVDP LTITSSLSSD GVLTVNGPRK QVSGPERTIP ITREEKPAVT AAPKK >CRAB_MESAU ALPHA CRYSTALLIN B CHAIN (ALPHA(B)-CRYSTALLIN). MDIAIHHPWI RRPFFPFHSP SRLFDQFFGE HLLESDLFST ATSLSPFYLR PPSFLRAPSW IDTGLSEMRM EKDRFSVNLD VKHFSPEELK VKVLGDVVEV HGKHEERQDE HGFISREFHR KYRIPADVDP LTITSSLSSD GVLTVNGPRK QASGPERTIP ITREEKPAVT AAPKK FASTA

  3. ID X03006; SV 1; linear; mRNA; STD; MAM; 620 BP. XX AC X03006; XX SV X03006.1 XX DT 28-JAN-1986 (Rel. 08, Created) DT 12-SEP-1993 (Rel. 36, Last updated, Version 2) XX DE Bovine mRNA for lens beta-s-crystallin XX KW beta-crystallin; beta-gamma-crystallin; crystallin. XX OS Bos taurus (cow) OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; OC Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; Pecora; Bovidae; OC Bovinae; Bos. XX RN [1] RP 1-620 RX PUBMED; 4054100. RA Quax-Jeuken Y.E.F.M., Driessen H., Leunissen J., Quax W.J., de Jong W., RA Bloemendal H.; RT "Beta-s-crystallin: structure and evolution of a distinct member of the RT beta-gamma-superfamily"; RL EMBO J. 4(10):2597-2602(1985). XX CC Data kindly reviewed (06-MAR-1986) by Y. Quax-Jeuken XX ... EMBL

  4. ... FH Key Location/Qualifiers FH FT source 1..620 FT /db_xref="taxon:9913" FT /mol_type="mRNA" FT /organism="Bos taurus" FT CDS 11..547 FT /db_xref="GOA:P06504" FT /db_xref="PDB:1A7H" FT /db_xref="UniProtKB/Swiss-Prot:P06504" FT /note="beta-s-crystallin (aa 1-177)" FT /protein_id="CAA26791.1" FT /translation="MSKAGTKITFFEDKNFQGRHYDSDCDCADFHMYLSRCNSIRVEGG FT TWAVYERPNFAGYMYILPRGEYPEYQHWMGLNDRLSSCRAVHLSSGGQYKLQIFEKGDF FT NGQMHETTEDCPSIMEQFHMREVHSCKVLEGAWIFYELPNYRGRQYLLDKKEYRKPVDW FT GAASPAVQSFRRIVE" FT misc_feature 602..607 FT /note="polyadenylation signal" FT polyA_site 620..620 FT /note="polyadenylation site" XX ... EMBL

  5. SQ Sequence 620 BP; 156 A; 161 C; 165 G; 138 T; 0 other; tgcaccaaac atgtctaaag ctggaaccaa aattactttc tttgaagaca aaaactttca 60 aggccgccac tatgacagcg attgcgactg tgcagatttc cacatgtacc tgagccgctg 120 caactccatc agagtggaag gaggcacctg ggctgtgtat gaaaggccca attttgctgg 180 gtacatgtac atcctacccc ggggcgagta tcctgagtac cagcactgga tgggcctcaa 240 cgaccgcctc agctcctgca gggctgttca cctgtctagt ggaggccagt ataagcttca 300 gatctttgag aaaggggatt ttaatggtca gatgcatgag accacggaag actgcccttc 360 catcatggag cagttccaca tgcgggaggt ccactcctgt aaggtgctgg agggcgcctg 420 gatcttctat gagctgccca actaccgagg caggcagtac ctgctggaca agaaggagta 480 ccggaagccc gtcgactggg gtgcagcttc cccagctgtc cagtctttcc gccgcattgt 540 ggagtgatga tacagatgcg gccaaacgct ggctggcctt gtcatccaaa taagcattat 600 aaataaaaca attggcatgc 620 // EMBL

  6. LOCUS X03006 620 bp mRNA linear MAM 12-SEP-1993 DEFINITION Bovine mRNA for lens beta-s-crystallin. ACCESSION X03006 VERSION X03006.1 GI:152 KEYWORDS beta-crystallin; beta-gamma-crystallin; crystallin. SOURCE Bos taurus (cattle) ORGANISM Bos taurus Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; Pecora; Bovidae; Bovinae; Bos. REFERENCE 1 (bases 1 to 620) AUTHORS Quax-Jeuken,Y., Driessen,H., Leunissen,J., Quax,W., de Jong,W. and Bloemendal,H. TITLE beta s-Crystallin: structure and evolution of a distinct member of the beta gamma-superfamily JOURNAL EMBO J. 4 (10), 2597-2602 (1985) PUBMED 4054100 COMMENT Data kindly reviewed (06-MAR-1986) by Y. Quax-Jeuken. FEATURES Location/Qualifiers source 1..620 /organism="Bos taurus" /mol_type="mRNA" /db_xref="taxon:9913" CDS 11..547 /note="unnamed protein product; beta-s-crystallin (aa 1-177)" /codon_start=1 /protein_id="CAA26791.1" /db_xref="GI:153" ... GenBank

  7. ... /db_xref="GI:153" /db_xref="GOA:P06504" /db_xref="PDB:1A7H" /db_xref="UniProtKB/Swiss-Prot:P06504" /translation="MSKAGTKITFFEDKNFQGRHYDSDCDCADFHMYLSRCNSIRVEG GTWAVYERPNFAGYMYILPRGEYPEYQHWMGLNDRLSSCRAVHLSSGGQYKLQIFEKG DFNGQMHETTEDCPSIMEQFHMREVHSCKVLEGAWIFYELPNYRGRQYLLDKKEYRKP VDWGAASPAVQSFRRIVE" misc_feature 602..607 /note="polyadenylation signal" polyA_site 620 /note="polyadenylation site" ORIGIN 1 tgcaccaaac atgtctaaag ctggaaccaa aattactttc tttgaagaca aaaactttca 61 aggccgccac tatgacagcg attgcgactg tgcagatttc cacatgtacc tgagccgctg 121 caactccatc agagtggaag gaggcacctg ggctgtgtat gaaaggccca attttgctgg 181 gtacatgtac atcctacccc ggggcgagta tcctgagtac cagcactgga tgggcctcaa 241 cgaccgcctc agctcctgca gggctgttca cctgtctagt ggaggccagt ataagcttca 301 gatctttgag aaaggggatt ttaatggtca gatgcatgag accacggaag actgcccttc 361 catcatggag cagttccaca tgcgggaggt ccactcctgt aaggtgctgg agggcgcctg 421 gatcttctat gagctgccca actaccgagg caggcagtac ctgctggaca agaaggagta 481 ccggaagccc gtcgactggg gtgcagcttc cccagctgtc cagtctttcc gccgcattgt 541 ggagtgatga tacagatgcg gccaaacgct ggctggcctt gtcatccaaa taagcattat 601 aaataaaaca attggcatgc // GenBank

  8. @SRR020799.18225 FTFWBJ101C758G length=132 ATCAGACACGATTTCTAAAGAATTAAAAAAAAAGAATCAAAAAAATTAAAAAAAAAAGTCTTTTGTGGTT TTAAAGATTTTCAAAGAAAGAGCACCCATTTTCCTAAAAGTCGCAAAAGAAACTAAAAAACT +SRR020799.18225 FTFWBJ101C758G length=132 AAAAAAAAAAA???A=666A==BB666666660>*'88.......22''''''''''25;56659-8888 <<>><=6999<><<<<///8;?AAA???????AA488888==>>96434***..,,,,,,.. @SRR020799.18226 FTFWBJ101CYKNR length=268 ATCAGACACGGGCTGGGGATGCATCGTTGCTCGCGGATGAGCGGAGAAAGGAAGGGAAAGATCCTGCGGC GATGTCCGCGGCTGAGTATTTGTCAAAGCTGGCGATACAGAGAGGAAGCAAAGACAACATAAGTGTGGTG GTGGTTGATTTGAAGCCTCGGAGGAAACTCAAGAGCAAACCCTTGAACTGAGGCAGAGAGGGTCCTTTTT TCTTAAATTTTTAAAATGAATATGGGTCTCTCCAAGAAAAAGTATTTACTATTATTAA +SRR020799.18226 FTFWBJ101CYKNR length=268 EEEEEEEEEEEEDA====EEFFFFFHEGGGGGGGHGGGIIIIIIIEEEEEEEEEEE===ADEEEEEEEEE EEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEEAA===AAEEEEEEEEEEEEEEEE DDDDDDEEEEEEEEEEEEEEEEEEEEEEEEED@@@@@@@@@@EEEEEEEEEEEEEEDDDEDD@@@;;;84 4<--441/////1111.>???;9444::?9999>>6.....6;2...;>9::777777 @SRR020799.18227 FTFWBJ101BYQE4 length=157 ATCAGACACGAAGCAGTGGTATCAACGCAGAGTACGCGGGGACGGCGGACGCGCTTTCTTCAATCAAAAA CTCGTCTGCTAATAAGTCTCTTTTCGTCTTCAGCTTCTTCCGCTTTTTCTGATTGCTTCTCTGATCTCTT ATCTCTAATTCTAATCC +SRR020799.18227 FTFWBJ101BYQE4 length=157 AAAAAAAAAAAAAAAAAAIIIIIIIHHHFFFFFFFFF====IIIIAAAAAAAAA>>=?<6=:==;22222 22=>>??@AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA;;;;8A2AA@@A?<<<AAAAAAAAAAA AAAAAAAAAAA?@?=0. @XXXXXXXXX.18225 FTFWBJ101C758G length=132 ATCAGACACGATTTCTAAAGAATTAAAAAAAAAGAATCAAAAAAATTAAAAAAAAAAGTCTTTTGTGGTT TTAAAGATTTTCAAAGAAAGAGCACCCATTTTCCTAAAAGTCGCAAAAGAAACTAAAAAACTN +XXXXXXXXX.18225 FTFWBJ101C758G length=132 AAAAAAAAAAA???A=666A==BB666666660>*'88.......22''''''''''25;56659-8888 <<>><=6999<><<<<///8;?AAA???????AA488888==>>96434***..,,,,,,..! FASTQ

  9. FASTQ quality encoding SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS..................................................... ..........................XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX...................... ...............................IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII...................... .................................JJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJ...................... !"#$%&'()*+,-./0123456789:;<=>?@ABCDEFGHIJKLMNOPQRSTUVWXYZ[\]^_`abcdefghijklmnopqrstuvwxyz{|}~ | | | | | | 33 59 64 73 104 126 S - Sanger Phred+33, raw reads typically (0, 40) X - Solexa Solexa+64, raw reads typically (-5, 40) I - Illumina 1.3+ Phred+64, raw reads typically (0, 40) J - Illumina 1.5+ Phred+64, raw reads typically (3, 40) with 0=unused, 1=unused, 2=Read Segment Quality Control Indicator (bold)

  10. taken from wikipedia

  11. SAM

  12. seq1 272 T 24 ,.$.....,,.,.,...,,,.,..^+. <<<+;<<<<<<<<<<<=<;<;7<& seq1 273 T 23 ,.....,,.,.,...,,,.,..A <<<;<<<<<<<<<3<=<<<;<<+ seq1 274 T 23 ,.$....,,.,.,...,,,.,... 7<7;<;<<<<<<<<<=<;<;<<6 seq1 275 A 23 ,$....,,.,.,...,,,.,...^l. <+;9*<<<<<<<<<=<<:;<<<< seq1 276 G 22 ...T,,.,.,...,,,.,.... 33;+<<7=7<<7<&<<1;<<6< seq1 277 T 22 ....,,.,.,.C.,,,.,..G. +7<;<<<<<<<&<=<<:;<<&< seq1 278 G 23 ....,,.,.,...,,,.,....^k. %38*<<;<7<<7<=<<<;<<<<< seq1 279 C 23 A..T,,.,.,...,,,.,..... ;75&<<<<<<<<<=<<<9<<:<< Pile up

  13. BLASTN 2.2.20 [Feb-08-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= 170079668_prom (100 letters) Database: all/Escherichia_coli_K_12_substr__DH10B/NC_010473.out 3902 sequences; 390,200 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value 170079668_prom 198 5e-53 >170079668_prom Length = 100 Score = 198 bits (100), Expect = 5e-53 Identities = 100/100 (100%) Strand = Plus / Plus Query: 1 gtaacttagagattaggattgcggagaataacaaccgccgttctcatcgagtaatctccg 60 |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||| Sbjct: 1 gtaacttagagattaggattgcggagaataacaaccgccgttctcatcgagtaatctccg 60 Query: 61 gatatcgacccataacgggcaatgataaaaggagtaacct 100 |||||||||||||||||||||||||||||||||||||||| Sbjct: 61 gatatcgacccataacgggcaatgataaaaggagtaacct 100 Blast

  14. <?xml version="1.0"?> <!DOCTYPE BlastOutput PUBLIC "-//NCBI//NCBI BlastOutput/EN" "http://www.ncbi.nlm.nih.gov/dtd/NCBI_BlastOutput.dtd"> <BlastOutput> <BlastOutput_program>blastn</BlastOutput_program> <BlastOutput_version>blastn 2.2.20 [Feb-08-2009]</BlastOutput_version> <BlastOutput_reference>~Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, ~Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), ~&quot;Gapped BLAST and PSI-BLAST: a new generation of protein database search~programs&quot;, Nucleic Acids Res. 25:3389-3402.</BlastOutput_reference> <BlastOutput_db>NC_010473.out</BlastOutput_db> <BlastOutput_query-ID>lcl|1_0</BlastOutput_query-ID> <BlastOutput_query-def>170079668_prom</BlastOutput_query-def> <BlastOutput_query-len>100</BlastOutput_query-len> <BlastOutput_param> =</BlastOutput_param> <BlastOutput_iterations> <Iteration> <Iteration_iter-num>1</Iteration_iter-num> <Iteration_query-ID>lcl|1_0</Iteration_query-ID> <Iteration_query-def>170079668_prom</Iteration_query-def> <Iteration_query-len>100</Iteration_query-len> <Iteration_hits> <Hit> <Hit_num>1</Hit_num> <Hit_id>gnl|BL_ORD_ID|0</Hit_id> <Hit_def>170079668_prom</Hit_def> <Hit_accession>0</Hit_accession> <Hit_len>100</Hit_len> <Hit_hsps> <Hsp> <Hsp_num>1</Hsp_num> <Hsp_bit-score>198.728</Hsp_bit-score> <Hsp_score>100</Hsp_score> <Hsp_evalue>4.54021e-53</Hsp_evalue> <Hsp_query-from>1</Hsp_query-from> <Hsp_query-to>100</Hsp_query-to> Blast xml

  15. <Hsp_hit-from>1</Hsp_hit-from> <Hsp_hit-to>100</Hsp_hit-to> <Hsp_query-frame>1</Hsp_query-frame> <Hsp_hit-frame>1</Hsp_hit-frame> <Hsp_identity>100</Hsp_identity> <Hsp_positive>100</Hsp_positive> <Hsp_align-len>100</Hsp_align-len> <Hsp_qseq>GTAACTTAGAGATTAGGATTGCGGAGAATAACAACCGCCGTTCTCATCGAGTAATCTCCGGATATCGACCCATAACGGGCAATGATAAAAGGAGTAACCT</Hsp_qseq> <Hsp_hseq>GTAACTTAGAGATTAGGATTGCGGAGAATAACAACCGCCGTTCTCATCGAGTAATCTCCGGATATCGACCCATAACGGGCAATGATAAAAGGAGTAACCT</Hsp_hseq> <Hsp_midline>||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||</Hsp_midline> </Hsp> </Hit_hsps> </Hit> </Iteration_hits> <Iteration_stat> <Statistics> <Statistics_db-num>3902</Statistics_db-num> <Statistics_db-len>390200</Statistics_db-len> <Statistics_hsp-len>12</Statistics_hsp-len> <Statistics_eff-space>3.02171e+07</Statistics_eff-space> <Statistics_kappa>0.710603</Statistics_kappa> <Statistics_lambda>1.37406</Statistics_lambda> <Statistics_entropy>1.30725</Statistics_entropy> </Statistics> </Iteration_stat> </Iteration> </BlastOutput_iterations> </BlastOutput> Blast xml

  16. Escherichia coli str. K-12 substr. DH10B, complete genome - 1..4686137 4126 proteins Location Strand Length PID Gene Synonym Code COG Product 190..255 + 21 170079664 thrL ECDH10B_0001 - - thr operon leader peptide 337..2799 + 820 170079665 thrA ECDH10B_0002 - COG0527E bifunctional aspartokinase I/homeserine dehydrogenase I 2801..3733 + 310 170079666 thrB ECDH10B_0003 - COG0083E homoserine kinase 3734..5020 + 428 170079667 thrC ECDH10B_0004 - COG0498E threonine synthase 5234..5530 + 98 170079668 yaaX ECDH10B_0005 - - hypothetical protein 5683..6459 - 258 170079669 yaaA ECDH10B_0006 - COG3022S hypothetical protein 6529..7959 - 476 170079670 yaaJ ECDH10B_0007 - COG1115E transporter 8238..9191 + 317 170079671 talB ECDH10B_0008 - COG0176G transaldolase B 9306..9893 + 195 170079672 mogA ECDH10B_0009 - COG0521H molybdenum cofactor biosynthesis protein 9928..10494 - 188 170079673 yaaH ECDH10B_0010 - COG1584S hypothetical protein 10643..11356 - 237 170079674 yaaW ECDH10B_0011 - COG4735S hypothetical protein 10830..11315 + 161 170079675 htgA ECDH10B_0012 - - hypothetical protein 11382..11786 - 134 170079676 yaaI ECDH10B_0013 - - hypothetical protein 12163..14079 + 638 170079677 dnaK ECDH10B_0014 - COG0443O molecular chaperone DnaK 14168..15298 + 376 170079678 dnaJ ECDH10B_0015 - COG0484O chaperone protein DnaJ 15445..16557 + 370 170079679 insL-1 ECDH10B_0016 - COG3385L IS186/IS421 transposase 16751..16960 - 69 170079680 mokC ECDH10B_0017 - - regulatory protein for HokC, overlaps CDS of hokC 16751..16903 - 50 170079681 hokC ECDH10B_0018 - - toxic membrane protein, small 17489..18655 + 388 170079682 nhaA ECDH10B_0020 - COG3004P pH-dependent sodium/proton antiporter 18715..19620 + 301 170079683 nhaR ECDH10B_0021 - COG0583K transcriptional activator NhaR 19811..20314 - 167 170079684 insB-1 ECDH10B_0022 - COG1662L IS1 transposase InsAB' 20233..20508 - 91 170079685 insA-1 ECDH10B_0023 - COG3677L IS1 repressor protein InsA 20815..21078 - 87 170079686 rpsT ECDH10B_0024 - COG0268J 30S ribosomal protein S20 21181..21399 + 72 170079687 yaaY ECDH10B_0025 - - hypothetical protein 21407..22348 + 313 170079688 ribF ECDH10B_0026 - COG0196H bifunctional riboflavin kinase/FMN adenylyltransferase 22391..25207 + 938 170079689 ileS ECDH10B_0027 - COG0060J isoleucyl-tRNA synthetase 25207..25701 + 164 170079690 lspA ECDH10B_0028 - COG0597MU lipoprotein signal peptidase 25826..26275 + 149 170079691 fkpB ECDH10B_0029 - COG1047O FKBP-type peptidyl-prolyl cis-trans isomerase (rotamase) 26277..27227 + 316 170079692 ispH ECDH10B_0030 - COG0761IM 4-hydroxy-3-methylbut-2-enyl diphosphate reductase 27293..28207 + 304 170079693 rihC ECDH10B_0031 - COG1957F ribonucleoside hydrolase RihC 28374..29195 + 273 170079694 dapB ECDH10B_0032 - COG0289E dihydrodipicolinate reductase 29651..30799 + 382 170079695 carA ECDH10B_0033 - COG0505EF carbamoyl phosphate synthase small subunit 30817..34038 + 1073 170079696 carB ECDH10B_0034 - COG0458EF carbamoyl phosphate synthase large subunit 34300..34695 + 131 170079697 caiF ECDH10B_0035 - - DNA-binding transcriptional activator CaiF 34781..35371 - 196 170079698 caiE ECDH10B_0036 - COG0663R carnitine operon protein CaiE 35377..36162 - 261 228937031 caiD ECDH10B_0037 - COG1024I carnitinyl-CoA dehydratase PTT

  17. browser position chr22:10000000-10025000 browser hide all track name=regulatory description="TeleGene(tm) Regulatory Regions" visibility=2 chr22 TeleGene enhancer 10000000 10001000 500 + . touch1 chr22 TeleGene promoter 10010000 10010100 900 + . touch1 chr22 TeleGene promoter 10020000 10025000 800 - . touch2 GFF

  18. So: if you start writing a new software application, first create your own dataformat...

  19. NOT So: if you start writing a new software application, first create your own dataformat...

More Related