160 likes | 232 Views
Learn the systematic approach of bioinformatics, explore NCBI's website, utilize tools in Microsoft Excel, and review software in molecular epidemiology.
E N D
CTCAAGGGGTNAGNNNTNTNAAAGNTGCCNTTCCAAAGNTNNGNNNANNACNNTTGGCCGAGAACTTNGNNGGGGNTNANTNNNATATTCCNATTTTGCCTAATACNANGCTTGATANTTTCCGTTTNNTCNCACCTGGGNNCNNNTAATCGGATGNNGGACANANCAANGCGGGCCTTCACCCCATCNTGGNGGNCCNTNNGNCCNTTTNGCCANTCNCNTNCGCCCNNGGGGTNNTNCNTNGCAGGGGGNNTANCGGTTCCNGGGGGNCAAANNTNCCNCAATGGNTTTNGGANNGTGNCCCCCNCCNTGAGNANTTNAAACANTNNNCNANTNNCATCNTNTTNGNANAACNNGGGGGGGAATTTTTTNNCAAGGNGGNNCCAANGCGNNATTATCCANCNCNNCCNAGTTGTNNAAANNAGTNTNCCNCGAGGNTAAAAAAACTTTTNTCCGGCGGNGGCAGNTNGNGNAAATAACNGTTTCCCCNCCNTTGTGTTNGGGGGCNCCCCCCCCCCCCCTNCAAANAANANAAANGNNNGNCGGNNATTTTNACCGTCGCGGGGGGCCCNCACCNCACCCGNAGNAAATCNACCANATCAAGNGAGGANGGNGGGNGAGGCCTTTTTTTTTTTNNAAAATCCCANAAAAACTTNNNNCCGNNGGGGGGGCTAAAAAAAAAACCCCCCCCNCCCACCCNNCCNGGGGGGGNGNAGGTTTNTTGTTTTTTTTTCCANAANACTNGGTTNGNGGGAAGAGATNAANNAACACACCCCCCCNCNTGNGGTCCTTNTTTCCCCNAANGGGTGNGGGNGGNNNATTCCTCCTNCNTNCCACNAANAAAGGGGGNNTTATTAAAAACTTNNCCTCAGGTNCNCTNGNGGGGGGGGGGGGGGGGNGGNCCANAANTNTTNCNCCCGGGNCGGGGNNAATTNCCCNGGGTNAGGNATCCTTCNAANAGAGGTTTTTAAAANACCTTNNCNCCCGGGGGGAAATNCCTGNTCCCCCCTCTCNNNAAGANGAAAAATAAAACTCAAGGGGTNAGNNNTNTNAAAGNTGCCNTTCCAAAGNTNNGNNNANNACNNTTGGCCGAGAACTTNGNNGGGGNTNANTNNNATATTCCNATTTTGCCTAATACNANGCTTGATANTTTCCGTTTNNTCNCACCTGGGNNCNNNTAATCGGATGNNGGACANANCAANGCGGGCCTTCACCCCATCNTGGNGGNCCNTNNGNCCNTTTNGCCANTCNCNTNCGCCCNNGGGGTNNTNCNTNGCAGGGGGNNTANCGGTTCCNGGGGGNCAAANNTNCCNCAATGGNTTTNGGANNGTGNCCCCCNCCNTGAGNANTTNAAACANTNNNCNANTNNCATCNTNTTNGNANAACNNGGGGGGGAATTTTTTNNCAAGGNGGNNCCAANGCGNNATTATCCANCNCNNCCNAGTTGTNNAAANNAGTNTNCCNCGAGGNTAAAAAAACTTTTNTCCGGCGGNGGCAGNTNGNGNAAATAACNGTTTCCCCNCCNTTGTGTTNGGGGGCNCCCCCCCCCCCCCTNCAAANAANANAAANGNNNGNCGGNNATTTTNACCGTCGCGGGGGGCCCNCACCNCACCCGNAGNAAATCNACCANATCAAGNGAGGANGGNGGGNGAGGCCTTTTTTTTTTTNNAAAATCCCANAAAAACTTNNNNCCGNNGGGGGGGCTAAAAAAAAAACCCCCCCCNCCCACCCNNCCNGGGGGGGNGNAGGTTTNTTGTTTTTTTTTCCANAANACTNGGTTNGNGGGAAGAGATNAANNAACACACCCCCCCNCNTGNGGTCCTTNTTTCCCCNAANGGGTGNGGGNGGNNNATTCCTCCTNCNTNCCACNAANAAAGGGGGNNTTATTAAAAACTTNNCCTCAGGTNCNCTNGNGGGGGGGGGGGGGGGGNGGNCCANAANTNTTNCNCCCGGGNCGGGGNNAATTNCCCNGGGTNAGGNATCCTTCNAANAGAGGTTTTTAAAANACCTTNNCNCCCGGGGGGAAATNCCTGNTCCCCCCTCTCNNNAAGANGAAAAATAAAACTCAAGGGGTNAGNNNTNTNAAAGNTGCCNTTCCNANGCTTGATANTTTCCGTTTNNTCNCACCTGGGNNCNNNTAATCGGATGNNGGACANANCAANGCGGGCCTTCACCCCATCNTGGNGGNCCNTNNGNCCNTTTNGCCANTCNCNTNCGCCCNNGGGGTNNTNCNTNGCAGGGGGNNTANCGGTTCCNGGGGGNCAAANNTNCCNCAATGGNTTTNGGANNGTGNCCCCCNCCNTGAGNANTTNAACTCAAGGGGTNAGNNNTNTNAAAGNTGCCNTTCCAAAGNTNNGNNNANNACNNTTGGCCGAGAACTTNGNNGGGGNTNANTNNNATATTCCNATTTTGCCTAATACNANGCTTGATANTTTCCGTTTNNTCNCACCTGGGNNCNNNTAATCGGATGNNGGACANANCAANGCGGGCCTTCACCCCATCNTGGNGGNCCNTNNGNCCNTTTNGCCANTCNCNTNCGCCCNNGGGGTNNTNCNTNGCAGGGGGNNTANCGGTTCCNGGGGGNCAAANNTNCCNCAATGGNTTTNGGANNGTGNCCCCCNCCNTGAGNANTTNAAACANTNNNCNANTNNCATCNTNTTNGNANAACNNGGGGGGGAATTTTTTNNCAAGGNGGNNCCAANGCGNNATTATCCANCNCNNCCNAGTTGTNNAAANNAGTNTNCCNCGAGGNTAAAAAAACTTTTNTCCGGCGGNGGCAGNTNGNGNAAATAACNGTTTCCCCNCCNTTGTGTTNGGGGGCNCCCCCCCCCCCCCTNCAAANAANANAAANGNNNGNCGGNNATTTTNACCGTCGCGGGGGGCCCNCACCNCACCCGNAGNAAATCNACCANATCAAGNGAGGANGGNGGGNGAGGCCTTTTTTTTTTTNNAAAATCCCANAAAAACTTNNNNCCGNNGGGGGGGCTAAAAAAAAAACCCCCCCCNCCCACCCNNCCNGGGGGGGNGNAGGTTTNTTGTTTTTTTTTCCANAANACTNGGTTNGNGGGAAGAGATNAANNAACACACCCCCCCNCNTGNGGTCCTTNTTTCCCCNAANGGGTGNGGGNGGNNNATTCCTCCTNCNTNCCACNAANAAAGGGGGNNTTATTAAAAACTTNNCCTCAGGTNCNCTNGNGGGGGGGGGGGGGGGGNGGNCCANAANTNTTNCNCCCGGGNCGGGGNNAATTNCCCNGGGTNAGGNATCCTTCNAANAGAGGTTTTTAAAANACCTTNNCNCCCGGGGGGAAATNCCTGNTCCCCCCTCTCNNNAAGANGAAAAATAAAACTCAAGGGGTNAGNNNTNTNAAAGNTGCCNTTCCAAAGNTNNGNNNANNACNNTTGGCCGAGAACTTNGNNGGGGNTNANTNNNATATTCCNATTTTGCCTAATACNANGCTTGATANTTTCCGTTTNNTCNCACCTGGGNNCNNNTAATCGGATGNNGGACANANCAANGCGGGCCTTCACCCCATCNTGGNGGNCCNTNNGNCCNTTTNGCCANTCNCNTNCGCCCNNGGGGTNNTNCNTNGCAGGGGGNNTANCGGTTCCNGGGGGNCAAANNTNCCNCAATGGNTTTNGGANNGTGNCCCCCNCCNTGAGNANTTNAAACANTNNNCNANTNNCATCNTNTTNGNANAACNNGGGGGGGAATTTTTTNNCAAGGNGGNNCCAANGCGNNATTATCCANCNCNNCCNAGTTGTNNAAANNAGTNTNCCNCGAGGNTAAAAAAACTTTTNTCCGGCGGNGGCAGNTNGNGNAAATAACNGTTTCCCCNCCNTTGTGTTNGGGGGCNCCCCCCCCCCCCCTNCAAANAANANAAANGNNNGNCGGNNATTTTNACCGTCGCGGGGGGCCCNCACCNCACCCGNAGNAAATCNACCANATCAAGNGAGGANGGNGGGNGAGGCCTTTTTTTTTTTNNAAAATCCCANAAAAACTTNNNNCCGNNGGGGGGGCTAAAAAAAAAACCCCCCCCNCCCACCCNNCCNGGGGGGGNGNAGGTTTNTTGTTTTTTTTTCCANAANACTNGGTTNGNGGGAAGAGATNAANNAACACACCCCCCCNCNTGNGGTCCTTNTTTCCCCNAANGGGTGNGGGNGGNNNATTCCTCCTNCNTNCCACNAANAAAGGGGGNNTTATTAAAAACTTNNCCTCAGGTNCNCTNGNGGGGGGGGGGGGGGGGNGGNCCANAANTNTTNCNCCCGGGNCGGGGNNAATTNCCCNGGGTNAGGNATCCTTCNAANAGAGGTTTTTAAAANACCTTNNCNCCCGGGGGGAAATNCCTGNTCCCCCCTCTCNNNAAGANGAAAAATAAAACTCAAGGGGTNAGNNNTNTNAAAGNTGCCNTTCCNANGCTTGATANTTTCCGTTTNNTCNCACCTGGGNNCNNNTAATCGGATGNNGGACANANCAANGCGGGCCTTCACCCCATCNTGGNGGNCCNTNNGNCCNTTTNGCCANTCNCNTNCGCCCNNGGGGTNNTNCNTNGCAGGGGGNNTANCGGTTCCNGGGGGNCAAANNTNCCNCAATGGNTTTNGGANNGTGNCCCCCNCCNTGAGNANTTNAA Bioinformatics and Data Management Jeff LeJeune Lejeune.3@osu.edu 330-263-3739
Lecture Goals • Define Bioinformatics • Explore NCBI’s website • Introduce some useful tools in available Microsoft Excel • Review other computer software used in Molecular Epidemiology
Bioinformatics Defined Systematic approach to store and classify and analyze data and informationand metadata that allows for the acquisition of knowledge and enhancement of the understanding of biological systems.
Data Numbers derived from observations, experiments or calculations Examples: Positive/Negative A, T, G, C ill/healthy
Information Data in context. Data with associated explanations and interpretations. Examples: Publications, available sequence data, that which is freely available.
Metadata • Data about data • Context in which information is used. • One application's metadata is another application's data Examples:Descriptive studies, reference lists, sources, Genbank accession numbers.
Knowledge and Understanding • This is what we’re all doing here! • Advancement of Science • Solving of problems
Bioinformatics Defined • Data • Information • Metadata • Knowledge & Understanding
Data Analysis Tools • NCBI • Bibliographic information • Sequence analysis (nucleotide, protein) • Other • Data scanning using Microsoft Excel • Other tools • Gel comparisons • Spatial data-GIS • Temporal data • Statistical considerations
Microsoft Excel • Easily available • Filters • Pivot Table Reports and Charts
Other Software • ClustalW multiple sequence alignments • www.ebi.ac.uk/clustalw/index.html • BioNumerics
Fingerprint types • Character types • Sequence types • 2-D gel types • Matrix types
Cardinal Rules of Data Management • Save your data • Back-up your data • Record databases, and program versions used • Write down sequence numbers • Record program parameters • Use E-values • Double check your results visually • In spreadsheets, one entry per column