1 / 25

Q1 Books in NCBI

Bioinformatics Exercise Stephen Tsui School of Biomedical Sciences http://www.bch.cuhk.edu.hk/teaching/bioinfo_exercise_question_2013.ppt. Q1 Books in NCBI. Find a figure showing the genome map of hepatitis B virus. Q2 PubMed.

hedya
Download Presentation

Q1 Books in NCBI

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Bioinformatics ExerciseStephen TsuiSchool of Biomedical Scienceshttp://www.bch.cuhk.edu.hk/teaching/bioinfo_exercise_question_2013.ppt

  2. Q1 Books in NCBI Find a figure showing the genome map of hepatitis B virus.

  3. Q2 PubMed Professor TSUI Lap Chee published an article, which describing the discovery of the cystic fibrosis gene, in a journal called ‘Science’ in 1989. What is the title of this article?

  4. Q3 PubMed Which university published the article “Genetic modulation of polyglutamine toxicity by protein conjugation pathways in Drosophila”? Who is the first author of this article? Try to download this paper.

  5. Q4 Entrez What is the sequence of the first five amino acid of a human protein called cysteine rich heart protein discovered in 1995?

  6. Q5 Entrez What is the full name of the gene with the accession number AF133732? How many amino acids are there in the protein it encodes?

  7. Q6 OMIM What is the chromosomal location of glucose-6 phosphate dehydrogenase gene?

  8. Q7 Taxonomy Browser What is the species represented by the name "Ovis aries"?

  9. Q8 Genome Biology How many nucleotides are there in rattus norvegicus (rat) chromosome 5? How many genes can be found in this chromosome?

  10. Q9 Codon Usage How many CGG codons have been used to code for the amino acid arginine in the following piece of coding DNA? atgcccaagtgtcccaagtgcaacaaggaggtgtacttcgccgagagggtgacctctctgggcaaggactggcatcggccctgcctgaagtgcgagaaatgtgggaagacgctgacctctgggggccacgctgagcacgaaggcaaaccctactgcaaccacccctgctacgcagccatgtttgggcctaaaggctttgggcggggcggagccgagagccacactttcaagtaa

  11. Q10 Restriction Site Analysis How many BamH1 cutting sites are there in the following DNA segment? cagaacaaca gtgcgggctc acctgccaag ggaggagaag agagcgcccc taaacatgcg gctgcggctg ctggtgtccg cgggcatgct gctggtggct ctgtcgccct gtctgccttg cagggccctg ctgagcaggg gatccgtctc tggagcgccg cgggccccgc agccgttgaa tttcttgcaa ccggagcagc cccagcaacc tcagccgatt ctgatccgca tgggtgaaga atacttcctc cgcctgggga acctcaacag aagtcccgct gctcggctgt cccccaactc cacgcccctc accgcgggtc gcggcagccg cccctcgcac gaccaggctg cggctaactt tttccgcgtg ttgctgcagc agctgcagat gcctcagcgc ccgctcgaca gcagcacgga gctggcggaa cgcggcgccg aggatgccct cggtggccac cagggggcgc tggagaggga gaggcggtcc gaggagccgc ccatctctct ggatctcacc ttccaccttc tgagggaagt cttggaaatg gccagggcag agcagttagc tcagcaagct cacagcaaca ggaaactgat

  12. Q11 Hydrophobicity Plot Which of the following is the most hydrophilic region, a.a. 50-60, a.a. 70-80, a.a. 330-340?   1 mdgsgerslp epgsqssaas ddieivvnvg gvrqvlygdl lsqypetrla elinclaggy  61 dtifslcddy dpgkrefyfd rdpdafkcvi evyyfgevhm kkgicpicfk nemdfwkvdl 121 kflddccksh lsekreelee iarrvqlild dlgvdaaegr wrrcqkcvwk flekpesscp 181 arvvaelsfl lilvssvvmc mdtipelqvl daegnrvehp tlenvetaci gwftleyllr 241 lfsspnklhf alsfmnivdv lailpfyvsl tlthlgarmm eltnvqqavq alrimriari 301 fklarhssgl qtltyalkrs fkelglllmy lavgifvfsa lgytmeqshp etlfknipqs 361 fwwaiitmtt vgygdiypkt tlsklnaais flcgviaial pihpiinnfv ryynkqrvle 421 taakhelelm elnsssggeg ktggsrsdld nlppepagke apscssrlkl shsdtfipll 481 teekhhrtrl qsck

  13. Q12 pI and Molecular Weight What are the pI and molecular weight of the following protein?   1 mdgsgerslp epgsqssaas ddieivvnvg gvrqvlygdl lsqypetrla elinclaggy  61 dtifslcddy dpgkrefyfd rdpdafkcvi evyyfgevhm kkgicpicfk nemdfwkvdl 121 kflddccksh lsekreelee iarrvqlild dlgvdaaegr wrrcqkcvwk flekpesscp 181 arvvaelsfl lilvssvvmc mdtipelqvl daegnrvehp tlenvetaci gwftleyllr 241 lfsspnklhf alsfmnivdv lailpfyvsl tlthlgarmm eltnvqqavq alrimriari 301 fklarhssgl qtltyalkrs fkelglllmy lavgifvfsa lgytmeqshp etlfknipqs 361 fwwaiitmtt vgygdiypkt tlsklnaais flcgviaial pihpiinnfv ryynkqrvle 421 taakhelelm elnsssggeg ktggsrsdld nlppepagke apscssrlkl shsdtfipll 481 teekhhrtrl qsck

  14. Q13 Protein Subcellular Location Prediction What is the predicted subcellular location of the SARS-3a protein? MDLFMRFFTLGSITAQPVKIDNASPASTVHATATIPLQASLPFGWLVIGVAFLAVFQSAT KIIALNKRWQLALYKGFQFICNLLLLFVTIYSHLLLVAAGMEAQFLYLYALIYFLQCINA CRIIMRCWLCWKCKSKNPLLYDANYFVCWHTHNYDYCIPYNSVTDTIVVTEGDGISTPKL KEDYQIGGYSEDRHSGVKDYVVVHGYFTEVYYQLESTQITTDTGIENATFFIFNKLVKDP PNVQIHTIDGSSGVANPAMDPIYDEPTTTTSVPL

  15. Q14 Transmembrane Region and Orientation Is the following a transmembrane protein? If yes, where is the transmembrane region? MDLFMRFFTLGSITAQPVKIDNASPASTVHATATIPLQASLPFGWLVIGVAFLAVFQSAT KIIALNKRWQLALYKGFQFICNLLLLFVTIYSHLLLVAAGMEAQFLYLYALIYFLQCINA CRIIMRCWLCWKCKSKNPLLYDANYFVCWHTHNYDYCIPYNSVTDTIVVTEGDGISTPKL KEDYQIGGYSEDRHSGVKDYVVVHGYFTEVYYQLESTQITTDTGIENATFFIFNKLVKDP PNVQIHTIDGSSGVANPAMDPIYDEPTTTTSVPL

  16. Q15 BLAST Sequence Alignment What are the identities of the following DNA sequence? cagaacaaca gtgcgggctc acctgccaag ggaggagaag agagcgcccc taaacatgcg gctgcggctg ctggtgtccg cgggcatgct gctggtggct ctgtcgccct gtctgccttg cagggccctg ctgagcaggg gatccgtctc tggagcgccg cgggccccgc agccgttgaa tttcttgcaa ccggagcagc cccagcaacc tcagccgatt ctgatccgca tgggtgaaga atacttcctc cgcctgggga acctcaacag aagtcccgct gctcggctgt cccccaactc cacgcccctc accgcgggtc gcggcagccg cccctcgcac gaccaggctg cggctaactt tttccgcgtg ttgctgcagc agctgcagat gcctcagcgc ccgctcgaca gcagcacgga gctggcggaa cgcggcgccg aggatgccct cggtggccac cagggggcgc tggagaggga gaggcggtcc gaggagccgc ccatctctct ggatctcacc ttccaccttc tgagggaagt cttggaaatg gccagggcag agcagttagc tcagcaagct cacagcaaca ggaaactgat

  17. Q16 BLAST Sequence Alignment What are the identities of the following protein sequence? skpmgtqtht mifdnafnct feyisdafsl dvseksgnfk hlrefvfknk dgflyvykgy qpidvvrdlp sgfntlkpif klplginitn frailtafsp aqdtwgtsaa ayfvgylkpt tfmlkydeng titdavdcsq

  18. Q17 Multiple Sequence Alignment Tools Which two of the following are closely related? >A MPKCPKCNKEVYFAERVTSLGKDWHRPCLKCEKCGKTLTSGGHAEHEGKPYCNHPCYAAMFGPKGFGRGGAESHTFK >B MPKCPKCDKEVYFAERVTSLGKDWHRPCLKCEKCGKTLTSGGHAEHEGKPYCNHPCYSAMFGPKGFGRGGAESHTFK >C MPKCPKCDKEVYFAERVTSLGKDWHRPCLKCEKCGKTLTSGGHAEHEGKPYCNHPCYSAMFGPKGFGRGGAESHTFK >D MPKCPKCQKEVYFAERVSSLGKDWHRPCLKCEKCSKTLTPGSHAEHEGKPYCNQPCYGALFGPKGFGRGGTESHSYK

  19. Q18 Structure Visualization • Get rasmol software from http://www.umass.edu/microbio/rasmol/getras.htm#raswin • Download the structure of a protein called CRIP and view the structure by rasmol. • Save the picture as a bmp file.

  20. Q19 Secondary Structure Prediction How many alpha helices are there in the hemoglobin alpha subunit? MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSH GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLL SHCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR

  21. Q20 Protein-protein Interaction Which proteins are the interacting partners of FHL2?

  22. Q21 Secondary Structure Prediction What is the identities of the following fingerprint? 2882.50 1182.44 1423.52 1191.50 1000.33 814.43 1624.74 1355.53 958.35 1165.39 1300.47 1426.57 2653.39 2265.11 2544.41

  23. Q22 Enzymes What are the substrate and products of an enzymes called "catalase"? What is the enzyme number of catalase?

  24. Q23 Protein Domain Analysis What domain can be find in the following protein?   1 msesfdcakc neslygrkyi qtdsgpycvp cydntfantc aecqqlighd srelfyedrh  61 fhegcfrccr cqrsladepf tcqdsellcn dcycsafssq csacgetvmp gsrkleyggq 121 twhehcflcs gceqplgsrs fvpdkgahyc vpcyenkfap scarcsktlt qggvtyrdqp 181 whreclvctg cqtplarqqf tsrdedpycv acfgelfapk cssckrpivg lgggkyvsfe 241 drhwhhncfs carcstslvg qgfvpdgdqv lcqgcfqagp

  25. Answer http://www.bch.cuhk.edu.hk/teaching/bioinfo_exercise_answer_2013.ppt

More Related