130 likes | 318 Views
University of Toronto (Mississauga Campus). CSC411- Machine Learning and Data Mining. Tutorial 10– March 23 th , 2007. Data Mining and Machine Learning Applications.
E N D
University of Toronto (Mississauga Campus) CSC411- Machine Learning and Data Mining Tutorial 10– March 23th, 2007
Data Mining and Machine Learning Applications Case 1: In order to improve the business, a national-chain supermarket starts a project to keep track of their customers. Regular customers can collect points or receive discounts by using their store card on each purchase. Temporary customers who are not members to the store will be assigned to a same temporary store card. Now supermarket is hiring the data mining analyst to help them on this project. Question: If you are the data mining analyst, how will you design the project and what data you need for the project?
Data Mining and Machine Learning Applications Case 2: Researchers found that individuals have different responses or reactions to the same drug treatment. For example, two smokers have the same smoking history. One is detected to have lung cancer and the other one does not. Single Nucleotide Polymorphisms (SNPs) are an important resource to explain these phenomenons. One possible project is study the association between the SNPs and the DNA sequences. Question: If you are the researcher, how will you design this project?
Cancer – Different Fates This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation
SNPs May Be the Solution SNPs A SNPs B SNPs C SNPs D This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation
What Is Variation in the Genome? Common Sequence Variations Polymorphism Deletions Insertions Chromosome This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation Translocations
SNPs Are the Most Common Type of Variation At least 1 percent of the population Most of the population G to C Common sequence Variant sequence SNP site This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation
The Genome Contains Genes Gene 1 Coding region Protein 1 Noncoding region Protein 2 Gene 2 Coding region Noncoding region This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation
Variation in the Human Genome Person 1 Person 2 = Variations in DNA This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation
Variations Causing No Changes This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation = Variations in DNA that cause no changes
Variations Causing Harmless Changes = Variations in DNA that cause harmless changes This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation
Variations Causing Harmful Changes No Disease Hemophilia = Variation in DNA that causes harmful change No Disease This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation
Variations Causing Latent Changes = Variations in DNA that cause latent effects Many years later Many years later This slide is copied from National Cancer Institute, Understanding cancel series: Genetic Variation (SNPs): http://www.nci.nih.gov/cancertopics/understandingcancer/geneticvariation