1 / 12

Classification of microarray gene expression data using support vector machines ( SVM )

Classification of microarray gene expression data using support vector machines ( SVM ). A presentation on the topic For CIS 595 Bioinformatics course by Despina Kontos Spring 2003 – Temple University. Overview… . What are microarray gene expression data ?

niyati
Download Presentation

Classification of microarray gene expression data using support vector machines ( SVM )

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Classification of microarraygene expression data using support vector machines (SVM) A presentation on the topic For CIS 595 Bioinformatics course by Despina Kontos Spring 2003 – Temple University

  2. Overview… • What are microarraygene expression data? • What are Support Vectors Machines? • How can we use them to utilize these gene expression data? CLASSIFICATION EXPERIMENTS !!!

  3. Microarrays… • What are they anyway??Gene expression levels on tissue or cell for varying environment conditions

  4. Microarrays… • From a machine learning point of view… Tissue classification Function classification

  5. Support Vector Machines (SVM) • Linear classifiers • Attempt to avoid overfitting by finding the optimal hyperplane that separates the data HOW??? By maximizing the Margin.. Support Vectors Introduced by V.Vapnic and co-workers in 1995

  6. Support Vector Machines (SVM) • And what about datasets that are not linearly separable?? Map the data into higher dimensional space and make linear classification there (theorem!!)

  7. Support Vector Machines (SVM) Some mathematical formulations… We need ONLYthe support vectors for computations!! We can use KERNELfunctions to avoidcomputations in higher dimensional space

  8. Some experiments… M.P.S.Brown, W.N.Grundy, D.Lin, N.Cristianini, C.W.Sugnet, T.S.Furey, M.Ares Jr. and D.Haussler,“Knowledge-based analysis of microarray gene expression data by using support vector machines", Proc.Natl.Acad.Sci.USA,97, 1, pp.262-267, 2000. Classification of gene function from microarray data using SVM 2,476 genes 79 DNA hybridization experiments 6 gene function families SVM providedoptimal classification!!! F1 F2 F3 ... Function Classification

  9. More experiments… T.furey, N.Cristianini, N. Duffy, D. Bednarski, M. Schummer and D Haussler, “Support Vector Machine Classification and Validation of Cancer Tissue Samples Using Microarray Expressioin Data”, Bioinformatics, 2000. Gene expression data on tissue 97,802 DNA clones 31 tissue samples Cancer ovarian Normal ovarian Normal non-ovarian Cancer Not Cancer ... ... Cancer Tissue Classification

  10. Conclusions • Microarray gene expression data are a very useful format of biological information (..expensive to obtain!!) • SVMnew and very promising classification apprach • A lot of research still to be done on Biologicalinformation processing using techniques developed in fields such as Machine Learning, Data Mining, etc..

  11. Additional resources.. Osuna, R. Freund, and F. Girosi. Support vector machines: Training and applications. In A.I. Memo. MIT A.I. Lab, 1996 N. Cristianini. ICML'01 tutorial, 2001 http://www.kernel-machines.org/ http://research.microsoft.com/users/jplatt/svm.html http://www.isis.ecs.soton.ac.uk/resources/svminfo/

  12. THANK YOU!!!!!

More Related