1 / 34

Data Processing Technologies for DNA Micr oarray

Data Processing Technologies for DNA Micr oarray. Nini Rao School of Life Science And Technology UESTC 14/11/2004. Introduction The Applications of SVD Technology The Applications of NMF Technology Summarization. Introduction. 1. Gene and Genomes

wyome
Download Presentation

Data Processing Technologies for DNA Micr oarray

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Processing Technologies for DNA Microarray Nini Rao School of Life Science And Technology UESTC 14/11/2004

  2. Introduction • The Applications of SVD Technology • The Applications of NMF Technology • Summarization

  3. Introduction • 1. Gene and Genomes Gene ----The basic unit of genetic function Gene Expression ----The process by which genetic information at the DNA level is converted into functional proteins.

  4. Introduction Genome Structure ---- each organism contains a unique genomic sequence with a unique structure.

  5. Gene structure

  6. Genome Data with unknown biological meanings exponentially increase. There are needs for mining these data.

  7. Analysis of these new data requires mathematical tools that are adaptable to the large quantities of data, while reducing the complexity of the data to make them comprehensible.

  8. 2. A Microarray A small analytical device. That allows genomic exploration with speed and precision unprecedented in the history of biology. This technology was presented in 1990s.

  9. 3. Microarray Analysis The process of using microarrays for scientific exploration. Massive Technologies for microarray analysis have been adopted since the early 1990s.

  10. 4. Type of Microarray

  11. 5. The Roles of Microarray To monitor gene expression levels on a genomic scale To enhance fundamental understanding of life on the molecular level regulation of gene expression gene function cellular mechanisms medical diagnosis, treatment, drug design

  12. The microarray data form a matrix

  13. Applications of SVD Mathematical definition of the SVD U is an mxn matrix S is an nxn diagonal matrix VT is also an nxn matrix

  14. One important result of the SVD of X

  15. X(l) is the closest rank-l matrix to X. • The term “closest” means that X(l) minimizes the sum of the squares of the difference of the elements of X and X(l) ∑ij|xij – x(l)ij|2=min

  16. SVD analysis of gene expression data

  17. The results for Elutriation Dataset

  18. Pattern Inference

  19. The result analysis for Pattern Inference • (a) Raster display of v’ , the expression of 14 eigengenes in 14 arrays. • (b) Bar chart of the fractions of eigenexpression • (c) Line-joined graphs of the expression levels of r1 (red) and r2 (blue) in the 14 arrays fit dashed graphs of normalized sine(red) and osine(blue) of period T =390 min and phase = 2*3.14/13, respectively.

  20. Data Sorting

  21. The results analysis for data sorting Fig.3.Genes sorted by relative correlation with r1 and r2 of normalized elutriation. • Normalized elutriation expression of the sorted 5,981 genes in the 14 arrays, showing traveling wave of expression. • Eigenarrays expression; the expression of a1 and a2, the eigenarrays corresponding to r1 and r2, displays the sorting. • Expression levels of a1(red) and a2(green) fit normalized sine and cosine functions of period Z=N-1= 5,980 and phase Q=2*3.14/13 (blue), respectively.

  22. Other Applications for SVD • Missing data • Comparison between two genomic sequences

  23. The Applications of NMF Mathematical definition of the NMF V (nm) = W (nr) . H (rm) In general, (n+m)r < nm. It can be used to extract the features that are hidden in dataset.

  24. Comparison with SVD

  25. The results for Elutriation Dataset

  26. The results for a - factor Dataset

  27. Summarization 1. SVD:Normalization 。 no data limitation NMF:No Normalization Positive data 2. SVD: Missing data, Cluster, Pattern inference, weak pattern extraction, Comparison NMF: Pattern inference, Cluster, Finding similarity 3. ICA is used to mining DNA microarray data.

  28. Thanks a lot!

More Related