1 / 28

Datamining @ ARTreat

Datamining @ ARTreat. Veljko Milutinović vm@etf.rs Zoran Babović zbabovic@gmail.com Nenad Korolija nenadko@gmail.com Goran Rakočević g.rakocevic@gmail.com Marko Novaković atisha34@yahoo.com. Agenda.

pisces
Download Presentation

Datamining @ ARTreat

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Datamining @ ARTreat Veljko Milutinović vm@etf.rsZoran Babović zbabovic@gmail.comNenad Korolija nenadko@gmail.comGoran Rakočević g.rakocevic@gmail.comMarko Novaković atisha34@yahoo.com

  2. Agenda • ARTReat – the project • Arteriosclerosis – the basics • Plaque classification • Hemodynamic analysis • Data mining for the hemodynamic problem • Data mining from patent records

  3. ARTreat – the project • ARTreat targets at providing a patient-specific computational modelof the cardiovascular system, used to improve the quality of predictionfor the atherosclerosis progression and propagation into life-threatening events. • FP7 Large-scale Integrating Project (IP)‏ • 16 partners • Funding: 10,000,000 €

  4. Atherosclerosis • Atherosclerosis is the condition in which anartery wall thickens as the result of a build-up of fatty materials such as cholesterol

  5. Artheriosclerotic plaque • Begins as a fatty streak, an ill-defined yellow lesion–fatty plaque, develops edges that evolve to fibrous plaques, whitish lesions with a grumous lipid-rich core

  6. Plaque components • Fibrous, Lipid, Calcified, Intra-plaque Hemorrhage

  7. Plaque classification • Different types of plaque pose different risks • Manual plaque classification (done by doctors)is a difficult task, and is error prone • Idea: develop an AI algorithmto distinguish between different types of plaque • Visual data mining

  8. Plaque classification (2)‏ • Developed by Foundation for Research and Technology • Based on Support Vector Machines • Looks at images produced by IVUS and MRIand are hand labeled by physicians • Up to 90% accurate

  9. Data mining task in Belgrade • Two separate paths: • Data mining from the results of hemodynamic simulations • Data mining form medical patient records • Goal: to provide input regarding the progression of the diseaseto be used for medical decision support

  10. Hemodynamics – the basics • Study of the flow of blood through the blood vessels • Maximum Wall Shear Stress – an important parameterfor plaque development prognoses

  11. Hemodynamics - CFD • Classical methods for hemodynamic calculations employ Computer Fluid Dynamics (CFD) methods • Involves solving the Navier-Stokes equation: • …but involves solving it millions of times! • One simulation can take weeks

  12. Data mining form hemodynamic simulations (first path)‏ • Idea: use results of previously done simulations • Train a data mining AI system capable of regression analysis • Use the system to estimate the desired valuesin a much shorter time

  13. Neural Networks - background • Systems that are inspired by the principle of operationof biological neural systems (brain)

  14. Neural Networks – the basics • A parallel, distributed information processing structure • Each processing element has a single output which branches (“fans out”) into as many collateral connections as desired • One input, one output and one or more hidden layers

  15. Artificial neurons • Each node (neuron) consists of two segments: • Integration function • Activation function • Common activation function • Sigmoid

  16. Neural Networks - backpropagation • A training method for neural networks • Try to minimize the error function:by adjusting the weights • Gradient descent: • Calculate the “blame” of each input for the output error • Adjust the weights by:(γ- the learning rate)

  17. Input data set • Carotid artery • 11 geometric parameters and the MWSS value

  18. The model • One hidden layer • Input layer: linear • Hidden and output: sigmoid • Learning rate 0.6 • 500K training cycles • Decay and momentum

  19. Current results • Average error: 8.6% • Maximum error 16,9%

  20. The “dreaded” line 4 • Line 4 of the original test set proved difficult to predict • Error was over 30% • Turned out to be an outlier • Combination of parameters was such that it couldn’t • But the CFD worked, NN worked • Visually the geometry looked fine • Goes to show how challenging the data preprocessing can be

  21. Dataset analysis • Two distinct areas of MWSS values: • the subset with lower values of MWSS, where a similar clear pattern can be seen against all of the input variables, • scattered cloud of values in the subset with higher MWSS values. • Histogram shows the majority of values grouped in the lower half of the values in the set, with only a small number of points in the higher half.

  22. MWSS value prediction • Two approaches: • Single model • Two models: • one for the low MWSS value data, • one for higher values, • classifier to choose the appropriate model • Models based on Linear Regression and SVM

  23. Results • Poor results for higher values of MWSS – insufficient values to train a model

  24. MWSS position • A few outliers and “strange” values in the data set • After elimination: • Further investigation needed into the data and the “outlier” values, although it is only a small number of them

  25. Genetic data • Single coronary angiography • Blood chemistry • Medications • Single Nucleotide Polymorphism (SNP) data on selected DNA sequences

  26. …and now for something completely different

  27. Questions

  28. Datamining @ ARTreat Project Veljko Milutinović vm@etf.rsZoran Babović zbabovic@gmail.comNenad Korolija nenadko@gmail.comGoran Rakočević g.rakocevic@gmail.comMarko Novaković atisha34@yahoo.com

More Related