1 / 28

Datamining @ ARTreat

Explore the ARTreat project targeting patient-specific cardiovascular models for atherosclerosis prediction. Learn about plaque classification, hemodynamics, and data mining from medical records and simulations. Discover the challenges and advancements in this vital field.

nmccray
Download Presentation

Datamining @ ARTreat

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Datamining @ ARTreat Veljko Milutinović vm@etf.rsZoran Babović zbabovic@gmail.comNenad Korolija nenadko@gmail.comGoran Rakočević g.rakocevic@gmail.comMarko Novaković atisha34@yahoo.com

  2. Agenda • ARTReat – the project • Arteriosclerosis – the basics • Plaque classification • Hemodynamic analysis • Data mining for the hemodynamic problem • Data mining from patent records

  3. ARTreat – the project • ARTreat targets at providing a patient-specific computational modelof the cardiovascular system, used to improve the quality of predictionfor the atherosclerosis progression and propagation into life-threatening events. • FP7 Large-scale Integrating Project (IP)‏ • 16 partners • Funding: 10,000,000 €

  4. Atherosclerosis • Atherosclerosis is the condition in which anartery wall thickens as the result of a build-up of fatty materials such as cholesterol

  5. Artheriosclerotic plaque • Begins as a fatty streak, an ill-defined yellow lesion–fatty plaque, develops edges that evolve to fibrous plaques, whitish lesions with a grumous lipid-rich core

  6. Plaque components • Fibrous, Lipid, Calcified, Intra-plaque Hemorrhage

  7. Plaque classification • Different types of plaque pose different risks • Manual plaque classification (done by doctors)is a difficult task, and is error prone • Idea: develop an AI algorithmto distinguish between different types of plaque • Visual data mining

  8. Plaque classification (2)‏ • Developed by Foundation for Research and Technology • Based on Support Vector Machines • Looks at images produced by IVUS and MRIand are hand labeled by physicians • Up to 90% accurate

  9. Data mining task in Belgrade • Two separate paths: • Data mining from the results of hemodynamic simulations • Data mining form medical patient records • Goal: to provide input regarding the progression of the diseaseto be used for medical decision support

  10. Hemodynamics – the basics • Study of the flow of blood through the blood vessels • Maximum Wall Shear Stress – an important parameterfor plaque development prognoses

  11. Hemodynamics - CFD • Classical methods for hemodynamic calculations employ Computer Fluid Dynamics (CFD) methods • Involves solving the Navier-Stokes equation: • …but involves solving it millions of times! • One simulation can take weeks

  12. Data mining form hemodynamic simulations (first path)‏ • Idea: use results of previously done simulations • Train a data mining AI system capable of regression analysis • Use the system to estimate the desired valuesin a much shorter time

  13. Neural Networks - background • Systems that are inspired by the principle of operationof biological neural systems (brain)

  14. Neural Networks – the basics • A parallel, distributed information processing structure • Each processing element has a single output which branches (“fans out”) into as many collateral connections as desired • One input, one output and one or more hidden layers

  15. Artificial neurons • Each node (neuron) consists of two segments: • Integration function • Activation function • Common activation function • Sigmoid

  16. Neural Networks - backpropagation • A training method for neural networks • Try to minimize the error function:by adjusting the weights • Gradient descent: • Calculate the “blame” of each input for the output error • Adjust the weights by:(γ- the learning rate)

  17. Input data set • Carotid artery • 11 geometric parameters and the MWSS value

  18. The model • One hidden layer • Input layer: linear • Hidden and output: sigmoid • Learning rate 0.6 • 500K training cycles • Decay and momentum

  19. Current results • Average error: 8.6% • Maximum error 16,9%

  20. The “dreaded” line 4 • Line 4 of the original test set proved difficult to predict • Error was over 30% • Turned out to be an outlier • Combination of parameters was such that it couldn’t • But the CFD worked, NN worked • Visually the geometry looked fine • Goes to show how challenging the data preprocessing can be

  21. Data mining from medical data(second path)‏ • Use a large medical database (3000 patients)to attempt to find patterns that help predicating progression of arteriosclerosis • Data include: • Coronary angiography results • Blood chemistry • Risk factors (such as smoking, obesity, family histrory, etc.)

  22. Repeated angio dataset • 90 different parameters • Includes data from two coronary angiographiestaken at different times (distances between 3 months and 10 years)‏

  23. Current approach • Divide the patients into three categories, according to the second angio: • Less then 50% stenosis • 50-75% stenosis • More than 75% stenosis(percentages chosen based on the dataset values)‏ • Use Neural and SVM classifiers to attempt classification

  24. Current resutls • Current results: 80% accuracy, • But: • Division is very crude (“inherited” form the dataset)‏ • Misclassifications sometimes happen between class 1 and class 3 • Dataset lacks healthy and less critical patients • LDL data are missing • Further improvements, both in algorithms and the data needed,to make the results significant

  25. Genetic data • Single coronary angiography • Blood chemistry • Medications • Single Nucleotide Polymorphism (SNP) data on selected DNA sequences

  26. …and now for something completely different

  27. Questions

  28. Datamining @ ARTreat Project Veljko Milutinović vm@etf.rsZoran Babović zbabovic@gmail.comNenad Korolija nenadko@gmail.comGoran Rakočević g.rakocevic@gmail.comMarko Novaković atisha34@yahoo.com

More Related