280 likes | 295 Views
Explore the ARTreat project targeting patient-specific cardiovascular models for atherosclerosis prediction. Learn about plaque classification, hemodynamics, and data mining from medical records and simulations. Discover the challenges and advancements in this vital field.
E N D
Datamining @ ARTreat Veljko Milutinović vm@etf.rsZoran Babović zbabovic@gmail.comNenad Korolija nenadko@gmail.comGoran Rakočević g.rakocevic@gmail.comMarko Novaković atisha34@yahoo.com
Agenda • ARTReat – the project • Arteriosclerosis – the basics • Plaque classification • Hemodynamic analysis • Data mining for the hemodynamic problem • Data mining from patent records
ARTreat – the project • ARTreat targets at providing a patient-specific computational modelof the cardiovascular system, used to improve the quality of predictionfor the atherosclerosis progression and propagation into life-threatening events. • FP7 Large-scale Integrating Project (IP) • 16 partners • Funding: 10,000,000 €
Atherosclerosis • Atherosclerosis is the condition in which anartery wall thickens as the result of a build-up of fatty materials such as cholesterol
Artheriosclerotic plaque • Begins as a fatty streak, an ill-defined yellow lesion–fatty plaque, develops edges that evolve to fibrous plaques, whitish lesions with a grumous lipid-rich core
Plaque components • Fibrous, Lipid, Calcified, Intra-plaque Hemorrhage
Plaque classification • Different types of plaque pose different risks • Manual plaque classification (done by doctors)is a difficult task, and is error prone • Idea: develop an AI algorithmto distinguish between different types of plaque • Visual data mining
Plaque classification (2) • Developed by Foundation for Research and Technology • Based on Support Vector Machines • Looks at images produced by IVUS and MRIand are hand labeled by physicians • Up to 90% accurate
Data mining task in Belgrade • Two separate paths: • Data mining from the results of hemodynamic simulations • Data mining form medical patient records • Goal: to provide input regarding the progression of the diseaseto be used for medical decision support
Hemodynamics – the basics • Study of the flow of blood through the blood vessels • Maximum Wall Shear Stress – an important parameterfor plaque development prognoses
Hemodynamics - CFD • Classical methods for hemodynamic calculations employ Computer Fluid Dynamics (CFD) methods • Involves solving the Navier-Stokes equation: • …but involves solving it millions of times! • One simulation can take weeks
Data mining form hemodynamic simulations (first path) • Idea: use results of previously done simulations • Train a data mining AI system capable of regression analysis • Use the system to estimate the desired valuesin a much shorter time
Neural Networks - background • Systems that are inspired by the principle of operationof biological neural systems (brain)
Neural Networks – the basics • A parallel, distributed information processing structure • Each processing element has a single output which branches (“fans out”) into as many collateral connections as desired • One input, one output and one or more hidden layers
Artificial neurons • Each node (neuron) consists of two segments: • Integration function • Activation function • Common activation function • Sigmoid
Neural Networks - backpropagation • A training method for neural networks • Try to minimize the error function:by adjusting the weights • Gradient descent: • Calculate the “blame” of each input for the output error • Adjust the weights by:(γ- the learning rate)
Input data set • Carotid artery • 11 geometric parameters and the MWSS value
The model • One hidden layer • Input layer: linear • Hidden and output: sigmoid • Learning rate 0.6 • 500K training cycles • Decay and momentum
Current results • Average error: 8.6% • Maximum error 16,9%
The “dreaded” line 4 • Line 4 of the original test set proved difficult to predict • Error was over 30% • Turned out to be an outlier • Combination of parameters was such that it couldn’t • But the CFD worked, NN worked • Visually the geometry looked fine • Goes to show how challenging the data preprocessing can be
Data mining from medical data(second path) • Use a large medical database (3000 patients)to attempt to find patterns that help predicating progression of arteriosclerosis • Data include: • Coronary angiography results • Blood chemistry • Risk factors (such as smoking, obesity, family histrory, etc.)
Repeated angio dataset • 90 different parameters • Includes data from two coronary angiographiestaken at different times (distances between 3 months and 10 years)
Current approach • Divide the patients into three categories, according to the second angio: • Less then 50% stenosis • 50-75% stenosis • More than 75% stenosis(percentages chosen based on the dataset values) • Use Neural and SVM classifiers to attempt classification
Current resutls • Current results: 80% accuracy, • But: • Division is very crude (“inherited” form the dataset) • Misclassifications sometimes happen between class 1 and class 3 • Dataset lacks healthy and less critical patients • LDL data are missing • Further improvements, both in algorithms and the data needed,to make the results significant
Genetic data • Single coronary angiography • Blood chemistry • Medications • Single Nucleotide Polymorphism (SNP) data on selected DNA sequences
Datamining @ ARTreat Project Veljko Milutinović vm@etf.rsZoran Babović zbabovic@gmail.comNenad Korolija nenadko@gmail.comGoran Rakočević g.rakocevic@gmail.comMarko Novaković atisha34@yahoo.com