1 / 21

HEARTbeat anomaly detection using adversarial oversampling

This presentation discusses the use of adversarial oversampling to improve the detection of heart anomalies through machine learning. The focus is on addressing the imbalanced sampling problem faced in minority cases. The proposed approach includes classification of heartbeat images, generation of synthetic images using Infogan architecture, and comparison with conventional methods. The results show improved accuracy for minority classes using the adversarial oversampling technique.

negri
Download Presentation

HEARTbeat anomaly detection using adversarial oversampling

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. HEARTbeat anomaly detection using adversarial oversampling Presenter: Syed Sharjeelullah Course: CS-732 Authors: Jefferson L. P. Lima David Macedo CleberZanchettin

  2. introduction • Cardiovascular diseases among the most common causes of death • Early detection and prevention mechanisms are the best way to avoid this • Machine learning at the forefront of solving this problem • Biggest problem faced is with minority cases and unbalanced sampling

  3. Introduction • Classification of heartbeat images in cardiovascular diseases • Infogan architecture used to generate synthetic images for minority classes • Adversarial oversampling and comparison with conventional methods • 2-dimensional convolutional neural network used • Data taken from MIT-BIH arrythmia database

  4. Historical work • ECG best form of prevention indicators for heart problems • Ecg classification is very difficult because of differing waveforms and distinct types • Plus the intra class variation • Current techniques use feature extraction process • Although spatial location of the heartbeat images play major part in classification

  5. Historical work • Ecg analysis is commonly done using convolutional neural networks • Datasets have high degree of imbalance • Example, rare types of classes have considerably low amount of data for training • This imbalance results in very bad accuracy for minority classes • Conventional oversampling methods include random oversampling, smote, adasyn

  6. Oversampling methods 1. Random oversampler: Choosing samples from minority class randomly and with replacement 2. Synthetic minority oversampling techniques: synthesizes new instances between New and existing instances 3. Adaptive synthetic sampling approach: uses density distribution as criteria For deciding no. of synthetic samples for minority classes

  7. Generative adversarial networks

  8. Explanation of the approach • Three phases 1. Pre processing 2. Adversarial oversampling (InfoGAN) 3. Training and classification

  9. Pre processing • The experiment dataset contains 48 half-hour excerpts of two-channel ambulatory ECG 360hz recordings obtained from 47 subjects. • THE ecg signals are cut into 25 time slices for each signal • the peak of the heartbeat at the center of each sliced image • Each sliced image is classified individually • Then binary images are produced with 112x112 resolution • These two dimensional labelled images would be the training and test data for the classifier

  10. Pre processing

  11. Adversarial oversampling • This phase generates the synthetic fake images for the minority classes • This is the adversary model because its speed of convergence, robustness to mode collapse and quality of synthetic images • Generator G takes standard random noise of 64x1 • Generator produces a synthetic image at the end of the model • The discriminator D takes both input and fake images of resolution 112x112 • The D is composed of 2-dimensional convolutional layers, dropout to minimize overfitting, batch normalization, Leaky ReLU as activation function, and Sigmoid function

  12. Adversarial oversampling • The general InfoGAN objective function is given by the lowerbound approximation to the Mutual Information as follows:

  13. Adversarial oversampling-infogan architecture

  14. Training and classification • Convolutional neural network is the classification model • Network takes 2D array as an input with a resolution of 112x112 • Rectified Linear Unit has been used as the activation function.

  15. Training and classification • Given the data set X, for a single sample of Xi from X, the network training consists in optimize the following cross-entropy objective function: • M is the number of classes; y is a binary indicator (0 or 1); • if class label c is the correct classification for observation (heartbeat) o, and p is the predicted probability observation o is of class c.

  16. Infogan training &CNN Architecture

  17. Experiments • The experiment dataset contains 48 half-hour excerpts of two-channel ambulatory ECG 360hz recordings obtained from 47 subjects. • The CNN training consisted of 20 epochs and an early-stop if the loss in the validation set did not decrease considering the previous epoch. • . The experiments were repeated ten times, where 20% the data was used as the test set, 10% as the validation set, and 70% to training. • A total of 1000 samples were generated for minority classes for balancing data

  18. Results

  19. Results

  20. results

  21. Conclusion • For ECG classification, bi-dimensional approach is better • Generation of synthetic samples from infogan improves accuracy for minority classes • Infogan synthetic samples maintain features of the actual classes • Adverserial sampling overcomes traditional methods • Future work would include working on automatic generation and selection of samples for minority classes

More Related