130 likes | 212 Views
The effects of applying cell-suppression and perturbation to aggregated genetic data.
E N D
The effects of applying cell-suppression andperturbation to aggregated genetic data Athos Antoniades, John Keane, Aristos Aristodimou, Christa Philipou, Andreas Constantinou, Christos Georgousopoulos, Federica Tozzi, Kyriacos Kyriacou, Andreas Hadjisavas, Maria Loizidou, Christiana Demitriou and Constantinos Pattichis
Introduction • Why Share Data? • What are the current legal and ethical limitations? • How have scientists shared medical data so far? • Key Problems • Perturbation • Cell Suppression
The Problem • Why share data: • Replication Testing • Statistical Power • Multiple Testing Problem • Legal and Ethical Issues • AnonymizationvsPseudoanonimization • Limitations derived from consent form signed by subjects • Other, regional, study, or subject specific issues.
How have scientists shared medical data Contingency Table and Data Cube example
16 year old widow Problem A paper that analyzes data from a specific study reports:
16 year old widow Problem A paper that analyzes data from a specific study reports:
16 year old widow Problem A paper that analyzes data from a specific study reports:
Categorization Differences Paper 2 that analyzes data from the same study reports: Paper 1 that analyzes data from a specific study reports:
Perturbation and Cell Suppression Perturbation (+-1) and Cell Suppression (<5) Original Data
Evaluation • Most common parameters tested • Perturbation:[0], [-1,1], [-3,3], [-5,5], [-10,10] • Cell Supression: <0, <=1, <=3,<=5,<=10 • Standard main effect test usingChi Square • Pearson’s Correlation Coefficient used to evaluate deviation of each parameter combination to original results. • A-priory defined threshold for Pearson’s correlation coefficient <=0.95.
Conclusion and Future Work • We were able to identify for this dataset, the maximum noise that can be added to the data without significantly affecting the outcomes. • Results only relevant to MASTOS, all other datasets need to repeat the analytical approach described. • Further investigation is necessary to identify the minimum parameter settings to satisfy legal and ethical requirements.
Who to Contact • Athos Antoniades • University of Cyprus • email: athos@cs.ucy.ac.cy