150 likes | 170 Views
Explore the benefits of analyzing DNA profile databases, including quality control, improved search strategies, and population genetics evaluation. Discover how real-world data can enhance statistical weights and refine testing kits. Uncover the significance of disclosure in the National DNA Index System.
E N D
The Time Has Come to Analyze DNA Profile Databases Dan E. Krane, Ph.D., Wright State University, Dayton, OH Forensic Bioinformatics (www.bioforensics.com)
National DNA Index System (NDIS) • Established in 1994, controlled by the FBI. • Explicit expectation that records would be subject to research and quality control. • No published research derived from NDIS to date.
Benefits of analyses of anonymized records • Quality assurance/quality control • Potential for improved search strategies • Potential for refinements of test kits • Evaluation of population genetic assumptions used for statistical weights • Use of real-world data in place of simulations
Review of Victoria State Database Krane/Paoletti analysis: >11,000 profiles each compared to all others across 9 loci: Shared allelesObserved occurrences 14401 1527 161 1716 18 0 AussieBump
300 100 20 1
Benefits of analyses of anonymized records • Quality assurance/quality control • Potential for improved search strategies • Potential for refinements of test kits • Evaluation of population genetic assumptions used for statistical weights • Use of real-world data in place of simulations
Benefits of analyses of anonymized records • Quality assurance/quality control • Evaluation of population genetic assumptions used for statistical weights • Are populations uniform at city/state/region/nation levels? • Are similar close relatives common? • Use of real-world data in place of simulations
How would you determine the frequency of Obama supporters in Seattle? Obama WA 61.2% ? Popular vote in 2012 by Congressional district. Romney won red districts, Obama won blue districts.
How would you determine the frequency of Obama supporters in Seattle? Obama WA 61.2% Region 60.6% ? Popular vote in 2012 by Congressional district. Romney won red districts, Obama won blue districts.
How would you determine the frequency of Obama supporters in Seattle? Obama WA 61.2% Region 60.6% U.S. 52.9% ? Popular vote in 2012 by Congressional district. Romney won red districts, Obama won blue districts.
How would you determine the frequency of Obama supporters in Seattle? Obama WA 61.2% Region 60.6% U.S. 52.9% Utah? 24.7% ? ? Popular vote in 2012 by Congressional district. Romney won red districts, Obama won blue districts.
Benefits of analyses of anonymized records • Quality assurance/quality control • Evaluation of population genetic assumptions used for statistical weights • Are populations uniform at city/state/region/nation levels? • Are similar close relatives common? • Use of real-world data in place of simulations
Benefits of analyses of anonymized records • Quality assurance/quality control • Evaluation of population genetic assumptions used for statistical weights • Use of real-world data in place of simulations • What fraction of 3-person mixtures look like 2-person mixtures? • How evenly distributed is DNA profile space?
It is time for disclosure of the NDIS database • Open access to data is a fundamental tenet of science. • Anonymized records would be easily copied and pose little (if any) privacy risk. • Time for DNA disclosure • Science, 2009; 326:1631-1632.