480 likes | 707 Views
DQO Training Course Day 1 Module 6. How Many Samples do I Need? Part 3. Presenter: Sebastian Tindall. 60 minutes. How Many Samples do I Need?. REMEMBER:. HETEROGENEITY IS THE RULE!. Sampling for Environmental Activities.
E N D
DQO Training Course Day 1 Module 6 How Many Samples do I Need?Part 3 Presenter: Sebastian Tindall 60 minutes
How Many Samples do I Need? REMEMBER: HETEROGENEITY IS THE RULE!
Sampling for Environmental Activities Chuck RamseyEnviroStat, Inc.PO Box 636Fort Collins, CO 80522970-689-5700970-229-9977 fax chuck@envirostat.org www.envirostat.org
Sampling for Analytical Purposes Pierre Gy Translated by A.G. Royle John Wiley & Sons 1998 ISBN: 0-471-97956-2
Pierre Gy’s Sampling Theory and Sampling Practices, 2nd Edition Francis F. Pitard CRC Press 1993 ISBN: 0-8493-8917-8 Heterogeneity, Sampling Correctness, and Statistical Process Control
Seven Major Sampling Errors • Fundamental Error - FE • Grouping and Segregation Error - GSE • Materialization Error - ME • Delimination Error - DE • Extraction Error - EE • Preparation Error - PE • Trends - CE2 • Cycles - CE3
Seven Major Sampling Errors SE = FE + GSE + DE + EE + PE + CE2 +CE3
Ramsey’s “Rules” • All measurements are an average • With discreet sampling, the sample average is a random variable • With discreet sampling, the sample SD is an artifact of the sample collection process
Ramsey’s “Rules” • Heterogeneity is the rule • Multi-increment sampling can drive a skewed distribution towards normal (by invoking the CLT) • FE2 • proportional to particle size • inversely proportional to mass • Lab data are suspect (error can be large)
Ramsey’s “Rules” (cont.) • Good sampling technique is critical • Typical sample sizes will underestimate the mean • Quality control (QC) is important • NO boiler plate; (e.g., PARCC) • QC must be problem specific • Maximize the use of onsite analysis to guide planning & decisions • DQOs are the most important component of the process
Ramsey’s “Rules” (cont.) • One measurement is a crap shoot: • Tremendous heterogeneity (variability) between: • Particles within a sample • Aliquots of a sample • Duplicate samples • Never take ONE grab sample to base a decision • Always collect X increments and use AT LEAST one multi-increment sample to make the decision
Average Exposure In discreet sampling: • the sample mean is a random variable. • the 95% UCL is a random variable. • the sample range is a random variable. • the sample standard deviation is a random variable • the sample standard deviation is an artifact of sample collection process. • n (# samples) is NOT proportional to the size of the population (e.g. area, mass, or volume).
A A A A B B B A B B A B B A A Average depends on locations sampled Average A = 16 ppm Average B = 221 ppm Average from discrete sampling is a random variable
Hot Spots x • 1,000,000 g at site • 100,000 g > AL • Take 10 samples • 1> AL • Remove that 1 • Re-sample = clean • Wrong! • If 100,000 >AL • Minus 1 • Still 99,999>AL AL= action level
Hot Spots Simply Means: “I want to look at units (e.g. Mass, volume) that are becoming smaller and smaller and smaller and smaller and smaller and smaller and smaller” $ $ $ $ $ $ $ $$
Additional Population Considerations • Sample support - “physical size, shape and orientation of the material that is extracted from the sampling unit that is actually available to be measured or observed, and therefore, to represent the sampling unit.” • Assure enough sample for analyses • Specify how the sample support will be processed and sub-sampled for analysis. EPA Guidance on Choosing a Sampling Design for Environmental Data Collection, EPA QA/G-5S, December 2002, EPA/240/R-02/005
Sub-Sampling • The DQO must define what represents the population in terms of laboratory sample size: • Typical laboratory sample sizes that are digested or extracted: metals - 1g, volatiles - 5g, semi-volatiles - 30 g • The 1g or 30g sample analyzed by the lab is supposed to represent a larger area/mass (e.g., acre). Does it?
Multi-Increment Sampling is the Way to Go Next slides show “How to” perform multi-increment sampling
n = m * k Collect “n” samples Group into “k” increments k = 3 k = 3 Remember; we want the AVERAGE over the Decision Unit Combine “k” into “m” multi- increments m = 2 FAM/Laboratory
Multi-Increment Sampling n = number of samples required k = increments m= samples analyzed
Multi-Increment Sampling is the Way to Go exposure unit = decision unit [DU] (1) Lab(7) Samples & QC (6) calc d & FE & mass(2,3,4) 10 scoops(5) Re-Calculate particle size(8) Average concentration for DU(12,13) Sub sample mass for lab analysis(10) Analyze entire sub sample(11) Grind(9)
Multi-Increment Sampling is the Way to Go • Agree on exposure unit or decision unit. • Select or measure a reasonable maximum sample particle size. • Select the FE. • Calculate the mass of sample needed based on the FE and particle size. • Select n, m, & k • Using a square scoop large enough to capture the maximum particlesize, collect enough sample increments (k) to equal the masscalculated in #4 and place in a jar, combining increments into one “sample”. • Repeat within a given decision unit to produce replicates (duplicate, triplicates, etc.) to generate QC “samples”. • Deliver the sample and QC sample(s) to the lab (m).
Multi-Increment Sampling is the Way to Go, continued 9. Calculate the particle size of sample needed based on the desired sub-sampling FE and the mass that the lab normally uses for a given analysis (extraction). 10. Lab may have to grind entire mass of field sample (& QCs) to the agreed upon maximum analytical particle size in #8. 11. Lab must perform one-dimensional sub-sampling of entire mass [spread entire ground sample on flat surface in thin layer, then systematically or randomly collect sufficient small mass sub-sampling increments to equal the mass the laboratory requires for an analysis; do likewise for each QC sample]. 12. Combine sub-sampling increments into the “sample”, then digest/extract/analyze the sample and QC samples. 13. Calculate the COPC concentration from each sample. 14. Concentration represents average concentration or activity per decision unit.
Comparison of Discrete vs. Multi-Increment Remember: (In discreet sampling) • An average is a random variable; • The SD is an artifact of the sample collection process.
SHOW VDT File X-bar as Random Variable
Effects of Grinding a Soil Walsh, Marianne E.; Ramsey, Charles A.; Jenkins, Thomas F., The Effect of Particle Size Reduction by Grinding on Subsampling Variance for Explosives Residues in Soil, Chemosphere 49 (2002) 1267-1273.
Fundamental Error FE = fundamental error M = mass of sample (g) d = maximum particle size <5% oversize (cm) 3 d 2 = FE 22 . 5 ~ M EPA/600/R-92/128, July 1992
Fundamental Error 3 d 2 FE = 22 . 5 22.5= ~ clfg • c - mineralogical factor - density factor (for soil ~ 2.5) • l - liberation factor (between 0 -1) • f - shape factor (for soil ~0.5) • g - granulometric factor ~0.25 ~ M
2 M ( FE ) = d 3 22 . 5 3 d = M 22 . 5 2 FE Fundamental Error Solve for particle size OR Solve for mass of sample
Constant Particle Size 9217 gm 20% 4097 gm 30% Particle Size - 2.54 cm
Examples of FE, Mass, Particle Size May not work well or at all with some media • Clay • Water • Air
Example • Soil like material • Largest particle about 4 mm • Action limit is 500 ppm • Analytical aliquot is one gram • Is this acceptable? Compliments of EnviroStat, Inc.
Example (cont) Check particle size representatives FE = = 1.2 FEpercent = 1.2 * 100 FEpercent = 120% EPA/600/R-92/128, July 1992 Compliments of EnviroStat, Inc.
Example (cont) What mass is required to reduce FE to 15%? But lab can analyze 10 grams at the most Compliments of EnviroStat, Inc.
Example (cont) To what particle size does the sample need to be reduced to achieve FE of 15%? Compliments of EnviroStat, Inc.
Example (cont) What is the FE to take 64 grams and grind it to 0.1 cm and take one gram? Ignoring all the other errors Compliments of EnviroStat, Inc.
Example (cont) • Option 1 • take at least 64 grams and grind to 0.1 cm • analyze one gram • Option 2 • take at least 64 grams and grind to 0.22 cm • analyze 10 grams • Other options • investigate/estimate sampling factors (clfg) Compliments of EnviroStat, Inc.
Multi-increment Sampling • Saves money by taking fewer samples to make decision • Eliminates the classical statistics obstacles • Samples are representative of population • Results are defensible • Does not excite the public • Faster • Cheaper
Key Points • All measurements are an average • In discreet sampling, • the sample average is a random variable • The sample range is a random variable • The sample UCL is a random variable • The sample standard deviation is a random variable • In discreet sampling, the SD is an artifact of the sample collection process • Heterogeneity is the rule • Multi-increment sampling can save your butt! • Multi-increment sampling can get you defensible data within your sampling & analyses budget
Key Points (cont.) • Due to inherent heterogeneity, collecting representative sample is difficult • Managing Uncertainty approach and “Ramsey’s Rules” advocate • using cheaper, real-time, on-site methods • increasing sample density or coverage • Controlling laboratory analysis quality does not control all error • Errors occur in each step of the collection and analysis process
Key Points (cont.) • Managing Uncertainty approach encourages use of DWP to provide flexibility to obtain sufficient sample density • Larger the “mass”, the lower the sampling error • Smaller the “particle”, the lower the sampling error • Proper sub-sampling is critical • Sample design must assess the normal, skewed, and badly skewed distributions • For badly skewed computer simulations are needed • Multi-increment samples drive the distribution to normal
How Many Samples do I Need? REMEMBER: HETEROGENEITY IS THE RULE!
Summary • Use Classical Statistical sampling approach: • Very likely to fail to get representative data in most cases • Use Other Statistical sampling approaches: • Bayesian • Geo-statistics • Kriging • Use M-Cubed Approach: Based on Massive FAM • Use Multi-Increment sampling approach: • Can use classical statistics • Cheaper • Faster • Defensible: restricted to surfaces (soils, sediments, etc.) MASSIVE DATA Required
End of Module 6 Thank you Questions? Comments? This concludes our presentation for Day 1 See you here at 8:30 AM tomorrow for Day 2.