270 likes | 280 Views
BLM Data Quality. Purpose - after this course you will be able to…. D escribe why Data Quality matters D efine what is data quality Show how Data Quality fits into the Data Life Cycle E xplain the measures of data quality D escribe the general data quality process
E N D
Purpose - after this course you will be able to… • Describe why Data Quality matters • Define what is data quality • Show how Data Quality fits into the Data Life Cycle • Explain the measures of data quality • Describe the general data quality process • Demonstrate a knowledge of how to measure data quality
Data Quality • Data: A representation of facts, which when put into context becomes information that is used to draw a conclusion or make a decision. • The only people who DO NOTneed to worry about data quality are those who neither create nor use data.
Data Life Cycle - Evaluate and QA/QC PLAN • The Evaluate Phase of the BLM Data Life Cycle is where numerous data evaluation factors are addressed • The QA/QC location in the middle shows that data quality should be addressed throughout the entire life cycle ARCHIVE ACQUIRE QA/QC EVALUATE MAINTAIN ACCESS
What is Data Quality? • BLM has defined data quality as “fitness for the intended use” Data quality may be considered as the sum of all data characteristics that determine how useful the data is in performing specific business processes.
Data Quality Dimensions • Validity • Non-Duplication • Completeness • Relationship Validity • Consistency • Concurrency
Data Quality Dimensions • Timeliness • Accurate (to reality) • Accurate (to surrogate source) • Precision • Derivation Integrity
Data Quality Dimensions • Completeness (Features) • Positional Accuracy • Logical Consistency • Attribute Accuracy • Geometric Accuracy (Raster) • Radiometric Accuracy (Raster)
Data Quality Dimensions • Geospatial Data quality measures can include the previous list as well as additional quality measures • Errors may be propagated from one dataset to the next and need to be measured and tracked
Data Quality Management Business Process Use the table to define what combination of quality dimensions to apply. Plan who does what and when. Plan Use the tools and methods selected in the planning step to acquire, compile, and summarize (quantitatively or qualitatively) the results for each Measure Apply professional judgment to form an aggregate conclusion or report specific quantifiable measurements. Evaluate Insert result into metadata file and report result to data maintenance function. Report quality measurement to the “Plan” process. Report
Error Rates Beware of First Impressions
Error Rates • An Error Rate is the number of times an error is made divided by the total number of entries (ER = # of Errors/Number of Entries) • HOWEVER, the trick with establishing what is an acceptable error rate and how you do quality control to prevent it is in determining what you are measuring against
Points to Remember • Determine the relative importance of the Fields you are entering (compared to other fields you are entering) • Adjust any quality control factors (# in sample, for instance) to ensure that accuracy level is properly accounted for • Target training and review to those fields with the highest accuracy level requirement • Do not assume overall quality based on entries alone; ensure that the relative importance of certain entries are factored in • Anyone can lie (or at least mislead) with statistics
Addressing Data Quality • Data Quality Plans should be developed during project planning • During all data acquisitions, regardless of method; collecting, buying, sharing, converting legacy data • Review and analysis of existing Data Sets and Applications • Whenever data are accessed and used
Data Quality Support • Data Quality Staff are available at the National Operations Center (NOC)- Branch of Resource Data in the Division of Resource Services • Data Quality Tools are available through the NOC Staff • https://blmspace.blm.doi.net/wo/wodm/Pages/ HomePage.aspx
Purpose - After this course you will be able to… • Describe why Data Quality matters • Define Data Quality • Show how Data Quality fits into the Data Life Cycle • Explain the Dimensions of data quality • Describe the general data quality process • Demonstrate a knowledge of how to measure data quality
Summary • Data Quality is the responsibility of every BLM employee who collects, manages, or uses data in their decision making processes.