1 / 27

BLM Data Quality

BLM Data Quality. Purpose - after this course you will be able to…. D escribe why Data Quality matters D efine what is data quality Show how Data Quality fits into the Data Life Cycle E xplain the measures of data quality D escribe the general data quality process

dsacco
Download Presentation

BLM Data Quality

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. BLM Data Quality

  2. Purpose - after this course you will be able to… • Describe why Data Quality matters • Define what is data quality • Show how Data Quality fits into the Data Life Cycle • Explain the measures of data quality • Describe the general data quality process • Demonstrate a knowledge of how to measure data quality

  3. Data Quality • Data: A representation of facts, which when put into context becomes information that is used to draw a conclusion or make a decision. • The only people who DO NOTneed to worry about data quality are those who neither create nor use data.

  4. Data Life Cycle - Evaluate and QA/QC PLAN • The Evaluate Phase of the BLM Data Life Cycle is where numerous data evaluation factors are addressed • The QA/QC location in the middle shows that data quality should be addressed throughout the entire life cycle ARCHIVE ACQUIRE QA/QC EVALUATE MAINTAIN ACCESS

  5. What is Data Quality? • BLM has defined data quality as “fitness for the intended use” Data quality may be considered as the sum of all data characteristics that determine how useful the data is in performing specific business processes.

  6. Data Quality Dimensions • Validity • Non-Duplication • Completeness • Relationship Validity • Consistency • Concurrency

  7. Data Quality Dimensions • Timeliness • Accurate (to reality) • Accurate (to surrogate source) • Precision • Derivation Integrity

  8. Data Quality Dimensions • Completeness (Features) • Positional Accuracy • Logical Consistency • Attribute Accuracy • Geometric Accuracy (Raster) • Radiometric Accuracy (Raster)

  9. Data Quality Dimensions • Geospatial Data quality measures can include the previous list as well as additional quality measures • Errors may be propagated from one dataset to the next and need to be measured and tracked

  10. Data Quality Management Business Process Use the table to define what combination of quality dimensions to apply. Plan who does what and when. Plan Use the tools and methods selected in the planning step to acquire, compile, and summarize (quantitatively or qualitatively) the results for each Measure Apply professional judgment to form an aggregate conclusion or report specific quantifiable measurements. Evaluate Insert result into metadata file and report result to data maintenance function. Report quality measurement to the “Plan” process. Report

  11. Error Rates Beware of First Impressions

  12. Error Rates • An Error Rate is the number of times an error is made divided by the total number of entries (ER = # of Errors/Number of Entries) • HOWEVER, the trick with establishing what is an acceptable error rate and how you do quality control to prevent it is in determining what you are measuring against

  13. Points to Remember • Determine the relative importance of the Fields you are entering (compared to other fields you are entering) • Adjust any quality control factors (# in sample, for instance) to ensure that accuracy level is properly accounted for • Target training and review to those fields with the highest accuracy level requirement • Do not assume overall quality based on entries alone; ensure that the relative importance of certain entries are factored in • Anyone can lie (or at least mislead) with statistics

  14. Addressing Data Quality • Data Quality Plans should be developed during project planning • During all data acquisitions, regardless of method; collecting, buying, sharing, converting legacy data • Review and analysis of existing Data Sets and Applications • Whenever data are accessed and used

  15. Data Quality Support • Data Quality Staff are available at the National Operations Center (NOC)- Branch of Resource Data in the Division of Resource Services • Data Quality Tools are available through the NOC Staff • https://blmspace.blm.doi.net/wo/wodm/Pages/ HomePage.aspx

  16. Purpose - After this course you will be able to… • Describe why Data Quality matters • Define Data Quality • Show how Data Quality fits into the Data Life Cycle • Explain the Dimensions of data quality • Describe the general data quality process • Demonstrate a knowledge of how to measure data quality

  17. Summary • Data Quality is the responsibility of every BLM employee who collects, manages, or uses data in their decision making processes.

More Related