140 likes | 434 Views
Why Data Standardization?. Data Standardization – Ensuring all objects of a specific class conform to a predefined, expected format. Comparisons – Knowing you will find all entities linked to the search target.
E N D
Why Data Standardization? • Data Standardization – Ensuring all objects of a specific class conform to a predefined, expected format. • Comparisons – Knowing you will find all entities linked to the search target. • Calculations – Improved accuracy of statistical measurements and entity resolution performed on data subsets. • Correctness – Every record from every data source can be compared to the standard which may help isolate the root cause of individual errors allowing mass corrections at the source. FOR OFFICIAL USE ONLY
Setting The Standard • Telephone Numbers • North American Numbering Plan Administrator • International Telecommunication Union 2024467790 (202) 446-7790 94467790 *8212024467790 10103452024467790 0012024467790 18002255528430133498020244612 1-2024467790 • IMSI, UFMI, IMEI, MESID, ESN, ENUM… FOR OFFICIAL USE ONLY
Setting The Standard • Telephone Numbers • North American Numbering Plan Administrator • International Telecommunication Union (202) 446-7790 *8212024467790 10103452024467790 0012024467790 1-2024467790 • Addresses • Country Specific Postal Standards • USPS Delivery Sequence File 14410 North Broad Street Apartment 113A Los Angeles California 90377-3344 FOR OFFICIAL USE ONLY
Setting The Standard • Addresses • Country Specific Postal Standards • USPS Delivery Sequence File 14410 North Broad Street Apartment 113A Los Angeles California 90377-3344 FOR OFFICIAL USE ONLY
Setting The Standard • Names Last, First First Last First Middle Last Dr. First Last First Last Jr. Last, First MI First MI Last • Foreign Names, Nicknames? FOR OFFICIAL USE ONLY
DM Definition Data management is the structured processes and systems that plan for, acquire, and maintain data, consistent with requirements, throughout the data life cycle. FOR OFFICIAL USE ONLY
Proposed CNTPO Initiative • Initiative • Develop a Prototype Standardization System Capable of Handling Multiple Disparate Data Types and Demonstrating Value-Added Capabilities by Employing Innovated Data Cleansing Modules for data Transformation and Loading Processes. • Goals • Reduce Costs • Integrate Disparate Information • Enable Data Reuse • Moving Forward • Completing Acquisition Plan • Present to the PIC FOR OFFICIAL USE ONLY