100 likes | 386 Views
ETL and Metadata. The ETL Process. Presentation System. Source Systems. Staging Area. Transform. Extract. Load. Source. Business Owner. IS Owner. Platform. Location. Data Source Description. Source Data. Record the name location and data that exists in the TPS environment.
E N D
The ETL Process Presentation System Source Systems Staging Area Transform Extract Load
Source Business Owner IS Owner Platform Location Data Source Description Source Data Record the name location and data that exists in the TPS environment. • File names and location • Layout • Attribute meaning
Target Table Target Column Data Type Len Target Column Description Source System Source Table / File Source Col / Field Data Txform Notes Extraction Copy specific data directly from the source tables into a working dataset in the staging area.
Transformation(Dimension Tables) • Generate surrogate key in a primary-surrogate table. Make this permanent. • Insert the surrogate key into the working dimension tables. • Conduct any editing/cleaning operations you need (usually on the working table) • Generate any derived attributes you need. • Generate and retain process logs.
Chg Flag Fact Group Derived Fact Name Derived Fact Description Type Agg Rule Formula Constraints Transfor- mations Transformation(Fact tables) • Join all dimensions to the fact table (using original primary keys). Insert surrogate keys • Generate derived facts • Generate indicator flags
Table Name Column Name Data Type Len Nulls? Column Description PK PK Order FK Target Data Describe the presentation data structure. • Model • Metadata • Usage and constraints
Flow Documentation • DFD for the ETL process • ERD for Source, Staging and Target databases. • Metadata • Usage notes.