280 likes | 396 Views
Leveraging the Microsoft Repository in Your Warehouse. Saeed K. Rahimi Graduate Programs in Software University of St. Thomas DWSoft Corporation. Contents. Data Warehousing and Repository The Open Information Model and Meta Data Coalition Microsoft’s Data Warehousing Strategy
E N D
Leveraging the Microsoft Repository in Your Warehouse Saeed K. Rahimi Graduate Programs in Software University of St. Thomas DWSoft Corporation
Contents • Data Warehousing and Repository • The Open Information Model and Meta Data Coalition • Microsoft’s Data Warehousing Strategy • Integrating the Repository Into Your Warehouse DAMA Presentation, August 2000
Data Warehouse Architecture DM DM DM Data warehouse Life Cycle Operational Data • Extract • Cleans • Transform • Load DW Metadata Services SQL Server 7.5 - Object Persistence Services Repository DAMA Presentation, August 2000
What is Metadata? (Continued) • Information not just data • What is 122599? • Is it an integer number? • A string? • Christmas day 1999? • Repository provides metadata about this: • mmddyy DAMA Presentation, August 2000
What Is Metadata? • Information about your • Systems • Source systems, • Warehouse systems, • Processes • Etc. • Databases • Source databases • Warehouse databases • Table • Columns DAMA Presentation, August 2000
What Is Metadata? • Information about your • Transformations • From source system to data warehouse • From data warehouse to source systems • Calculations • Total-Sales = amount*quantity – discount + shipping&handling DAMA Presentation, August 2000
What Is Metadata? (Continued) OLAP information • Sales cube • TotalSales measure • Time, product, geography dimensions • Other information • Reporting Tools • Modeling Tools DAMA Presentation, August 2000
Metadata Benefits • Roadmap to information in the warehouse • What is in the warehouse • What it means • How to get the information • Who owns it • When it was created • How it was created/generated DAMA Presentation, August 2000
Metadata Benefits • All users that interact with the warehouse will benefit from enhanced information • Analysts can better understand what data is available, and how it was calculated • Maintains a history prior to current processes DAMA Presentation, August 2000
Metadata Benefits (Continued) • Provides a common frame of reference – “what does total sales mean” • Provides a central storage location of warehouse knowledge (helps with turnover, growth) • Allows enterprises to ensure data marts/warehouses are built using common terms and goals DAMA Presentation, August 2000
Open Information Model • The schema for the repository • Common set of core definitions to promote sharing of information among different tools and vendors • Covers basic set of sub-models • Created with help/input from over 20 partners and reviewed by 300 • Microsoft has transferred rights to evolve the OIM to the meta data coalition (MDC) DAMA Presentation, August 2000
MDC OIM Components • Current components • Unified Modeling Language (UML) • Relational database schema • Data transformations • Multidimensional schema (OLAP) • English semantics • Legacy databases • Component descriptions • Components under review • Report definitions • Entity/relationship diagrams • Business engineering DAMA Presentation, August 2000
Sample OIM Constructs ITfmTransformation Transformation – mapping from Source column to Target column TfmTransformationHasSourceColumns TfmTransformationHasTargetColumns TfmObjectSetConsistsOfObject ITfmTransformableObjectSet ITfmTransformableObject (Columns) DAMA Presentation, August 2000
A A B B C C D D Virtual Relationship This This VS DAMA Presentation, August 2000
a b c e d Virtual Properties This This A A VS a b B B c c C C D D e d e d DAMA Presentation, August 2000
Microsoft’s Strategy Impact • Prior to SQL server 7.0, DW was primarily available only to very large companies • Tools were expensive, expertise hard to find • Microsoft has lowered the bar by bundling components (with SQL server for free) and making DW available to many more companies DAMA Presentation, August 2000
Microsoft Warehouse Components • Data transformation services • Extract, transform, load • OLAP services • Multidimensional server and analysis • Repository • Information sharing • English query • English to SQL translation DAMA Presentation, August 2000
Repository Maintenance • Importing information into the repository has always been a challenging task • Information needs to be mapped to the model components • Information capture needs to be automated • Once in the repository, maintaining versions of the information is needed • New information load can not destroy the information already in the repository DAMA Presentation, August 2000
Loading the Repository Data Transformation Services OLAP Services Star Schema Database OLTP Database English Query Application Transform Other Sources OLAP DB Schema DB Schema Semantics Repository Other DAMA Presentation, August 2000
Loading the Repository • DTS can save to repository • OLAP information can be saved to repository with SQL Server 2000 • OLE DB and ODBC data sources and related information can be imported into repository with SQL Server 2000 DAMA Presentation, August 2000
Implementation Issues • DTS doesn’t provide complete source to target mappings for certain transformations • For example • When a query used as source, • Interfaces to many third party tools incomplete or not available • Modeling tools • ERWin • Visio • PowerDesigner • Warehouse tools • Business Objects • Information Advantage DAMA Presentation, August 2000
Repository Features • Extensible metamodel • Version management • Dynamic COM based interface • Batch XIF file based interface • Workspace concept • Some pre-built interfaces to data warehouse tools DAMA Presentation, August 2000
Keys to Success • Ensure repository information is current • Provide access to all users of the warehouse • Tackle implementation in well defined, small steps • Designate someone with the responsibility and authority to manage the repository • Maintain long term focus DAMA Presentation, August 2000
Access to All Users • Show users what they want to see • Provide simple, easy to understand information • Make warehouse information easily available “in context” • Use familiar interface and UI concepts • Web based access • Client/server based access • Provide custom integration to existing tools and interfaces when necessary DAMA Presentation, August 2000
Modes of Access • Administrative Access • Management • Meta-model (schema) changes • Meta-data loading and maintenance • End-user Access • Browsing • Navigation • Impact analysis DAMA Presentation, August 2000
Types of Access • Local and over the LAN access • Metamodel management • Repository bulk loading • Interface from other tools (loading meta-data) • Remote and over the WEB access • Browser based • accessible from anywhere on the globe • Navigation oriented • Interface from/to other tools • May be minor editing capabilities DAMA Presentation, August 2000
Access Control • User Classification • Different Class of Users Need to Access Different Objects in the Repository • Different User Profiles • Different Read/write Access Control • Possible User Classes • Business Executive • Casual User • Database Administrator • Data Warehouse Administrator • Repository Administrator DAMA Presentation, August 2000
Questions? Comments or discussions? Emails: skrahimi@stthomas.edu srahimi@dwsoft.com