210 likes | 328 Views
AFPOA Virtual Vendor Day Topic: Data Integration Gregory J. Vaughan – Executive Consultant, WW Military and Defense Lead, Information Agenda Tiger Team. There’s no “easy button” for this…. Data Integration is a complex problem A myopic view of the problem frustrates the desired end state
E N D
AFPOA Virtual Vendor DayTopic: Data IntegrationGregory J. Vaughan – Executive Consultant, WW Military and Defense Lead, Information Agenda Tiger Team
There’s no “easy button” for this… • Data Integration is a complex problem • A myopic view of the problem frustrates the desired end state • Scoping the problem too narrowly reduces the likelihood of success • Focusing later on data integration requires a revisit of the problem scope • Data integration presents the greatest risk to IT related business initiatives • Data Governance is required, but frequently overlooked • The complexities of data integration requires a comprehensive solution
TEXTANALYTICS PREDICTIVEANALYTICS BI (REPORTS, DASHBOARDS,QUERY, OLAP) • Info. Integration • Data Quality • Info. Services USERS UNSTRUCTURECONTENT APPLICATIONS DATA MARTS DATAWAREHOUSE OPERATIONALDATA INTERNAL/EXTERNALDATBASES MASTERDATA APPLICATIONS OPTIMIZATION Analytics AnalyticalInformation OLAPCUBES TrustedInformation Define & Govern INTERNALDATABASES METADATA Operational Systems/Data EXTERNALDATABASES Solution Architecture – General View
The IBM Solution: IBM Information ServerDelivering information you can trust IBM Information Server Unified Deployment Transform Deliver Understand Cleanse Discover, model, and govern information structure and content Standardize, merge, and correct information Combine and restructure information for new uses Synchronize, virtualize and move information for in-line delivery Unified Metadata Management Parallel Processing Rich Connectivity to Applications, Data, and Content
Business Analysts Enterprise Architects Executives Subject Matter Experts Data Steward System Architect ERP System Manager DBA Developer Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives legacy Data Analysts & Architects BI apps SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom
Business Analysts Enterprise Architects Executives Subject Matter Experts Data Steward System Architect ERP System Manager DBA Developer Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives legacy Data Analysts & Architects BI apps SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom
Information Analyzer InfoSphere Information Analyzer Requirements Analyze source data quality and monitor adherence to integration and quality rules • Perform data quality assessment • Define business rules to monitor data quality • Establish stewards for governance of data quality Benefits • Identify data quality issues early to reduce project risks • Monitor quality metrics over time for compliance • Create business confidence with trusted information 7
Business Glossary InfoSphere Business Glossary Requirements Create and manage business vocabulary and relationships and related to physical sources • Capture business terms and classifications • Link business terms and classifications to IT assets • Identify data stewards and make glossary accessible Benefits • Context for information is available to everyone, immediately • IT projects are aligned with data governance • Collaboration increases across business and IT 8 8
Business Analysts Enterprise Architects Executives Subject Matter Experts Data Steward System Architect ERP System Manager DBA Developer Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives legacy Data Analysts & Architects BI apps SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom
QualityStage InfoSphere QualityStage Requirements • Resolution of data quality issues • Standardization of data formats • Cleanse data • Manage duplicate data • Enable ongoing quality Standardize, cleanse and deduplicate data, ensuring a complete, accurate view of information Benefits • Removes duplicates • Cross-references matching records • Survives a single, complete record • Validate and enriches data 10
Business Analysts Enterprise Architects Executives Subject Matter Experts Data Steward System Architect ERP System Manager DBA Developer Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives legacy Data Analysts & Architects BI apps SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom
Metadata Workbench InfoSphere Metadata Workbench Requirements • Handle Change Management processes with measured impact. • Visualize and trace information flows across enterprise landscape • Access and report on operational and design metadata Support information governance with traceability on data movement, modeling & BI applications Benefits • Deliver enterprise audit control information. • Mediate system disruptions. • Govern enterprise assets over time. • Ensure effective collaboration with line of business stakeholders. 12
Business Analysts Enterprise Architects Executives Subject Matter Experts Data Steward System Architect ERP System Manager DBA Developer Align business and IT objectives using single platform that creates trusted information for use in key initiatives Sources Business Initiatives legacy Data Analysts & Architects BI apps SAP dbs warehouse Xls., xml, flat mdm warehouse z/OS custom
InfoSphere Data Architect Data Architect Requirements • Design and manage enterprise models • Enforce model conformance to enterprise standards • Leverage industry data models for best practices Model, visualize, and relate diverse and distributed data assets Benefits • Speed design activities • Populate Business Glossary from model terms • Validate models for enterprise conformance 14
InfoSphere FastTrack FastTrack Requirements • Capture business requirements for source to target mappings • Leverage source analysis and business vocabulary • Generate candidate ETL jobs Capture Design Specifications and accelerate translation into data integration projects Benefits • Accelerate development of integration processes • Centralized management of specifications • Audit design decisions over time 15
Information Governance Core Disciplines Security and Privacy Secure &Protect Understand &Define Monitor & Audit JASON MICHAELS ROBERT SMITH IBM InfoSphere Optim Data Masking Solution De-identify sensitive informationwith realistic but fictional data for testing & development purposes Requirements • Protect confidential data used in test, training & development systems • Implement proven data masking techniques • Support compliance with privacy regulations • Solution supports custom & packaged ERP applications Benefits • Protect sensitive information from misuse and fraud • Prevent data breaches and associated fines • Achieve better data governance Personal identifiable information is masked with realistic but fictional data for testing & development purposes.
Information Governance Core Disciplines Security and Privacy Secure &Protect Understand &Define Monitor & Audit Test Data Management 2TB IBM InfoSphere Optim Test Data Management Solution Create “right-size” production-like environmentsfor application testing Requirements • Create referentially intact, “right-sized” test databases • Automate test result comparisons to identify hidden errors • Shorten iterative testing cycles and accelerate time to market Subset & Mask 25 GB Benefits Production or Production Clone 25 GB Development • Deploy new functionality more quickly and with improved quality • Easily refresh & maintain test environments • Reduce storage and operational costs Unit Test 100 GB 50 GB Training Integration Test InfoSphere Optim TDM supports data on distributed platforms (LUW) and z/OS. Out-of-the-box subset support for packaged applications ERP/CRM solutions as well as : Other
Best Practices Capabilities & Differentiators • Single data integration platform with multiple components • Consistent and repeatable methodology for mitigating risks • Industry leading Probabilistic Matching Engine for data standardization jobs • Native Parallel Processing Engine for scalability • Shared GUI Interface between major components of the platform • Centralized repository of critical metadata shared across the platform • Data integration enablement in an SOA environment
IBM Information Server Federal Customers • Personnel and recruiting analysis • Procurement system consolidation • Real-time data management • Inventory parts analysis • Agency data migrations • Authoritative source • Personnel record consolidation • System synchronization