270 likes | 294 Views
Learn about data virtualization with IBM Information Server, a technology that allows you to access data from various sources without creating new databases or disrupting existing ones. Federation, a key component of data virtualization, provides a unified view of resources while retaining their autonomy and integrity.
E N D
IBM Information Server Virtualizing Access to Information
Virtualizing Access to Information • What is Data Virtualization? • Virtual Access using Federation • Federation as a Information Service • Summary
What is Data Virtualization? Providing information in the form that applications and users need, while hiding the complexity of the underlying sources. Data virtualization allows information to be accessed through a common interface that centralizes the control data access.
Complexity and Cost • Complex & costly synchronization • Inconsistent and poor quality data • No feedback on quality of service • Impossible to support business transformation • Heterogeneous, distributed data • Inconsistent islands of information underlie applications • Complications from M&A and departmental purchases Order Proc SupplyChain Procure-ment CRM
BusinessAnalysis BI tools MgmtReports But what if… • Accessing data anywhere in your enterprise • No matter where it resides • Regardless of what format it is in • Regardless of vendor • Without creating new databases and without disruptive changes to existing ones… And all your data appeared to be in a single relational database
What is Federation? Federation is an integration pattern that allows a collection of resources to be viewed and manipulated as if they were a single resource while retaining their autonomy and integrity. It is the technology on which EII is based.
EAI ETL Replication Target /DataWarehouse Interpret Transform Route Database DataSource DataSource … Database Application Application Application load capture, apply transform extract Database How does Federation compare to…
The IBM Solution: IBM Information ServerDelivering information you can trust IBM Information Server Unified Deployment Transform Deliver Understand Cleanse Discover, model, and govern information structure and content Standardize, merge, and correct information Combine and restructure information for new uses Synchronize, virtualize and move information for in-line delivery Unified Metadata Management Parallel Processing Rich Connectivity to Applications, Data, and Content
Virtualizing Access to Information • Why Virtual Access • Virtual Access using Federation • Making Federation into a Service • Summary and References
Federation Transparent • Appears to be one source • Independent of how and where data is stored • Applications continue to work despite of any change in how data is stored Heterogeneous • Accesses data from diverse sources • Relational, Structured, XML, messages, Web, … Extensible • Bring together almost any data source. • Wrapper Development Toolkit High Function • Full query support against all data • Capabilities of sources as well Autonomous • Non-disruptive to data sources, existing applications, systems. High Performance • Optimization of distributed queries
Federated Sources SQL SQL Content Federation Server Classic Federation Server for z/OS II Content Edition Mainframe databases Mainframe files Relational databases XML Web services Packaged applications Web Other Collaboration Systems Content & Imaging Workflow systems • IMS • Adabas • CA-Datacom • CA-IDMS • VSAM • Sequential • DB2 • Informix • Oracle • Sybase • Teradata • Microsoft SQL Server • ODBC • WebSphere BI Adaptors • SAP • PeopleSoft • Siebel • OLE DB • Excel • Flat files • Life sciences • Custom-built • DB2 CM • Family • Domino.doc • Documentum • FileNet • Open Text • Stellent • Interwoven • Hummingbird • WebSphere • FileNet • Lotus Notes • Microsoft Index Server • IBM Lotus Extended Search • Sametime • QuickPlace • Microsoft Exchange Plus partner tools and custom-built connectors extend access to more sources
Nicknames look just like tables to the application Federated execution plans chosen by cost-based optimizer Optimizer decides how to distribute query work between II and remote sources. Cost-based pushdown of operations. Query fragments executed remotely sent via the native client library in the source’s own dialect Support for remote SPs Federation Basic Capabilities WebSphere Federation Server Cost-based optimizer Table SP Nickname Nickname Local + Remote Execution Plans Rel. Wrapper NR. Wrapper Client library Client library Non-SQL SP Remote sources
Select ‘1’from customers Where id=123 Select count(*)from customers_NK Where id=123 'Pushdown' of query operations • The federated server decides whether some or all parts of a query can be "pushed-down", i.e. processed at the remote data source(s). Pushdown-ability depends on • availability of needed functionality at remote source with same semantics • server options (example: is collating sequence at Federated server and remote source the same?) • Example: A remote source can handle an equality predicate, but not count(*). Fed Server Pushdown CompensateCOUNT(*) Data source 500 rows of ‘1’ 500
Select Join vs Local Remote Remote Local caching of remote data • Sending remote requests can be expensive. Why not use previously evaluated query result to answer new queries? • Applications remain intact • Materialized query table (MQT): local table defined by the result of a query • Can include single table, joins, aggregations • Can be indexed, replicated in partitioned environment • Optimizer “routes to” them transparently as appropriate • Can include local tables/views, relational and nonrelational nicknames • Cache tables wizard
Virtualizing Access to Information • Why Virtual Access • Virtual Access using Federation • Federation as a Information Service • Summary and References
What if you could… • Package your information so it could be reused for many purposes by many users • Insulate your users from having to know how your information was produced • Make changes to how information created without breaking the applications that use them • Make it accessible to virtually all your users regardless of their location while maintaining control over who is accessing what data
What is Information Service? … service oriented architecture (SOA)? An IT architectural style that supports service orientation … a service? A repeatable business task – e.g., check customer credit; open new account … Information Service? Publishing consistent, reusable services that make it easier for applications, people and processes to access reliable information without knowing how or where it is stored.
WebSphere Information Services Director • Packages information integration logic as services that insulate developers from underlying sources • Allows these services to be invoked as EJB, JMS, or Web services • Provides load balancing & fault tolerance for requests across multiple Information Servers • Provides foundation infrastructure for Information Services
Operational DB2 DB2 DataStage DataStage QualityStage QualityStage Federation Federation Design Repository Repository Repository WebSphere Information Services Director EJB JMS Web Services WISD Server WISD Services Metadata Services Design service Load balancer Administration service Shared Services Logging service Reporting service Security service WISD Agent WISD Agent SOA Handlers SOA Handlers
Portal Business Process Application ... ... Activity 1 retrieve Activity n portlet portlet portlet portlet Data Federation – Traditional Context ... Consumer Data Federation Pattern Interface: SQL, XQuery Provider Data Virtualization ThroughData Federation Server StructuredData Source MainframeData Source unstructured
Portal Business Process ... ... Activity 1 retrieve Activity n ESB portlet portlet portlet portlet Data Federation – SOA Context Application ... Consumer Interface: SCA (SDO), WSDL/SOAP, JMS/XML, RMI, … Data Federation Pattern Information Service Enablement Provider Data Virtualization ThroughData Federation Server StructuredData Source MainframeData Source unstructured
Information Service Director Service-enabling information tasks Portal All the client knows is the operation they want to performs and how to invoke it Request in CurrentInventory SOAP/http Response out • Check authorization • Start monitoring & logging • Match service request to information task to be done Request Arrives Response Returned Complex heterogeneous query • Route to federation server Query result • Execute query against information sources
Request Arrives Response Returned Information Service Director Complex heterogeneous query Query result Same Information Service – Multiple client bindings Business Process Portal Request in CurrentInventory SOAP/http EJB Response out
Information Service Director Multiple Information Services Business Process Portal Request in UpdateInventory CurrentInventory SOAP/http EJB Response out Return Response Request Arrives • User of Information Services shielded from complexity of the information systems • Information system can evolve without impacting users • Single control point for access to information systems Current Inventory Processing Update Inventory Processing Complex heterogeneous query Complex DataStage Job Query result Job result
Value of Federation for Virtual Data Access • Speed time to market for new applications • Simplify and enrich portal development • Reduce hand-coding by half • Reduce skills requirements • Use familiar SQL programming model and existing tools • Build on a standards-based, strategic integration platform • Enhance value and insight from existing assets and applications • Work within your existing infrastructure • Extend existing warehouses • Combine existing data and content assets in new ways • Facilitate cross-divisional reporting • Increase control over IT costs • Reduce need to rip and replace • Reduce need to manage redundant data
IBM Information On Demand 2006 ibm.com/events/InformationOnDemand Why attend: • Participate in the PREMIER discussion on the future of Information Management • Learn how the transformation to Information as a Service will help you unlock business value and drive competitive advantage • Hear how your peers are realizing ROI • Understand the roadmap to long term strategic advantage • Learn best practices in your industry • Receive the best in technical education and free certification • Extensive opportunities for networking with both your peers and industry experts IBM Information On Demand 2006October 15-20, 2006 Anaheim, California • The premier information management event for business and IT executives, managers, professionals, DBA's and developers. • Select from over 800 sessions: a 2 1/2 day business leadership track with 180 sessions and a 5 day technical track with 650 sessions. • Latest strategy and product announcements • Large Expo Center • Hands on labs • One on ones with executives and specialists • Birds of a Feather roundtables