E190 Heavily Distributed Database Architecture for OLTP Systems

E190 Heavily Distributed Database Architecture for OLTP Systems • Michael Knoll • Manager IT - Federal Express Corporation • Revenue Systems Engineering • mknoll@fedex.com

Agenda How to combine data distribution concepts and strategies with Adaptive Server features to implement very large, scalable OLTP systems. • Why Distribute? • The Nature of Distribution • Products and Features Involved • Core Concepts and Strategies

Why Distribute? Large Database • Need to reduce size for manageability and performance Performance • Eliminate bottlenecks by utilizing more resources Scalability • Need to be able to scale horizontally to meet new requirements and business growth

Why Distribute? Fault Tolerance • Protect from single points of failure (Geography) • Desire to operate in multiple locations or datacenters (Cost) • Take advantage of more, smaller, cheaper hardware (Administration) • More, smaller databases

What kind of distribution are we talking about? Multi-Server • Many SMP machines operating independently Multi-Database • Many databases managed independently Variable Data Distribution • Horizontal, vertical, and replicated

What kind of distribution are we talking about? Batch Applications • “parallel”, partitioned by a key On-line Services • Small distributed transactions routed by a key Locator Services • Low performance data location by brute force

Database Architecture Diagram

Products Involved ASE 12.5 • Important features exist to make it all possible Replication Server 11.5 • Still as reliable as ever Open Client 11.X - 12.X • Custom C/C++ program shells incorporating distribution services

Products Involved BEA Tuxedo • Highly configurable, routed application services MQ Series • Persistent transaction messaging Tibco • Database Adapters for event driven publishing

Key ASE Features • Remote Procedure Calls • Execute Immediate • Distributed Transaction Management • Component Integration Services (CIS, a.k.a. Omni) • Union in View • Data Only Locking • Many Others...

Core Concepts & Strategies • Data Distribution • Transaction Routing • Dynamic Built RPC’s • Multi-database, Multi-server Transaction Coordination • Connection Management • Distributed Access to Central Tables • Physical Table Partitioning • Data Location & “Look Through Databases”

Data Distribution Horizontal Distribution The primary method for spreading the data that makes up the vast majority of the system. • Same schema • Different data • Data is partitioned by a consistent key • Transaction routing is essential

Data Distribution Vertical Distribution The primary method for segregating data that serves different purposes within the system. • Different schema • Different data • Data is not partitioned by a key • Data is placed in separate databases serving different purposes • Connection management is the routing method

Data Distribution Propagation Special purpose data that is required in totality by distributed applications. • Same schema • Same data • Data is replicated from a single master to N replicates • Latency must be tolerable

Transaction/Application Routing On-line Transactions • Data dependant routed to Tuxedo application servers • Routing is a lookup on an xref table • Routing assignment is deterministic Batch Applications • Inputs are pre-split by routing key • Control of flow is managed by 3rd party scheduling product

Transaction/Application Routing Database Targets • Every database in the system is assigned a unique target • We chose to identify them at 3 levels • Site - more specifically, datacenter • Server - multiple servers per datacenter • Database - multiple databases per dataserver Example: 010203 = Site 01, Server 02, Database 03

Transaction/Application Routing Database Target Lookup Routing Id resolves database target

Transaction/Application Routing Hostname lookup Database target resolves SQL Server/Database combination

Dynamic Built RPC’s • Secondary routing • Used when unit-of-work spans servers/databases • Make no assumptions of data location • Use of execute immediate • Use of Distributed Transaction Management (DTM) feature • Typically only your “on-line transactions”

Dynamic Built RPC’s Multi-server transaction without rpc

Dynamic Built RPC’s Multi-server transaction with rpc

Dynamic Built RPC’s Coding example (utility function): CREATE PROCEDURE proc_sel_rpc_info @db_target varchar(6), @rpc_string varchar(60) OUTPUT AS SELECT @rpc_string = rtrim(sql_svr_nm)+ "." + rtrim(phys_db_nm) FROM db_nm_info WHERE db_nm = @db_target

Dynamic Built RPC’s Coding example (exec immediate): CREATE PROC proc_InsShpmtB @db_target varchar(6), @shpmt_id char(17), @ab_nbr char(12), @parm4 char(9), @parm5 char(2), AS DECLARE @rpc_string varchar(60) , @rpc_proc varchar(100), @return_code int exec proc_sel_rpc_info @db_target, @rpc_string OUTPUT select @rpc_proc=@rpc_string + "..proc_InsShpmtA" exec @return_code=@rpc_proc @shpmt_id, @ab_nbr, @parm4, @parm5 return @return_code

Connection Management Methods for database access • All applications, batch and on-line have a single database context for insert/update/delete • Secondary read-only connections (no pooling) • Routed Tuxedo services (provides pooling) • Secondary update connections (no UOW consistency) • Remote procedure calls (DTM w/UOW consistency) • Proxy tables and/or rpc’s (CIS w/UOW consistency)

Distributed Access to Central Tables N:1 Configuration • Proxy tables for the single, centralized base table created at each desired location • Useful for aggregating distributed data • Provides proper isolation as if the table was local • Avoids multiple job steps - makes for easy coding • Can be very contentious

Distributed Access to Central Tables

Physical Table Partitioning • See Russ Wheaton’s E121 afternoon session • Method for managing very large tables often containing transient data • Promotes truncating vs. deleting to purge data • Made to look like a single table to the application • Inserts/Updates are directed to appropriate “partition” • Selects can be directed when the partition key is known. Alternatively a union view can be employed

Data Location • Combines union views and proxy tables or proxy rpc’s • Used in the rare case when the routing key for a transaction is not known • Union views are serial in execution and read only • Where high performance not required, retrieve the data via the union view by way of proxy tables • For performance, only return the location followed by a dynamic generated rpc

Data Location Proxy Tables (1 for each of 18 target databases) create existing table shipment_010101 ( shpmt_id char(17), col1 int, col2 int, … col20 varchar(35) ) external table at “SYB_D1.syb_d1_a1.dbo.shipment” create existing table shipment_010203 ( shpmt_id char(17), col1 int, col2 int, … col20 varchar(35) ) external table at “SYB_D2.syb_d2_a3.dbo.shipment”

Data Location Union view over proxy tables create view shipment_locator as select “010101” “target”, shpmt_id, col1, col2, … , col20 from shipment_010101 union all select “010102 ” “target”, shpmt_id, col1, col2, … , col20 from shipment_010102 union all select “010103 ” “target”, shpmt_id, col1, col2, … , col20 from shipment_010103 union all . . select “010603 ” “target”, shpmt_id, col1, col2, … , col20 from shipment_010603 go

Data Location Proxy rpc’s (1 for each of 18 target databases) create existing table shipment_proc_010101 ( does_exist smallint, _shpmt_id char(17) null ) external procedure at “SYB_D1.syb_d1_a1.dbo.shipment_proc” Procedure at each of 18 target databases create procedure shipment_proc @shpmt_id char(17) as select 1 from shipment where shpmt_id = @shpmt_id

Data Location Union view over proxy rpc’s create view shipment_locator as select “010101” “target”, does_exist, _shpmt_id from shipment_proc_010101 union all select “010102” “target”, does_exist, _shpmt_id from shipment_proc_010102 union all select “010103”, “target”, does_exist, _shpmt_id from shipment_proc_010103 union all . . select “010603”, “target”, does_exist, _shpmt_id from shipment_proc_010603 go

Look Through Database • Contains no data • Contains proxy tables, rpc’s and union views • NOT a high performer • Used when ... • searching for data without routing key • performing queries combining results from multiple locations • single point of access is desired

E190 Heavily Distributed Database Architecture for OLTP Systems