190 likes | 203 Views
Explore loading SSAS partitions directly, using SSIS as a data source, data mining, and performance tuning in SQL Server 2005 Integration Services. Attendees will gain insights on optimization strategies and practical demos.
E N D
Advanced Topics Using Microsoft SQL Server 2005 Integration Services Allan Mitchell – SQLBits – Oct 2007
Introduction • Loading SSAS partitions directly • Using an SSIS Package as a data source • Data Mining in your Packages • Understanding Performance • Q and A
Who am I? • SQL Server MVP • Co author on Wrox book on SSIS • www.SQLDTS.com, www.SQLIS.com • Consultancy company – Konesans (www.konesans.com)
What I will not cover • Finding SSIS • Creating packages • General package design
Loading SSAS Directly • Dimension And Partition Processing • Do not need to load the underlying DB • Quicker ROI • Don’t fully process the Partition/Dimension outside of the ETL process !
Using SSIS as a data source • SSRS • Complex logic (Microsoft.SqlServer.Dts.DtsClient) • Windows App • Populate a web page perhaps • Custom Component • Take the output of one package and use it in another
Data Mining in SSIS • Transforms • Data Mining Query • Insurance Company perhaps • Term Extraction • Textual Extraction (DT_WSTR or DT_NTEXT) • Term lookup • Textual Lookup (DT_WSTR or DT_NTEXT) • Destinations • Data Mining Model training
Performance Tuning • Data Correlation • Use a merge Join over lookup – why? • Remember sorted inputs – Do on source preferably. • Avoid delay between 2 sources hitting.
Performance Tuning • Buffer Strategy • DefaultMaxBufferRows (10000) • DefaultMaxBufferSize (10 MB) • MaxBufferSize and MinBufferSize (100MB and 64KB) • Avoid many small buffers. • Turn on logging of BufferSizeTuning event
The Raw File • Only SSIS can read and write • Only Adapter that does not use a Connection Manager • High Speed • Source and Destination • No datatype conversions required • Eases back pressure
Demo • The Raw File and How To Read
Access Modes and OLEDB Source Adapter • Do not use Table or View as Access Mode • Use SQL Command instead • Openrowset (Overhead + bad exec plan) • Sp_prepare (much better) • (MS say 17* improvement)
Questions? Email: allan.mitchell@konesans.com