Microsoft. Microsoft SQL Server. 2012 Integration Services. Wee-Hyong Tok. Rakesh Parida Matt Masson. Xiaoning Ding. Kaarthik Sivashanmugam



Similar documents
SQL Server Integration Services. Design Patterns. Andy Leonard. Matt Masson Tim Mitchell. Jessica M. Moss. Michelle Ufford

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

SQL Server Integration Services Design Patterns

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

SSIS Training: Introduction to SQL Server Integration Services Duration: 3 days

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Implementing a Data Warehouse with Microsoft SQL Server

East Asia Network Sdn Bhd

Implementing a Data Warehouse with Microsoft SQL Server

SQL Server Administrator Introduction - 3 Days Objectives

Implementing a Data Warehouse with Microsoft SQL Server 2012

SQL Server 2008 Administration

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Implementing a Data Warehouse with Microsoft SQL Server 2012

LearnFromGuru Polish your knowledge

Agenda. SSIS - enterprise ready ETL

Implementing a Data Warehouse with Microsoft SQL Server

Implementing and Maintaining Microsoft SQL Server 2008 Integration Services

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

SQL SERVER DEVELOPER Available Features and Tools New Capabilities SQL Services Product Licensing Product Editions Will teach in class room

MS SQL Server 2014 New Features and Database Administration

Implementing a Data Warehouse with Microsoft SQL Server 2012

W I S E. SQL Server 2012 Database Engine Technical Update WISE LTD.

Beginning SQL Server Administration. Apress. Rob Walters Grant Fritchey

Course Outline. Module 1: Introduction to Data Warehousing

AV-005: Administering and Implementing a Data Warehouse with SQL Server 2014

Implementing a Data Warehouse with Microsoft SQL Server 2012

Administering a SQL Database Infrastructure

Administering a SQL Database Infrastructure 20764; 5 Days; Instructor-led

Implementing a Data Warehouse with Microsoft SQL Server 2012

Correct Answer: J Explanation. Explanation/Reference: According to these references, this answer looks correct.

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

MODULE FRAMEWORK : Dip: Information Technology Network Integration Specialist (ITNIS) (Articulate to Edexcel: Adv. Dip Network Information Specialist)

Implementing and Administering an Enterprise SharePoint Environment

6231A - Maintaining a Microsoft SQL Server 2008 Database

Microsoft SQL Server 2012 Administration

$99.95 per user. SQL Server 2005 Database Administration CourseId: 152 Skill level: Run Time: 30+ hours (158 videos)

Implementing a Data Warehouse with Microsoft SQL Server 2014

Designing, Optimizing and Maintaining a Database Administrative Solution for Microsoft SQL Server 2008

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

Before attending this course, participants should have:

Would-be system and database administrators. PREREQUISITES: At least 6 months experience with a Windows operating system.

PassTest. Bessere Qualität, bessere Dienstleistungen!

Enterprise and Standard Feature Compare

MS Design, Optimize and Maintain Database for Microsoft SQL Server 2008

Explain how to prepare the hardware and other resources necessary to install SQL Server. Install SQL Server. Manage and configure SQL Server.

Pro SQL Server Reporting Services. Third Edition. mm m. Brian McDonald. Shawn McGehee. Rodney Landrum. Apress*

SQL SERVER TRAINING CURRICULUM

Administering Team Foundation Server 2013

Course 20463:Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Microsoft SQL Server 2005 Database

Microsoft SQL Server 2008 Administrator's Pocket Consultant

Microsoft SQL Server Beginner course content (3-day)

MCTS Microsoft SQL Server 2005 Implementation & Maintenance

6231B: Maintaining a Microsoft SQL Server 2008 R2 Database

Maintaining a Microsoft SQL Server 2008 Database

About the Authors About the Technical Reviewers Acknowledgments

MOC 20462C: Administering Microsoft SQL Server Databases

Microsoft SQL Business Intelligence Boot Camp

Course 20462C: Administering Microsoft SQL Server Databases

Administering a SQL Database Infrastructure (MS )

For Sales Kathy Hall

MOC Administering Microsoft SQL Server 2014 Databases

Administering Microsoft SQL Server Databases

SQL Server 2012 Business Intelligence Boot Camp

Administering Microsoft SQL Server Databases

SQL Server 2008 Designing, Optimizing, and Maintaining a Database Session 1

Microsoft SQL Database Administrator Certification

SQL Server Training Course Content

NUTECH COMPUTER TRAINING INSTITUTE 1682 E. GUDE DRIVE #102, ROCKVILLE, MD WEB: TEL:

Database Administrator Certificate Capstone Project Evaluation Checklist

Server 2008 SQL. Administration in Action ROD COLLEDGE MANNING. Greenwich. (74 w. long.)

Administering Microsoft SQL Server Databases

50238: Introduction to SQL Server 2008 Administration

Administering Microsoft SQL Server Databases

Course. Overview. Length: 5 Day(s) Published: English. IT Professionals. Level: Type: Method: Delivery. Enroll now (CAL)

AppFabric. Pro Windows Server. Stephen Kaufman. Danny Garber. Apress. INFORMATIONSBIBLIOTHbK TECHNISCHE. U N! V En SIT AT S R!

(Exam ): Configuring

Microsoft SQL Server 2012 Administration. Real-World Skills for MCSA Certification and Beyond (Exams , , and )

Introduction. Part I: Finding Bottlenecks when Something s Wrong. Chapter 1: Performance Tuning 3

Cass Walker TLG Learning

$99.95 per user. SQL Server 2008/R2 Database Administration CourseId: 157 Skill level: Run Time: 47+ hours (272 videos)

Implementing a SQL Data Warehouse 2016

Microsoft SQL Server for Oracle DBAs Course 40045; 4 Days, Instructor-led

MICROSOFT EXAM QUESTIONS & ANSWERS

SQL Server for Database Administrators Course Syllabus

Course Syllabus. At Course Completion

Data Integration and ETL with Oracle Warehouse Builder: Part 1

System Administration of Windchill 10.2

Microsoft Administering a SQL Database Infrastructure

MOC 20467B: Designing Business Intelligence Solutions with Microsoft SQL Server 2012

10775 Administering Microsoft SQL Server Databases

W I S E. SQL Server 2008/2008 R2 Advanced DBA Performance & WISE LTD.

Administering Microsoft SQL Server Databases MOC 20462

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Course Outline: Course 6317: Upgrading Your SQL Server 2000 Database Administration (DBA) Skills to SQL Server 2008 DBA Skills

Transcription:

Microsoft Microsoft SQL Server 2012 Integration Services Wee-Hyong Tok Rakesh Parida Matt Masson Xiaoning Ding Kaarthik Sivashanmugam

Contents Foreword Introduction xxi xxiii PART I OVERVIEW Chapter 1 SSIS Overview 3 Common Usage Scenarios for SSIS 4 Consolidation of Data from Heterogeneous Data Sources 4 Movement of Data Between Systems 9 Loading a Data Warehouse 12 Cleaning, Formatting, or Standardization of Data 16 Identification, Capture, and Processing of Data Changes 17 Coordination of Data Maintenance, Processing, or Analysis 18 Evolution of SSIS 20 Setting Up SSIS 21 SQL Server Features Needed for Data Integration 22 SQL Server Editions and Integration Services Features 24 Summary 25 Chapter 2 Understanding SSIS Concepts 27 Control Flow 28 Tasks 28 Precedence Constraints 30 Variables and Expressions 31 Containers 32 Connection Managers 35 What do you think of this book? We want to hear from you! Microsoft is interested in hearing your feedback so we can continually improve our books and learning resources for you. To participate in a brief online survey, please visit: microsoft.com/learning/booksurvey

Packages and Projects 36 Parameters 37 Log Providers 38 Event Handlers 40 Data Flow Source Adapters Destination Adapters Transforms SSIS Catalog Overview 41 41 42 43 44 45 Catalog 46 Folders 46 Environments 46 References 47 Summary 47 Chapter 3 Upgrading to SSIS 2012 49 What's New in SSIS 2012 49 Upgrade Considerations and Planning 50 Feature Changes in SSIS 50 Dependencies and Tools 52 Upgrade Requirements 52 Upgrade Scenarios 53 Unsupported Upgrade Scenarios 54 Upgrade Validation 55 Integration Services Upgrade Upgrade Advisor 55 Performing Upgrade Addressing Upgrade Issues and Manual Upgrade Steps 69 Conversion to Projects after Upgrade 55 61 71 Summary 79 viii Contents

PART II DEVELOPMENT Chapter 4 New SSIS Designer Features 83 The Integration Services Designer 83 Visual Studio 83 Undo and Redo 84 Getting Started Window 85 Toolbox 85 Variables Window 87 Zoom Control 88 Autosave and Recovery 89 Status Icons 89 Annotations 90 Configuration and Deployment 90 Solution Explorer Changes 90 Parameter Tab 92 Visual Studio Configurations 92 Project Compilation 93 Deployment Wizard 94 Project Conversion Wizard 95 Import Project Wizard 96 New Tasks and Data Flow Components 96 Change Data Capture 96 Expression Task 99 DQS Cleansing Transform 100 ODBC Source and Destination 100 Control Flow 100 Expression Adorners 100 Connection Managers 101 Execute SQL Task 101 ix

Chapter 8 Working with Change Data Capture in SSIS 2012 195 CDC in SQLServer 195 Using CDC in SQL Server 196 CDC Scenarios in ETLs 197 Stages in CDC 198 CDC in SSIS 2012 202 CDC State 202 CDC Control Task 205 Data Flow Component: CDC Source 211 CDC Splitter Component 215 CDC for Oracle 217 Introduction 217 Components for Creating CDC for Oracle 219 CDC Service Configuration MMC 219 Oracle CDC Designer MMC 221 MSXDBCDC Database 233 Oracle CDC Service Executable (xdbcdcsvc.exe) 235 Data Type Handling 238 SSIS CDC Components 240 Summary 240 Chapter 9 Data Cleansing Using SSIS 241 Data Profiling Task 241 Fuzzy Lookup Transformation 246 Fuzzy Grouping Transformation 251 Data Quality Services Cleansing Transform 254 Summary 261 xii Contents

PART III DATABASE ADMIN Chapter 10 Configuration in SSIS 265 Configuration Basics 266 How Configurations Are Applied 266 What to Configure 266 Changes in SSIS 2012 267 Configuration in SSIS 2012 267 Parameters 268 Creating Package Parameters 268 Creating Project Parameters 271 API for Creating Parameters 273 Using Parameters 274 Configuring Parameters on the SSIS Catalog 281 Configuring, Validating, and Executing Packages and Projects...281 Configuration Through SSMS 281 Configuration Using SQL Agent, DTExec, and T-SQL 286 SSIS Environments 287 Evaluation Order of Parameters 291 Package Deployment Model and Backward Compatibility 291 Package Deployment Model 292 Best Practices for Configuring SSIS 295 Best Practices with Package Deployment Model 295 Best Practices with Project Deployment Model 298 Summary 300 Chapter 11 Running SSIS Packages 301 Ways to Run SSIS Packages 301 Package Locations 303 Configuring Packages 307 Error Dumps 308 Logging Options 309

Running Packages in the SSIS Catalog 311 Prepare Executions 312 Starting SSIS Package Executions 316 View Executions 319 Executions with T-SQL 320 Running Packages from SQL Agent 321 Create an SSIS Job Step 322 Execute Packages from the SSIS Catalog 323 Running Packages via PowerShell 325 Creating and Running SSIS Packages Programmatically 326 Summary 331 Chapter 12 SSIS T-SQL Magic 333 Overview of SSIS Stored Procedures and Views 333 Integration Services Catalog 334 SSIS Catalog Properties 334 Querying the SSIS Catalog Properties 335 Setting SSIS Catalog Properties 335 SSIS Projects and Packages 336 Deploy an SSIS Project to the SSIS Catalog 336 Learning About the SSIS Projects Deployed to the SSIS Catalog 337 Configuring SSIS Projects 338 Managing SSIS Projects in the SSIS Catalog 341 Running SSIS Packages in the SSIS Catalog 343 SSIS Environments 347 Creating SSIS Environments 348 Creating SSIS Environment Variables 348 Configuring SSIS Projects Using Configuring SSIS Projects Using SSIS Environments 349 Reference Values 350 Package Execution Using SSIS Environments 351 Managing SSIS Environment and Environment Variables 351 Summary 353 xiv Contents

Chapter 13 SSIS PowerShell Magic 355 PowerShell Refresher 355 PowerShell and SQL Server 356 Managing SSIS with PowerShell 359 SSIS Management Object Model 359 PowerShell with SSIS Management Object Model 360 PowerShell and SSIS Using T-SQL 364 Advantages of Using PowerShell with SSIS 366 Summary 366 Chapter 14 SSIS Reports 367 Getting Started with SSIS Reports 367 Data Preparation 369 Monitoring SSIS Package Execution 370 Integration Services Dashboard 370 All Executions Report 372 All Validations and All Operations Reports 373 Using SSIS Reports to Troubleshoot SSIS Package Execution 375 Using the Execution Performance Report to Identify Performance Trends 380 Summary 383 PART IV DEEP-DIVE Chapter 15 SSIS Engine Deep Dive 387 The Control Flow Engine 387 Overview 387 Load 388 Apply Parameters 390 What do you think of this book? We want to hear from you! Microsoft is interested in hearing your feedback so we can continually improve our books and learning resources for you. To participate in a brief online survey, please visit: microsoft.com/learning/booksurvey

Validate 390 Execute 392 The Data Flow Engine 399 Overview 400 Execution Control 403 Backpressure 410 Engine Tuning 413 Summary 416 Chapter 16 SSIS Catalog Deep Dive 417 SSIS Catalog Deep Dive 417 Creating the SSIS Catalog 417 Unit of Deployment to the SSIS Catalog 419 What Is Inside SSISDB? 420 SQL Server Instance Starts Up 422 SSIS Catalog and Logging Levels 424 Understanding the SSIS Package Execution Life Cycle 425 Stopping SSIS Package Executions 428 Using the Windows Application Event Log 428 SSIS Catalog Maintenance and SQL Server Agent Jobs 429 Backup and Restore of the SSIS Catalog 432 Back Up SSISDB 433 Restore SSISDB 434 Summary 436 Chapter 17 SSIS Security 437 Protect Your Package 437 Control Package Access 437 Package Encryption 441 Sensitive Variables and Parameters 443 Package Signing 444 xvi Contents

Security in the SSIS Catalog 445 Security Overview 446 Manage Permissions 448 DDL Trigger 455 Running SSIS with SQL Agent 456 Requirements 456 Create Credentials 456 Create Proxy Accounts 458 Create SQL Agent Jobs 461 Summary 463 Chapter 18 Understanding SSIS Logging 465 Configure Logging Options 465 Choose Containers 466 Select Events 468 Add Log Providers 470 Log Providers 473 Text Files 473 SQLServer 473 SQL Server Profiler 474 Windows Event Log 474 XML Files 475 Logging in the SSIS Catalog 476 Logging Levels 476 Event Logs 478 Event Context Information 479 Advanced Logging Topics 480 Customizing Logging Fields 480 Logging with dtexec Utility 481 Developing Custom Log Providers 481

Chapter 19 Automating SSIS 485 Introduction to SSIS Automation 485 Programmatic Generation of SSIS Packages 485 Metadata-Driven Package Execution 486 Dynamic Package Generation 487 Handling Design-Time Events 488 Samples 490 Metadata-Based Execution 499 Custom Package Runner 500 Using PowerShell with the SSIS Management Object Model 504 Using PowerShell with SQL Agent 507 Alternative Solutions and Samples 510 Samples on Codeplex 510 Third-Party Solutions 511 Summary 515 PART V TROUBLESHOOTING Chapter 20 Troubleshooting SSIS Package Failures 519 Getting Started with Troubleshooting 519 Data Preparation 521 Troubleshooting Failures of SSIS Package Executions 522 Three Key Steps Toward Troubleshooting Failures of SSIS Package Executions 524 Execution Path 528 Finding the Root Cause of Failure 528 Troubleshooting the Execute Package Task and Child Package Executions 531 DiagnosticEx Events 533 Execute Package Task and Execution Path 534 Troubleshooting SSIS Package Execution Failures Scheduled with SQL Agent 536 xviii Contents

Using Callerlnfo to Determine SSIS Package Executions That Are Executed by SQL Agent 539 Using SQL Agent History Tables to Determine the SSIS Job Steps That Failed 539 Summary 540 Chapter 21 SSIS Performance Best Practices 541 Creating a Performance Strategy 542 OVAL Technique 542 Measuring SSIS Performance 544 Measuring System Performance 544 Measuring Performance of Data Flow Tasks 548 Designing for Performance 554 Parallelize Your Design 554 Using SQL Server Optimization Techniques 558 Bulk Loading Your Data 560 Keeping SSIS Operations in Memory 563 Optimizing SSIS Lookup Caching 564 Optimizing SSIS Infrastructure 568 Summary 570 Chapter 22 Troubleshooting SSIS Performance Issues 571 Performance Profiling 571 Troubleshooting Performance Issues 572 Data Preparation 573 Understanding SSIS Package Execution Performance 574 SSIS Package Execution Duration 574 Time Spent at Each Task in the SSIS Package 575 Time Spent at Each Phase of the Data Flow Component 575 Elapsed Time for Data Flow Component Phases (Active Time vs. Total Time) 576 Monitoring SSIS Package Execution Performance 578 xix

Per-Execution Performance Counters 580 Interactive Analysis of Performance Data 581 Summary 590 Chapter 23 Troubleshooting Data Issues 591 Troubleshooting in the Design Environment 591 Row Count Values 591 Data Viewers 592 Data in Error Output 594 Breakpoints and Debug Windows 595 Troubleshooting in the Execution Environment 595 Execution Data Statistics 595 Data Tap 598 Error Dumps 602 Summary 605 Index 607 About the Authors 639 What do you think of this book? We want to hear from you! Microsoft is interested in hearing your feedback so we can continually improve our books and learning resources for you. To participate in a brief online survey, please visit: microsoft.com/learning/booksurvey xx Contents