news from Tom Bacon about Monday's lecture



Similar documents
Oracle. Brief Course Content This course can be done in modular form as per the detail below. ORA-1 Oracle Database 10g: SQL 4 Weeks 4000/-

Oracle Database: SQL and PL/SQL Fundamentals NEW

Oracle Database 11 g Performance Tuning. Recipes. Sam R. Alapati Darl Kuhn Bill Padfield. Apress*

ORACLE DATABASE 11G: COMPLETE

DBMS Questions. 3.) For which two constraints are indexes created when the constraint is added?

Microsoft SQL Database Administrator Certification

MOC 20461C: Querying Microsoft SQL Server. Course Overview

Oracle Database 11g SQL

David Dye. Extract, Transform, Load

UNIVERSITY AUTHORISED EDUCATION PARTNER (WDP)

Oracle Database 12c: Admin, Install and Upgrade Accelerated

SharePlex for SQL Server

D12C-AIU Oracle Database 12c: Admin, Install and Upgrade Accelerated NEW

LearnFromGuru Polish your knowledge

Database Extension 1.5 ez Publish Extension Manual

Oracle Database 10g Express

Oracle Database 10g: Introduction to SQL

Oracle Architecture, Concepts & Facilities

Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC

Oracle Database 12c: Administration Workshop NEW

Oracle Database 12c: Introduction to SQL Ed 1.1

AV-005: Administering and Implementing a Data Warehouse with SQL Server 2014

OLTP Meets Bigdata, Challenges, Options, and Future Saibabu Devabhaktuni

Oracle 11g Database Administration

Oracle Database 12c: Administration Workshop NEW. Duration: 5 Days. What you will learn

Restore and Recovery Tasks. Copyright 2009, Oracle. All rights reserved.

Duration Vendor Audience 5 Days Oracle End Users, Developers, Technical Consultants and Support Staff

AV-004: Administering and Programming with ORACLE

IBM Campaign and IBM Silverpop Engage Version 1 Release 2 August 31, Integration Guide IBM

SQL Databases Course. by Applied Technology Research Center. This course provides training for MySQL, Oracle, SQL Server and PostgreSQL databases.

4. The Third Stage In Designing A Database Is When We Analyze Our Tables More Closely And Create A Between Tables

Percona Server features for OpenStack and Trove Ops

SQL Server Developer Training Program. Topics Covered

OLH: Oracle Loader for Hadoop OSCH: Oracle SQL Connector for Hadoop Distributed File System (HDFS)

IBM Campaign Version-independent Integration with IBM Engage Version 1 Release 3 April 8, Integration Guide IBM

Central Security Server

General DBA Best Practices

Table of Contents Chapter 1 - Getting Started with Oracle Data Relationship Management (DRM) 1

ORACLE DATABASE: ADMINISTRATION WORKSHOP I

COURCE TITLE DURATION. Oracle Database 11g: Administration Workshop I

Instant SQL Programming

Implementing and Maintaining Microsoft SQL Server 2008 Integration Services

Setting Up Your Team-SQL Database for ORACLE 8.05

Oracle Database: SQL and PL/SQL Fundamentals NEW

StreamServe Persuasion SP5 Oracle Database

Oracle Database: SQL and PL/SQL Fundamentals

Oracle Total Recall with Oracle Database 11g Release 2

1. INTRODUCTION TO RDBMS

Oracle 10g PL/SQL Training

Oracle Data Integration Solutions GoldenGate New Features Summary

Oracle SQL. Course Summary. Duration. Objectives

5. CHANGING STRUCTURE AND DATA

Real-time Data Replication

Concepts of Database Management Seventh Edition. Chapter 7 DBMS Functions

ETL Overview. Extract, Transform, Load (ETL) Refreshment Workflow. The ETL Process. General ETL issues. MS Integration Services

SQL Server for developers. murach's TRAINING & REFERENCE. Bryan Syverson. Mike Murach & Associates, Inc. Joel Murach

Oracle EXAM - 1Z Oracle Database 11g Release 2: SQL Tuning. Buy Full Product.

How To Load Data Into An Org Database Cloud Service - Multitenant Edition

RMAN What is Rman Why use Rman Understanding The Rman Architecture Taking Backup in Non archive Backup Mode Taking Backup in archive Mode

Designing Database Solutions for Microsoft SQL Server 2012 MOC 20465

Database security tutorial. Part I

Data Integration and ETL with Oracle Warehouse Builder: Part 1

How to Implement Multi-way Active/Active Replication SIMPLY

1Z0-117 Oracle Database 11g Release 2: SQL Tuning. Oracle

Optimizing Your Data Warehouse Design for Superior Performance

Karl Lum Partner, LabKey Software Evolution of Connectivity in LabKey Server

Chapter 6: Physical Database Design and Performance. Database Development Process. Physical Design Process. Physical Database Design

Expert Oracle. Database Architecture. Techniques and Solutions. 10gr, and 11g Programming. Oracle Database 9/, Second Edition.

High-Volume Data Warehousing in Centerprise. Product Datasheet

Automatic Data Optimization

Implementing a Data Warehouse with Microsoft SQL Server 2012

Topics Advanced PL/SQL, Integration with PROIV SuperLayer and use within Glovia

Designing Database Solutions for Microsoft SQL Server 2012

<Insert Picture Here> Oracle Database Security Overview

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Solving Business Pains with SQL Server Integration Services. SQL Server 2005 / 2008

mylittleadmin for MS SQL Server Quick Start Guide

MySQL Security for Security Audits

DBMS / Business Intelligence, SQL Server

Oracle Database 12c: New Features for Administrators

Objectif. Participant. Prérequis. Pédagogie. Oracle Database 11g - Administration Workshop I Release 2. 5 Jours [35 Heures]

How To Improve Performance In A Database

Oracle Backup and Recover 101. Osborne Press ISBN

<Insert Picture Here> MySQL Security In A Cloudy World

Oracle Database: Introduction to SQL

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region

Evaluation Checklist Data Warehouse Automation

Informatica Data Replication FAQs

Oracle Database: SQL and PL/SQL Fundamentals

Change Color for Export from Light Green to Orange when it Completes with Errors (31297)

MS SQL Performance (Tuning) Best Practices:

Oracle Database: Introduction to SQL

W I S E. SQL Server 2008/2008 R2 Advanced DBA Performance & WISE LTD.

SQL Server 2008 Administration

Oracle Database: Introduction to SQL

Physical Design. Meeting the needs of the users is the gold standard against which we measure our success in creating a database.

Transcription:

ECRIC news from Tom Bacon about Monday's lecture I won't be at the lecture on Monday due to the work swamp. The plan is still to try and get into the data centre in two weeks time and do the next migration, go live the following week. I edited down my long list of notes to a few short technical points and will email them in case any are worth mentioning.

ECRIC notes from Tom Bacon for Monday's lecture 1) Approach to data migration. Manual script based for import with no ETL tools used. Extract, transform and load A schema per migration separate from the main destination schema with the following: A load table per file to hold the data received and uploaded via Oracle SQL*Loader. Data types matching main schema tables where appropriate, e.g. some fields concatenated into destination rawsource fields. Text files exported from source system with unit and record separator characters to handle multi-line records. Tighter column typing will pick up field issues at the initial file load stage.

Primary key mappings table to link load table primary key values to sequence generated primary key values that will be used in the main tables. Populate in bulk in advance to allocate a batch of main table primary key values to the migration. Views to map load tables to main schema lookup tables and join through the primary key mapping table to include main table primary IDs. '<migration name>_' columns for load table columns and remaining columns corresponding to main schema. Used to drive inserts into main tables. Multiple views on some load tables and union views to wrap up sub-classes for a single destination table insert. Insert DML run to populate main tables from views. Defensive on primary IDs to prevent attempted duplicate inserts and limited by primary ID range where multiple migration batches are run and main table records from earlier batches may have been deleted. Extra scripts to update main schema lookup tables from source databases where required.

2) The DML auditing approach needed to be speeded up for the bulk loading of data during migration. 3) Using Oracle Transparent Data Encryption to encrypt tablespaces rather than disk encryption. 4) Using Oracle Data Guard for replication (redo log shipping). 5) Need to watch out for character set translation issues when moving and uploading files.

6) Foreign keys should be indexed on child table columns unless the parent table records are not going to be updated or deleted, e.g. a lookup table where records are only ever inserted. Otherwise full scanning the child tables may make parent table updates and deletes very slow. 7) A correlated sub-query based update may not be the best approach. Running several updates to specific values each driven from a single sub-query may be suitable when setting a small number of values such as flag fields and may be much faster with large data volumes. 8) Ruby on Rails Active Record not using bind variables in SQL so limiting statement reuse. Can be an issue with batch processing if there are multiple sessions. Different Oracle settings may help.

9) A CLOB field was moved into a child table to reduce unnecessary CLOB processing by Active Record. Character Large OBject 10) Don't forget about nulls and three-valued logic. If SQL query or DML execute counts aren't what you expect that might be the reason. 11) On Oracle 11g the default profile has a password life time of 180 days so user passwords will expire after this time unless the profile is updated. The default password lock time of 1 day is also better set to unlimited so locked out accounts have to be manually unlocked. 12) Deadlocks seen during de-duplication due to use of a view by Rails which included a few lookup table values. This led to (subexclusive) lookup table row locking in addition to the (exclusive) main table locking so that locking conflicts on the lookup tables were more likely.

13) It is possible to get invalid dates in Oracle date datatype fields from binary inserts (e.g. from Access linked-tables) or to_date function bugs which then cause errors when queried. 14) Schema integration complexity is coming from the changes in ICD classifications over time. 15) Don't forget the rownum pseudocolumn can be used in Oracle update DML to assign unique values to each row, e.g. update tab1 set col1 = rownum ; 16) The size of the DML audit table needs watching. The rest of the main schema will grow during the migrations but the audit table is growing much more quickly. Archiving off and/or partitioning may not be suitable. Useful compression requires the 11g Advanced Compression licensed option. The table is not required in the analysis snapshots.

17) Analysis snapshots will be taken onto separate server(s), e.g. one daily, a few monthly and one yearly. Analysis users need scratch areas in their own schemas, and a common scratch area with quotas on suitable tablespaces. The approach is being discussed.