Partitioning: Tips and Tricks. Arup Nanda Longtime Oracle DBA

Similar documents

Partitioning: What, When, Why & How

<Insert Picture Here> Designing and Developing Highly Scalable Applications with the Oracle Database

Partitioning in Oracle Database 11g. An Oracle White Paper June 2007

Partitioning with Oracle Database 11g Release 2

Scaling To Infinity: Partitioning Data Warehouses on Oracle Database. Thursday 15-November 2012 Tim Gorman

Oracle Database 11g Best Practices for using Partitioning in HA Environments and VLDBs

March 9 th, Oracle Total Recall

Oracle Database 11gNew Features: Best Practices to Improve Scalability, Performance & High Availability

<Insert Picture Here> Oracle Database Directions Fred Louis Principal Sales Consultant Ohio Valley Region

Partitioning under the hood in MySQL 5.5

StreamServe Persuasion SP5 Oracle Database

news from Tom Bacon about Monday's lecture

Oracle DBA Course Contents

Oracle Database 11 g Performance Tuning. Recipes. Sam R. Alapati Darl Kuhn Bill Padfield. Apress*

Exadata for Oracle DBAs. Longtime Oracle DBA

Oracle Database 11g for Data Warehousing

1Z0-117 Oracle Database 11g Release 2: SQL Tuning. Oracle

Oracle Database 10g: Administration Workshop II Release 2

Oracle EXAM - 1Z Oracle Database 11g Release 2: SQL Tuning. Buy Full Product.

Oracle Total Recall with Oracle Database 11g Release 2

PERFORMANCE TIPS FOR BATCH JOBS

RMAN What is Rman Why use Rman Understanding The Rman Architecture Taking Backup in Non archive Backup Mode Taking Backup in archive Mode

A basic create statement for a simple student table would look like the following.

Optimizing the Performance of the Oracle BI Applications using Oracle Datawarehousing Features and Oracle DAC

LOGGING OR NOLOGGING THAT IS THE QUESTION

Lessons Learned while Pushing the Limits of SecureFile LOBs. by Jacco H. Landlust. zondag 3 maart 13

An Esri White Paper February 2011 Best Practices for Storing the ArcGIS Data Reviewer Workspace in an Enterprise Geodatabase for Oracle

5. CHANGING STRUCTURE AND DATA

ETL Process in Data Warehouse. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

low-level storage structures e.g. partitions underpinning the warehouse logical table structures

Contents at a Glance. Contents... v About the Authors... xiii About the Technical Reviewer... xiv Acknowledgments... xv

DBMS Questions. 3.) For which two constraints are indexes created when the constraint is added?

MyOra 3.0. User Guide. SQL Tool for Oracle. Jayam Systems, LLC

Unit Storage Structures 1. Storage Structures. Unit 4.3

Oracle Database 10g: New Features for Administrators

Advanced Oracle SQL Tuning

Analyzing & Optimizing T-SQL Query Performance Part1: using SET and DBCC. Kevin Kline Senior Product Architect for SQL Server Quest Software

Oracle 12c Recovering a lost /corrupted table from RMAN Backup after user error or application issue

Guide to Performance and Tuning: Query Performance and Sampled Selectivity

If you have not multiplexed your online redo logs, then you are only left with incomplete recovery. Your steps are as follows:

Recover Oracle Database upon losing all Control Files

The safer, easier way to help you pass any IT exams. Exam : 1Z Upgrade Oracle9i/10g/11g OCA to Oracle Database 12c OCP.

Oracle Database In-Memory A Practical Solution

Chapter 5: Logical Database Design and the Relational Model Part 2: Normalization. Introduction to Normalization. Normal Forms.

SQL Simple Queries. Chapter 3.1 V3.0. Napier University Dr Gordon Russell

Who am I? Copyright 2014, Oracle and/or its affiliates. All rights reserved. 3

Oracle Backup and Recover 101. Osborne Press ISBN

In This Lecture. SQL Data Definition SQL SQL. Notes. Non-Procedural Programming. Database Systems Lecture 5 Natasha Alechina

CONFIGURING AND OPERATING STREAMED PROCESSING IN PEOPLESOFT GLOBAL PAYROLL IN PEOPLETOOLS 8.48/9

SQL Performance for a Big Data 22 Billion row data warehouse

Revolutionized DB2 Test Data Management

Automatic Data Optimization

Safe Harbor Statement

Backup/Recovery Strategy and Impact on Applications. Jacek Wojcieszuk, CERN IT Database Deployment and Persistancy Workshop October, 2005

CIS 631 Database Management Systems Sample Final Exam

Oracle TDE Tablespace Encryption

Oracle 11gR2 : Recover dropped tablespace using RMAN tablespace point in time recovery

This appendix describes the following procedures: Cisco ANA Registry Backup and Restore Oracle Database Backup and Restore

Access Tutorial 2: Tables

12. User-managed and RMAN-based backups.

Oracle Database Links Part 2 - Distributed Transactions Written and presented by Joel Goodman October 15th 2009

MySQL Cluster Deployment Best Practices

Review your answers, feedback, and question scores below. An asterisk (*) indicates a correct answer.

RAID installation guide for Silicon Image SiI3114

David Dye. Extract, Transform, Load

Toad for Data Analysts, Tips n Tricks

Oracle Database 11g: New Features for Administrators

Exadata: from Beginner to Advanced in 3 Hours. Arup Nanda Longtime Oracle DBA (and now DMA)

DBA Best Practices: A Primer on Managing Oracle Databases. Leng Leng Tan Vice President, Systems and Applications Management

An Oracle White Paper August Automatic Data Optimization with Oracle Database 12c

Oracle Database 12c: New Features for Administrators

Experiment 5.1 How to measure performance of database applications?

SAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here

ASM and for 3rd Party Snapshot Solutions - for Offhost. Duane Smith Nitin Vengurlekar RACPACK

Oracle Database 10g: Performance Tuning 12-1

Module 14: Scalability and High Availability

Physical Design. Meeting the needs of the users is the gold standard against which we measure our success in creating a database.

Optimizing Performance. Training Division New Delhi

ORACLE 11g RDBMS Features: Oracle Total Recall Oracle FLEXCUBE Enterprise Limits and Collateral Management Release 12.1 [December] [2014]

The Sins of SQL Programming that send the DB to Upgrade Purgatory Abel Macias. 1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

SQL Server Database Coding Standards and Guidelines

Managing Objects with Data Dictionary Views. Copyright 2006, Oracle. All rights reserved.

New Features in MySQL 5.0, 5.1, and Beyond

Top 10 Performance Tips for OBI-EE

Oracle 11g DBA Online Course - Smart Mind Online Training, Hyderabad. Oracle 11g DBA Online Training Course Content

Query Optimization in Oracle 12c Database In-Memory

Oracle Data Recovery Advisor

Data Compression in Blackbaud CRM Databases

1 Introduction. 2 Technical overview/insights into FDAs. 1.1 What is what

Many DBA s are being required to support multiple DBMS s on multiple platforms. Many IT shops today are running a combination of Oracle and DB2 which

Transcription:

Partitioning: Tips and Tricks Arup Nanda Longtime Oracle DBA

Agenda Partitioning primer Choosing a partition strategy Choosing a partition key Solutions to common problems using partitioning Potential issues to watch out for Creative solutions in partitioning 2

3

Local Indexes Index is partitioned exactly as the table Index entries of each part are found in the corresponding partition in index only When table partition is dropped, so is the index partition Example create index in_mytab on mytab (col1) local part1 part2 part3 part4 Table part1 part2 part3 part4 Index 4

Global Indexes Entries for all parts of the table are found all over the index. Usually used for unique indexes Index may be optionally partitioned When table part is dropped, the index needs to be rebuilt. Example CREATE INDEX PK_MYTAB ON MYTAB (COL2) GLOBAL; part1 part2 part3 part4 Table Index Index 5

Global-vs-Local Index Whenever possible, use local index In Primary Key (or Unique) Indexes: If part column is a part of the PK local is possible and should be used e.g. TXN table. PK (TXN_DT, TXN_ID) and part key is (TXN_DT) If not, try to include the column in PKs E.g. if TXN_ID was the PK of TXN, can you make it (TXN_DT, TXN_ID)? Ask some hard design questions Do you really need a PK constraint in the DW? 6

Global indexes can be partitioned The global indexes can themselves be partitioned in any manner, different from the table partitioning scheme create table mytab ( col1 number, col2 date, col3 varachar2, and so on for other columns ) partition by range (col1) ( partition p1 values less than (101), partition p2 values less than (201), partition p2 values less than (301) ) create index pk_mytab on mytab (col2) global partition by hash partitions 4; Global index is hash partitioned while table is range partitioned, on different columns. 7

Different Range Partitioning create table mytab ( ) partition by range (col1) ( partition p1 values less than (101), partition p2 values less than (201), partition p2 values less than (301) ) create index IN1 on MYTAB (col4) global partition by range (col4) (partition p1 values less than (100), partition p2 values less than (maxvalue) ) create index IN1 on MYTAB (col2) global partition by range (col4) Will fail with ORA-14038: GLOBAL partitioned index must be prefixed create index IN1 on MYTAB (col4,col2) global partition by range (col4) 8

Sub-Partitioning Range-Hash Sales Date and Sales Trans ID Range-List Sales Date and Product Code Range-range 2 date columns List-range Product code and then sales date List-list Product code and geographic territory List-Hash Product code and transaction id 9

Global Index Maintenance Global Indexes maintained with the partition operation alter table mypart drop partition p1 update indexes; Or, only global indexes: alter table mypart drop partition p1 update global indexes; 10

11

Referential Partitioning You want to partition CUSTOMERS on ACC_REP column The column is not present on child tables Earlier option: add the column to all tables and update it Difficult and error-prone 11g has referential partitioning CUSTOMERS CUST_ID ACC_REP part SALES SALES_ID CUST_ID FK TOT_AMT LINE_ITEMS SALES_ID FK LINE_ID PRODUCT_ID 12

Referential Partitioning Partition CUSTOMERS as usual create table SALES ( SALES_ID number not null, CUST_ID number not null, TOT_AMT number constraint fk_sales_01 foreign key (cust_id) references customers) partition by reference (fk_sales_01); Partitions of SALES are created with data from CUSTOMERS. CUSTOMERS CUST_ID ACC_REP part SALES SALES_ID CUST_ID FK TOT_AMT LINE_ITEMS SALES_ID FK LINE_ID PRODUCT_ID 13

Addressing Ref Partitions USER_PART_TABLES view has info partitioning_type "REFERENCE" ref_ptn_constraint_name the FK name The partitions will also bear the same name as the parent 14

INTERVAL Partitioning SALES table partitioned on SALES_DT Partitions defined until SEP 2008. Before Oct starts, you have to create the partition If you don't create the part, the INSERT will fail on Oct 1 st. To mitigate the risk, you created the PMAX partition. Undesirable When you finally add the OCT08 partition, you will need to split the PMAX highly undesirable 15

Interval Partitions create table SALES ( sales_id number, sales_dt date ) partition by range (sales_dt) interval (numtoyminterval(1,'month')) store in (TS1,TS2,TS3) ( partition SEP08 values less than (to_date('2008-10-01','yyyy-mm-dd')) ); Creates a partition automatically when a new row comes in Specifies one partition per month This is the first partition. The subsequent partition names are system generated 16

Addressing Interval Partitions USER_PART_TABLES view: partitioning_type "INTERVAL" USER_TAB_PARTITIONS view: high_value shows the upper bound of partition To address a specific partition: select * from SALES partition for (to_date('22-sep-2008','dd-mon-yyyy')); 17

Non-Interval Process To add partitions automatically: http://arup.blogspot.com/2010/11/tool-to-add-range-partitions.html To drop partitions automatically: http://arup.blogspot.com/2010/11/automatic-range-partitiondropping-tool.html 18

Asynch Global Index part1 part2 part3 Table part1 part2 part3 Global Index alter table drop t partition part3 update global indexes; A scheduler job pmo_deferred_gidx _maint_job cleans up Column ORPHANED_ENTRIES in USER_INDEXES view giptab_test1.sql 19

Partial Index part1 part2 part3 part1 part2 part3 SQL> alter table ptab1 modify partition p1 indexing on; SQL> alter table ptab1 modify partition p2 indexing off; SQL> create index in_g2_ptab1 on ptab1 (c1) global indexing partial; Table Local Index partindex_test1.sql 20

Watchout! 21

Date Partition-keys Clear definition helps This will not choose the partition at compile time where sales_date between '1-jan-09' and '31-jan- 09'; This will: where sales_date between TO_DATE(' 2009-01-01 00:00:00', 'SYYYY-MM-DD HH24:MI:SS', 'NLS_CALENDAR=GREGORIAN') and TO_DATE(' 2009-01-31 00:00:00', 'SYYYY-MM-DD HH24:MI:SS', 'NLS_CALENDAR=GREGORIAN') / date_explain1.sql date_explain2.sql exp.sql 22

Partition-wise Joins Works for range partitioned tables Not for hash partitioned Works only for equality operators; not ranges pwj_test1.sql pwj_test2.sql 23

Multicolumn create table mcptab1 ( col1 number(10), col2 number(10) ) partition by range (col1, col2) ( partition p1 values less than (101, 101), partition p2 values less than (201, 201) ) mcpart_test1.sql COL1 COL2 ---------- ---------- 100 100 100 101 100 102 101 100 100 200 100 201 100 202 101 101 101 102 102 100 102 101 102 102 101 200 101 201 101 202 102 200 102 201 102 202 24

Multi-part key Determination Consider 1st column < boundar y value? No = boundar y value? No Place in Partition 1 Yes Yes A Place in Partition 1 Yes < boundar y value? Consider 2nd column Place in Partition 2 A No 2 nd column is considered only when 1 st column is equal to the boundary, not less or not more 25

Subpart Stats Collection The normal method to collect stats begin dbms_stats.gather_table_stats ( ownname=> user, tabname=>'ptest2'); end; Problem: This populates the partition stats but not subpartition stats To collect the subpartition stats, you must use granularity parameter. It has to be either ALL or SUBPARTITION begin dbms_stats.gather_table_stats ( ownname=> user, tabname=>'ptest2, granularity=> SUBPARTITION ); end; subpart_stats1.sql subpart_stats2.sql 26

Partition stats collection The granularity parameter controls the scope for the stats Possible Values 1. AUTO determined by Oracle 2. GLOBAL AND PARTITION global stats and partition-level stats (subpartition level stats are not collected) 3. SUBPARTITION down to subpartition level 4. GLOBAL only global stats 5. ALL global, part and subpart level 6. APPROX_GLOBAL AND PARTITION new in 11g. Global stats are not calculated; but derived from partition stats GRANULARITY Table Partition Partition Subpartition Global Global Statistics Statistics GLOBAL YES NO NO NO PARTITION NO YES YES NO DEFAULT YES YES YES NO SUBPARTITION NO NO YES YES ALL YES YES YES YES 27

Stats for a specific partition only To collect stats for a specific partition (or subpartition) Use the partname parameter begin dbms_stats.gather_table_stats ( ownname=> user, tabname=>'ptest2, partname=> SALES_Q1, ); end; In 11g, the global stats are automatically updated 28

Creative Solutions 29

Partition on Virtual Columns VC: not stored with the table Computed at runtime Can be indexed and partitioned vcpart_test1.sql 30

Partition on Invisible Columns Invisible columns are not visible Need not be entered Can be indexed and partitioned Invcolpart_test1.sql 31

Index Blocks Too Hot to Handle Consider an index on TRANS_ID a monotonically increasing number It may make a handful of leaf blocks experience severe contention This hot area shifts as the access patterns change Solution: Reverse Key Index? 1-20 1-10 11-20 12-20 13-20 14-20 32

Solution: Hash Partitioned Index Index Can be hash-partitioned, regardless of the partitioning status of the table Table SALES is un-partitioned; while index is partitioned This creates multiple segments for the same index, forcing index blocks to be spread on many branches Can be rebuilt: alter index IN_SALES_01 rebuild partition <PartName>; Can be moved, renamed, etc. create index IN_SALES_01 on SALES (SALES_TRANS_ID) global partition by hash (SALES_TRANS_ID) partitions 8 33

Reason Hash partition 1 Hash partition 2 Hash partition 8 1-20 1-20 1-10 11-20 1-10 11-20 12-20 12-20 13-20 13-20 14-20 14-20 34

When Overlap between Logical Modeling and Physical Design Logical Partition Design Physical Last part of logical design and first part of physical design When should partitioning be used In almost all the time for large tables There is no advantage in partitioning small tables, right? Wrong. In some cases small tables benefit too 35

How to Choose Partitioning 36

Why? Common Reasons Easier Administration: Smaller chunks are more manageable Rebuilding indexes partition-by-partition Data updates, does not need counters Performance: full table scans are actually partition scans Partitions can be joined to other partitions Latching 37

More Important Reasons Data Purging DELETEs are expensive REDO and UNDO Partition drops are practically free Local indexes need not be rebuilt Archival Usual approach: insert into archival table select * from main table Partition exchange Local indexes need not be rebuilt 38

Materialized Views Refreshes Partition Exchange Create a temp table Create Indexes, etc. When done, issue: alter table T1 exchange partition sp11 with table tmp1; Data in TMP1 is available Temp Table sp11 sp21 sp31 sp41 sp12 sp13 partition p1 sp22 partition p2 sp32 sp33 partition p3 partition p4 Table 39

Backup Efficiency When a tablespace is read-only, it does not change and needs only one backup RMAN can skip it in backup Very useful in DW databases Reduces CPU cycles and disk space A tablespace can be read only when all partitions in them can be so SQL> alter tablespace Y08M09 read only; 40

Data Transfer Traditional Approach insert into target select * from source@dblink Transportable Tablespace Make it read only Copy the file "Plug in" the file as a new tablespace into the target database Can also be cross-platform Source TST TS1 Target TST TS1 41

Information Lifecycle Management When data is accessed less frequently, that can be moved to a slower and cheaper storage, e.g. from Fiber to SATA Two options: 1. Create a tablespace ARC_TS on cheaper disks ALTER TABLE TableName MOVE PARTITION Y07M08 TABLESPACE ARC_TS; Reads will be allowed; but not writes 2. ASM Approach ALTER DISKGROUP DROP DISK ADD DISK Fully available Fast Disk TS1 Slow Disk ARC_TS 42

How to Decide First, decide on the objectives of partitioning. Multiple objectives possible Objectives Data Purging Data Archival Performance Improving Backups Data Movement Ease of Administration Different Type of Storage Assign priorities to each of these objectives 43

Thank You! My Blog: arup.blogspot.com My Tweeter: arupnanda 44