Agenda Introduzione Enterprise Grid Computing Panoramica architetture: Oracle Real Application Clusters Oracle Data Guard Maximum Availability Architecture Q&A
I Problemi dell IT Isole applicative distinte Configurate per i picchi Scalabilita limitata Affidabilita < 99.x% Scarsa sicurezza a causa della frammentazione Costi in continua crescita Impatto su tutte le linee di business
Un esempio: a Dicembre Order Entry caricato al massimo per processare tutti gli ordini La fatturazione non e usata al 100% Order Entry Fatturazione
Un esempio: poi a Gennaio Order Entry cala dopo il picco stagionale La fatturazione e sovracaricata per la chiusura di fine anno Order Entry Fatturazione
Con il Grid Computing Bilanciamento del carico basato su policy che consentano di ottimizzare entrambi le condizioni di picco di carico Order Entry & Fatturazione
Oracle Grid Computing Application Server Clusters Database Clusters Networked Storage
Oracle Grid Computing Application Server Clusters Database Clusters Networked Storage
Infrastructure Grid Application Server Clusters Real Application Clusters Grid Control Automatic Storage Management
Automatic Storage Management ASM Low Cost Eliminates need for volume manager and file systems Works well with inexpensive, modular storage Gives better storage utilization Easy up to 50% less DBA/Sys Admin work Fault tolerant Raw disk performance Capacity on demand Automatic I/O load balancing
Automatic Storage Management New capability in the Oracle database kernel Provides a vertical integration of the file system and volume manager for simplified management of database files Spreads database files across all available storage for optimal performance Enables simple and non-intrusive resource allocation with automatic rebalancing Virtualizes the storage resources Designed to tolerate failures
ASM How it Works No volumes: just a pool of storage Automatic Storage Management
ASM How it Works No volumes: just a pool of storage Partitions total disk space into uniform sized megabyte units Automatic Storage Management
ASM How it Works No volumes: just a pool of storage Partitions total disk space into uniform sized megabyte units Units assigned to a file are managed and tracked by ASM Automatic Storage Management
ASM How it Works No volumes: just a pool of storage Partitions total disk space into uniform sized megabyte units Units assigned to a file are managed and tracked by ASM Online add/remove of disk with automatic rebalancing Automatic Storage Management
ASM How it Works No volumes: just a pool of storage Partitions total disk space into uniform sized megabyte units Units assigned to a file are managed and tracked by ASM Online add/remove of disk with automatic rebalancing Automatic Storage Management
ASM Disk Groups Administer disk groups not files Number of disk groups remains constant while number of files and disks continually increases Fast Disk Group Slow Disk Group
Mirroring with ASM 3 choices for disk group redundancy External: defers to hardware mirroring Normal: 2-way mirroring High: 3-way mirroring Integration with database removes need for dirty region logging
The Operational Stack TODAY ASM Tables Tables Tablespace Files File System Logical Vol 0010 0010 0010 0010 0010 0010 0010 0010 0010 0010 Tablespace Files File System Logical Vol Oracle ASM Disks Disk Group The best way to lower mgmt costs is to remove complexity
ASM Failure Groups Disk Groups assigned by DBAs A Failure Group is a set of disks sharing a common resource whose failure needs to be tolerated Redundant copies of an extent are stored in separate Failure Groups Controller 1 Controller 2 Failure Groups assigned by DBAs or automatically by ASM Failure Group 1 Failure Group 2 Disk Group
Oracle Grid Computing Application Server Clusters Database Clusters Networked Storage
Tipica architettura Scalabilità limitata dalla capacità dell HW Affidabilità con HW ridondato ERP DW Web CRM
Oracle9i Real Application Clusters: Condivisione di dati e memoria - Scalabilità lineare - Affidabilità 24x7 ERP DW CRM Web
Overall Architecture Data Backup Tape Tape App Server High Speed IPC Interconnect Disk Switch Switch Oracle9i Real Application Clusters Database Disk App Server Disk Disk Shared RAID Disk Subsystem N - Tier Oracle9i Data Guard Failover or DR Site
Full Cache Fusion Oracle9i Cache Fusion incrementa le prestazioni e l affidabilità I dati sono inviati direttamente su un bus di interconnessione ad alta velocità (Gigabit). Minimizzare I/O Node A Node B Data Transfer Request Database buffers Database buffers Database
Transparent Application Failover Select Failover SELECT * FROM emp; Client SELECT * FROM emp; empno name empno name 7369 Smith 7499 Allen 7521 Ward 7566 Jones **** BANG ***** Instance 1 Instance 2 7369 Smith 7499 Allen 7521 Ward 7566 Jones 7654 Martin 7698 Blake Rows Partially Returned When Failover Occurred DB Continues Returning Remaining Rows
Oracle 9i RAC: scalabilità della E-Business Suite Oracle11i E-Business Suite Benchmark 84% Scalability 7,000 6,000 5,000 5,433 6,496 # Users 4,000 3,000 2,000 1,000 0 1,288 2,296 * 4,368 * 1 Node 2 Nodes 4 Nodes 5 Nodes 6 Nodes
Single System Image Management Enterprise Manager 10g permette la gestione dellambiente di RAC come un single system image Le pagine del Cluster forniscono ampie info Aggregazione di stati, prestazioni di tutte le istanze Supporto a operazioni sul database e sui servizi Drill down di pagine per la specifica istanza Drill up alla pagina di cluster Cluster page Visualizzazione HW e configurazione sistema operativo, prestazioni e stati del cluster Drill down di pagine per gli specifici nodi
Enterprise Manager Database Home Page
Enterprise Manager Database Performance Page Viste del database Drill down per le info dell istanza Visualizzazione dei dati del servizio Collezionate dall AWR Visualizzazione servizi top, moduli, azioni Definizioni di soglie Allarmi
Enterprise Manager Cluster Home Page
Enterprise Manager Cluster HW Performance Viste del cluster Visualizzazione dell utilizzo dell HW CPU Memoria Dischi
Oracle Grid Computing Application Server Clusters Database Clusters Networked Storage
Highly Available Application Oracle10gAS Clients Load Balancer Web Cache Application Server Tier OC4J Clusters Database Tier
Oracle Grid Computing Application Server Clusters Database Clusters Networked Storage
Complete Manageability Solution Manage entire infrastructure Manage large number of systems Grid Control
Oracle Database 10g Self-Managing Database Built-in Intelligent Infrastructure Code instrumentation Workload repository Automation of Routine Tasks Automatic disk-based Backup and Recovery Automatic optimizer statistics collection Automatic Memory Management Automatic Storage Management Tools to Empower the DBA Automatic Database Diagnostic Monitor Automatic Tuning Optimizer Lots more
Performance Diagnosis & Troubleshooting Automatic Workload Repository MMON ADDM finds 30 top problems tbs SYSAUX In memory statistics SGA 7:00am 7:30am 8:00am 8:30am WR Schema Snapshot 1 Snapshot 2 Snapshot 3 Snapshot 4 8:30am
Oracle Grid Computing Application Server Clusters Database Clusters Networked Storage
Availability Defined The degree to which an application or service is available when, and with the functionality, users expect
Oracle s Integrated HA Solutions Unplanned Downtime Planned Downtime System Failures Data Failures System Changes Data Changes Real Application Clusters ASM Flashback RMAN & Oracle Secure Backup H.A.R.D Data Guard Streams Online Reconfiguration Rolling Upgrades Online Redefinition Oracle MAA Best Practices
<Insert Picture Here> Oracle Data Guard
Cos è Oracle Data Guard? - La soluzione Oracle per il disaster recovery dei dati - Automatizza la creazione e il mantenimento di una o più copie transazionalmente consistenti del database di produzione - Se il database primario diventa unavailable (disastro, manutenzione), un database di standby può essere attivato e assumere il ruolo di primario - Richiede la medesima release di database e di sistema operativo sia per il sito primario che per quello di standby
Data Guard Architecture Overview Clients Clients Primary Site Standby Site Data Changes Broker Agent Primary Database Data Guard Broker Standby Database Broker Agent
How Does It Work? As primary database is modified, redo data is propagated to standby databases Standby databases kept synchronized with primary Primary database is open and active; standby database is either in recovery or open read-only / readwrite Standby database can be transitioned to the primary role as necessary
Data Guard Configuration Primary Site Standby Site A Standby Site B Primary Database Standby Database Standby Database Managed as a single configuration Primary and standby databases can be Real Application Clusters or single-instance Oracle Up to nine standby databases supported in a single configuration
Production Database Oracle Data Guard Architecture Network Sync or Async Redo Shipping Broker Optional Delay Redo Apply Physical Standby Database Backup DIGITAL DATA STORAGE DIGITAL DATA STORAGE Transform Redo to SQL Logical Standby Database Open for Reports Optional Delay SQL Apply Additional Indexes & MVs
Oracle Data Guard Process Architecture Transactions LGWR (Synchronous/Asynchronous) Oracle Net RFS remote file server Physical/Logical Standby Database MRP/ LSP Managed recovery process Logical standby process Primary Database ARCH (Synchronous) Online Redo Logs FAL fetch archive log SRL Standby Redo Logs ARCH Backup / Reports Transform Redo to SQL for SQL Apply Archived Redo Logs Archived Redo Logs
Data Guard Redo Apply Data Guard Broker Primary Database Optional Delay Physical Standby Database Backup Network DIGITAL DATA STORAGE Sync or Async Redo Shipping Redo Apply Physical Standby Database is a block-for-block copy of the primary database Uses the database recovery functionality to apply changes Can be opened in read-only mode for reporting/queries Can also perform backup, offloading production database
Data Guard SQL Apply Primary Database Data Guard Broker Optional Delay Logical Standby Database Additional Indexes & Materialized Views Network Continuously Open for Reports Sync or Async Redo Shipping Transform Redo to SQL and Apply Logical Standby Database is an open, independent, active database Contains the same logical information (rows) as the production database Physical organization and structure can be very different Can host multiple schemas Can be queried for reports while logs are being applied via SQL Can create additional indexes and materialized views for better query performance
Standby Databases Are Not Idle Standby Server Read-Only / Read-Write Reporting Standby Database Tape Backups Standby database can be used to offload the primary database, increasing the ROI
Cascaded Redo Log Destinations Standby database receives its redo data from another standby database and not from the original primary database Primary database sends a set of redo data to only selected standby databases and not to all standby databases Reduces the load on the primary system, and also reduces network traffic and use of valuable network resources around the primary site Primary Database Redo Data Physical Standby Database Retransmitted Physical Standby Database
Protection from Human Errors and Data Corruptions Primary Site Standby Site Production Database Standby Database Optional Delayed Apply The application of changes received from the primary can be delayed at standby to allow for the detection of user errors and prevent standby to be affected The apply process also revalidates the log records to prevent application of any log corruptions
Switchover and Failover Primary and Standby role transitions Switchover Planned role reversal No database reinstantiation required Used for maintenance of OS or hardware Failover Unplanned failure (e.g. disasters) of primary Primary database must be reinstantiated Initiated using simple SQL / GUI interface Data Guard automates the processes involved
Flexible Data Protection Modes Protection Mode Risk of Data Loss Redo Shipment Maximum Protection Zero Data Loss Double Failure Protection Synchronous redo shipping to 2 sites Maximum Availability Zero Data Loss Single Failure Protection Synchronous redo shipping Maximum Performance Minimal data loss usually 0 to few seconds Asynchronous redo shipping Balance cost, availability, performance, and transaction protection
Example Ease of Use Switchover using Enterprise Manager is now literally two mouse clicks
Switched!
<Insert Picture Here> Oracle Maximum Availability Architecture
Oracle s Integrated HA Solution Set Feature Integration Grid Clusters Automatic Storage Management Flashback RMAN & Flash Recovery Area H.A.R.D Data Guard Online Reconfiguration Rolling Upgrades Online Redefinition Best Practices Blueprint
Maximum Availability Architecture Oracle recommended High Availability architecture HA blueprints based on Oracle technologies: Database, Application Server, Enterprise Manager, Evolves with new Oracle versions and features Based on HA requirements of customers Intended for Oracle customers interested in HA Best practices to define and meet SLA requirements Architectural, configuration and operational best practices Minimize scheduled and unscheduled downtime Efficiently recover from outages and restore fault tolerance Hardware and OS independent Unbreakable Architecture + Best Practices = Maximum Availability
MAA Was Created Based on Real world customer requests and questions: What issues should we consider for choosing the most optimal high availability architecture? What is Oracle s best high availability architecture? How can we manage this high availability environment? What are the performance trade-offs? How do we repair from various outages?
MAA Best Practice Publications Best Practices on: RAC/ Data Guard configuration Redo data transport mechanisms Instance Recovery Switchover/Failover Media recovery SQL Apply configuration Network configuration Integration of HA technologies White papers 1 : MAA detailed Media Recovery Site/Network configuration Fast-Start Checkpointing SQL Apply Best Practices Role Management 1. Ref. http://otn.oracle.com/deploy/availability/htdocs/maa.htm for latest updates
MAA Architecture Overview OracleAS OracleAS WAN Traffic Manager Dedicated Network RAC Primary Site Data Guard RAC Secondary Site
Data Guard and RAC Data Guard and Real Application Clusters are complementary and should be used together for Maximum Availability Architecture Real Application Clusters provides high availability Provides rapid and automatic recovery from node failures or an instance crash Provides increased scalability Data Guard provides disaster protection and prevents data loss By maintaining transactionally consistent copies of primary database Protects against disasters, data corruption and user errors Does not require expensive and complex HW/SW mirroring
Data Guard + RAC Configuration Primary Site Standby Site RAC Broker RAC Primary Database Data Guard Standby Database Data Guard + RAC: end-to-end Data Protection and HA Basis of Maximum Availability Architecture Managed as a single configuration
Real Application Clusters + Data Guard Sito Primario R/W Sito Remoto Standby R/O Data Guard RAC Interconnect LAN/WAN Database A Primary > 100Km. Database A Standby
We encourage you to use The Information Company message at the end of all your presentations.
The preceding is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment to deliver any material, code, or functionality, and should not be relied upon in making purchasing decisions. The development, release, and timing of any features or functionality described for Oracle s products remains at the sole discretion of Oracle.
We encourage you to use The Information Company message at the end of all your presentations.