Informatica Executive Summit Nov. 3, 2010 1
Managing Data Growth in the 21 st Century: Leveraging Virtualization & Cloud Technology Tony Pagliarulo Vice President of IT, EMC 2
Agenda EMC Focus & Strategy EMC IT Journey to the Private Cloud Data Virtualization Roadmap Information Management Governance 3
About EMC Fortune 500 Rank: 166 Revenues (2010 estimate): > $16.9 billion Employees (end Q3 2010 worldwide): 47,000 Countries where EMC does business: > 80 Total Cash and Investments (year to date): $10.5 billion Quarterly Free Cash Flow (year to date): $2.2 billion Market Value (October 2010): > $44 billion Founded: 1979 4
EMC s Focus EMC is a TECHNOLOGY company EMC s focus is IT Infrastructure 5
EMC s Complementary Strategies Information Infrastructure Virtual Infrastructure Information Storage Information Management Information Protection Information Security Information Intelligence Virtualization (VMware) the Cloud OS 6
EMC IT at a Glance User Profiles 48,000 internal users 400,000+ customers and partners IT Environment 5 data centers, 7 PB storage Business Applications 400+ applications and tools Virtualization 6,000+ OS images (worldwide) 71% of all virtualized 85% of Intel virtualized Global Support 80+ countries and 20 languages 7
EMC IT Current Challenges Globalization Performance Business Value Functionality Manageability We have the same challenges as our customers Cost of Ownership Security Interoperability 8
The Digital Universe 2009-2020 2009: 0.8 Zettabytes GROWING by a factor of 44 Source: IDC Digital Universe Study, sponsored by EMC, May 2010 2020: 35.2 Zettabytes 9
IT Infrastructure today 72% Maintain 28% Invest Complex Inefficient Inflexible Costly 10
Enter The Cloud Enter The Cloud. 11
What is Cloud Computing? 12
The Cloud is... Built DIFFERENTLY: Dynamic pools of virtualized resources Operated DIFFERENTLY: End-to-end service delivery Consumed DIFFERENTLY: Convenient for IT and for those they support Private Cloud is one that IT controls 13
Implications for Today s Data Centers Trusted Controlled Reliable Secure Multiple Incompatible Architectures 14
Implications for Today s Data Centers Trusted Controlled Reliable Secure Dynamic Cost-Efficient On-Demand Flexible Multiple Incompatible Architectures Homogeneous x86 Architecture 15
Implications for Today s Data Center Private Cloud Trusted Controlled Reliable Secure Trusted Dynamic Controlled Cost-Efficient Reliable On-Demand Secure Flexible Cloud OS Compute Network Storage Dynamic Cost-Efficient On-Demand Flexible Public Cloud 16
The Goal: Global Workload Deployment Virtualization Enables Cloud Computing Virtual Applications Federation Private Cloud Operating System Information Security Public Cloud 17
EMC IT s Cloud Strategy Traditional Apps Next-Gen Cloud Apps SaaS Federation (vmotion + VPLEX) Private Cloud Virtualization (vsphere) Information Security Public Cloud 18
Our Journey to the Private Cloud IT Production Lower costs Business Production Improved quality of service IT-as-a-Service Improve agility % Virtualized 85% VDC Optimization Standardization Virtualization 50% We are here 95% 15% 30% Governance Cloud enablement Service management Platinum Gold 19
The Journey to the Private Cloud % Virtualized IT Production Lower Costs 15% Business Production Improve Quality of Service Applications Application portfolio rationalization Application selection Virtualization of CIO 50% owned applications 30% Data center consolidation Virtualization strategy Virtualization factory Infrastructure Governance IT-as-a-Service Improve Agility 85% 95% Platinum Gold Establish PMO Design and implement transformation dashboard Implement IT management policies Establish service catalog 20
EMC Enterprise Information Architecture End User Query Tools Subject Oriented Marts BI as a Service Revenue H R POC 1 POC 2 BU App 1 BU App 2 Data Federation PI Tool TBD Master Data Global Data Warehouse Enterprise Data Rapid Prototyping Data Integration Layer Informatica PowerCenter Informatica Data Services Customer Master Catalyst Oracle 11i Source Systems SAP PeopleSof t Etc 21
Enter The Cloud Data Virtualization Roadmap 22
Guiding Principles Application/ Database Layer Maintain as few copies of data as possible Master Data Management (Informatica Siperian) as single source of truth Informatica Data Services to enable data federation Subset data using Informatica Applimation Transform and replicate data if needed Informatica PowerCenter used to feed the Global Data Warehouse and the subject marts Archive data Archive database data using Informatica Applimation Email archiving using EMC SourceOne Filesystem archiving using EMC Rainfinity 23
Guiding Principles Storage Layer Better utilization using storage optimization techniques File virtualization using EMC Rainfinity Block virtualization using EMC vplex Virtual provisioning De-duplication technology Source de-duplication using EMC Avamar Target de-duplication using EMC Data Domain Object technology for primary storage/backup EMC Atmos as durable distributed object storage 24
Data ILM- Complete Lifecycle of Data 1. Archive for Performance archive (relocate) production data to less expensive and virtualized infrastructure. Improve core application performance and operational efficiency Lower total application infrastructure cost Maintain seamless application access to data 2. Archive for Compliance archive data to on-line content address storage. Meet compliance requirements while reduce risk and infrastructure cost Maintain application independent access to archived data in compressed file via ODBC/JDBC Search, browse, view archived data in compressed file through Informatica Data Discovery portal 3. Archive for Application Retirement archive data to on-line content addressable storage. Retire legacy application and eliminate application and RDBMS license and server costs Maintain application independent access to archived data via ODBC/JDBC Search, browse, view archived data through Informatica Data Discovery portal Nearline Database 25
Remove Large Amounts of Data using Data Archive and Application Retirement EMC Management for Oracle Applications with Informatica ALM Server Production Informatica Data Archive Staging Files File Archive Server Data Discovery Archive engine relocates all data within identified tables and entities based on the archiving policy definition Decrease capital and operating costs by reducing storage volume of rarely-used data Retired application data stored in highly compressed immutable file archive format EMC Centera BI Tools 26
Reduce Storage Footprint with Data Subset EMC Management for Oracle Applications with Informatica Production replica Data Subset Filter Review the effect of subsetting before removing the data Data integrity and immediate availability for subsetted instances Reduce the footprint of module storage by 78% Useful data Test/Development: EMC Symmetrix DMX-4 27
Oracle 11i Applications EMC IT Use Case Poor Performance Infrastructure Costs Resource Costs 28
Storage Multiplier Effect (circa 2008) 153TB Tape Backup of Prod 90 TB 63TB 35TB Backup of (Prod, Splx, Dev, Test, etc) onto EDL with RAID 28 TB 3 TB - Dev, Test, Training, Perf, etc RAID 32TB 12 TB - Dev, Test, Training, Perf 20TB 5 TB - Prod, Splx, SBY, ACT, Bkup Mirror 15TB 10TB 5TB 5 TB - Prod, Splx, SBY, ACT, Bkup DR 5 TB - Prod, Splx, SBY, ACT, Bkup Mirror 5 TB - Prod, Splx, SBY, ACT, Bkup Oracle 11i Multiplier 1TB of Data 29
Data ILM Journey Deduplication 2011 Subsetting Decommission of 3 Envs. Elimination of Tape Backups Reduce Backup Retention Archiving 2008 153 TB Tape Backup (90) EDL Backup (30) Non Prod (12) Disaster Recovery (5) Production (5) 2009 64TB EDL Backup (30) Non Prod (9) Disaster Recovery (5) Production (5) 2010 40TB EDL Backup (30) Non Prod (9) Disaster Recovery (5) Production (5) Oracle 11i Multiplier Effect 1 TB 20TB Deduplication Archiving Retirement 30
Enter The Cloud Information Mgmt Governance 31
However Impact Will Be Limited Without Enterprise Data Governance Customer lifecycle process Lead generation Lead mgt Oppty mgt Order to cash Service and support 1 Define: Consistent data definitions across a single process (over multiple geos and functions) Product lifecycle process Feasibility Design Qualification General Availability End of life 2 Integrate: Consistent data integration across multiple processes driving enterprise-wide analytics and insights Other processes (partner, vendor, etc) Data Governance 3 Sustain: Ongoing adjustment of business rules, data cleansing and sourcing 32
Data Governance Best Practice DATA Data is an enterprise asset, and should be governed and secured at the enterprise level CRM Customer Accounts Partner Accounts HR Employee Contractor ERP Product Item Orders MDM Eng/Svc Business Intelligence Product Quality Total Customer Experience Role-based Access Compliance & reporting Other Shadow NDA Personal Business ownership of data has to come topdown from the highest executives IT is a key enabler, but not the owner Business users are the data stewards and content architects 33
www.emc.com/emcit EMC IT Journey to the Private Cloud: A Practitioner's Guide http://www.emc.com/collateral/software/white-papers/h7298-it-journey-private-cloud-wp.pdf 34
Q&A EMC CONFIDENTIAL INTERNAL USE ONLY 35
THANK YOU 36