IBM Tivoli Storage Productivity Center Reduce the cost and complexity of storing vast amounts of data Ronan MAUJEAN ronan.maujean@fr.ibm.com Information Infrastructure SME
Why Storage Resource Management? Volume of Data Accelerated, uncontrollable, unplanned growth in file and database data. Data is growing exponentially. IDC estimates continued 60% yearly growth of new disk PB shipped Variety of Information Information Technology holds the promise of bringing a variety of new types of information to the people who need it Service Level Expectations Business can t afford lack of availability for end users. Problems have to be avoided or solved quickly Reduced Staff and/or IT budget ROI and Cost reductions are high-priority expectations Velocity of Change IT organizations are under tremendous pressure to deliver the right IT services with flexibility. And as a result, 85% of problems are caused by IT staff changing something. 80% of problems not detected by IT staff until reported Complex heterogeneous environment Resulting from mergers and acquisitions, legacy hardware, technology evolution and purchases from different vendors, environments are complex and difficult to manage 2
Questions on the Information Infrastructure How much storage do I have available for my applications? How do I know the storage is not the bottleneck for user response time issues? How is my storage being used? When do I need to buy more storage? How do I maintain storage service levels? How well are my storage systems performing? How reliable is my SAN? How do I simplify and centralize the management of my storage infrastructure? How do I monitor and centrally manage my replication services? How can I automate the provisioning of my storage systems, databases, file systems and SAN? Which applications, users and databases are the primary consumers of my storage? Is the storage infrastructure available and performing as needed? Which files must be backed up, archived and retained for compliance? 3
Service Delivery Improving Service levels, beating Expectations Visibility Improve visibility of storage usage and data demands to enable better forecasting Control and Automation Increase control and automation of storage processes to improve cycle time and reduce errors Flexible Delivery Options Support flexible delivery options, such as cloud and managed services, to improve services levels Identify up to 15% of allocated storage for reclamation and immediate reuse Accurately predict capacity and performance requirements Increase administrator productivity up to 2X by simplifying disk provisioning Respond more quickly to business changes Improve flexibility and standardize service by moving selected processes and workloads to a secure external cloud environment 4
Methodology of an efficient Storage Resource Management End to End Storage Infrastructure Monitoring Easy access to all the configuration information Control Availability For Migrations, Upgrades, Troubleshooting Performance Management Establish a baseline Control environment with alerts and regular reports Problem determination Place data depending on workload Anticipate future needs Capacity Management for Files and Databases Monitor with alerts and regular reports to prevent out-of-space conditions and anticipate new needs Identify free space to create new volumes Gain Space thanks to Automatic Data Analysis and Usage by users Chargeback A configuration tool All the element managers in a single tool Application of SAN best practices Recommendation Maintain and Increase Service Level Agreements Reduce Costs Facilitate Administration Reduce new hardware purchases Gain time Make your life easier and your results better 5
IBM Tivoli Storage Productivity Center Enable end-to-end storage management with a single tool Extends storage configuration management across the SAN Centralizes management of storage Improve storage utilization, performance and service levels SAN Topology end to end views and management Reduce storage complexity to make your team more productive Storage Reporting across host file systems, data bases and storage Correlation to host usage Ties to Tivoli Storage Manager for complete information life cycle management Based on SMI-S Protocol 6
Storage Management Value Progression TPC TPC Basic Edition TPC For (SSPC) Disk Standard Edition TPC Standard Edition with SERP SSPC and TPC Basic Edition adds Host discovery Fabric zoning for all switches GUI Enterprise security TPC Basic Edition provides the following capabilities: Discovers your storage infrastructure Draws a physical and logical topology map and I/O data path Monitors health/status, Manages alerts/events Reports on device and host capacity utilization TPC for Disk adds performance management for your disk storage devices Monitors near realtime performance statistics for disk systems Reports historical performance trends to help diagnose problems Exports report data for offline processing TPC Standard Edition enables end-to-end heterogeneous storage resource management with advanced analytics Monitors near real-time performance statistics for end-to-end I/O paths Identifies I/O hot spots and recommends data placement for optimized throughput Automates workflow-like provisioning for the endto-end storage infrastructure Manages configuration change history Analyzes file system and file-level capacity IBM Storage Enterprise Resource Planner (SERP) correlates storage resource information with asset management data Integrates TPC information with heterogeneous SRM tools for enterprisewide reporting Correlates with asset management ownership and location data to provides information for chargeback Reports on storage infrastructure power consumption 7 Simple license file upgrade, no new code to install
Storage Management Value Progression TPC Basic Edition (SSPC) TPC For Disk TPC Standard Edition TPC Standard Edition with SERP SSPC and TPC Basic Edition adds Host discovery Fabric zoning for all switches GUI Enterprise security TPC Basic Edition provides the following capabilities: Discovers your storage infrastructure Draws a physical and logical topology map and I/O data path Monitors health/status, Manages alerts/events Reports on device and host capacity utilization TPC For Replication TPC for Replication adds copy services management for IBM disks Practice multi-site disaster recovery procedures Detect mirroring failure and automate failover Automate fail-back after repair Supports FlashCopy, Metro Mirror, and Global Mirror configurations Optional Three Site configuration for IBM DS8000 storage devices Simple license file upgrade, no new code to install 8
TPC Architecture Design Overview TPC for Replication (zos only) Productivity Center For Data Standard Edition Basic Edition IBM Tivoli Storage Productivity Center Productivity Center For Disk Productivity Center For Replication Two Site BC Three Site BC API s for Storage Management Applications Device Server Performs discovery Real time availability, perf. monitoring Configure the SAN and storage devices Data Server Common Schema and Database Time-based data collection engine Collects data on a periodic basis for capacity and performance analysis purposes. (performance data collection requires TPC for Disk and/or Fabric feature(s)) GUI Common Graphical User Interface for all functions New highly scalable topology display and status display. Display and configuration of SMI-S devices COMMON HOST AGENT FRAMEWORK CIMOM VMWare Virtual Infrastructue Interface Subordinate TPC server(s) and RM Server(s) SNMP Storage Resource Agent TPC 4.1 Managed System Managed System 9
End to End Monitoring 10
End-to-End Monitoring All the configuration in a single tool Which Storage Subsystem is providing disks to which server? Which physical disks are behind? What composes my storage subsystem? How is done the path, which ports are used? Inventory of the assets For Files and databases For virtual machines To go to next level of management: reduce cost, facilitate work, increase SLA Migrations Upgrades Troubleshooting Understand configuration quickly Do I have a potential bottleneck? Configuration History 11
Storage Area Network Discovery Discover and manage multi-vendor devices in a single view View connectivity and relationships for physical and virtual storage resources Context-sensitive menus provide instant access to common functions 12
Virtualization Support Topology views show the relationship between the physical computers and virtual machines VMware VMWare ESX system IBM SAN Volume Controllers IBM AIX VIO environments VMWare Virtual Computers Reports account for information reported by the physical computers and the virtual systems totals adjusted to show what is really available 13
Storage Resource Agent New in 4.1 Collect asset and capacity data from supported servers without a resident agent TPC Server Windows AIX Linux Lightweight native code remotely pushed to servers for data collection and removed at completion Improved performance of agent data collection with minimal server impact Simplifies server data collection and reduces administrator burden of managing and monitoring agents 14
Understand environment and monitor with reports All the information in a single place, made easily accessible 15
Understand environment and monitor with end-to-end topology view Asset Reporting for servers, SAN switches and storage subsystems View connectivity and relationships for physical and virtual storage resources Automatic Discovery and management of multi-vendor devices in a single view 16
Monitor Availability of components At the Server level Ping At the Switch level Error Frame, Error rate, port down At the Storage level Controller status, port down Go to next level of Storage Management: Increase SLA 17
Policy-based Group Management Group resources by application, line of business, or any other userdefined way to better manage the infrastructure Incorporate policies, view status, and manage alerts for all elements of the group Manage complex infrastructures more efficiently by grouping resources and viewing status at highest levels to ensure application and resource availability 18
Health Impact Analysis Reports Real-time Monitoring End-to-end view of the entire storage path View performance data for components and the entire path Discover Bottlenecks Reduce time to problem isolation and resolution Computer Data path Fabric Storage device Performance Volume Tabular Data Path details 19
Configuration Change Analytics & Auditing New configuration change management feature allow IT administrator to track, audit, compare and contrast current configurations with previous configurations Automatic or manual detection of changes can be accomplished based on user policies At-a-glance views of all topology information is available with popouts providing detailed change information Allows quicker time to problem isolation -Device removed Change summary Helps minimize potential outage impacts from configuration changes 20
Performance Management 21
Performance Management Establish a baseline Control your environment with alerts and regular reports For example: Response time IO rate Data Rate Top Volumes performance Top Switch Port Data Rate Performance Problem Determination At the SAN Switch level At the Disk level Configuration History Place data depending on workload to increase performance and reduce costs Identify workloads, top volumes to increase Performance Right-Tiering with existing and future data (SAN planner, Storage Optimizer, STAR) To go to next level of management: reduce problems and downtime, increase SLA, reduce costs, facilitate work 22
Get detailed reports on performance metrics and trends TPC monitors and captures critical performance data disk subsystem ports, subsystem arrays, disk volumes for throughput, I/O and cache rates and response times. Inspect performance history switches and ports for throughput rates And tracks historical performance Administrator gets trend analysis to quickly identify possible application workload contention in the storage array Configuration History 23 Place your cursor over the event for more details
Get detailed reports on performance metrics and trends In context Display current performance metrics for a storage subsystem 24
Get detailed reports on performance metrics and trends In context Drill down and display current performance metrics for individual volumes View volume performance at-a-glance 25
Implement Alerts to anticipate problems and new needs Prevent Performance Problems and anticipate new needs by defining condition to be respected Establish performance thresholds in storage systems and SAN fabrics in order to improve performance and service levels. Alerts are generated when thresholds are violated, indicating potential bottlenecks. Automatically monitor for performance thresholds and trigger notification if thresholds are violated. Establish availability conditions (pings, port down, etc) Trigger actions such as Event forwarding, SNMP Trap, Script execution, or email Define automated responses for selected event conditions 26
Capacity Management 27
Capacity Management Monitor Capacity Utilization to prevent out-of-space conditions and anticipate new needs for the entire infrastructure, by Storage subsystem, by Storage Pool, by LUN, by Filesystem, by Database, for each computer or for each storage system. Identify Free Space to create new volumes Gain Space With a view of what is stored (age, file types, largest files..) for a better archive management With quotas for a better control of usage by users For Databases too Chargeback Go to next level of Capacity Management to optimize Space Utilization, facilitate administration, reduce new hardware purchases, reduce costs. 28
Server Asset View Reduce the cost and complexity of storing vast amounts of data Detailed Asset reports about your servers 29
Storage Subsystems Asset View Detailed Asset reports about your storage sub-system configuration 30
Capacity Management Storage Asset & Capacity Reporting Monitors and analyzes capacity utilization by user, department, file system or database Predicts storage growth based on trend analysis Reports on file systems and database storage utilization - Table Space consumption - Data base allocated but unused space - Capacity planning for file system and database growth Ability to create chargeback invoices based on storage utilization Identify storage utilization and usage - At Risk file identification - Provide reports for storage migration - Provide reports for data classification to support Tiered Storage implementations Drive Tivoli Storage Manager to backup at risk files 31
Capacity Management Policy-based Actions Mark files for archive to reclaim wasted space and improve storage utilization Execute scripts based on trigger conditions from received events Drive Tivoli Storage Manager to archive or delete obsolete files or move to other media (tiered storage) 32
Capacity Management Charge Back Analysis Report cost of Storage to the consumers Allow users to be aware of and accountable for their usage Track by: User or Group of Users Computer Database tablespace Integrates with Tivoli Usage and Accounting Manager for comprehensive charge-back reporting 33
Configuration Tool 34
A configuration tool Create, allocate & provision with automatic analysis and recommendations based on workloads LUNs via an easy-to-use wizard interface Assign host ports Assign volumes to subsystem ports Create/assign fabric zone Define RAID level Create/delete volumes Automatic analysis to apply SAN best practices and alerts can be applied to all fabrics, a single fabric, or a single zoneset All the different element managers accessible in a unique window Facilitate administration, gain time, optimize performance and resource utilization, Provide a better answer to business SLA 35
Policy-based Provisioning Integrated SAN Planners Policy and performance based guidance in configuring subsystem volumes and assigning the volumes to hosts. Extension of the Volume Volume Provisioning Advisor (VPA): Includes the DS8000 and DS6000 controllers. User can specify the controllers to find suitable storage space. User to specify controller types, usage of multiple controller types and RAID level. User can override portions of the recommendation and determine if the result matches the requirements. This fabric, computer and subsystem were selected from the topology viewer 36
Storage Performance Optimizer Update Non-intrusive analysis Reports & Recommendations Provides Out-of-band storage optimization engine that lays out a process to optimize storage allocation, migration, and consolidation Automatically identifies hot spots and provides recommendations to improve disk efficiency & end-user application availability Can help reduce service-level response times of resource-constrained applications by an average of 48% up to 90% Output Supports DS8000, DS6000, DS4000, DS5000 & SVC to enable migration of storage pools to optimize data throughput Note: Support is only available for Fibre Channel disk drives 37
Configuration Analysis New monitoring & notification features that provide configuration validation based on best-practices policies Best Practices & Security Best Practices & Security Automated or manual analysis can be applied to all fabrics, a single fabric, or a single zoneset Easy identification: Policy violators are flagged and displayed in the topology viewer Security events notify IT administrator of zone security violations Multiple methods for dynamic alerting; SNMP traps, tech events, emails 38 Customize your own best practices policies via an xml format (simple language)
TPC Answers These Questions What are the current performances? What is the current utilization (allocation)? Where storage is growing? How will growth be accurately forecasted? How much non-essential data is being stored? Which systems to migrate to what tiers? How can storage inventories be kept up-to-date? How much downtime is storage-related? How will storage policies be audited or enforced? 39
IBM Tivoli Storage Productivity Center An open storage infrastructure management solution designed to: Centralize, single point of management and control of storage infrastructure (disk, data) providing asset, capacity, performance and availability management Plan Discover and Configure Designed to improve administrator efficiency & storage utilization Provides analytic reporting on performance impacts and configuration changes Helps reduce the effort and complexities of managing large-scale multi-vendor heterogeneous environments Problem Isolation Analytic reporting Monitor and Automate 40
41