Webinar Data Center Capacity Management David Cuthbertson, Director Square Mile Systems Ltd david.cuthbertson@squaremilesystems.com www.squaremilesystems.com
Capacity Management - Why? 1. Standards have been referenced for the design, implementation and testing of data centers TIA942 2. Good components have been chosen - which rarely go wrong 3. Why does the data center sometimes run out of capacity? delay installation of critical kit?
Square Mile Background Develop AssetGen toolsets, training and Business Processes techniques for operational management Departmental, Company of complex IT infrastructure Focus areas Services Connectivity management Data center management Service and CMDB mapping Applications PC, server, mainframe, SOA System change impact analysis Documentation techniques Infrastructure visualisation i Use Visio automation for both diagramming and reporting tool End user, infrastructure, supplier Virtual Infrastructure Network, Servers, Storage, DBMS Hardware Infrastructure Network, Servers, UPS, Storage, Other Fixed Infrastructure (Cabling, Power, Cabinets, Buildings)
Best Practices Training Practical Data Centre Management (2 days) Managing the facility and external teams (ITIL, ISO27001, BS25999) How to Map Services and Systems (1 day) Communicating change / incident impact and dependencies (ITIL) Visio for IT Professionals (2 days) Training course on Visio diagramming and automation techniques Creating and Maintaining Visio Infrastructure Diagrams One day iin-house workshop on Visio diagram automation
Changing Requirements (Device) The Server requires 1 Power connection 1 or 2 network cables 1 KVM connection Server
Todays Requirements (Device) Core Switch SAN Array Power PDU Other D D Edge Switch SAN Switch Power Strip KVM Video 1-6 0-4 1-6 1-4 Server Min 3 Max 20+
Changing Requirements (Environment) Density More servers in a rack (42 1u servers = 84 network cables) More ports on a switch Cabling technology Cat6A cable takes up more space, fibre used for SAN Power Multiple power inputs (HP blade server up to 6) & loading Cooling Must keep within cooling limits and not block airflow Risk One box may have multiple servers so errors are multiplied
Changing Requirements (Business) Centralised planning of changes Local knowledge of connectivity resources required Reduce the costs of change Less manual effort, optimise use of hardware assets Faster turn round of tasks Use of workflow & virtual systems (VLAN, storage, servers) Out tasking physical changes Formal instructions / work orders to 3 rd parties Reduce risk of disruption More resilience and alternate paths required
Why Manage Capacity? Business IT Computing Needs The Data Center team can t predict hosting and other requirements! - Highlight investment required - React to changes Infrastructure Data Center
How Are Most Data Centers Managed? Informal / formal processes Site survey, pre-installation checks, audits Ownership is often assigned locally Create knowledge sets as individuals or within teams MS Office - Excel, Visio, Word, Notes, Sharepoint, Access Specialist toolsets AssetGen, Aperture, Nlyte and others Or give the problem to someone else Host, outsource, out task.
CRITICAL MA JOR MINOR US ER COMPACT COMPACT PROLIANT PROLIANT PROLIANT I micr os ys tems ONLINE SPARE MIRROR PCI POWER POWER RIS ER SUPPLY SUPPLY CA GE P P M PROC PR OC UID DIMMS FANS 1 2 INTER LOCK OVER TEMP 440 1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 Capacity Management in Practice 1. How you decide where to put equipment 2. When to say no (or yes) Exceed technical design or operational limits Doesn t conform to the capacity management plan Not optimal use of available resource 3. Establishing authority and ownership Allocation of resources and funding Decommissioning and moving 4. Confidence in service provision UID Everything is working within design limits Failover or resilience will work as required HP ProLiant DL380 G5
Practical Issues Who owns the problem of creating and maintaining an end to end data centre capacity management system? Facilities? IT Data Centre teams? Platform teams? Service management? Development teams? Where do you start? People Process Toolsets Is this going to be solved by ITIL configuration management and a CMDB? NO!
Capacity Management What do I Mean? Demand Management Requests / Reservation Reviews Performance Management Monitoring & Analysis Tuning / Optimisiing Modelling Placement options, current limits Heat flow, resilience, fail over Capacity Communication Capacity plan, energy usage Reporting (Current and Future) Workflow System Monitoring Systems Life cycle controls Resource management Infrastructure Documentation Space Power Cooling Connectivity Inventory Projects Monitoring of current state Alerting
Allocating a Server Location Building Room Row Rack or tile Height Weight Power Network connectivity SAN connectivity Heat Cost Square Mile Systems 14
Define Your Provisioning Lifecycle Business Business Requirement Proceed Authorised Projects Budgetary Response Detailed Plans Project Start Data Center Log Request Placement Build Phase Project Completion Update all systems
Define the Roles/Responsibilities Design Install Change Maintain Monitor Document Power Cooling Racks Cabling Servers Storage Network Mid-range Square Mile Systems 16
Define The Process Flow Swim Chart Projects Design Team Deployment Planning Build Team Change Mgmt Request Request form Std Components Yes No Design Review Outline Design Doc Forward booking schedule Book Contractor Update Capacity Plan Allocated dates Deployment design QA Check Change from reserved to allocated Accept/Reject Accept Release Handover Detailed Plan Schedule Change Reject Confirm Contractors Square Mile Systems 17
CFD Heat Flow Modelling Square Mile Systems 18
Making Capacity Management Easier Develop common processes and standards for changes Naming and labelling Easier project / operations handover Reservation of capacity and connectivity in operational systems Focus on reducing the number of data sets Spreadsheets Databases Diagrams Documents Monitoring i systems Workflow tools
AssetGen Techniques Coordinated database, multiple viewpoints, automated diagramming Services Software Servers Storage Cabinets Networks Cabling Power Voice Capacity and connectivity reports Change impact analysis and audit trails Excel Visio Visio Visio Spreadsheet outputs Service and Architecture diagrams LAN/SAN/WAN/Power diagrams Rack, floor plans
Visio Diagrams From AssetGen Rack Position Service impact BLADE_BIRM01 Floor Plan H/W Build BLADE-BIRM01.BLADE-SW2 UK BIRM01_BLADE-05 UK BIRM01_BLADE-04 UK_BIRM01_BLADE-12 UK BIRM01_BLADE-03 UK BIRM01_BLADE-02 UK_BIRM01_BLADE-10 UK BIRM01_BLADE-01 UK_BIRM01_BLADE-09 BLADE-BIRM01.BLADE-SW1 Power Supply Network Connections
Data Graphics - Rack Function Colour coding 01-01 01-02 01-03 01-04 01-05 01-06 01-07 01-08 01-09 01-10 Network Network Server Server Server Server Server Storage Storage Mixed 02-01 02-02 02-03 02-04 02-05 02-06 02-07 02-08 02-09 02-10 Unalloc Network Network Server Server Server Mixed Server Server ated Network 03-01 03-02 03-03 03-04 03-05 03-06 03-07 03-08 03-09 03-10 Unalloc Network Network Server Server Server Server Server Server Server ated Visio 2007 linked to structured data
Data Graphics - Racks by Power Feed Bar chart 01-01 01-02 01-03 01-04 01-05 01-06 01-07 01-08 01-09 01-10 32 32 11 14 32 20 48 33 48 43 32 15 32 32 32 32 12 11 10 20 Breaker Actual 02-01 02-02 02-03 02-04 02-05 02-06 02-07 02-08 02-09 02-10 32 32 32 32 6 3 1 2 32 22 32 32 32 32 3 1 1 1 32 20 03-01 03-02 03-03 03-04 03-05 03-06 03-07 03-08 03-09 03-10 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 Visio 2007 linked to structured data
Data Graphics - Racks Exceeding Power Icons show where power / cooling limits exceed design 01-01 01-02 01-03 01-04 01-05 01-06 01-07 01-08 01-09 01-10 1095 2855 3705 5655 2755 2055 7455 1705 1455 7055 02-01 02-02 02-03 02-04 02-05 02-06 02-07 02-08 02-09 02-10 1095 505 2205 1005 7105 7205 1555 2255 2505 7105 03-01 03-02 03-03 03-04 03-05 03-06 03-07 03-08 03-09 03-10 Visio 2007 linked to structured data 2000 0 350 350 0 0 0 350 0 0
Techniques Used by AssetGen One update for a component affects Inventory Connectivity Capacity reports - rack space, LAN, SAN, Power One repository to collate technical and commercial data
Capacity Management Review Demand Management Requests / Reservation Reviews Performance Management Monitoring & Analysis Tuning / Optimisiing Modelling Placement options, current limits Heat flow, resilience, fail over Capacity Communication Capacity plan, energy usage Reporting (Current and Future) Workflow System Monitoring Systems Life cycle controls Resource management Infrastructure Documentation Space Power Cooling Connectivity Inventory Projects Monitoring of current state Alerting
Webinar Summary Ab better understanding di of capacity management tissues and techniques. Gained a brief understanding of Square Mile techniques and AssetGen technology See the videos on AssetGen web site N t bi Next webinars are Integrating Excel/Visio with data centre management toolsets The Easy way to build a CMDB Spreadsheet Chaos! the problem of infrastructure documentation
Data Centre Capacity Management David Cuthbertson Square Mile Systems Ltd david.cuthbertson@squaremilesystems.com +44 (0)870 950 4651 www.squaremilesystems.com www.assetgen.com