BlueArc unified network storage systems 7th TF-Storage Meeting Scale Bigger, Store Smarter, Accelerate Everything
BlueArc s Heritage Private Company, founded in 1998 Headquarters in San Jose, CA Highest Performing NAS Server in the Industry Proven in the most demanding C HPC environments 6 th Generation Product Patented Hardware Accelerated Architecture Doubled performance and scale with each release
Storage Challenges Everyone needs to: Increase Delivered Performance and Capacity Scale for breakthrough differentiation Control Complexity and Simplify Administration Connect and manage thousands of elements Automate administration and optimization with smarter tools Achieve Cost Effectiveness and Reliability at Scale Optimize for the right technology use in the right place Meet space, power and cooling constraints 3 2010 BlueArc Corp. Proprietary and Confidential
Administration Work Flow in Higher Education / Academic Research Control Costs Optimize and consolidate Manage complexity $$$ and simplify administration Theoretical Models Instruments, Experiments Historical data Exponential Growth Combining new and existing data for analysis Many Researchers That Collaborate Information needs to be shared quickly without problems Long-term Archive Permanent Storage Campus / Worldwide Researchers Expedite Analysis Need to match performance of faster compute cluster Computational Cluster with Workspace Storage 4 2010 BlueArc Corp. Proprietary and Confidential
BlueArc s Technology Framework - Removes Data Storage Barriers Platforms SiliconFS File System Software Storage Ecosystem SAN 5 2010 BlueArc Corp. Proprietary and Confidential
BlueArc s File System: SiliconFS Common point of integration for all elements of a storage solution The central engine that manages all data movement Simultaneous access to native CIFS, NFS and iscsi Maps storage assets to applications, virtual clients or data life cycle Comprehensive virtualization tool set simplifies administration of file system Spans physical nodes, virtual servers and external devices Has advanced Metadata Optimization 6 2010 BlueArc Corp. Proprietary and Confidential
Metadata Optimization NEW! Cost-efficient performance is achieved by moving disk-intensive metadata ops to a high-speed storage tier Metadata operations can be 50% of all FS operations Lower cost SATA or NL-SAS for user data, 15K SAS or SSD for metadata Traditional file system all data on single storage Tier Multi tier file system Meta data and User data split Tier 1 High Speed SAS Disks Meta Data & User Data Meta Data Small Reads & Writes User Data Larger Reads & Writes Tier 0 High Speed SSD or SAS Disks Tier 1 Lower Cost NL-SAS or SATA Disks 7 2010 BlueArc Corp. Proprietary and Confidential
What is Metadata Focus on two types of metadata: Administrative metadata - This is essentially technical information about the data UNIX and/or Windows properties such as permissions/acls Ownership attributes Creation, modification and access dates and times Directories and their properties. Structural metadata - This is information about how the data is organized on the storage media. In the context of BlueArc SiliconFS this is object data as well as free space bitmaps and other object properties For more information about the structure and technology behind BlueArc file system see the SiliconFS whitepaper. 8 2010 BlueArc Corp. Proprietary and Confidential
Benefits of Metadata Optimization Nearly every file system operation requires metadata Maintaining fast access to this metadata has a direct impact on performance. BlueArc servers use dedicated Metadata caches with very aggressive caching algorithms Metadata Optimization helps with: Minimize impact of metadata cache miss: Accelerated service of cache requests Quicker access to metadata on fast storage Maximize writing metadata to storage: Faster metadata writes with more efficient checkpoint process Higher efficiency when writing metadata & data in parallel to different tiers 9 2010 BlueArc Corp. Proprietary and Confidential
Where will Metadata Optimization help? Metadata Optimization can benefit many different applications & workloads. Heavy metadata workloads with a lot of metadata cache miss Write workloads that are metadata intensive BlueArc Replication, Tape Backup, and Data Migrator Directory listings with cold cache (significant benefit on directories with many files) Aged file systems (fragmented free space can cause more metadata random write IO) 10 2010 BlueArc Corp. Proprietary and Confidential
Implementing Metadata Optimization A Metadata Optimized Storage Pool contains two tiers of disk LUNs are designated as either tier 0 or tier 1 when the pool is created Tier 0 is considered fast storage, such as SSD or SAS drives Tier 1 is slower and denser storage, such as NLSAS/SATA Metadata is nearly always required for file system operations, separating it onto fast storage can greatly improve the responsiveness of the server. Isolating metadata IO (particularly write IO) from data IO can also improve performance of tier 1 storage due to the reduced workload on the disks. 12 2010 BlueArc Corp. Proprietary and Confidential
SAN Platforms Storage Ecosystem Titan 3200 Platforms Titan 3100 Information Services Mercury 100 SiliconFS File System Mercury 50 2010 BlueArc Corp. Proprietary and Confidential 13
Platform comparison Mercury 50 Mercury 100 Titan 3100 Titan 3200 Product Class Lower Mid-range Mid-range High End Cluster Nodes 2 Up to 4 Up to 8 Max Storage Capacity 4 PB 8 PB 8 PB Max File System Size 256 TB 256 TB 256 TB NFS Throughput 700 MB/s 1100 MB/s 1200 MB/s Performance (Ops/Sec) 40,137 SPECsfs 2008 72,921 SPECsfs 2008 96,428 SPECsfs 97 High End Up to 8 16 PB 256 TB 1500 MB/s 194,909 SPECsfs 97 Storage Options Software / File Services Multiple options from several manufacturers are available with each platform All software and file system options (NFS, CIFS, iscsi) available 2010 BlueArc Corp. Proprietary and Confidential
Software Platforms SiliconFS File System Software Storage Ecosystem SAN 15 2010 BlueArc Corp. Proprietary and Confidential
BlueArc Software Data Management Data Protection Virtualization Intelligent Tiered Storage Policy Based Data Migration and Replication Support with External Storage Devices Dynamic Read Caching Active/Active and Campus Clustering Instant File System Recovery from Snapshots Synchronous and Asynchronous Replication Backup and Recovery iscsi block access Global Name Space Secure virtual storage servers Virtual Volumes Storage pools with thin provisioning 16 2010 BlueArc Corp. Proprietary and Confidential
DATA MANAGEMENT 17 2010 BlueArc Corp. Proprietary and Confidential
Intelligent Tiered Storage Cluster or Server Farm The seamless migration of data across storage tiers within a single infrastructure to ease management and reduce cost Network Storage Cluster SAN Tier 3 Deduplication Compliance Existing NAS Automatic and transparent data migration between tiers Rules based policy engine reduces manual intervention Third party or external storage devices as an integrated tier Tier 0 Solid State Cache Tier 1 High Performance Tier 2 High Capacity Reduce dependence on high performance tier for peak demand 18 2010 BlueArc Corp. Proprietary and Confidential
Dynamic Read Caching Read Caching Dynamically copies files to a high performance storage tier for use across physical or virtual servers during periods of peak demand. Copy1 Copy2 Copy3 Copy4 Aggregated bandwidth and faster response time for read intensive applications Less capacity for faster tiers required to maintain performance reducing cost Files can be cached from external devices within the namespace including de-dupe or stranded filers Fast Slow Policy based management eliminates manual intervention Copy Ideal for vertical markets with read intensive applications or unpredictable demand like ediscovery, Entertainment, Internet Services, and Life Sciences 22 2010 BlueArc Corp. Proprietary and Confidential
25 2010 BlueArc Corp. Proprietary and Confidential DATA PROTECTION
Clustering High Availability (HA) clustering Two-node Active/Active configuration or N-way (three to eight nodes) clustered configuration Scale one node at a time to quickly and flexibly grow with the business 26 2010 BlueArc Corp. Proprietary and Confidential
Snapshots Snapshots for point-in-time copies of the file system, policy based automation, manually, scripted or event driven File System Recover from Snapshots Rollback to intact file system even if live file system is inconsistent Depending on the size of the file system, rollback can be nearly instantaneous 28 2010 BlueArc Corp. Proprietary and Confidential
Snapshots Snapshot View #2 Cumulative History BlueArc snapshot methods use references to create a cumulative file system history with multiple views without data duplication Snapshot View #1 Live File System Highly efficient ~1 second granularity with no performance or storage penalty Object-based file system Hardware acceleration Instantaneous data recovery Can be integrated into data migration, replication, or backup policies 29 2010 BlueArc Corp. Proprietary and Confidential
File Based Replication Multi-stream, concurrent replication streams deliver high performance Leverages inherent snapshot and NDMP (backup and recovery) capabilities Replication of the file system, directory, virtual volume, or snapshot level for granularity Preserves user/group permissions and quota information Policy-based management and scheduling 30 2010 BlueArc Corp. Proprietary and Confidential
High Speed Replication Allows administrators to create one or more identical copies of data while keeping the source and targets synchronized. Complements existing file-based replication Disaster recovery Data archival ( remote snapshots ) Asynchronous replication over standard IP networks Server to server over LAN or WAN LAN-free replication if source and target are both connected to the same server Snapshot based file replication technology Object based for optimal performance Full or incremental copies Multiple concurrent streams Only copies changed blocks Preserves NFS and CIFS file attributes Preserves NFS handles Data compression over WAN (future) Automated rule based scheduler One time, recurring, continuous 32 2010 BlueArc, Corp. Proprietary and Confidential.
Backup LAN-free backup to tape via NDMP Integration with all major 3 rd party data protection suites 34 2010 BlueArc Corp. Proprietary and Confidential
35 2010 BlueArc Corp. Proprietary and Confidential VIRTUALIZATION
Virtualization Framework Virtual File System Global Name Space with single root Virtual File System unifies directory structure and presents a single logical view Virtual Servers Network Storage Cluster Multiple Virtual Servers per Node Virtual Servers allocate server resources for performance and high availability Virtual Storage Pools Storage Pool File System File System Storage Pool File System Virtual Volumes Multiple dynamic Virtual Volumes per File System Multiple File Systems Per Storage Pool Virtual Storage pools simplify storage provisioning for applications and workgroups Virtual Tiered Storage Parallel RAID Striping Virtual tiered storage optimizes performance, high availability and disk utilization across physical disk arrays 36 2010 BlueArc Corp. Proprietary and Confidential
Virtual File System - Global Namespace Company Geography Department CIFS and NFS Clients Creates a global namespace with a virtual unified directory structure for multiple file systems within a single Titan or across multiple Titans in a cluster. File System One File System Two Simplified Management Single, unified directory structure Logical, user friendly names Persistent after physical infrastructure changes Alter and migrate file systems without user disruption Universal access to data with NFS and CIFS support 37 2010 BlueArc Corp. Proprietary and Confidential
Secure Virtual Servers Security Policies Virtual Servers VLAN-1 Network-1 VLAN-2 VLAN-3 Network-2 VLAN-4 Company A Company B Company C Company D Mercury or Titan Cluster Enables single Titan/Mercury or clusters to span multiple network domains and assign different security policies to each Virtual Server Sets Security Profile by EVS Authentication Permissions Name services File System Access Control Very useful for multi-tenant environments or large enterprises with multiple independent departments 38 2010 BlueArc Corp. Proprietary and Confidential
Virtual Volumes Creates a logical sub-set of a file system independent of physical disk type, location or capacity. Enhanced Security Virus scanning, access based enumeration, auditing, file blocking, etc. Simplified Management Dynamic expansion or contraction via quotas or policies. Increased Storage Utilization Allows oversubscription at volume or file system level for thin provisioning Storage added as needed without over provisioning 39 2010 BlueArc Corp. Proprietary and Confidential
Virtual Storage Pools File System1 File System2 Logical Storage Pool RAID Sets File System3 Un-allocated Free Space Creates a logical storage pool of shared storage that one or more file systems and servers can pull from. Lower Capacity Costs Dynamically allocates storage to file systems and manages free space, optimizing disk utilization Simplified Management Virtualizes RAID sets Easier provisioning as file systems expand as needed by drawing from a common storage pool Improved Performance Increases throughput by aggregating the performance of multiple drives 40 2010 BlueArc Corp. Proprietary and Confidential
Storage Ecosystem Platforms SiliconFS File System Software Storage Ecosystem SAN 41 2010 BlueArc Corp. Proprietary and Confidential
Storage Ecosystem Clustering and Global Namespace Switched fabric SAN enables shared storage SAN Tier 3 De-dupe Archive / Compliance Competitive Filers Integrated third party storage devices Tier 0 Solid State Cache Tier 1 High Performance FC Disk SAS 15k Disk Tier 2 High Capacity SATA Disk Nearline SAS Disk Tier 4 Tape or VTL Archive Encryption Best in class, tiered storage optimized for different tasks 42 2010 BlueArc Corp. Proprietary and Confidential
BlueArc for Higher Education: Storage Solutions for Your Needs Exponential growth in data BlueArc scales to Petabytes of data, with easy upgrades Increasing pressure to reach conclusions faster High bandwidth cluster connection, automatic load balancing Many researchers that collaborate Stability and reliable high performance through crunch times Results available in single name space for easy sharing Control costs Automatic and policy based administration Consolidate and leverage existing storage systems Save on power and cooling with fewer systems 43 2010 BlueArc Corp. Proprietary and Confidential
Thank You! For more information: www.bluearc.com