The Oklahoma PetaStore: A Business Model for Big Data on a Small Budget
|
|
|
- Blake Hicks
- 10 years ago
- Views:
Transcription
1 The : A Business Model for Big Data on a Small Budget Patrick Calhoun, Petascale Storage Administrator Henry Neeman, Director OU Supercomputing Center for Education & Research (OSCER) University of Oklahoma Information Technology XSEDE [14] Wednesday July
2 Co-authors David Akin, OU Joshua Alexander, OU Brett Zimmerman, OU Fred Keller, OU Brandon George, OU 2
3 Outline We ll have time for these: Context Business Model Technology We might have time for this: User Interface We won t have time for these, but feel free to look at the slides on your own: Implementation Maintenance Sociology Please feel free to ask questions at any time. We like interacting. 3
4 Context
5 Overview (6) IBM System x Faculty & Staff 12 projects / 10 departments OSCER-RELATED FUNDING TO DATE: $259M total, $145M to OU DCS 9900 TS
6 Large Data Volume Choices I ve got tens of TB of data (or hundreds of TB or PB or ). Why can t I just buy a bunch of USB drives at my local big box store (or online)? 6
7 Large Data Volume Choices You can enter a NASCAR race on a riding lawnmower, but: you probably won t win; you probably will get killed
8 Why Not Roll-Your-Own? If a research team s data sizes are small, roll-your-own is perfectly reasonable: USB disk drives are cheap: 4 TB USB 3.0 = $139 (pricewatch.com 7/15/2014). Buy two and copy everything to both drives (getting user compliance on the secondary copy isn t necessarily trivial). Slightly bigger than that: can do a small, cheap RAID enclosure for mirroring or RAID6 (RAID5 probably isn t robust enough for large drives, given rebuild times), BUT: Price per TB starts going way up. Need much more expertise to configure and manage. Risk is higher because a failed system loses lots of data or buy two, doubling your costs. 8
9 Jargon Backup Nightly incremental (just the files that are new or have changed in the past 24 hours), AND Occasional full dump (every week, every month, whatever) Archive Write Once, Read Seldom if Ever 9
10 How Do Researchers Behave in the Wild? Territoriality Affordability No data management strategy Why? 10
11 Territoriality Some researchers like to hug their toys because they don t trust others (a) to provide shared resources to a large community, while simultaneously (b) serving each user s specific needs well (and at high priority)
12 Affordability Some researchers perceive roll-your-own as cheaper than a central resource even when it s actually more expensive because of non-obvious (non-hardware) costs. Space, power, cooling: rack-in-a-closet isn t plausible any more. Labor: requires expertise far beyond a typical grad student. Maintenance: not cheap, especially after 3 to 4 years. But they have to stretch their research funds as far as possible. 12
13 No Data Mgmt Strategy Some research teams store their research data is on a single hard drive in the PC under a grad student s desk, in which case: The faculty member doesn t know what format the data is in, where to find it, nor how to read it so when the grad student graduates, the data essentially becomes unuseable. May be rarely if ever backed up. 13
14 Why? Some researchers perceive their administrations (especially but not only central IT) as barriers to their progress, instead of partners in their progress. In some cases, this is based on direct negative experience and/or advice/anecdotes from colleagues. For some users, the bulk of their hands-on computing experience is with personal computing (PCs, laptops, tablets, phones), which typically are relatively straightforward to manage with tiny capital, labor and expertise cost (e.g., increase phone storage by inserting MicroSD card; install software with a few taps for a few dollars or free). Grad student labor is (relatively) cheap. At some institutions, faculty incentives are based on: graduating students, publishing papers, getting external funding NOT on having well-managed IT resources. 14
15 How to Be, and Seem, Cheaper? Distribute the costs among multiple entities. That way, no one has to bear the whole burden. Therefore, the cost for each becomes affordable. Find ways to leverage the funding to get other funding. 15
16 Business Model
17 OneOCII The is part of a statewide initiative known as the OneOklahoma Cyberinfrastructure Initiative, co-led by: University of Oklahoma (OU, Norman) Oklahoma State University (OSU, Stillwater) Langston University (Langston) Tandy Supercomputing Center (part of the Oklahoma Innovation Institute, a non-profit in Tulsa) OneNet (Oklahoma s research, education & government network) 17
18 OK PetaStore Technology Strategy Distribute the costs among a research funding agency, the institution, and the research teams. Archive, not live storage: Write once, read seldom if ever. Independent, standalone system; not part of a cluster. Spend grant funds on many media slots but few media (tape cartridges, disk drives). Most of the media that the grant has purchased have been allocated to the research projects in the proposal. Media slots are available on a first come first serve basis. Software cost should be a small fraction of total cost. Under the OneOklahoma Cyberinfrastructure Initiative, this is also true for academic institutions statewide (and also many non-academic institutions). Maximize media longevity. 18
19 Business Model Grant: hardware, software, 3 year warranties on everything Institution (CIO + VPR): space, power, cooling, labor, maintenance after the 3 year warranty period Researchers: media (tape cartridges, disk drives) Compared to roll-your-own disk, for researchers PetaStore tape is: cheaper more reliable less labor requires less training (~1 hour) can be faster (~200 MB/sec to write, ~140 MB/sec to read) Compared to roll-your-own disk, PetaStore disk is: more expensive, but otherwise like tape 19
20 Business Model Grant: hardware, software, 3 year warranties on everything Institution (CIO + VPR): space, power, cooling, labor, maintenance after the 3 year warranty period Researchers: media (tape cartridges, disk drives) Compared to roll-your-own disk, for researchers PetaStore tape is: Cheaper (33% cheaper per TB raw 7/15/2014) By the way, LTO-5 tape is also faster than SATA disk drives (140 MB/sec vs 25-50). LTO-5 has an unrecoverable read error ( bit rot ) rate of 1 in bits, compared to for SATA and for SAS. 20
21 NSF MRI Grant Acquisition of Extensible Petascale Storage for Data Intensive Research National Science Foundation grant no. OCI /1/2010-9/30/2013, no cost extension to 9/30/
22 NSF MRI Grant: Summary OU was awarded a National Science Foundation (NSF) Major Research Instrumentation (MRI) grant in It features 15 faculty and staff from 12 projects in 10 departments. We ve purchased and deployed a combined disk/tape bulk storage archive from IBM: the NSF budget paid for most of the hardware and software, plus warranties/maintenance for 3 years; OU cost share and institutional commitment pay for space, power, cooling and labor, as well as maintenance after the 3 year project period; individual users (e.g., faculty across Oklahoma) pay for the media (disk drives and tape cartridges). 22
23 Data Management Plans Beginning mid-january 2011, ALL proposals to the NSF had to have 2-page data management plans. (The plan could be an argument that no data management plan is needed). National Institutes of Health have a similar requirement. OSCER has worked with the Office of the VP for Research to create boilerplate text that includes a description of the PetaStore. This doesn t address issues such as metadata, provenance, etc, but it does cover physical data management. 23
24 Longevity The current PetaStore system will end-of-life roughly Faculty may not have funds for purchasing more media in PetaStore II. How to handle the tape? 24
25 Longevity Strategy PetaStore II has to be backward-compatible with PetaStore I, in the sense of allowing LTO, including LTO-5 and LTO-6 (could also allow non-lto, if desired). Tape cartridges are good for the earliest of: 15 years 5000 load/unload cycles 200 complete tape read/writes So far, only 6 tape cartridges (< 1%) are in danger of wearing out in less than 15 years. PetaStore II must include a couple of LTO-6 drives, which can read and write both LTO-6 and LTO-5. 25
26 Longevity Transition 1. Acquire PetaStore II, including a small amount of LTO-7 media. 2. Put PetaStore II into full production. 3. Put PetaStore I into read-only mode. 4. Copy a modest amount of data off tape cartridges on PetaStore I to PetaStore II. 5. Empty those PetaStore I tape cartridges. 6. Export them from PetaStore I and import them into PetaStore II. 7. Repeat steps 4-6 until all tape cartridges have been moved over. 8. Decommission PetaStore I. 9. May want to copy old data from new media to old media. 26
27 Technology
28 Hardware (6) IBM System x Faculty & Staff 12 projects / 10 departments OSCER-RELATED FUNDING TO DATE: $259M total, $145M to OU DCS 9900 TS
29 Hardware & Software Hardware Disk: IBM DCS9900 (rebranded DDN S2A9900) Tape: IBM TS3500 Software Disk: IBM s General Parallel File System (GPFS) Tape: IBM s Tivoli Storage Manager (TSM) 29
30 Hardware: Disk IBM DCS9900 (rebranded DataDirect Networks S2A9900) 2 controllers 20 enclosures of 60 disk drive slots each (1200 slots total) NOT EXPANDABLE Initially purchased 300 disk drives (minimum allowed) ~477 TB useable Currently at 530 disk drives ~842 TB useable Cost saving strategy: maximize the ratio of disk drive slots per controller (which are expensive) Peak speed 5.4 GB/sec, benchmarked at ~4 GB/sec (idealized test) faster than our then-cluster s parallel filesystem, similar to current cluster s speed Speed wasn t a goal: it s an archive! 30
31 Hardware: Tape IBM TS x LTO-5 tape drives 2859 tape cartridge slots Initially 100 tape cartridges, has grown to 960 so far Expandable to over 22,600 tape cartridge slots (over 55 PB at LTO-6 coming soon!) Planning to buy 2 x LTO-6 tape drives soon LTO-X can write LTO-(X-1) and read LTO-(X-2) 31
32 Software: Disk IBM s General Parallel FileSystem (GPFS) Charged per server, not per tape slot or per TB VERY VANILLA! NOTE: High Energy Physics has a separate Lustre partition (200 of the 530 disk drives). 32
33 Software: Tape Tivoli Storage Manager (TSM) Integrates well with GPFS. Originally designed for backups archive capability added on later. Not ideal to manage billions of files in an archive. Priced per server, and we have only 6 servers. Most tape software has one or both of the following: per-cartridge-slot activation upcharge; per-tb capacity charge. These charges would have wrecked the project. Not IBM s first choice for archive software. They d prefer us to use HPSS (common at national centers). But, HPSS s cost would have consumed the entire budget. 33
34 Tape Software Summary We chose a terribly risky software strategy because all of the alternatives to this high risk would have guaranteed failure. We got very lucky: It actually works! Configuring the software took weeks of hard labor, including a 2 week onsite intervention by an IBM expert. BUT: Now that we know how to do this, we can help others. 34
35 User Interface
36 Interface Methods Linux Shell on Computing Cluster SCP/SFTP (GUI or character terminal) GridFTP and GlobusOnline 36
37 Filesystem Layout Consistency All files belong in the directory tree /archive/ /archive/... contains the same data, regardless of interface Transparency Data redundancy and target media types are obvious in the path. Leverage existing skills Users LITERALLY use standard POSIX and GNU commands (Plus some optional supplemental commands) 37
38 Duplication Policies Comparable base path names to our computing cluster. /home/username; /scratch/username, /work/username, /work/project /archive/username/disk_1copy_unsafe /archive/username/disk_1copy_tape_1copy /archive/username/tape_1copy_unsafe /archive/username/tape_2copies project often substitutes for username The user has to choose the policy for each file (or collection of files), and has to type the implication of a dicey choice. 38
39 Offsite Copies These two policies can benefit from off-site copies: disk_1copy_tape_1copy tape_2copies Periodic export and reclamation Weekly SneakerNet from South Campus to OU IT s Disaster Recovery data center (~5 miles). In Oklahoma, natural disasters tend to be highly localized: Tornadoes (common) Flash floods (occasional) Ice storms (non-disruptive of storage, especially tape) Earthquakes, but nothing strong in the past 15+ years 39
40 Current Groups Using Currently, 26 research groups have capacity on the PetaStore (plus OSCER itself, which consumes 10% of the original disk space for a landing pad for files that are to be on tape only). Of these, 9 are from the grant proposal and 14 aren t. Footprints (as of Apr ): disk_1copy_unsafe: 11.5 TB disk_1copy_tape_1copy: 40.4 TB (per copy) tape_1copy_unsafe: 60.4 TB tape_2copies: TB (per copy) 40
41 Weird Constraints File Sizes: prefer GB, accept 1 GB Keeps the number of files manageable (avoid flakiness, excessive database traversal times, shoeshining ) Retrieval time ~4 minutes for 10 GB, ~22 minutes for 100 GB (excluding time pending in the queue until a drive is available, if any) File Types: Unless individual files are GB AFTER COMPRESSION, we prefer that they be zip files or gzipped tar files. Compression is a good thing. Replacing many small files with one big file is a good thing. NOTE: No autocompression when copying from disk to tape. 41
42 Background slides
43 NSF MRI Research Projects 1. Numerical Prediction and Data Assimilation for Convection Storms, Tornadoes and Hurricanes: Xue, Meteorology and Center for Analysis & Prediction of Storms (CAPS) 2. ATLAS Tier 2 High Energy Physics: Strauss, Skubic, Severini, Physics & Astronomy, Oklahoma Center for High Energy Physics 3. Earth Observations for Biogeochemistry, Climate and Global Health: Xiao, Botany & Microbiology, Center for Spatial Analysis 4. Adaption of Robust Kernel Methods to Geosciences: Trafalis, Industrial Engr; Richman, Leslie, Meteorology 5. 3D Synthetic Spectroscopy of Astrophysical Objects: Baron, Physics & Astronomy 6. Credibility Assessment Research Initiative: Jensen, Management Information Systems, Center for Applied Social Research 43
44 NSF MRI Research Projects 7. Developing Spatiotemporal Relational Models to Anticipate Tornado Formation: McGovern, Computer Science (CS), Interaction, Discovery, Exploration, Adaptation (IDEA) Lab 8. Coastal Hazards Modeling: Kolar, Dresback, Civil Engineering & Environmental Science (CEES), Natural Hazards Center 9. High Resolution Polarimetric Radar Studies Using OU- PRIME Radar: Palmer, Meteorology & Atmospheric Radar Research Center 10. Perceptual and cognitive capacity: Modeling Behavior and Neurophysiology: Wenger, Psychology 11. Multiscale Transport in Micro- and Nano-structures: Papavassiliou, Chemical, Biological & Materials Engr 12. Electron Transfer Cofactors and Charge Transport: Wheeler, Chemistry & Biochemistry 44
45 Implementation
46 Storage Admin Labor Cost How much labor does Patrick average per month? May 2011 (delivery) Feb 2011 (full production): ~140 hours per month (~80% FTE) Ongoing maintenance labor: ~9 hours per month Ongoing user training labor: ~75 hours per month 46
47 TSM Server Setup TSM servers are set up as follows: Vanilla RHEL Linux 5.7 (May upgrade to RHEL 6.2+) Can be configured either as a GPFS server (expensive) or GPFS client (cheap) guess which we picked... LTO tape drives in the TS3500 use the lin_tape kernel module lin_tape handles tape drive multipathing 47
48 GPFS Server Setup GPFS servers are set up as follows: Vanilla RHEL Linux 5.7 (May upgrade to RHEL 6.2+) Use Linux Device Mapper Multipath for GPFS NSD LUNs Each NSD is owned by one primary and one secondary GPFS Server. 48
49 Minor Components Required In addition to the DCS9900 and TS3500 SAN: 8 Gb FibreChannel SAN (campus FC backbone) GPFS Servers: 4 x IBM System x3650 TSM Servers: 2 x IBM System x3650 (Active-Passive) Separate Data and Management Networks: 10 Gb, GigE Client Systems: GPFS/NFS/SFTP 49
50 SAN Layout Dual (Redundant-path) Fabrics Separate physical interface for Disk and Tape Zoning: Exactly 2 endpoints per zone LOTS OF ZONES 32 Zones for GPFS Servers <-> DCS Zones for TSM Servers <-> TS Zones for TSM Servers <-> DCS9900 HBA: QLogic QLE2562-8Gb FC Dual-port HBA for System x 50
51 SAN Zones 51
52 Ethernet Connectivity One public network for data (10 Gb) One private network for management (GigE) 52
53 Optional Client Servers Lustre Servers for opt-out of HSM (Unsupported) Remote mount via sftp. (Unsupported) Allows for encrypted filesystem support, for example. 53
54 Maintenance
55 Monitoring DDN s UNSUPPORTED s2mon Custom scripts to check statuses User Reports 55
56 Exception Handling Historically due to implementation oversights. Maximum Number of used Scratch tapes Led to insufficient available tapes, failed migrations. TSM Log Backup aging policy Led to insufficient available tapes, failed migrations. No Disk Quota bound on number of inodes Led to failed stat queries, locked systems. Inconsistent tape drive enumeration Led to broken TSM paths, inaccessible user data. Stale File Handle. No data lost. 56
57 Sociology
58 User Training Takes about an hour We currently train each new user one-on-one before letting them on. 58
59 User Training Orientation Outline: Description and Intent of the Inquisition of user s use case System Rules: 1. Files MUST be 1 GB or larger. 2. Files SHOULD NOT exceed 100 GB. 3. All media must be purchased through our approved channels. The 4 Duplication Policies (and directory names) The 3 Interfaces (Compute Cluster, SCP/SFTP, gridftp) Supplemental Commands How to Zip or tar+gzip a collection of files Any other training for this user s use case 59
60 Use Agreement Before a user can buy media and/or log in, they have to sign and submit a PetaStore Use Agreement. 60
61 Use Agreement 1. I will not store on the PetaStore any files that are subject to the US federal Health Insurance Portability and Accountability Act (HIPAA). 2. I will not store on the PetaStore any files that are subject to the US federal Family Educational Rights and Privacy Act (FERPA). 3. I will not store on the PetaStore any files that are classified. 4. If any of the files that I store on the PetaStore are subject to one or more agreements with any Institutional Review Board (IRB), including but not limited to the IRB of the University, then I will take full responsibility for ensuring full compliance with such agreement(s). 61
62 Use Agreement 5. If I am collaborating with colleagues who are at institutions outside of the United States of America (that is, outside of both US states and US territories), then I will take full responsibility for ensuring that those colleagues do not access the PetaStore themselves, but rather I and/or or other members of my team who are at US institutions will access the PetaStore on behalf of the entire team. 6. I understand that, if and when I cease to be employed by and/or a student at an institution in the United States of America, then access to my files on the PetaStore will be available only to those of my collaborators who are employed by and/or students at US institutions. 62
63 Use Agreement 7. I will take full responsibility for ensuring that my use of the PetaStore is in full compliance with the most current version of the University s Acceptable Use Policy, currently accessible at Policy.pdf. 8. If I am one of the Principal/Co-Principal investigators of a team, then I will take full responsibility for ensuring that any student members of the team are likewise in full compliance. 63
64 Use Agreement 9. I understand that the ability of the University to provide the PetaStore is contingent on continued National Science Foundation funding and cooperation; that the University provides the PetaStore on an as-is basis, and while every reasonable and good faith effort will be made to ensure the reliability and availability of the PetaStore and of the files stored on it, the University makes no guarantees with respect to its reliability or continued availability. 10. In the event that the University ceases providing the PetaStore or any comparable resource, then I will take full responsibility for transferring any and all relevant files to other storage resources, and in a timely manner. 64
65 Use Agreement 11. I will take full responsibility for ensuring that I keep abreast of and comply with changes to any of the relevant laws, policies and circumstances described above. 65
66 Acknowledgements NSF MRI Participants and External Advisory Group OSCER Operations Team: Brandon George, David Akin, Brett Zimmerman, Joshua Alexander OU CIO/VPIT Loretta Early, Asst VPIT Eddie Huebsch OU VP for Research Kelvin Droegemeier OU IT: Fred Keller, Gensheng Qian, cable crew, etc. IBM: Jim Herzig (now retired), Mike Kane (now at Verizon), Frank Lee, Tu Nguyen, Ray Paden 66
67 Acknowledgements Portions of this material are based upon work supported by the National Science Foundation under the following grant: Grant No. OCI , MRI: Acquisition of Extensible Petascale Storage for Data Intensive Research. 67
68 Thanks for your attention! QUESTIONS? 68
PADS GPFS Filesystem: Crash Root Cause Analysis. Computation Institute
PADS GPFS Filesystem: Crash Root Cause Analysis Computation Institute Argonne National Laboratory Table of Contents Purpose 1 Terminology 2 Infrastructure 4 Timeline of Events 5 Background 5 Corruption
Availability and Disaster Recovery: Basic Principles
Availability and Disaster Recovery: Basic Principles by Chuck Petch, WVS Senior Technical Writer At first glance availability and recovery may seem like opposites. Availability involves designing computer
PetaLibrary Storage Service MOU
University of Colorado Boulder Research Computing PetaLibrary Storage Service MOU 1. INTRODUCTION This is the memorandum of understanding (MOU) for the Research Computing (RC) PetaLibrary Storage Service.
Archive Data Retention & Compliance. Solutions Integrated Storage Appliances. Management Optimized Storage & Migration
Solutions Integrated Storage Appliances Management Optimized Storage & Migration Archive Data Retention & Compliance Services Global Installation & Support SECURING THE FUTURE OF YOUR DATA w w w.q sta
IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE
White Paper IBM TSM DISASTER RECOVERY BEST PRACTICES WITH EMC DATA DOMAIN DEDUPLICATION STORAGE Abstract This white paper focuses on recovery of an IBM Tivoli Storage Manager (TSM) server and explores
Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms. Cray User Group Meeting June 2007
Performance, Reliability, and Operational Issues for High Performance NAS Storage on Cray Platforms Cray User Group Meeting June 2007 Cray s Storage Strategy Background Broad range of HPC requirements
A Comparative TCO Study: VTLs and Physical Tape. With a Focus on Deduplication and LTO-5 Technology
White Paper A Comparative TCO Study: VTLs and Physical Tape With a Focus on Deduplication and LTO-5 Technology By Mark Peters February, 2011 This ESG White Paper is distributed under license from ESG.
June 2009. Blade.org 2009 ALL RIGHTS RESERVED
Contributions for this vendor neutral technology paper have been provided by Blade.org members including NetApp, BLADE Network Technologies, and Double-Take Software. June 2009 Blade.org 2009 ALL RIGHTS
Globus and the Centralized Research Data Infrastructure at CU Boulder
Globus and the Centralized Research Data Infrastructure at CU Boulder Daniel Milroy, [email protected] Conan Moore, [email protected] Thomas Hauser, [email protected] Peter Ruprecht,
Every organization has critical data that it can t live without. When a disaster strikes, how long can your business survive without access to its
DISASTER RECOVERY STRATEGIES: BUSINESS CONTINUITY THROUGH REMOTE BACKUP REPLICATION Every organization has critical data that it can t live without. When a disaster strikes, how long can your business
Ultra-Scalable Storage Provides Low Cost Virtualization Solutions
Ultra-Scalable Storage Provides Low Cost Virtualization Solutions Flexible IP NAS/iSCSI System Addresses Current Storage Needs While Offering Future Expansion According to Whatis.com, storage virtualization
Solution Brief: Creating Avid Project Archives
Solution Brief: Creating Avid Project Archives Marquis Project Parking running on a XenData Archive Server provides Fast and Reliable Archiving to LTO or Sony Optical Disc Archive Cartridges Summary Avid
Implementing a Digital Video Archive Using XenData Software and a Spectra Logic Archive
Using XenData Software and a Spectra Logic Archive With the Video Edition of XenData Archive Series software on a Windows server and a Spectra Logic T-Series digital archive, broadcast organizations have
How To Backup At Qmul
TSM Backup and Restore Strategy and Overview (Draft) Prepared by: Trevor Leigh Version: 1.1 Page 1 of 13 Document Owner: Name/Position Steve Wicks, Servers & Storage Manager Revision History Version Description
Disaster Recovery Strategies: Business Continuity through Remote Backup Replication
W H I T E P A P E R S O L U T I O N : D I S A S T E R R E C O V E R Y T E C H N O L O G Y : R E M O T E R E P L I C A T I O N Disaster Recovery Strategies: Business Continuity through Remote Backup Replication
Large File System Backup NERSC Global File System Experience
Large File System Backup NERSC Global File System Experience M. Andrews, J. Hick, W. Kramer, A. Mokhtarani National Energy Research Scientific Computing Center at Lawrence Berkeley National Laboratory
<Insert Picture Here> Refreshing Your Data Protection Environment with Next-Generation Architectures
1 Refreshing Your Data Protection Environment with Next-Generation Architectures Dale Rhine, Principal Sales Consultant Kelly Boeckman, Product Marketing Analyst Program Agenda Storage
PARALLELS CLOUD STORAGE
PARALLELS CLOUD STORAGE Performance Benchmark Results 1 Table of Contents Executive Summary... Error! Bookmark not defined. Architecture Overview... 3 Key Features... 5 No Special Hardware Requirements...
Tiered Adaptive Storage
Tiered Adaptive Storage An engineered system for large scale archives, tiered storage, and beyond Harriet G. Coverston Versity Software, Inc. St. Paul, MN USA [email protected] Scott Donoho,
Virtual Tape Systems for IBM Mainframes A comparative analysis
Virtual Tape Systems for IBM Mainframes A comparative analysis Virtual Tape concepts for IBM Mainframes Mainframe Virtual Tape is typically defined as magnetic tape file images stored on disk. In reality
Setting Up and Using Tivoli Storage Manager with Storage Director
Setting Up and Using Tivoli Storage Manager with Storage Director Contents Preface... 3 Install Tivoli... 3 Setup... 3 Storage Director Setup... 3 TSM Setup... 4 Adding the TSM Clients to the configuration...
Cloud Storage. Parallels. Performance Benchmark Results. White Paper. www.parallels.com
Parallels Cloud Storage White Paper Performance Benchmark Results www.parallels.com Table of Contents Executive Summary... 3 Architecture Overview... 3 Key Features... 4 No Special Hardware Requirements...
Tandberg Data AccuVault RDX
Tandberg Data AccuVault RDX Binary Testing conducts an independent evaluation and performance test of Tandberg Data s latest small business backup appliance. Data backup is essential to their survival
The Archival Upheaval Petabyte Pandemonium Developing Your Game Plan Fred Moore President www.horison.com
USA The Archival Upheaval Petabyte Pandemonium Developing Your Game Plan Fred Moore President www.horison.com Did You Know? Backup and Archive are Not the Same! Backup Primary HDD Storage Files A,B,C Copy
STORAGE CENTER. The Industry s Only SAN with Automated Tiered Storage STORAGE CENTER
STORAGE CENTER DATASHEET STORAGE CENTER Go Beyond the Boundaries of Traditional Storage Systems Today s storage vendors promise to reduce the amount of time and money companies spend on storage but instead
Data Management & Storage for NGS
Data Management & Storage for NGS 2009 Pre-Conference Workshop Chris Dagdigian BioTeam Inc. Independent Consulting Shop: Vendor/technology agnostic Staffed by: Scientists forced to learn High Performance
Implementing Offline Digital Video Storage using XenData Software
using XenData Software XenData software manages data tape drives, optionally combined with a tape library, on a Windows Server 2003 platform to create an attractive offline storage solution for professional
HP StorageWorks P2000 G3 and MSA2000 G2 Arrays
HP StorageWorks P2000 G3 and MSA2000 G2 Arrays Family Data sheet How can the flexibility of the HP StorageWorks P2000 G3 MSA Array Systems help remedy growing storage needs and small budgets? By offering
Presents. Attix5 Technology. An Introduction
Presents Attix5 Technology An Introduction January 2013 1. Global Block Level Deduplication. Attix5 Feature Top 10 Things That Matter When Attix5 is first installed on a target server a full backup is
Keys to optimizing your backup environment: Legato NetWorker
Keys to optimizing your backup environment: Legato NetWorker Natalie Mead Storage Consultant GlassHouse Technologies [email protected] Introduction Audience Profile Storage Management Interdependence
A Better Approach to Backup and Bare-Metal Restore: Disk Imaging Technology
A Better Approach to Backup and Bare-Metal Restore: Disk Imaging Technology Acronis True Image Enterprise Server for Windows Acronis True Image Server for Windows Acronis True Image Server for Linux Another
Considerations when Choosing a Backup System for AFS
Considerations when Choosing a Backup System for AFS By Kristen J. Webb President and CTO Teradactyl LLC. October 21, 2005 The Andrew File System has a proven track record as a scalable and secure network
Archival Storage At LANL Past, Present and Future
Archival Storage At LANL Past, Present and Future Danny Cook Los Alamos National Laboratory [email protected] Salishan Conference on High Performance Computing April 24-27 2006 LA-UR-06-0977 Main points of
Many government agencies are requiring disclosure of security breaches. 32 states have security breach similar legislation
Is it safe? The business impact of data protection. Bruce Master IBM LTO Program Linear Tape-Open, LTO, LTO Logo, Ultrium and Ultrium Logo are trademarks of HP, IBM and Quantum in the US and other countries.
SAN TECHNICAL - DETAILS/ SPECIFICATIONS
SAN TECHNICAL - DETAILS/ SPECIFICATIONS Technical Details / Specifications for 25 -TB Usable capacity SAN Solution Item 1) SAN STORAGE HARDWARE : One No. S.N. Features Description Technical Compliance
SOP Common service PC File Server
SOP Common service PC File Server v0.6, May 20, 2016 Author: Jerker Nyberg von Below 1 Preamble The service PC File Server is produced by BMC-IT and offered to Uppsala University. It is especially suited
Maurice Askinazi Ofer Rind Tony Wong. HEPIX @ Cornell Nov. 2, 2010 Storage at BNL
Maurice Askinazi Ofer Rind Tony Wong HEPIX @ Cornell Nov. 2, 2010 Storage at BNL Traditional Storage Dedicated compute nodes and NFS SAN storage Simple and effective, but SAN storage became very expensive
BlueArc unified network storage systems 7th TF-Storage Meeting. Scale Bigger, Store Smarter, Accelerate Everything
BlueArc unified network storage systems 7th TF-Storage Meeting Scale Bigger, Store Smarter, Accelerate Everything BlueArc s Heritage Private Company, founded in 1998 Headquarters in San Jose, CA Highest
The Microsoft Large Mailbox Vision
WHITE PAPER The Microsoft Large Mailbox Vision Giving users large mailboxes without breaking your budget Introduction Giving your users the ability to store more e mail has many advantages. Large mailboxes
IBM PROTECTIER: FROM BACKUP TO RECOVERY
SOLUTION PROFILE IBM PROTECTIER: FROM BACKUP TO RECOVERY NOVEMBER 2011 When it comes to backup and recovery, backup performance numbers rule the roost. It s understandable really: far more data gets backed
Violin Memory Arrays With IBM System Storage SAN Volume Control
Technical White Paper Report Best Practices Guide: Violin Memory Arrays With IBM System Storage SAN Volume Control Implementation Best Practices and Performance Considerations Version 1.0 Abstract This
Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software
Implementing an Automated Digital Video Archive Based on the Video Edition of XenData Software The Video Edition of XenData Archive Series software manages one or more automated data tape libraries on
XenData Product Brief: SX-550 Series Servers for LTO Archives
XenData Product Brief: SX-550 Series Servers for LTO Archives The SX-550 Series of Archive Servers creates highly scalable LTO Digital Video Archives that are optimized for broadcasters, video production
Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows
Storage Switzerland White Paper Storage Infrastructures for Big Data Workflows Sponsored by: Prepared by: Eric Slack, Sr. Analyst May 2012 Storage Infrastructures for Big Data Workflows Introduction Big
Evaluation of Enterprise Data Protection using SEP Software
Test Validation Test Validation - SEP sesam Enterprise Backup Software Evaluation of Enterprise Data Protection using SEP Software Author:... Enabling you to make the best technology decisions Backup &
The safer, easier way to help you pass any IT exams. Exam : 000-115. Storage Sales V2. Title : Version : Demo 1 / 5
Exam : 000-115 Title : Storage Sales V2 Version : Demo 1 / 5 1.The IBM TS7680 ProtecTIER Deduplication Gateway for System z solution is designed to provide all of the following EXCEPT: A. ESCON attach
Protecting enterprise servers with StoreOnce and CommVault Simpana
Technical white paper Protecting enterprise servers with StoreOnce and CommVault Simpana HP StoreOnce Backup systems Table of contents Introduction 2 Technology overview 2 HP StoreOnce Backup systems key
EMC Unified Storage for Microsoft SQL Server 2008
EMC Unified Storage for Microsoft SQL Server 2008 Enabled by EMC CLARiiON and EMC FAST Cache Reference Copyright 2010 EMC Corporation. All rights reserved. Published October, 2010 EMC believes the information
Keys to Successfully Architecting your DSI9000 Virtual Tape Library. By Chris Johnson Dynamic Solutions International
Keys to Successfully Architecting your DSI9000 Virtual Tape Library By Chris Johnson Dynamic Solutions International July 2009 Section 1 Executive Summary Over the last twenty years the problem of data
TSM (Tivoli Storage Manager) Backup and Recovery. Richard Whybrow Hertz Australia System Network Administrator
TSM (Tivoli Storage Manager) Backup and Recovery Richard Whybrow Hertz Australia System Network Administrator 2 Preparation meets success 3 Hertz Service Delivery Hertz has over 220 car hire locations
Data Backup Options for SME s
Data Backup Options for SME s As an IT Solutions company, Alchemy are often asked what is the best backup solution? The answer has changed over the years and depends a lot on your situation. We recognize
EMC BACKUP MEETS BIG DATA
EMC BACKUP MEETS BIG DATA Strategies To Protect Greenplum, Isilon And Teradata Systems 1 Agenda Big Data: Overview, Backup and Recovery EMC Big Data Backup Strategy EMC Backup and Recovery Solutions for
Storage Solutions For the DIY-types
M U L T I - T I E R S T O R A G E Joe Little, Electrical Engineering, Stanford University ( revision 1. 0) ABSTRACT The Electrical Engineering department at Stanford University has often spearheaded the
The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000
The IntelliMagic White Paper: Storage Performance Analysis for an IBM Storwize V7000 Summary: This document describes how to analyze performance on an IBM Storwize V7000. IntelliMagic 2012 Page 1 This
Large Scale Storage. Orlando Richards, Information Services [email protected]. LCFG Users Day, University of Edinburgh 18 th January 2013
Large Scale Storage Orlando Richards, Information Services [email protected] LCFG Users Day, University of Edinburgh 18 th January 2013 Overview My history of storage services What is (and is not)
Comparison of Native Fibre Channel Tape and SAS Tape Connected to a Fibre Channel to SAS Bridge. White Paper
Comparison of Native Fibre Channel Tape and SAS Tape Connected to a Fibre Channel to SAS Bridge White Paper Introduction IT Managers may approach backing up Storage Area Networks (SANs) with the idea of
IBM System Storage Portfolio Overview
IBM System Storage Portfolio Overview Daniel Ndirangu: Storage Sales Specialist Email Address: [email protected] The Business Challenge Every two days now, we create as much information as we did from
EMC Backup Storage Solutions: The Value of EMC Disk Library with TSM
A Detailed Review Abstract The white paper describes how the EMC Disk Library can enhance an IBM Tivoli Storage Manager (TSM) environment. It describes TSM features, the demands these features place on
Hitachi NAS Platform and Hitachi Content Platform with ESRI Image
W H I T E P A P E R Hitachi NAS Platform and Hitachi Content Platform with ESRI Image Aciduisismodo Extension to ArcGIS Dolore Server Eolore for Dionseq Geographic Uatummy Information Odolorem Systems
XenData Video Edition. Product Brief:
XenData Video Edition Product Brief: The Video Edition of XenData Archive Series software manages one or more automated data tape libraries on a single Windows 2003 server to create a cost effective digital
Considerations when Choosing a Backup System for AFS
Considerations when Choosing a Backup System for AFS By Kristen J. Webb President and CTO Teradactyl LLC. June 18, 2005 The Andrew File System has a proven track record as a scalable and secure network
Enterprise Backup Solution Vendor Questions
Enterprise Backup Solution Vendor Questions What is the size of a single full back up? If Backups comprise 28% of the 19TB, can we assume that a single full (not compressed) is approximately 6TB? The approximate
Object Oriented Storage and the End of File-Level Restores
Object Oriented Storage and the End of File-Level Restores Stacy Schwarz-Gardner Spectra Logic Agenda Data Management Challenges Data Protection Data Recovery Data Archive Why Object Based Storage? The
Online Storage Replacement Strategy/Solution
I. Current Storage Environment Online Storage Replacement Strategy/Solution ISS currently maintains a substantial online storage infrastructure that provides centralized network-accessible storage for
Stretching A Wolfpack Cluster Of Servers For Disaster Tolerance. Dick Wilkins Program Manager Hewlett-Packard Co. Redmond, WA dick_wilkins@hp.
Stretching A Wolfpack Cluster Of Servers For Disaster Tolerance Dick Wilkins Program Manager Hewlett-Packard Co. Redmond, WA [email protected] Motivation WWW access has made many businesses 24 by 7 operations.
Effective Storage Management for Cloud Computing
IBM Software April 2010 Effective Management for Cloud Computing April 2010 smarter storage management Page 1 Page 2 EFFECTIVE STORAGE MANAGEMENT FOR CLOUD COMPUTING Contents: Introduction 3 Cloud Configurations
Enhancements of ETERNUS DX / SF
shaping tomorrow with you ETERNUS - Business-centric Storage Enhancements of ETERNUS DX / SF Global Product Marketing Storage ETERNUS Business-centric Storage Agenda: 1 Overview of the top 3 innovations
PrimeArray Data Storage Solutions Network Attached Storage (NAS) iscsi Storage Area Networks (SAN) Optical Storage Systems (CD/DVD)
Fall 2008 PrimeArray Data Storage Solutions Network Attached Storage (NAS) iscsi Storage Area Networks (SAN) Optical Storage Systems (CD/DVD) AutoStor iscsi SAN solution. See pages 8 and 9 for more information.
How to recover a failed Storage Spaces
www.storage-spaces-recovery.com How to recover a failed Storage Spaces ReclaiMe Storage Spaces Recovery User Manual 2013 www.storage-spaces-recovery.com Contents Overview... 4 Storage Spaces concepts and
Mass Storage System for Disk and Tape resources at the Tier1.
Mass Storage System for Disk and Tape resources at the Tier1. Ricci Pier Paolo et al., on behalf of INFN TIER1 Storage [email protected] ACAT 2008 November 3-7, 2008 Erice Summary Tier1 Disk
WHITE PAPER BRENT WELCH NOVEMBER
BACKUP WHITE PAPER BRENT WELCH NOVEMBER 2006 WHITE PAPER: BACKUP TABLE OF CONTENTS Backup Overview 3 Background on Backup Applications 3 Backup Illustration 4 Media Agents & Keeping Tape Drives Busy 5
WHITE PAPER. HIPPA Compliance and Secure Online Data Backup and Disaster Recovery
WHITE PAPER HIPPA Compliance and Secure Online Data Backup and Disaster Recovery January 2006 HIPAA Compliance and the IT Portfolio Online Backup Service Introduction October 2004 In 1996, Congress passed
Reduce your data storage footprint and tame the information explosion
IBM Software White paper December 2010 Reduce your data storage footprint and tame the information explosion 2 Reduce your data storage footprint and tame the information explosion Contents 2 Executive
WHITEPAPER: Understanding Pillar Axiom Data Protection Options
WHITEPAPER: Understanding Pillar Axiom Data Protection Options Introduction This document gives an overview of the Pillar Data System Axiom RAID protection schemas. It does not delve into corner cases
Backup and Recovery Solutions for Exadata. Ľubomír Vaňo Principal Sales Consultant
Backup and Recovery Solutions for Exadata Ľubomír Vaňo Principal Sales Consultant Fundamental Backup and Recovery Data doesn t exist in most organizations until the rule of 3 is complete: Different Media
Protecting Microsoft SQL Server with an Integrated Dell / CommVault Solution. Database Solutions Engineering
Protecting Microsoft SQL Server with an Integrated Dell / CommVault Solution Database Solutions Engineering By Subhashini Prem and Leena Kushwaha Dell Product Group March 2009 THIS WHITE PAPER IS FOR INFORMATIONAL
SAN Conceptual and Design Basics
TECHNICAL NOTE VMware Infrastructure 3 SAN Conceptual and Design Basics VMware ESX Server can be used in conjunction with a SAN (storage area network), a specialized high speed network that connects computer
Linux System Administration
System Backup Strategies Objective At the conclusion of this module, the student will be able to: describe the necessity for creating a backup regimen describe the advantages and disadvantages of the most
EMC Disk Library with EMC Data Domain Deployment Scenario
EMC Disk Library with EMC Data Domain Deployment Scenario Best Practices Planning Abstract This white paper is an overview of the EMC Disk Library with EMC Data Domain deduplication storage system deployment
EMC Invista: The Easy to Use Storage Manager
EMC s Invista SAN Virtualization System Tested Feb. 2006 Page 1 of 13 EMC Invista: The Easy to Use Storage Manager Invista delivers centrally managed LUN Virtualization, Data Mobility, and Copy Services
An Oracle White Paper July 2012. Expanding the Storage Capabilities of the Oracle Database Appliance
An Oracle White Paper July 2012 Expanding the Storage Capabilities of the Oracle Database Appliance Executive Overview... 2 Introduction... 2 Storage... 3 Networking... 4 Determining the best Network Port
RFP - Equipment for the Replication of Critical Systems at Bank of Mauritius Tower and at Disaster Recovery Site. 06 March 2014
RFP - Equipment for the Replication of Critical Systems at Bank of Mauritius Tower and at Disaster Recovery Site Response to Queries: 06 March 2014 (1) Please specify the number of drives required in the
RAID Made Easy By Jon L. Jacobi, PCWorld
9916 Brooklet Drive Houston, Texas 77099 Phone 832-327-0316 www.safinatechnolgies.com RAID Made Easy By Jon L. Jacobi, PCWorld What is RAID, why do you need it, and what are all those mode numbers that
Unitrends Recovery-Series: Addressing Enterprise-Class Data Protection
Solution Brief Unitrends Recovery-Series: Addressing Enterprise-Class Data Protection 2 Unitrends has leveraged over 20 years of experience in understanding ever-changing data protection challenges in
XenData Product Brief: SX-250 Archive Server for LTO
XenData Product Brief: SX-250 Archive Server for LTO An SX-250 Archive Server manages a robotic LTO library creating a digital video archive that is optimized for broadcasters, video production companies,
UPSTREAM for Linux on System z
PRODUCT SHEET UPSTREAM for Linux on System z UPSTREAM for Linux on System z UPSTREAM for Linux on System z is designed to provide comprehensive data protection for your Linux on System z environment, leveraging
Audit4 Installation Requirements
Audit4 version 8.1+ 2015 Copyright 2012 S4S Pty Ltd Audit4 Support Matrix 2015 The following table provides details on the operating system and database engine support for Audit4 as at March 2015. Operating
Backup & Disaster Recovery Options
Backup & Disaster Recovery Options Since businesses have become more dependent on their internal computing capability, they are increasingly concerned about recovering from equipment failure, human error,
Creating a Cloud Backup Service. Deon George
Creating a Cloud Backup Service Deon George Agenda TSM Cloud Service features Cloud Service Customer, providing a internal backup service Internal Backup Cloud Service Service Provider, providing a backup
