AFS File Servers Dos and Don'ts



Similar documents
How To Monitor Your Computer With Nagiostee.Org (Nagios)

The Panasas Parallel Storage Cluster. Acknowledgement: Some of the material presented is under copyright by Panasas Inc.

Distributed Backup And Disaster Recovery for AFS. A work in progress Steve Simmons Dan Hyde scs@umich.edu drh@umich.edu University of Michigan

Demand Attach / Fast-Restart Fileserver

InterWorx Clustering Guide. by InterWorx LLC

VMware vsphere 5.1 Advanced Administration

Moving Virtual Storage to the Cloud. Guidelines for Hosters Who Want to Enhance Their Cloud Offerings with Cloud Storage

NEWSTAR ENTERPRISE and NEWSTAR Sales System Requirements

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

Moving Virtual Storage to the Cloud

IBRIX Fusion 3.1 Release Notes

Server Virtualization with Windows Server Hyper-V and System Center

Linux + Windows 95 mini HOWTO

AFS Usage and Backups using TiBS at Fermilab. Presented by Kevin Hill

LBNC and IBM Corporation Document: LBNC-Install.doc Date: Path: D:\Doc\EPFL\LNBC\LBNC-Install.doc Version: V1.0

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

Linux Software Raid. Aug Mark A. Davis

VMware vsphere 5.0 Boot Camp

Parallels Cloud Server 6.0

Using Multipathing Technology to Achieve a High Availability Solution

Filesystems Performance in GNU/Linux Multi-Disk Data Storage

Cluster Configuration Manual Cluster configuration on Database Servers

Course Outline. Create and configure virtual hard disks. Create and configure virtual machines. Install and import virtual machines.

Storage Architectures for Big Data in the Cloud

Parallels Cloud Server 6.0

Server Virtualization with Windows Server Hyper-V and System Center

GlusterFS Distributed Replicated Parallel File System

capacity management for StorageWorks NAS servers

Chapter 5: Operating Systems Part 1

Considerations when Choosing a Backup System for AFS

Hyper-V vs ESX at the datacenter

LUSTRE FILE SYSTEM High-Performance Storage Architecture and Scalable Cluster File System White Paper December Abstract

Zadara Storage Cloud A

ZOTAC RAIDbox User s Manual

Date: March Reference No. RTS-CB 018

Server Virtualization with Windows Server Hyper-V and System Center

Deploying and Optimizing SQL Server for Virtual Machines

Dedicated Hosting. The best of all worlds. Build your server to deliver just what you want. For more information visit: imcloudservices.com.

Red Hat Cluster Suite

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft. dcache Introduction

OSG Hadoop is packaged into rpms for SL4, SL5 by Caltech BeStMan, gridftp backend

Lessons learned from parallel file system operation

Windows Server 2012 R2 Hyper-V: Designing for the Real World

NEXTGEN v5.8 HARDWARE VERIFICATION GUIDE CLIENT HOSTED OR THIRD PARTY SERVERS

OpenAFS Backup Strategies

LISTSERV in a High-Availability Environment DRAFT Revised

Shared Parallel File System

Optimizing Ext4 for Low Memory Environments

PARALLELS SERVER 4 BARE METAL README

Installing a Second Operating System

Google File System. Web and scalability

10GbE Ethernet for Transfer Speeds of Over 1150MB/s

Update: About Apple RAID Version 1.5 About this update

Enabling Technologies for Distributed Computing

VMware vsphere 5 Quick Start Guide

Scalable Windows Server File Serving Clusters Using Sanbolic s Melio File System and DFS

RAID Utility User s Guide Instructions for setting up RAID volumes on a computer with a MacPro RAID Card or Xserve RAID Card.

StarWind iscsi SAN Software Hands- On Review

Enterprise Manager Performance Tips

Network Attached Storage. Jinfeng Yang Oct/19/2015

Cloud Server. Parallels. Key Features and Benefits. White Paper.

Setup for Microsoft Cluster Service ESX Server and VirtualCenter 2.0.1

Data recovery Data management Electronic Evidence

Considerations when Choosing a Backup System for AFS

Dragon Medical Enterprise Network Edition Technical Note: Requirements for DMENE Networks with virtual servers

EMC ViPR Controller. Service Catalog Reference Guide. Version 2.3 XXX-XXX-XXX 01

OS6 N2520 N2560 N4520 N4560 FWRN Build OS

High Availability Storage

1 Storage Devices Summary

JOB ORIENTED VMWARE TRAINING INSTITUTE IN CHENNAI

Outline. MCSA: Server Virtualization

Filing Systems. Filing Systems

CS 377: Operating Systems. Outline. A review of what you ve learned, and how it applies to a real operating system. Lecture 25 - Linux Case Study

Storage and High Availability with Windows Server

Server-Virtualisierung mit Windows Server Hyper-V und System Center MOC 20409

VMware for Bosch VMS. en Software Manual

Parallels. Clustering in Virtuozzo-Based Systems

RAID Made Easy By Jon L. Jacobi, PCWorld

Server Virtualization with Windows Server Hyper-V and System Center

Large Scale file storage with MogileFS. Stuart Teasdale Lead System Administrator we7 Ltd

Sheepdog: distributed storage system for QEMU

Ultimate Guide to Oracle Storage

Hardware and Software Requirements for Installing California.pro

WHITE PAPER. Permabit Albireo Data Optimization Software. Benefits of Albireo for Virtual Servers. January Permabit Technology Corporation

Distributed Storage Systems

Software to Simplify and Share SAN Storage Sanbolic s SAN Storage Enhancing Software Portfolio

Virtual server management: Top tips on managing storage in virtual server environments

Virtual Server Agent v9 with VMware. March 2011

Windows Host Utilities Installation and Setup Guide

Installing and Configuring Windows Server Module Overview 14/05/2013. Lesson 1: Planning Windows Server 2008 Installation.

Enabling Technologies for Distributed and Cloud Computing

XFS File System and File Recovery Tools

Transcription:

AFS File Servers Dos and Don'ts Alf Wachsmann Stanford Linear Accelerator Center alfw@slac.stanford.edu AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 1

AFS File Server... Operating System provides disk space /vicepa, /vicepb, /vicepc,..., /vicepiu Should be partitions but can be anything on namei servers Up to 255 vice partitions per server Currently a 2TB size limit per partition File server stores AFS volumes inside these partitions No limitation on number of volumes per partition (well, it's 4,294,967,295 and VLDB <2GB) Currently a ~2TB size limit per volume With namei file server, stuff in partitions is readable With inode file server, stuff is not easily accessible AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 2

...AFS File Server Directories reside in volumes. How many files can you have per directory in AFS? You can have 64,000 files in an AFS directory if the filenames are all less than 16 characters long. If the filenames are between 16 and 32 characters than this number decreases. There are 64,000 slots per directory. Each file < 16 Characters takes 1 slot. Each file > 16 and <32 takes 2 slots, etc... AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 3

Failure Analysis Components that can (and will) fail on an AFS file server: Disk with (parts of) /vicepxy partition(s) Entire RAID array File system with /vicepxy partition(s) File server hardware Let's look at each component and understand what AFS can do for you and what you can do for AFS AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 4

Single-Disk Failure AFS cannot bundle up a bunch of disks and use them Need support from host the disks are connected to (e.g. PVFS (bad idea!)) Directly connect the disks to computers and make the computers AFS file servers Handle disk failures with RAID5 or RAID6 SAN works well Bunch of disks in a computer: Use software RAID5 Use LVM Put a journaling file system on the partitions AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 5

Multi-Disk Failure If entire RAID unit fails, AFS clients use RO clones of volumes on other servers (doesn't work for RW data) Side Note: In case you lose a RW volume, a RO volume can easily be converted to a RW volume with: vos convertrotorw -server <machine name> \ -partition <partition name> \ -id <volume name or ID> This works only with namei file servers! AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 6

File Systems for /vicepxy... /vicepxy should be a partition with a file system With namei file servers, /vicepxy can be directories or symbolic links Many things are possible: Network File System (NFS, AFS) Clustered File Systems (GFS, Panasas, Ibrix, etc.) Parallel File Systems (PVFS2, GPFS, Luster, etc.) Needs /vicepxy/alwaysattach to make sure it gets mounted by the file server if it's not a real mountpoint These translations are NOT A GOOD IDEA Use a local file system Use a journaled file system AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 7

...File Systems for /vicepxy There is a 2TB size limit for /vicepxy If OS can resize partitions, AFS can use them Theoretical limit for AFS volumes is ~2TB Large volumes need more time to move/release Files in AFS can be larger than 2GB (with newer AFS server and client software) AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 8

Server Failure General advise: use more and smaller AFS file servers instead of few big ones Big in terms of size of the computer and Big in terms of amount of disk space Old Transarc recommendation: 200 AFS clients per AFS file server SLAC sees no problem with 2000-3000 clients per server (2CPU Sun V240s, E280Rs; up to 8 partitions with ~60GB each) Support for maximum of 104,000 clients per server Problem: If server fails, direct attached partitions become unaccessible AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 9

Server Fail-Over Normal Mode: LUN1 /vicepaa /vicepab /vicepac LUN2 /vicepba /vicepbb /vicepbc FileServer1 FileServer2 AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 10

Server Fail-Over Normal Mode: Failure Mode: LUN1 /vicepaa /vicepab /vicepac LUN2 /vicepba /vicepbb /vicepbc FileServer1 FileServer2 AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 11

Tolerating Server Failures Disk array attached to two servers 2 LUNs zoned to be visible to both servers First server uses first LUN for its /vicepaa - /vicepax partitions Second server uses second LUN for its /vicepba - /vicepby partitions In case first server fails: second server mounts partitions in first LUN Run vos salvage on these new partitions Run vos syncvldb on these new partitions Run vos syncserv on these new partitions AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 12

Server Failure Gotcha If the file server goes down completely, clients will switch to other replicas and mark RW volumes as unavailable properly If the file server is up and answering Rx pings but not serving data, clients will not switch over and instead will just hang Make sure a AFS file server goes completely down! AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 13

File Server Performance Many tweaks possible to improve performance Complicated and very specific to your situation Costly operations at AFS file server startup: Running salvage Shut down your file servers cleanly (if possible) Attaching volumes Don't have too many volumes per single file server AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 14

Monitoring Use the monitoring tools that come with AFS to check on your file servers xstat_fs_test afsmonitor scout udebug AFS:::Monitor Perl module Watch out for bug with AFS on 64 bit Linux systems: Some counters are always 0 Use general purpose monitoring tools like Nagios AFS&Krb BPW '06 AFS File Servers Dos and Don'ts - A. Wachsmann 15