Benchmarking FreeBSD. Ivan Voras <ivoras@freebsd.org>

Similar documents

AIX NFS Client Performance Improvements for Databases on NAS

Configuring Apache Derby for Performance and Durability Olav Sandstå

Performance and scalability of a large OLTP workload

Virtuoso and Database Scalability

Performance And Scalability In Oracle9i And SQL Server 2000

Storage benchmarking cookbook

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices

Secure Web. Hardware Sizing Guide

Chronicle: Capture and Analysis of NFS Workloads at Line Rate

Best Practices for Monitoring Databases on VMware. Dean Richards Senior DBA, Confio Software

Database Hardware Selection Guidelines

Virtualization and Performance NSRC

Cloud Computing through Virtualization and HPC technologies

Fixed Price Website Load Testing

Configuration Maximums VMware Infrastructure 3

Performance Characteristics of VMFS and RDM VMware ESX Server 3.0.1

Deep Dive: Maximizing EC2 & EBS Performance

MySQL performance in a cloud. Mark Callaghan

Products for the registry databases and preparation for the disaster recovery

PERFORMANCE TUNING ORACLE RAC ON LINUX

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

Agility Database Scalability Testing

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

Understanding Flash SSD Performance

Comparing the Network Performance of Windows File Sharing Environments

CHAPTER 17: File Management

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

Outline. Failure Types

AIX 5L NFS Client Performance Improvements for Databases on NAS

DELL TM PowerEdge TM T Mailbox Resiliency Exchange 2010 Storage Solution

Accelerating Server Storage Performance on Lenovo ThinkServer

Performance test report

UBUNTU DISK IO BENCHMARK TEST RESULTS

Performance Baseline of Oracle Exadata X2-2 HR HC. Part II: Server Performance. Benchware Performance Suite Release 8.4 (Build ) September 2013

Magento & Zend Benchmarks Version 1.2, 1.3 (with & without Flat Catalogs)

Scaling Analysis Services in the Cloud

Hardware Performance Optimization and Tuning. Presenter: Tom Arakelian Assistant: Guy Ingalls

Investigation of storage options for scientific computing on Grid and Cloud facilities

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

MAGENTO HOSTING Progressive Server Performance Improvements

Microsoft Exchange Server 2003 Deployment Considerations

HADOOP MOCK TEST HADOOP MOCK TEST I

System Requirements Table of contents

Intel Solid- State Drive Data Center P3700 Series NVMe Hybrid Storage Performance

Pivot3 Reference Architecture for VMware View Version 1.03

Enterprise Manager Performance Tips

Maximizing SQL Server Virtualization Performance

DMS Performance Tuning Guide for SQL Server

Performance Report Modular RAID for PRIMERGY

Google File System. Web and scalability

Audit & Tune Deliverables

5,100 PVS DESKTOPS ON XTREMIO

Distributed File Systems

W H I T E P A P E R : T E C H N I C A L. Understanding and Configuring Symantec Endpoint Protection Group Update Providers

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

Isilon IQ Scale-out NAS for High-Performance Applications

Best Practices for Optimizing SQL Server Database Performance with the LSI WarpDrive Acceleration Card

Using VMware VMotion with Oracle Database and EMC CLARiiON Storage Systems

White Paper. Recording Server Virtualization

Lecture 5: GFS & HDFS! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl

Performance Evaluation of VMXNET3 Virtual Network Device VMware vsphere 4 build

VMWARE WHITE PAPER 1

Benchmarking Hadoop & HBase on Violin

Taking Linux File and Storage Systems into the Future. Ric Wheeler Director Kernel File and Storage Team Red Hat, Incorporated

How swift is your Swift? Ning Zhang, OpenStack Engineer at Zmanda Chander Kant, CEO at Zmanda

Microsoft Dynamics NAV 2013 R2 Sizing Guidelines for On-Premises Single Tenant Deployments

Performance in a Gluster System. Versions 3.1.x

SLIDE 1 Previous Next Exit

Cloud Performance Benchmark Series

WITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE

How to Choose your Red Hat Enterprise Linux Filesystem

Boost SQL Server Performance Buffer Pool Extensions & Delayed Durability

A Scalability Study for WebSphere Application Server and DB2 Universal Database

SCI Briefing: A Review of the New Hitachi Unified Storage and Hitachi NAS Platform 4000 Series. Silverton Consulting, Inc.

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

VirtualCenter Database Performance for Microsoft SQL Server 2005 VirtualCenter 2.5

High Performance SQL Server with Storage Center 6.4 All Flash Array

THE EXPAND PARALLEL FILE SYSTEM A FILE SYSTEM FOR CLUSTER AND GRID COMPUTING. José Daniel García Sánchez ARCOS Group University Carlos III of Madrid

Informatica Data Director Performance

Cisco UCS and Fusion- io take Big Data workloads to extreme performance in a small footprint: A case study with Oracle NoSQL database

PARALLELS CLOUD SERVER

IMPLEMENTING GREEN IT

PARALLELS SERVER BARE METAL 5.0 README

SYSTEM SETUP FOR SPE PLATFORMS

How To Store Data On An Ocora Nosql Database On A Flash Memory Device On A Microsoft Flash Memory 2 (Iomemory)

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

Using Power to Improve C Programming Education

Deploying and Optimizing SQL Server for Virtual Machines

Load Testing Analysis Services Gerhard Brückl

S 2 -RAID: A New RAID Architecture for Fast Data Recovery

Performance Analysis of Web based Applications on Single and Multi Core Servers

Improve Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database

Panasas at the RCF. Fall 2005 Robert Petkus RHIC/USATLAS Computing Facility Brookhaven National Laboratory. Robert Petkus Panasas at the RCF

Pricing Guide. Overview FD Enterprise License SaaS Packages Dedicated SaaS Shared SaaS. Page 2 Page 3 Page 4 Page 5 Page 8

A Filesystem Layer Data Replication Method for Cloud Computing

Best Practices for Deploying SSDs in a Microsoft SQL Server 2008 OLTP Environment with Dell EqualLogic PS-Series Arrays

Selling Virtual Private Servers. A guide to positioning and selling VPS to your customers with Heart Internet

On Benchmarking Popular File Systems

Transcription:

Benchmarking FreeBSD Ivan Voras <ivoras@freebsd.org>

What and why? Everyone likes a nice benchmark graph :) And it's nice to keep track of these things The previous major run comparing FreeBSD to Linux was done by Kris Kennaway in 2008. There are occasional benchmarks appearing on blogs and mailing lists which show interesting results...

Target audience Mostly developers The results are decent, but not rosy Significant space for improvement Some system administrator material What to do, what not to do Tuning? Avoid benchmarking? Do you need it?

Purpose of benchmarking... To check your system for configuration bugs and obvious problems (hw/sw) To check your capacity for running an application (web server, db server, etc.) To compare your hardware to others' To compare your favorite application / operating system to others...

Repeatability The best benchmarks are performed in a way anyone can repeat them Both result verification (error checking) and a way to compare their own setup to the one tested Repeatability is a good thing...

Test hardware 2x IBM x3250 M3 Xeon E3-1220 3.1 GHz, 4-core (no HTT) 32 GB RAM 4x SATA drive RAID0 1 Gbps NIC (2 ports) Directly connected NICs

Test software FreeBSD 9.1 and 10-CURRENT (HEAD) CentOS 6.3 PostgreSQL 9.2.3 blogbench 1.1 bonnie++ 1.97 filebench 1.4.8 Mdcached 1.0.7

Accuracy and precision Accurate = if it measures what we think it measures (or is it off the mark) Precise = is the measured value good approximation of the real one (or is it affected by noise)

Systematic and random errors Systematic = we're not measuring what we thing we're measuring (the method of benchmarking is wrong, even though the results may be repeatable and look correct). Random = affected by noise outside our control, results in non-repeatable measurements.

Expressing precision Find meaningful precision (or: avoid false precision) 412.567 MB/s 412.5 MB/s 412 MB/s 410 MB/s

Example: hard drive performance (Systematic: you are not measuring what you think are measuring) File systems vs (spinning) drives 350 300 250 MB/s 200 150 100 50 0 outside middle inside diskinfo -vt /dev/da0

File systems are hard to measure Data location-dependant performance Data structure (re)use-dependant performance (newfs vs old system) Double for flash drives Whole drive Noise MB/s 400 350 300 250 200 150 outside middle inside 100 50 0 1 2 3 4 5 6 7 8

Speaking of noise...

Reducing the difference Create a partition spanning the minimal required disk space (e.g. 350 3x-4x the RAM size) 300 250 MB/s 200 150 outside middle inside 100 50 0 partition whole drive

bonnie++ File system bandwidth & file op latency 600 500 MB/s 400 300 200 100 0 Write Rewrite Read CentOS 6.3 FreeBSD 9.1 FreeBSD 10

File systems are complex beasts Think about it it's a database... Data layout issues (esp. on rotating rust) Concurrent access issues miscellaneous features such as attributes, security, reliability, TRIM NFS is also a horror but for different reasons

Blogbench Stresses file system parallelism and random disk IO Creates a tree of small-ish files (2 KiB 64 KiB) Reads and writes them Atomic renames for (some) writes Multithreaded

Blogbench results 1,900,000 2000000 1800000 1600000 1400000 1200000 1000000 800000 600000 400000 200000 0 Linux read FreeBSD 9.1 read FreeBSD 10 read 700,000 4500 4000 3500 3000 2500 2000 1500 1000 500 0 Linux write FreeBSD 9.1 write FreeBSD 10 write

Blogbench FreeBSD 9.1 vs 10 x read-91 + read-10 +----------------------------------------------------------------------+ xx x * x x*x + * x +* + + + + + AM M A +----------------------------------------------------------------------+ N Min Max Median Avg Stddev x 11 509450 695303 600394 593580.45 60234.934 + 11 577355 893889 695303 708560.45 102923.93 Difference at 95.0% confidence 114980 +/- 75005.3 19.3706% +/- 12.6361% (Student's t, pooled s = 84325.5) x write-91 + write-10 +----------------------------------------------------------------------+ + + + + ++* * x+ x x x x * x x+ x M A M +----------------------------------------------------------------------+ N Min Max Median Avg Stddev x 11 1299 2173 1734 1716.6364 287.0642 + 11 1111 2104 1299 1416 296.7814 Difference at 95.0% confidence -300.636 +/- 259.694-17.5131% +/- 15.128% (Student's t, pooled s = 291.963)

Why is Blogbench slow on FreeBSD? A multithreaded mix of file operations On FreeBSD (UFS): open(o_write) write() rename() etc... block each other (exclusive lock) + writers block readers So called write bias of FreeBSD

Why is Blogbench slow on FreeBSD? 884 100130 blogbench 1.472372 CALL open(0x7ffffebf5770,0x601<o_wronly O_CREAT O_TRUNC>,0x180<S_IRUSR S_IWUSR>) 884 100190 blogbench 1.472381 CALL close(0x2a) 884 100162 blogbench 1.472382 CALL read(0x31,0x602f60,0x10000) 884 100141 blogbench 1.472386 CALL read(0x32,0x602f60,0x10000) 884 100190 blogbench 1.472387 CALL open(0x7ffff73b9ba0,0<o_rdonly>,<unused>0) 884 100162 blogbench 1.472395 CALL read(0x31,0x602f60,0x10000) 884 100130 blogbench 1.472398 CALL write(0x2a,0x612f60,0x1df2) 884 100141 blogbench 1.472402 CALL read(0x32,0x602f60,0x10000) 884 100162 blogbench 1.472407 CALL close(0x31) 884 100130 blogbench 1.472412 CALL close(0x2a) 884 100190 blogbench 1.472403 CALL open(0x7ffff73b9ba0,0<o_rdonly>,<unused>0) 884 100162 blogbench 1.472418 CALL open(0x7ffffabd5ba0,0<o_rdonly>,<unused>0) 884 100130 blogbench 1.472421 CALL rename(0x7ffffebf5770,0x7ffffebf5ba0) 884 100141 blogbench 1.472415 CALL close(0x32) 884 100190 blogbench 1.472423 CALL read(0x2a,0x602f60,0x10000) 884 100141 blogbench 1.472430 CALL open(0x7ffffd5eaba0,0<o_rdonly>,<unused>0) 884 100162 blogbench 1.472433 CALL read(0x31,0x602f60,0x10000) 884 100190 blogbench 1.472438 CALL read(0x2a,0x602f60,0x10000) 884 100130 blogbench 1.472442 CALL open(0x7ffffebf5770,0x601<o_wronly O_CREAT O_TRUNC>,0x180<S_IRUSR S_IWUSR>) 884 100190 blogbench 1.472443 CALL close(0x2a) 884 100141 blogbench 1.472451 CALL open(0x7ffffd5eaba0,0<o_rdonly>,<unused>0) 884 100162 blogbench 1.472451 CALL read(0x31,0x602f60,0x10000) 884 100190 blogbench 1.472450 CALL open(0x7ffff73b9ba0,0<o_rdonly>,<unused>0) 884 100141 blogbench 1.472460 CALL read(0x2a,0x602f60,0x10000) 884 100130 blogbench 1.472464 CALL write(0x32,0x612f60,0x13f2) 884 100190 blogbench 1.472465 CALL open(0x7ffff73b9ba0,0<o_rdonly>,<unused>0) 884 100162 blogbench 1.472461 CALL close(0x31) 884 100190 blogbench 1.472475 CALL read(0x31,0x602f60,0x10000) 884 100162 blogbench 1.472477 CALL open(0x7ffffabd5ba0,0<o_rdonly>,<unused>0) 884 100141 blogbench 1.472480 CALL read(0x2a,0x602f60,0x10000) 884 100130 blogbench 1.472478 CALL close(0x32)

Why is Blogbench slow on FreeBSD? 800000 close() 700000 read() (not distinguishing between open(o_rd) and open(o_wr) ) 600000 500000 open() 400000 300000 write() 200000 100000 0

PostgreSQL pgbench Initialized with -s 1000 100,000,000 records, 16 GB size (fits in RAM, for RO) PostgreSQL configuration: 8 GB shared buffers 32 MB work memory autovacuum off 30 checkpoint segments

PostgreSQL / pgbench caveats WAL logging means: data is written to checkpoint segments, restarts are needed between WRITE benchmarks data is transferred to proper storage almost unpredictably, long-ish runs are needed for repeatable benchmarks Autovacuum can run almost unpredictably

PostgreSQL results 60000 50000 40000 30000 20000 10000 0 1 2 4 8 Linux disk RO avg Linux ramfs RW avg FreeBSD disk RW avg FreeBSD remote tmpfs RO avg 12 16 24 Linux disk RW avg Linux remote ramfs RO avg FreeBSD tmpfs RO avg 32 48 64 Linux ramfs RO avg FreeBSD disk RO avg FreeBSD tmpfs RW avg 128

PostgreSQL result notes On Linux, there's almost no difference between ext4 when data is fully cached (warmed) and ramfs On FreeBSD, tmpfs is faster 60000 50000 40000 30000 20000 10000 0 1 2 4 8 12 16 24 32 48 64 128 Linux disk RO avg FreeBSD disk RO avg Linux ramfs RO avg FreeBSD tmpfs RO avg

PostgreSQL result notes Linux is consistently 5%-10% better 1200 1000 800 600 400 200 0 1 2 4 8 12 16 24 32 48 64 128 Linux disk RW avg FreeBSD disk RW avg

PostgreSQL result notes Remote access doesn't help much 60000 50000 40000 30000 20000 Scheduler issue 10000 0 1 2 4 Linux ramfs RO avg 8 12 Linux remote ramfs RO avg 16 24 FreeBSD tmpfs RO avg 32 48 64 FreeBSD remote tmpfs RO avg 128

However... 200000 180000 160000 140000 120000 100000 80000 60000 40000 20000 Different benchmark, by Florian Smeets 40-core, 80-thread system, 256 GB RAM scale 100 10,000,000 records Linux wins after 32 threads 0 Threads 2 Threads 4 Threads 8 Threads 10 Threads 16 Threads 20 Threads 24 Threads 32 FreeBSD-VMC-233854-postgres-9.2-devel Linux-kernel-3.3.0-glibc-2.15-postgres-9.2-devel FreeBSD-head-r233892

Filebench Dubious correctness of the port... fileserver and webproxy profiles fileserver: Emulates simple file-server I/O activity. This workload performs a sequence of creates, deletes, appends, reads, writes and attribute operations on a directory tree. 50 threads are used by default. The workload generated is somewhat similar to SPECsfs. webproxy: Emulates I/O activity of a simple web proxy server. A mix of create-write-close, open-read-close, and delete operations of multiple files in a directory tree and a file append to simulate proxy log. 100 threads are used by default. Local drive, NFSv3 and NFSv4

fileserver profile 900 800 700 600 500 400 NFSv4 is slow even on Linux Should not be possible 300 200 100 0 Linux local MB/s Linux NFSv3 MB/s Linux NFSv4 MB/s FreeBSD local MB/s FreeBSD NFSv3 MB/s FreeBSD NFSv4 MB/s

webproxy profile 800 700 600 500 Possible, but improbable 400 300 200 100 0 Linux local MB/s Linux NFSv3 MB/s Linux NFSv4 MB/s FreeBSD local MB/s FreeBSD NFSv3 MB/s FreeBSD NFSv4 MB/s

Bullet Cache My own cache server, presented at BSDCan 2012 :) Lots of small (32 byte 128 byte) TCP command / response transactions Multithreaded + non-blocking IO (nearly 2,000,000 TPS over Unix sockets on medium-end hardware)

Linux support incomplete, lacks epoll() Within measurement error, 9.1 == 10 600000 500000 400000 300000 200000 100000 0 CentOS 6.3 FreeBSD 9.1 FreeBSD 10 100 conn 200 conn 300 conn 400 conn 500 conn

Result notes TCP is fairly concurrent in FreeBSD, multiple TCP streams do not block each other > 470,000 PPS per direction Over Unix sockets (local): 1,020,000 TPS

On tuning... The year is 2013 and FreeBSD actually auto-tunes reasonably well Example #1: vfs.read_max Example #2: hw.em.txd/rxd Example #3: kern.maxusers + hi/lobufspace Example #4: vm.pmap.shpgperproc

Why no CPU benchmarks? You are not going to influence raw CPU performance from the OS (except in edge cases / misconfiguration) You can test the quality of the libc and libm implementations... but be sure that is what you want to You can also test the quality of compiler optimizations... if you want to.

(LLVM vs GCC) (do not draw conclusions based on this) (Phoronix benchmark, Botan/KASUMI) Botan v1.10.3 Test: KASUMI Mbytes/s, More Is Better OpenBenchmarking.org GCC 4.7.2 SE +/- 0.00 39.35 GCC 4.8.0 SE +/- 0.00 37.91 LLVM Clang 3.2 SE +/- 0.00 39.34 LLVM Clang 3.3 SVN SE +/- 0.00 65.75 15 30 45 60 75 1. (CXX) g++ options: -m64 -ldl -lpthread -lrt Powered By Phoronix Test Suite 4.4.1

Why NOT benchmark? When is benchmarking important? Is performance more important than features? Is it cheaper to buy another machine than to reconfigure / develop a better OS? The future is actually quite good During 7.x timeframe: 8 CPU scalability During 10.x timeframe: 32 CPU scalability

Continuous benchmarking? Some talks were had... It would probably be easier to set up now after the developer cluster has been updated...

The End Benchmarking FreeBSD Ivan Voras <ivoras@freebsd.org>