Enterprise Applications



Similar documents
Thread level parallelism

Binary search tree with SIMD bandwidth optimization using SSE

EMC XtremSF: Delivering Next Generation Storage Performance for SQL Server

Benchmarking Cassandra on Violin

In-Memory Databases Algorithms and Data Structures on Modern Hardware. Martin Faust David Schwalb Jens Krüger Jürgen Müller

Performance And Scalability In Oracle9i And SQL Server 2000

Lecture 11: Multi-Core and GPU. Multithreading. Integration of multiple processor cores on a single chip.

BENCHMARKING CLOUD DATABASES CASE STUDY on HBASE, HADOOP and CASSANDRA USING YCSB

Lecture 23: Multiprocessors

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

Petabyte Scale Data at Facebook. Dhruba Borthakur, Engineer at Facebook, SIGMOD, New York, June 2013

RAID. RAID 0 No redundancy ( AID?) Just stripe data over multiple disks But it does improve performance. Chapter 6 Storage and Other I/O Topics 29

MySQL performance in a cloud. Mark Callaghan

WITH A FUSION POWERED SQL SERVER 2014 IN-MEMORY OLTP DATABASE

Chapter Introduction. Storage and Other I/O Topics. p. 570( 頁 585) Fig I/O devices can be characterized by. I/O bus connections

Distributed Architecture of Oracle Database In-memory

DELL s Oracle Database Advisor

The team that wrote this redbook Comments welcome Introduction p. 1 Three phases p. 1 Netfinity Performance Lab p. 2 IBM Center for Microsoft

PSAM, NEC PCIe SSD Appliance for Microsoft SQL Server (Reference Architecture) September 11 th, 2014 NEC Corporation

1 Storage Devices Summary

EMC XtremSF: Delivering Next Generation Performance for Oracle Database


MapReduce and Hadoop Distributed File System V I J A Y R A O

Scalable Internet Services and Load Balancing

The IBM Cognos Platform for Enterprise Business Intelligence

Precise and Accurate Processor Simulation

Putting Apache Kafka to Use!

2009 Oracle Corporation 1

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

Concept of Cache in web proxies

Express5800 Scalable Enterprise Server Reference Architecture. For NEC PCIe SSD Appliance for Microsoft SQL Server

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Price/performance Modern Memory Hierarchy

Parallel Computing 37 (2011) Contents lists available at ScienceDirect. Parallel Computing. journal homepage:

Web Server Software Architectures

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Lecture 36: Chapter 6

PART IV Performance oriented design, Performance testing, Performance tuning & Performance solutions. Outline. Performance oriented design

Unstructured Data Accelerator (UDA) Author: Motti Beck, Mellanox Technologies Date: March 27, 2012

MapReduce and Hadoop Distributed File System

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

Database Management Systems

Why Computers Are Getting Slower (and what we can do about it) Rik van Riel Sr. Software Engineer, Red Hat

How To Create A Multi Disk Raid

Improve Business Productivity and User Experience with a SanDisk Powered SQL Server 2014 In-Memory OLTP Database

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

One Server Per City: C Using TCP for Very Large SIP Servers. Kumiko Ono Henning Schulzrinne {kumiko, hgs}@cs.columbia.edu

Study Plan Masters of Science in Computer Engineering and Networks (Thesis Track)

Evaluation Report: Accelerating SQL Server Database Performance with the Lenovo Storage S3200 SAN Array

Using Synology SSD Technology to Enhance System Performance Synology Inc.

Memory-Centric Database Acceleration

Performance Counters. Microsoft SQL. Technical Data Sheet. Overview:

Dell Microsoft Business Intelligence and Data Warehousing Reference Configuration Performance Results Phase III

Parallel Programming Survey

Improving the Database Logging Performance of the Snort Network Intrusion Detection Sensor

Configuring Apache Derby for Performance and Durability Olav Sandstå

EMC VFCACHE ACCELERATES ORACLE

Maximum performance, minimal risk for data warehousing

Lab Validation Report

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

Using Synology SSD Technology to Enhance System Performance. Based on DSM 5.2

Virtuoso and Database Scalability

Removing Performance Bottlenecks in Databases with Red Hat Enterprise Linux and Violin Memory Flash Storage Arrays. Red Hat Performance Engineering

Capacity Planning for Microsoft SharePoint Technologies

Agenda. Enterprise Application Performance Factors. Current form of Enterprise Applications. Factors to Application Performance.

HP ProLiant BL660c Gen9 and Microsoft SQL Server 2014 technical brief

Making Multicore Work and Measuring its Benefits. Markus Levy, president EEMBC and Multicore Association

Data Center Storage Solutions

EMC Unified Storage for Microsoft SQL Server 2008

RevoScaleR Speed and Scalability

SAP HANA - Main Memory Technology: A Challenge for Development of Business Applications. Jürgen Primsch, SAP AG July 2011

Systems Infrastructure for Data Science. Web Science Group Uni Freiburg WS 2014/15

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

OBSERVEIT DEPLOYMENT SIZING GUIDE

HP Smart Array Controllers and basic RAID performance factors

Petascale Software Challenges. Piyush Chaudhary High Performance Computing

OC By Arsene Fansi T. POLIMI

Operating System Impact on SMT Architecture

SQL Server 2008 Performance and Scale

MAGENTO HOSTING Progressive Server Performance Improvements

<Insert Picture Here> Best Practices for Extreme Performance with Data Warehousing on Oracle Database

IBM CELL CELL INTRODUCTION. Project made by: Origgi Alessandro matr Teruzzi Roberto matr IBM CELL. Politecnico di Milano Como Campus

Topics in basic DBMS course

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 5 - DBMS Architecture

Accelerating Database Applications on Linux Servers

Comprehending the Tradeoffs between Deploying Oracle Database on RAID 5 and RAID 10 Storage Configurations. Database Solutions Engineering

Reduce Latency and Increase Application Performance Up to 44x with Adaptec maxcache 3.0 SSD Read and Write Caching Solutions

ANALYTICS BUILT FOR INTERNET OF THINGS

Azul Compute Appliances

SMB Direct for SQL Server and Private Cloud

COSC 6374 Parallel Computation. Parallel I/O (I) I/O basics. Concept of a clusters

Transcription:

Enterprise Applications Chi Ho Yue Sorav Bansal Shivnath Babu Amin Firoozshahian EE392C Emerging Applications Study Spring 2003

Functionality Online Transaction Processing (OLTP) Users/apps interacting with database in real-time Online Analytical Processing(OLAP) / data mining Experts doing offline data analysis Web servers Serves static HTML / dynamically generated pages File servers Provide access to stored data over the network Video servers Special type of file servers

Data Set and Working Set OLTP: few GB OLAP: few hundred TB / 1-2 PB Web servers 10k-500k hits/hr. Around 100 concurrent connections BBC sees 200 hits/sec. Peak: 2000 hits/sec File size: frequently 4KB, avg. 18KB File/video servers Huge amounts of stored data (TB/PB) Request size widely varies

Computational Complexity OLAP/data mining Could get complex: e.g., sort, pattern-matching OLTP Usually minimal, depends on applications Mostly transaction code Web server: 1 parent, 40 child processes Each child process parses request, retrieves content, writes to TCP connection Tree like path (1-out of-5 instructions is a branch) File & video servers Algorithms like : scheduling, pre-fetching, data distribution, and fault tolerance

Memory Access Behavior OLTP and OLAP/mining Indexed and sequential access patterns Good spatial locality, limited temporal locality Large memory footprints conflict misses File and video servers Memory largely used as caches and buffers Large spatial locality in video servers Good pre-fetching behavior for video servers

Webservers: Strong Locality of Access Patterns Server side temporal Locality: LFU caching Works best

I/O Accesses OLTP I/O bus on multithread SMP memory system, bottleneck for scalability High bandwidth for simultaneous accesses OLAP/mining: I/O is not primary bottleneck High performance disk arrays (RAID) Exploit TLP to hide latency File and video servers Huge volumes of storage with high bandwidth to stored data Fast network connections

Types of Parallelism OLTP and OLAP High TLP: thread per query/transaction Limited ILP: more dependencies, less loops Web servers: TLP (24-40 child threads run concurrently) Limited ILP (studies revealed IPC on Pentium pro 1.5 times better than on Pentium) File and video servers Limited ILP, mostly control code

Architectural Requirements OLAP High TLP SMP, SMT work well Need high performance disk arrays to hide I/O latency Memory stalls are biggest bottleneck: large footprint OLTP SMP systems: parallel processing of shared data Precise exceptions, atomicity, coherence, fault tolerance Web Server Cache Size impacts performance greatly I/O bandwidth Branch Prediction/Speculation does not work too well File and video servers Huge storage High bandwidth to stored data

Benchmarks OLTP TPC-C : throughput for a mixture of read-only and update intensive transactions OLAP TPC-R, TPC-H : models a decision support system in a manufacturing application; 22 complex SQL queries Throughput, price/performance Web servers SPECweb96, SPECweb99, Webstone File servers SPEC SFS : Throughput vs. Response time Postmark : pool of dynamic and small files

Scaling Trends OLTP, Web Servers Increasing number of simultaneous requests OLAP Increasing data sets Trying to close gap between data and decision What to do? Distributed Servers RAID have many disks per processor Bus bandwidth needs to scale

Emerging Applications Streaming applications Processing continuous streams instead of stored data Monitoring applications: network monitoring, intrusion detection, stock monitoring Run-time profiling and adaptivity is key Sensor data management Traffic monitoring and military applications Peer-to-peer systems

Conclusion Lots of TLP Mostly control code limited ILP Bottleneck Cache misses (greatly impacts performance) I/O bandwidth (disk to memory) Network bandwidth (memory to network) Branch mis-prediction rate (tree-like path) Speculative and OOO execution would be less useful