Scalable Internet Services and Load Balancing



Similar documents
Scalable Internet Services and Load Balancing

Database Scalability and Oracle 12c

Lecture 3: Scaling by Load Balancing 1. Comments on reviews i. 2. Topic 1: Scalability a. QUESTION: What are problems? i. These papers look at

Testing & Assuring Mobile End User Experience Before Production. Neotys

CS514: Intermediate Course in Computer Systems

1. Comments on reviews a. Need to avoid just summarizing web page asks you for:

Why SSL is better than IPsec for Fully Transparent Mobile Network Access

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

Agenda. Distributed System Structures. Why Distributed Systems? Motivation

Web Caching and CDNs. Aditya Akella

Large-Scale Web Applications

Lustre Networking BY PETER J. BRAAM

In Memory Accelerator for MongoDB

Netsweeper Whitepaper

TRILL Large Layer 2 Network Solution

Internet Content Distribution

Application Delivery Networking

Load Balancing and Sessions. C. Kopparapu, Load Balancing Servers, Firewalls and Caches. Wiley, 2002.

Introduction: Why do we need computer networks?

Server Traffic Management. Jeff Chase Duke University, Department of Computer Science CPS 212: Distributed Information Systems

Cluster Computing. ! Fault tolerance. ! Stateless. ! Throughput. ! Stateful. ! Response time. Architectures. Stateless vs. Stateful.

INCREASING THE CLOUD PERFORMANCE WITH LOCAL AUTHENTICATION

How To Understand The Power Of The Internet

White Paper A10 Thunder and AX Series Load Balancing Security Gateways

Private Cloud Storage for Media Applications. Bang Chang Vice President, Broadcast Servers and Storage

DATA COMMUNICATOIN NETWORKING

SiteCelerate white paper

How To Balance A Load Balancer On A Server On A Linux (Or Ipa) (Or Ahem) (For Ahem/Netnet) (On A Linux) (Permanent) (Netnet/Netlan) (Un

DEPLOYMENT GUIDE Version 1.1. DNS Traffic Management using the BIG-IP Local Traffic Manager

Understanding Slow Start

A Standard Modest WebSite

Building a Highly Available and Scalable Web Farm

Datacenter Operating Systems

安 瑞 科 技 物 聯 網 對 應 用 交 付 器 (ADC) 的 需 求 及 應 用 實 例 徐 乃 丁 博 士 研 發 副 總 裁 / 技 術 長

Mirror File System for Cloud Computing

Data Centers and Cloud Computing. Data Centers

High-Performance DNS Services in BIG-IP Version 11

F5 and Oracle Database Solution Guide. Solutions to optimize the network for database operations, replication, scalability, and security

Cloud Computing. Chapter 8 Virtualization

Exploiting Remote Memory Operations to Design Efficient Reconfiguration for Shared Data-Centers over InfiniBand

Load Balancing Security Gateways WHITE PAPER

Lightweight DNS for Multipurpose and Multifunctional Devices

Cisco Application Networking for IBM WebSphere

Cloud Computing. Chapter 4 Infrastructure as a Service (IaaS)

NoSQL Data Base Basics

Network Virtualization and Data Center Networks Data Center Virtualization - Basics. Qin Yin Fall Semester 2013

Application layer Protocols application transport

First Midterm for ECE374 02/25/15 Solution!!

Topics. 1. What is load balancing? 2. Load balancing techniques 3. Load balancing strategies 4. Sessions 5. Elastic load balancing

Mobile Performance Testing Approaches and Challenges

A Reliable and Fast Data Transfer for Grid Systems Using a Dynamic Firewall Configuration

Enterprise Applications

Network Attached Storage. Jinfeng Yang Oct/19/2015

Avaya P333R-LB. Load Balancing Stackable Switch. Load Balancing Application Guide

The Sierra Clustered Database Engine, the technology at the heart of

Global Server Load Balancing

Directory Enabled Distributed Packet Filtration System

Content Delivery Networks

Intel Ethernet Switch Load Balancing System Design Using Advanced Features in Intel Ethernet Switch Family

Looking Ahead The Path to Moving Security into the Cloud

ScaleArc for SQL Server

Monitoring applications to increase security in 40G and 100G networks

Stateful Inspection Technology

Overview - Using ADAMS With a Firewall

ScaleArc idb Solution for SQL Server Deployments

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Overview - Using ADAMS With a Firewall

LSI SAS inside 60% of servers. 21 million LSI SAS & MegaRAID solutions shipped over last 3 years. 9 out of 10 top server vendors use MegaRAID

Key Components of WAN Optimization Controller Functionality

Content Delivery Networks

Hewlett Packard - NBU partnership : SAN (Storage Area Network) или какво стои зад облаците

NLoad Balancing Stackable Switch

Tier Architectures. Kathleen Durant CS 3200

Getting More Performance and Efficiency in the Application Delivery Network

Concierge SIEM Reporting Overview

Scala Storage Scale-Out Clustered Storage White Paper

SVN5800 Secure Access Gateway

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

Apache Hadoop. Alexandru Costan

Router Architectures

Avaya P330 Load Balancing Manager User Guide

DDoS Attacks & Mitigation

Small is Better: Avoiding Latency Traps in Virtualized DataCenters

ZEN LOAD BALANCER EE v3.04 DATASHEET The Load Balancing made easy

3/21/2011. Topics. What is load balancing? Load Balancing

Alteon Web OS. Intelligent Internet. What s New in Alteon Web OS Alteon Web OS Benefits. Product Brief

DNS (Domain Name System) is the system & protocol that translates domain names to IP addresses.

Direct NFS - Design considerations for next-gen NAS appliances optimized for database workloads Akshay Shah Gurmeet Goindi Oracle

CitusDB Architecture for Real-Time Big Data

WAN Optimization, Web Cache, Explicit Proxy, and WCCP. FortiOS Handbook v3 for FortiOS 4.0 MR3

I N T E R S Y S T E M S W H I T E P A P E R F O R F I N A N C I A L SERVICES EXECUTIVES. Deploying an elastic Data Fabric with caché

Deploying IPv6, Now. Christian Huitema. Architect Windows Networking & Communications Microsoft Corporation

High Performance Cluster Support for NLB on Window

Cisco and Visual Network Systems: Implement an End-to-End Application Performance Management Solution for Managed Services

Load-Balancing Introduction (with examples...)

Technology Insight Series

Scalability of web applications. CSCI 470: Web Science Keith Vertanen

Business Case for a DDoS Consolidated Solution

Monitoring Traffic manager

Cray: Enabling Real-Time Discovery in Big Data

Transcription:

Scalable Services and Load Balancing Kai Shen Services brings ubiquitous connection based applications/services accessible to online users through Applications can be designed and launched quickly and immediately open to many many potential users Trends: Centralization of applications/services Scalability requirements: many simultaneous user accesses; large amount of hosted data, 4/8/2013 CSC 257/457 - Spring 2013 1 4/8/2013 CSC 257/457 - Spring 2013 2 Step 1 Crawling Crawling get all these Web pages out there: First retrieve some root pages; Parse their content and follow hyperlinks to retrieve more pages; Depth first search or breadth first search? Performance Analysis for Crawling What are the resources involved? CPU processing for TCP/HTTP protocol handling and the parsing of page content writing to disk storage network bandwidth to remote web sites Assume average page size 10KB raw processing power of a single CPU 1000 pages/sec I/O to a single disk 100 seeks/sec up to 100 pages/sec network bandwidth from/to the T1 link (1.5Mbit/s) 12 pages/sec T3 link (45Mbit/s) 360 pages/sec 4/8/2013 CSC 257/457 - Spring 2013 3 4/8/2013 CSC 257/457 - Spring 2013 4 CSC 257/457 - Spring 2013 1

Challenge Parallel Crawling Avoid redundant crawling Indexing Step 2 Indexing crawled raw web pages are not easy to search. we index them to formats that are easy to search. As part of indexing, we need to give each page an ID using a hash function. Computer: Page #123 Page #357 Networks: Page #124 Page #468 Fast intersection? 4/8/2013 CSC 257/457 - Spring 2013 5 4/8/2013 CSC 257/457 - Spring 2013 6 Step 3 Online Search Partitioning and Replication Index server Index servers (partition 1) Firewall Web server/ Query handler Local-area network Local-area network Index servers (partition 2) Page server Web server/ Query handlers Scalability, reliability 4/8/2013 CSC 257/457 - Spring 2013 7 Page servers 4/8/2013 CSC 257/457 - Spring 2013 8 CSC 257/457 - Spring 2013 2

Load Balancing over Servers Popular sites like Google or Facebook receive tens or hundreds of millions of hits per day. A large number of replicated servers are used at these sites. Key question: how to balance client requests over these servers? Goals: Performance Ease of deployment 4/8/2013 CSC 257/457 - Spring 2013 9 Load Balancing on Servers Technique 1 DNS Rotation IP address of Facebook.com? IP address of Facebook.com? DNS server 4/8/2013 CSC 257/457 - Spring 2013 10 Discussions on DNS Rotation Advantages Require almost no change on the existing architecture Load Balancing on Servers Technique 2 Cooperative Offloading Problems DNS Caching Rigid load balancing policy can t balance based on runtime load changes slow or no adjustment in response to failures 4/8/2013 CSC 257/457 - Spring 2013 11 4/8/2013 CSC 257/457 - Spring 2013 12 CSC 257/457 - Spring 2013 3

Discussions on Cooperative Offloading Can be combined with the DNS rotation. Advantages: More flexible policy is possible Be more responsive to runtime workload and server failures (to a certain degree) Problems: Need software changes on servers Longer delay 4/8/2013 CSC 257/457 - Spring 2013 13 Cooperative Offloading with TCP Handoff [Pai et al. ASPLOS1998] What does 1.3 do? What does 1.4 do? What does client do? 1.3 1.3 1.4 for CNN.com Where is the TCP state? All packets in a TCP connection must offload to one server! 4/8/2013 CSC 257/457 - Spring 2013 14 Cooperative Offloading vs. TCP Handoff Software changes on the servers Delays Load Balancing on Servers Technique 3 Load Balancing 1.1 1.2 Firewall LB 1.2 1.1 128.111.1.1 All packets in a TCP connection must offload to one server. for CNN.com 4/8/2013 CSC 257/457 - Spring 2013 15 4/8/2013 CSC 257/457 - Spring 2013 16 CSC 257/457 - Spring 2013 4

More About Load Balancing Summary Load balancing policies in LB routers (Goal: transparency, plugand play) Simple rotation Least number of active requests Shortest response time How deep do we look into the network protocol stack? Network layer (IP)? Transport layer (TCP/UDP)? Application layer? Scalable servers partitioning replication Load balancing for servers DNS rotation Cooperative offloading (w. TCP handoff) Load balancing router Changes required on the components: DNS server?? Web server?? Client???? 4/8/2013 CSC 257/457 - Spring 2013 17 4/8/2013 CSC 257/457 - Spring 2013 18 CSC 257/457 - Spring 2013 5