Yahoo! Communities Architectures Ian Flint

Similar documents
Maintaining Non-Stop Services with Multi Layer Monitoring

Scalable Architecture on Amazon AWS Cloud

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds.

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project

Oracle Identity Analytics Architecture. An Oracle White Paper July 2010

Vistara Lifecycle Management

How Comcast Built An Open Source Content Delivery Network National Engineering & Technical Operations

Analyzing large flow data sets using. visualization tools. modern open-source data search and. FloCon Max Putas

Architecting ColdFusion For Scalability And High Availability. Ryan Stewart Platform Evangelist

VPS Cloud Hosting. Call (02)

A SURVEY ON AUTOMATED SERVER MONITORING

The Benefits of Verio Virtual Private Servers (VPS) Verio Virtual Private Server (VPS) CONTENTS

MySQL Enterprise Monitor

Ganglia & Nagios. Maciej Lasyk 11. Sesja Linuksowa Wrocław, /25. Maciej Lasyk, Ganglia & Nagios

Building Success on Acquia Cloud:

AGENDA: INTRODUCTION: 1. How is our cloud monitoring setup? 2. Which are the tools used? 3. How do we access monitoring dashboard?

Managing your Red Hat Enterprise Linux guests with RHN Satellite

Linux Server Support by Applied Technology Research Center. Proxy Server Configuration

ENTERPRISE-CLASS MONITORING SOLUTION FOR EVERYONE ALL-IN-ONE OPEN-SOURCE DISTRIBUTED MONITORING

Cloud Based Application Architectures using Smart Computing

SCALABLE DATA SERVICES

Monitoring Remedy with BMC Solutions

XpoLog Center Suite Log Management & Analysis platform

Best of Breed of an ITIL based IT Monitoring. The System Management strategy of NetEye

Winning the J2EE Performance Game Presented to: JAVA User Group-Minnesota

DTWMS Required Software Engineers. 1. Senior Java Programmer (3 Positions) Responsibilities:

ManageEngine (division of ZOHO Corporation) Infrastructure Management Solution (IMS)

XpoLog Center Suite Data Sheet

TPAf KTl Pen source. System Monitoring. Zenoss Core 3.x Network and

Practical Load Balancing

.NET UI Load Balancing & Clustering

Red Hat Network Satellite Management and automation of your Red Hat Enterprise Linux environment

Client Overview. Engagement Situation

PANDORA FMS NETWORK DEVICES MONITORING

Project Manager 1 Post == Experience years in Project Management in reputed company, Salary Rs.1,20,000/-

Wikimedia architecture. Mark Bergsma Wikimedia Foundation Inc.

How To Monitor Mysql With Zabbix

Amazon Web Services Primer. William Strickland COP 6938 Fall 2012 University of Central Florida

Siemens SAP Cloud - Infrastructure as a Service

ScienceLogic vs. Open Source IT Monitoring

XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines. A.Zydroń 18 April Page 1 of 12

Enterprise Application Monitoring with

Investor Newsletter. Storage Made Easy Cloud Appliance High Availability Options WHAT IS THE CLOUD APPLIANCE?

Database Services for CERN

PANDORA FMS NETWORK DEVICE MONITORING

Open Source in the Data Centre. John Ferlito Bulletproof Networks

GroupWise SMTP Infrastructure Design:

Internet Services. CERN IT Department CH-1211 Genève 23 Switzerland

A Modular Architecture Using Open Source Components

Cost-Effective Business Intelligence with Red Hat and Open Source

Nagios and Cloud Computing

OpenITSM - IT Service Management with Open Source Software

COMPARING NETWORK AND SERVER MONITORING TOOLS

BlackBerry Enterprise Server for Microsoft Exchange Version: 5.0 Service Pack: 2. Feature and Technical Overview

ActiveVOS Server Architecture. March 2009

Application Discovery Manager User s Guide vcenter Application Discovery Manager 6.2.1

Red Hat Satellite Management and automation of your Red Hat Enterprise Linux environment

Mark Bennett. Search and the Virtual Machine

Jenkins User Conference Herzelia, July #jenkinsconf. Testing a Large Support Matrix Using Jenkins. Amir Kibbar HP

/ Administrator Training PAT-2012

.NET UI Load Balancing & Failover

OMNITURE MONITORING. Ensuring the Security and Availability of Customer Data. June 16, 2008 Version 2.0

USING OPEN SOURCE SOFTWARE IN DAILY ISP OPERATIONS

Application Performance Management for Enterprise Applications

XpoLog Competitive Comparison Sheet

ZABBIX. An Enterprise-Class Open Source Distributed Monitoring Solution. Takanori Suzuki MIRACLE LINUX CORPORATION October 22, 2009

SOLUTIONS INC. BACK-IT UP. Online Backup Solution

EMC Data Domain Boost for Oracle Recovery Manager (RMAN)

Techniques for implementing & running robust and reliable DB-centric Grid Applications

Cloud Storage in a PaaS World

High Availability Solutions for the MariaDB and MySQL Database

Scalability and BMC Remedy Action Request System TECHNICAL WHITE PAPER

Advanced Linux System Administration Knowledge GNU/LINUX Requirements

Security Compliance and Data Governance: Dual problems, single solution CON8015

Cloud Optimize Your IT

Clustered Database Reporting Solution utilizing Tivoli

Your eyes in the network

CHAPTER 4 PERFORMANCE ANALYSIS OF CDN IN ACADEMICS

Red Hat Global File System for scale-out web services

Xythos WebFile Server Architecture A Technical Guide to the Core Technology, Components, and Design of the Xythos WebFile Server Platform

BASICS OF SCALING: LOAD BALANCERS

PANDORA FMS OFFICIAL TRAINING

Red Hat Network Satellite (On System z) 18-JUNE CAVMEN Meeting

America s Most Wanted a metric to detect persistently faulty machines in Hadoop

How To Use Mindarray For Business

OpenITSM - IT Service Management with Open Source Software

ATUL TRIPATHI. Address: A-71 Pandav Nagar, New Delhi. atul.trp@gmail.com Mobile :

Virtualization Standards for Business Continuity: Part 1

TECHNOLOGY WHITE PAPER Jun 2012

GRAVITYZONE HERE. Deployment Guide VLE Environment

Vembu NetworkBackup v3.1.1 GA

MySQL Strategy. Morten Andersen, MySQL Enterprise Sales. Copyright 2014 Oracle and/or its affiliates. All rights reserved.

Monitoring, Managing and Supporting Enterprise Clouds with Oracle Enterprise Manager 12c Name, Title Oracle

DIR Contract Number DIR-TSO-2621 Appendix C Pricing Index

Rob Zoeteweij CUSTOMER CASE CONFIGURATION MANAGEMENT PROVISIONING & AUTOMATED PATCHING

USING JE THE BE NNIFE FITS Integrated Performance Monitoring Service Availability Fast Problem Troubleshooting Improved Customer Satisfaction

Datasheet FUJITSU Cloud Monitoring Service

Cisco PIX vs. Checkpoint Firewall

S3 Monitor Design and Implementation Plans

Learn Oracle WebLogic Server 12c Administration For Middleware Administrators

Transcription:

Yahoo! Communities Architectures Ian Flint November 9, 2007 1

Agenda What makes Yahoo! Yahoo!? Hardware Infrastructure Software Infrastructure Operational Infrastructure Process Examples 2

What makes Yahoo! Yahoo!? What do these sites have in common? Del.icio.us Flickr Yahoo! Groups Yahoo! Mail Bix 3

What makes Yahoo! Yahoo!? Accountability at the property level Architecture Application Operations Infrastructure Decisions Incubator Environment Properties function independently on a common hardware platform Highly cost-conscious Open-source attitude 4

What makes Yahoo! Yahoo!? Standards at the infrastructure level Hardware/Software platform Configuration Management Operational tools and best practices Executive Involvement Cost Robustness Redundancy 5

Hardware Infrastructure Common Platform 6

Hardware Infrastructure Shared Components Network, Data Center, NAS Centrally managed by infrastructure team Load Balancing DSR is preferred model Proxy load balancing only where necessary 7

Hardware Infrastructure Hardware (x86, RAID/SCSI) Jointly managed by properties and ops Hardware Selection Price/Performance is a constant consideration Supply chain and provisioning cost Reliability vs. Price Single-Homed hosts (even databases) Pooling across multiple switches Fast Failover to mitigate risk of switch failure 8

Hardware Infrastructure Example Layered Infrastructure Hosts distributed across multiple racks for power/network redundancy at the pool level Really Big Load Balancers doing DSR Router Load Balancer Aggregation Switch Rack Switches Hosts The Internet 9

Software Infrastructure Shared Repository 10

Software Infrastructure OS (FreeBSD, moving to RHEL) Databases (MySQL, Oracle) Development Platforms PHP (most properties) C/C++ (primary infrastructure platform) Java Python 11

Software Infrastructure Installable components Managed through yinst package manager Stored on common distribution server Examples: yapache, yts, yfor, ymon, yiv, vespa 12

Software Infrastructure More about yinst Robust Package Manager Installation, Versioning, Scripting Implementation Software installed on distribution cluster (package repository) Hosts then pull software (via proxies) Software stored under a common root Used for everything from perl modules to common components to applications 13

Software Infrastructure Shared Infrastructure enables rapid integration of acquisitions UDB SSO External Infrastructure Akamai CDN and DNS Gomez & Keynote SDS YMDB 14

Software Infrastructure - Bix The Internet Akamai (CDN) YTS (Reverse Proxy) YTS (Reverse Proxy) Akamai (CDN) Media Media Servers Media Servers Servers Yfor (failover resolver) Web Web Servers Web Servers Servers Primary Colo Backup Colo Web Web Servers Web Servers Yfor (failover resolver) Media Media Servers Media Servers Servers Global Server Load Balancing between sites YTS provides Reverse Proxy and Connection Management Yfor provides fast failover from colo to colo Media is served via a content delivery network for performance and to reduce load on servers 15

Software Infrastructure - Bix Yfor Failover Resolver used for fast failover of database connections Dual Master MySQL setup for write hosts Media storage on NetApp NAS device, with snapmirroring to backup data center 16

Software Infrastructure - Bix Yapache reverse proxy in front of Tomcat instance PHP used to access Yahoo shared services Static files served from disk Fairly standard Java environment (Spring, Hibernate, ehcache, c3po, log4j, etc.) Web Server Yapache (Yahoo-improved Apache) PHP Yahoo Shared Services Spring Lucene Hessian log4j Mod_proxy (Reverse Proxy) Tomcat Bix Application Static Files Hibernate ehcache c3po MySQL Connector 17

Software Infrastructure - Groups Inbound Groups mail hits a qmail cluster Mail filtered against real-time blacklist Mail forwarded to second qmail cluster Proprietary anti-spam algorithms applied Mail forwarded to group members Mail stored on archive servers Oracle RAC clusters store metadata Periodic Electric Potato measures QoS 18

Software Infrastructure Groups Dynamic content served via web pool running python/c++ application CSS and images served via a squid-fronted pool Group photos on Y! photos infrastructure backed by Yahoo! Media DB (YMDB) Database feature implemented as sleepycat DB hosted on message store Calendar feature implemented via API calls to calendar.yahoo.com 19

Operational Infrastructure Managing the Platform 20

Operational Infrastructure Common Monitoring Infrastructure Nagios Main monitor for clusters Numerous standard plugins Standards/Best Practices around custom plugins Ywatch Basic monitoring of machines over SNMP Heartbeats plus fundamental metrics (IO, CPU, Disk, etc.) Ymon NRPE/NSCA on steroids Automated forwarding of active and passive checks Scripted setup Drraw Data Visualization Deep integration with Nagios and ymon 21

Operational Infrastructure Rollup Monitoring Clusters rolled up to centralized monitoring console Prioritization and correlation of events Internal Site QOS Monitoring QOS monitoring for sites Response time and availability The OC 24x7, worldwide operations center Provides tier 1 and 2 support Centralized CMDB Configuration Management DB manages every device Contact info, escalations, and runbooks included 22

Operational Infrastructure Example Application Servers perform checks which are registered by Nagios as passive checks Metrics are aggregated by metrics module On-demand graphing is provided by drraw Nagios alerts are forwarded to central ywatch console SNMP-Based Checks Media Media Media Servers Servers Servers yapache Web Web Servers Web Servers Servers ymon server ymond ywatch server DB Servers Web Web Servers Servers metrics aggregator nagios grapher (rrdtool/drraw) forwarding agent yapache listener Ops Console 23

Processes and Standards Keeping it sane 24

Process and Standards Hardware Review Committee Strong emphasis on economics Personal attention from David Filo Software Review Committee Thinking through major licensing decisions Business Continuity Planning Required of all properties Must have and test backup data center Paranoids Ongoing site scans Enforcement of standards 25

Questions? 26