How SoundCloud scales



Similar documents
Scalable Architecture on Amazon AWS Cloud

What is Cloud Computing? Tackling the Challenges of Big Data. Tackling The Challenges of Big Data. Matei Zaharia. Matei Zaharia. Big Data Collection

HYPER-CONVERGED INFRASTRUCTURE STRATEGIES

Understanding AWS Storage Options

Ground up Introduction to In-Memory Data (Grids)

Tech Radar - May 2015

So What s the Big Deal?

In Memory Accelerator for MongoDB


SCALABILITY. Hodicska Gergely. Web Engineering Manager as Ustream. May 7, 2012

Hadoop: Distributed Data Processing. Amr Awadallah Founder/CTO, Cloudera, Inc. ACM Data Mining SIG Thursday, January 25 th, 2010

Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013

Performance and Scalability Overview

Department of Computer Science University of Cyprus EPL646 Advanced Topics in Databases. Lecture 14

Cloud Computing and Amazon Web Services. CJUG March, 2009 Tom Malaher

High Availability Using MySQL in the Cloud:

EMC BACKUP MEETS BIG DATA

Managing MySQL Scale Through Consolidation

Welcome to Virtual Developer Day MySQL!

Scalable Web Application

MakeMyTrip CUSTOMER SUCCESS STORY

The Flash Transformed Data Center & the Unlimited Future of Flash John Scaramuzzo Sr. Vice President & General Manager, Enterprise Storage Solutions

Intro to AWS: Storage Services

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

WOLKEN KOSTEN GELD GUSTAVO ALONSO SYSTEMS GROUP ETH ZURICH

MongoDB and Couchbase

London: capital of debt. Reducing the health consequences of personal debt

Can Flash help you ride the Big Data Wave? Steve Fingerhut Vice President, Marketing Enterprise Storage Solutions Corporation

High Performance MySQL Choices in Amazon Web Services: Beyond RDS. Andrew Shieh, SmugMug Operations smugmug.

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

Introduction to Polyglot Persistence. Antonios Giannopoulos Database Administrator at ObjectRocket by Rackspace

High Availability of VistA EHR in Cloud. ViSolve Inc. White Paper February

The Flash-Transformed Financial Data Center. Jean S. Bozman Enterprise Solutions Manager, Enterprise Storage Solutions Corporation August 6, 2014

Performance and Scalability Overview

Ø Teaching Evaluations. q Open March 3 through 16. Ø Final Exam. q Thursday, March 19, 4-7PM. Ø 2 flavors: q Public Cloud, available to public

Big data blue print for cloud architecture

Open source large scale distributed data management with Google s MapReduce and Bigtable

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

OTM in the Cloud. Ryan Haney

Offerte del 10 maggio 2013

Building. Applications. in the Cloud. Concepts, Patterns, and Projects. AAddison-Wesley. Christopher M. Mo^ar. Cape Town Sydney.

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

STeP-IN SUMMIT June 18 21, 2013 at Bangalore, INDIA. Performance Testing of an IAAS Cloud Software (A CloudStack Use Case)

CiteSeer x in the Cloud

Building Your Big Data Team

MOVING THE ELEPHANT IN THE ROOM Data Migration at Scale

Secrets to Backups and Disaster Recovery for Architects. How to Prevent Losing Your Data And Losing Your Business

Microservices on AWS

Open source Google-style large scale data analysis with Hadoop

ebay Storage, From Good to Great

Hadoop IST 734 SS CHUNG

Big Data Web Analytics Platform on AWS for Yottaa

Savanna Hadoop on. OpenStack. Savanna Technical Lead

Planning an OpenStack PoC

Real Time Big Data Processing

How, What, and Where of Data Warehouses for MySQL

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Building HTML5 and hybrid mobile apps using cloud services. Andrei Glazunov

YARN, the Apache Hadoop Platform for Streaming, Realtime and Batch Processing

Data Pipeline with Kafka

Web Application Hosting in the AWS Cloud Best Practices

Hadoop on OpenStack Cloud. Dmitry Mescheryakov Software

OPTIMIZING APPLICATION MONITORING

Embracing Open Source: Practice and Experience from Alibaba. Wensong Zhang The 11 th Northeast Asia OSS Promotion Forum

Power Tools for Pivotal Tracker

Scaling Pinterest. Yash Nelapati Ascii Artist. Pinterest Engineering. Saturday, August 31, 13

Time-to-live. Adrian Colyer CTO SpringSource

Search Big Data with MySQL and Sphinx. Mindaugas Žukas

Improve performance and availability of Banking Portal with HADOOP

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Database Virtualization and the Cloud

Are You Ready for the Holiday Rush?

From SOA to MOA: Transitioning from Service to Mobile Oriented Architectures

Time series IoT data ingestion into Cassandra using Kaa

Certified Cloud Computing Professional VS-1067

BUILDING THE IXONOS ELASTIC CLOUD WITH RED HAT. Ixonos Plc

bla bla OPEN-XCHANGE Open-Xchange Hardware Needs

How is a rational (big) data deployment approach like optimizing the generation mix of a power company?

TECHNOLOGY WHITE PAPER Jun 2012

Protect Data... in the Cloud

Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack

Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings

C a r l G o e t h a l s T e r r e m a r k E u r o p e. C a r l. g o e t h a l t e r r e m a r k. c o m

MySQL és Hadoop mint Big Data platform (SQL + NoSQL = MySQL Cluster?!)

Alexander Rubin Principle Architect, Percona April 18, Using Hadoop Together with MySQL for Data Analysis

Modern IT Operations Management. Why a New Approach is Required, and How Boundary Delivers

Flash Technology Update from Micron and Intel

Winning Against All Odds: Big Data for the Budget Travel Industry. Silviu Preoteasa Head of Marketing Technology

The Pros and Cons of Erasure Coding & Replication vs. RAID in Next-Gen Storage Platforms. Abhijith Shenoy Engineer, Hedvig Inc.

Middleware- Driven Mobile Applications

Transcription:

How SoundCloud scales Alexander Grosse @klangberater

Youtube for Audio

Fakten

Fakten Alexa Ranking (233)

Fakten Alexa Ranking (233) Employees (190) - 80 engineers

Fakten Alexa Ranking (233) Employees (190) - 80 engineers 4 office locations (Berlin, London, San Francisco, Sofia)

Fakten Alexa Ranking (233) Employees (190) - 80 engineers 4 office locations (Berlin, London, San Francisco, Sofia) >35 million registered users

Fakten Alexa Ranking (233) Employees (190) - 80 engineers 4 office locations (Berlin, London, San Francisco, Sofia) >35 million registered users partners (BBC, CNN)

Fakten Alexa Ranking (233) Employees (190) - 80 engineers 4 office locations (Berlin, London, San Francisco, Sofia) >35 million registered users partners (BBC, CNN) > 10.000 Apps

How SoundCloud scales

?

Organization Technology

What is the foundation?

Principles

Principles Top-Down vs. Bottom-Up

Principles Top-Down vs. Bottom-Up Trust

Principles Top-Down vs. Bottom-Up Trust How to react to mistakes

Principles Top-Down vs. Bottom-Up Trust How to react to mistakes Continuous Improvement

Principles Top-Down vs. Bottom-Up Trust How to react to mistakes Continuous Improvement Innovation

Top-Down vs. Bottom-Up Flat Hierarchies? Everybody has them... Engineering teams responsible for architecture (not architecture teams) Influence on the product

Trust Deployment Change Management Process Time for technical debt

Reaction to mistakes What will be changed if a mistake happens?

Innovation R&D Department? Only Product?

SoundCloud s basic principles

SoundCloud s basic principles No dedicated QA roles

SoundCloud s basic principles No dedicated QA roles You build it - you run it

SoundCloud s basic principles No dedicated QA roles You build it - you run it Team Building (Rotation, constant changes)

SoundCloud s basic principles No dedicated QA roles You build it - you run it Team Building (Rotation, constant changes) Hacker Time

SoundCloud s basic principles No dedicated QA roles You build it - you run it Team Building (Rotation, constant changes) Hacker Time Continuous Improvement

SoundCloud s basic principles No dedicated QA roles You build it - you run it Team Building (Rotation, constant changes) Hacker Time Continuous Improvement Move fast and break things

No QA Does that mean we don t test?

You build it - you run it

Flat Hierarchy

Innovation Hacker Time

Continuous Improvement Retrospectives Post Mortems (Projekte)

Mistakes...

Scrum? Kanban? Process?

What is the most important thing you have to do so that this works?

???

??? Initial Culture

??? Initial Culture Hiring!!!

SoundCloud Hiring Process

How does it look like in reality?

Organisation

HTML5 Mobile (native) other V1 API Search T&S Payment Partner Tools Data Activ. Delivery Platform System Engineering (Berlin / SF)

How do we structure teams? cross functional?

Do teams stay the same? Change!

Technology

Ruby/Scala/Go/Clojure/ Javascript

Scale Horizontally/ Vertically?

Rails->Mysql RabbitMQ->Cassandra http://backstage.soundcloud.com/2012/08/ evolution-of-soundclouds-architecture/

MySQL 7 Master, 9 slaves, 900GB data Additional slaves in EC2 Master Peak is 15K IOPS writes, 45k reads one slave lags intentionally 1 hour behind Je nach Konsistenzanforderung wird Slave oder Master gewählt (für reads - unterstützt read-only mode)

Cassandra 3 Cluster 88 Nodes 15 TB of data up to 70K reads/sec

Hadoop Elastic Search Proprietary Spam Detection

ElasticSearch 80 Mio Docs (gaining 1 Mio every 6 days) 175 QPS (gaining 15 QPS every month)

Hosted in S3/EC2 and physical DCs

Outages

Post Mortems

Summary Move fast and break things Don t get slow because you fear to make mistakes

backstage.soundcloud.com

Alexander Grosse alexandergrosse.com @klangberater We are hiring! www.soundcloud.com/jobs