Deploying Splunk on Amazon Web Services



Similar documents
How AWS Pricing Works

How AWS Pricing Works May 2015

Cloud Computing and Amazon Web Services

Amazon Elastic Compute Cloud Getting Started Guide. My experience

Best PracBces: Deploying Splunk on Physical, Virtual, and Cloud Infrastructure

Cloud Computing with Amazon Web Services and the DevOps Methodology.

Scalable Architecture on Amazon AWS Cloud

Using ArcGIS for Server in the Amazon Cloud

Introduction to Cloud Computing

Smartronix Inc. Cloud Assured Services Commercial Price List

Amazon EC2 Product Details Page 1 of 5

Description of Application

Copyright 2015 Splunk Inc. Go Big or Go Home. Sean Delaney Specialist SE Mustafa Ahamed Director, Product Management

Deep Dive: Maximizing EC2 & EBS Performance

AWS Performance Tuning

Zadara Storage Cloud A

Amazon Elastic Beanstalk

Alfresco Enterprise on AWS: Reference Architecture

Cloud Computing and E-Commerce

Hosting Requirements Smarter Balanced Assessment Consortium Contract 11 Test Delivery System. American Institutes for Research

EXECUTIVE SUMMARY CONTENTS. 1. Summary 2. Objectives 3. Methodology and Approach 4. Results 5. Next Steps 6. Glossary 7. Appendix. 1.

Hadoop & Spark Using Amazon EMR

The High-Performance Cloud Infrastructure Company! 2011 Joyent, Inc. Contains Joyent Restricted Secrets. Not for Public Disclosure. Patents Pending.!

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Building an AWS-Compatible Hybrid Cloud with OpenStack

ArcGIS for Server in the Amazon Cloud. Michele Lundeen Esri

THE DEFINITIVE GUIDE FOR AWS CLOUD EC2 FAMILIES

PUBLIC CLOUD USAGE TRENDS

MySQL and Virtualization Guide

SQL Server Virtualization 101. David Klee, Group Principal and Practice Lead. SQL PASS Virtualization VC,

High Performance MySQL Choices in Amazon Web Services: Beyond RDS. Andrew Shieh, SmugMug Operations smugmug.

PostgreSQL Performance Characteristics on Joyent and Amazon EC2

Increased Security, Greater Agility, Lower Costs for AWS DELPHIX FOR AMAZON WEB SERVICES WHITE PAPER

IBM Platform Computing Cloud Service Ready to use Platform LSF & Symphony clusters in the SoftLayer cloud

DIABLO TECHNOLOGIES MEMORY CHANNEL STORAGE AND VMWARE VIRTUAL SAN : VDI ACCELERATION

SoftNAS Architecture on AWS

Gain Insight into Your Cloud Usage with the Splunk App for AWS

AWS Cloud for HPC and Big Data

MaxDeploy Ready. Hyper- Converged Virtualization Solution. With SanDisk Fusion iomemory products

Deep Security For Service Providers

Traditional v/s CONVRGD

Scaling in the Cloud with AWS. By: Eli White (CTO & mojolive) eliw.com - mojolive.com

How To Set Up Wiremock In Anhtml.Com On A Testnet On A Linux Server On A Microsoft Powerbook 2.5 (Powerbook) On A Powerbook 1.5 On A Macbook 2 (Powerbooks)

FortiGate Amazon Machine Image (AMI) Selection Guide for Amazon EC2

Using WebSphere Application Server on Amazon EC2. Speaker(s): Ed McCabe, Arthur Meloy

Deploying Database clusters in the Cloud

How To Use Arcgis For Free On A Gdb (For A Gis Server) For A Small Business

Efficiency Analysis of AWS Offering Vs. Private/Hybrid Implementations or Traditional Colo

OTM in the Cloud. Ryan Haney

Amazon Web Services Student Tutorial

Designing Apps for Amazon Web Services

A Shared File System on SAS Grid Manger in a Cloud Environment

Cloud Models and Platforms

How To Choose Between A Relational Database Service From Aws.Com

Providing Self-Service, Life-cycle Management for Databases with VMware vfabric Data Director

Microsoft Windows Server Failover Clustering (WSFC) and SQL Server AlwaysOn Availability Groups on the AWS Cloud: Quick Start Reference Deployment

Resource Sizing: Spotfire for AWS

ArcGIS 10.3 Server on Amazon Web Services

LARGE-SCALE DATA STORAGE APPLICATIONS

Overview and Deployment Guide. Sophos UTM on AWS

An Introduction to Cloud Computing Concepts

Software AG and the AWS cloud. Past, Present and Best Practices. Jonathan Madamba Director, Solution Cloud John Fitzgerald Director, Product Marketing

Microsoft Private Cloud Fast Track

Part 1: Price Comparison Among The 10 Top Iaas Providers

Deploying for Success on the Cloud: EBS on Amazon VPC. Phani Kottapalli Pavan Vallabhaneni AST Corporation August 17, 2012

Microsoft SharePoint Server 2013 on the AWS Cloud: Quick Start Reference Deployment

AWS Storage: Minimizing Costs While Retaining Functionality

Hunk & Elas=c MapReduce: Big Data Analy=cs on AWS

DISTRIBUTED SYSTEMS [COMP9243] Lecture 9a: Cloud Computing WHAT IS CLOUD COMPUTING? 2

Deploying for Success on the Cloud: EBS on Amazon VPC Session ID#11312

Drupal in the Cloud. Scaling with Drupal and Amazon Web Services. Northern Virginia Drupal Meetup

I/O PERFORMANCE COMPARISON OF VMWARE VCLOUD HYBRID SERVICE AND AMAZON WEB SERVICES

Oracle Applications and Cloud Computing - Future Direction

Managing MySQL Scale Through Consolidation

Developing Plugins for Cloud Scale

Using ArcGIS for Server in the Amazon Cloud

IntroducJon to Splunk Cloud & Case Study: MindTouch. Praveen Rangnath Splunk César López- Natarén MindTouch Aaron Fulkerson MindTouch

PostgreSQL on Amazon. Christophe Pettus PostgreSQL Experts, Inc.

Is Hyperconverged Cost-Competitive with the Cloud?

The Cost of the Cloud. Steve Saporta CTO, SwipeToSpin Mar 20, 2015

SUSE Cloud 2.0. Pete Chadwick. Douglas Jarvis. Senior Product Manager Product Marketing Manager

Amazon Web Services. Lawrence Berkeley LabTech Conference 9/10/15. Jamie Baker Federal Scientific Account Manager AWS WWPS

Simplified Forwarder Deployment and Deployment Server Techniques

Scalable Web Application

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

Financial Services Grid Computing on Amazon Web Services January 2013 Ian Meyers

Aneka Dynamic Provisioning

Running R from Amazon's Elastic Compute Cloud

Understanding ArcGIS Deployments in Public and Private Cloud. Marwa Mabrouk

The Secret World of Cloud IaaS Pricing: How to Compare Apples and Oranges Among Cloud Providers

Using Amazon EMR and Hunk to explore, analyze and visualize machine data

WebLogic on Oracle Database Appliance: Combining High Availability and Simplicity

Performance Benchmark for Cloud Databases

ArcGIS for Server: In the Cloud

Transcription:

Copyright 2014 Splunk Inc. Deploying Splunk on Amazon Web Services Simeon Yep Senior Manager, Business Development Technical Services Roy Arsan Senior SoHware Engineer

Disclaimer During the course of this presentalon, we may make forward- looking statements regarding future events or the expected performance of the company. We caulon you that such statements reflect our current expectalons and eslmates based on factors currently known to us and that actual events or results could differ materially. For important factors that may cause actual results to differ from those contained in our forward- looking statements, please review our filings with the SEC. The forward- looking statements made in the this presentalon are being made as of the Lme and date of its live presentalon. If reviewed aher its live presentalon, this presentalon may not contain current or accurate informalon. We do not assume any obligalon to update any forward- looking statements we may make. In addilon, any informalon about our roadmap outlines our general product direclon and is subject to change at any Lme without nolce. It is for informalonal purposes only, and shall not be incorporated into any contract or other commitment. Splunk undertakes no obligalon either to develop the features or funclonality described or to include any such feature or funclonality in a future release. 2

Amazon Web Services vs. Everyone Else 3

ObjecLve: Integrate your Splunk Enterprise deployment with Amazon Web Services (AWS) 4

Bios Simeon Yep! 6+ years @ Splunk! Roles in: Support ConsulLng Technical Sales! Currently focused on Partner Ecosystem (including AWS) Roy Arsan! 2+ years @ Splunk! Roles in: Product Engineering Cloud Architecture 5

Agenda! Infrastructure: AWS ElasLc Compute Cloud (EC2)! Deployment Examples! How to Deploy: AWS CloudFormaLon! Apps + Other 6

AWS EC2 Infrastructure

What is this Amazon stuff?! Amazon ElasLc Compute Cloud (EC2) is a web service that provides resizable compute capacity in the cloud! Pay only for capacity that you actually use! Splunk is easily deployed in Amazon 8

Splunk and Hardware! Splunk consumes high I/O due to indexing and searching! Load!= GB/day! Search drives a large porlon of the load Rare vs. Sparse vs. ReporLng Real- Lme vs. Historic! Reference servers can index up to 500 GB/day with no search load! Virtualized systems incur some overhead, but work well if tuned correctly 9

Typical User Scenario 1. Sign- up for an AWS account (use AWS IAM IdenLty and Access Management) 2. Launch an instance (via user chosen tool such as GUI, CLI, or external) 3. Use key credenlals to access the instance 4. Install SoHware/Splunk 10

Instances ü Availability Zones exist within Regions (8 Regions + Gov) ü Amazon Machine Image (AMI) Amazon Linux based Best Performance Cost EffecLve (extra $$ for Windows)

Instances! Instance type Pricing: Spot vs. On- demand vs. Reserved Family: Storage vs. Compute vs. GPU vs. Memory vs. General Purpose GeneraLon: Current vs. Previous! Instance size Workload size: compute units, memory, storage Micro, Small, Medium, Large, Extra Large (XL) ê MulLple XL sizes: xlarge, 2xlarge, 4xlarge, 8xlarge 4XL general purpose provides similar performance to a reference server ê 50-150 GB/day indexing and searching 12

Instance Storage! Instances have ephemeral storage (Current Gen has SSDs) General Purpose instances have GBs to TBs Storage OpLmized instances have up to 48 TB! Data is lost when the instance dies! EBS ElasLc Block Storage Persistent block level storage volumes for use with EC2 instances Cost associated 1 TB costs $50/month, 5 TB costs $250/month Data is not lost when instance dies can be remounted with new instance! S3 Simple Storage Service Online cloud storage service (files, data, snapshots, etc ) Need this for backup purposes Can also be used as a data feed for Splunk 13

Storage Summary! For single instances or non- replicated distributed deployments: Use EBS volumes in RAID 1+0 for indexes, RAID 0 for OS/soHware SoHware RAID will consume cpu Use snapshots to backup the instance (S3) IOPS oplmized can provide some benefits XFS preferred (customer feedback)! Warming Doesn t have to do with Datacenter temperature Improves first write performance hit NoLceable improvements in performance when performed on ephemeral storage EBS volumes created from snapshots also benefit from warming 14

Instance SelecLon! How can I make my deployment resilient? OpLon 1: RAID 1+0 at the storage layer + EBS (was the preferred setup) OpLon 2: Index ReplicaLon OpLon 3: Data Cloning (Index and Forward, HA license required)! Instance seleclon should factor in resiliency, use- case, and cost! Index ReplicaLon FTW (?) Factoring in most common retenlon needs, you may need large EBS volumes and/or double the instances to be resilient (maybe HA license as well) ReplicaLon requires more instances, but does not require EBS IR is driven per instance cost 15

! 1 TB/day deployment example EBS backed storage for availability No replicalon Instance SelecLon 16

Instance SelecLon! 1 TB/day deployment cost comparison! Overall Cost is equivalent when EBS retenlon is 211 days (vs. 960)! Index ReplicaLon offers immediate search capability with SF/RF 17

Instance SelecLon Distributed Deployments Using Index ReplicaLon (IR)! Local ephemeral storage (SSDs) may perform beuer than EBS! Search/ReplicaLon Factor determines availability of data for searching! IR adds load and requires more servers and storage Using EBS volumes, no IR! Typically fewer instances to manage vs. IR! Search Availability is driven by the capability to remount a volume to a new instance (automalcally or manually)! Cost can be largely driven by retenlon and daily volume 18

Best PracLces! Custom AMI crealon Create your own AMI using Linux based or Splunk provided Leverage current configuralon tooling with AMI (don t have to use deployment server, but can be very helpful)! AuthenLcaLon and AuthorizaLon Policies will dictate what you can or cannot use LDAP/AD will require an SSL tunnel Other oplons: scripted input or proxying (SSO) SAML (Okta)! Security SSL everywhere + private network Install your own cerlficates 19

! Search Head Pools Best PracLces! Deploy to the same Availability Zone ReplicaLon and searches across Regions and AZ can be a challenge! Monitor from outside of the Region/AZ Offers addilonal resiliency! Use a Virtual Private Cloud (VPC) 20

Best PracLces (Rewrite)! ConfiguraLon and SoHware Management Use the tools you are most familiar with Chef and Puppet content publicly available! Deployment server usage EffecLve for controlling Splunk configuralon (only)! Use Cloud FormaLon Allows for easy and quick deployment Great starlng point for large deployments (See Appendix A) 21

General Guidelines Follow Best PracLces for ArchitecLng and Sizing: Load=Searching+Indexing Indexers (50-150 GB/day)! m3.2xlarge 8vcpu, 30 GB RAM! i2.4xlarge 16vcpu, 122 GB RAM! hs1.8xlarge 16vcpu, 117 GB RAM *These are all starlng points! Splunk can index and search more OR less depending on overall load Search Heads (8+ users)! c3.2xlarge 8vcpu, 15 GB RAM! c3.4xlarge 32vcpu, 60 GB RAM Cluster Master or Deployment Server! m3.xlarge 4vcpu, 15 GB RAM! c3.2xlarge 8vcpu, 15 GB RAM License Master! m3.large 2vcpu, 7.5 GB RAM! m3.xlarge 4vcpu, 15 GB RAM 22

Architecture & Deployment Examples

Architecture Examples! Centralized! Decentralized! Hybrid! Centralized with Index ReplicaLon 24

Search Pooling Centralized Topology Indexers Forwarders Intermediate Forwarder Forwarders Syslog Devices 25

Decentralized Topology Search Pooling 26

Hybrid Topology 27

Index Replication with Search Pooling Cluster Master Search Pool Forwarders Peer Nodes 28

! Deployment A! Deployment B Deployment Examples 29

Deployment A! Use Case: Searching, ReporLng and AnalyLcs! Capable of 1-100+ GB/day indexing! m3.2xlarge instance High value for cpu (8 vcpu, 30 GB RAM) Previously were using c1.xlarge (8 vcpu, 7 GB RAM)! RAID 1+0 across 4 EBS volumes! 16 concurrent users 30

Deployment B! Use Case: ApplicaLon Management, Security Forensics! Capable of 500 GB/day indexing! Distributed deployment with Index ReplicaLon (2 SF, 3 RF)! 3 hs1.8xl instances with 49 TB ephemeral storage (indexers)! c1.xlarge instance (search head)! Leveraging AWS API for instance management 31

Deployment B Search Head(s) Cluster Master License Master Indexer Indexer Indexer 32 32

Example Architectures Use case and requirements influence final setup, but there is no right or wrong way Using EBS Backed Storage! 20 GB/day m3.2xlarge (single instance)! 100 GB/day m3.2xlarge (single instance)! 300 GB/day m3.2xlarge (3) c3.4xlarge! 500 GB/day m3.2xlarge as indexer (5) c3.4xlarge as search head (1) Using Index ReplicaJon! 100 GB/day m3.2xlarge as indexer (2) c3.2xlarge as search head (1) c3.xlarge as CM/LM! 500 GB/day hs1.8xlarge as indexer (3) c3.8xlarge as search head (1) m3.xlarge as CM/LM 33

How To Provision Deployments

Cloud Provisioning Tools A Primer Server Provisioning Deployment Provisioning AWS OpsWorks AWS CloudFormation Scalr Terraform! Flexible recipe- based configuralon Configure machine based on role! Fast template- based provisioning Provision & connect resources 35

Splunk AWS CloudFormaLon What used to take days to get all configured properly, now I can do in few minutes with Splunk [AWS] CloudFormaLon Abdallah Mohammed, Data Architect, Intuit 36

Splunk AWS CloudFormaLon! Open- source self- service tool (no cost associated)! Fast, automated, consistent Splunk deployments on AWS! Available on GitHub: Templates + Tutorial hups://github.com/splunk/splunk- aws- cloudformalon! Splunk Blog: Deploy your own Splunk cluster on AWS in minutes! hup://blogs.splunk.com/2014/05/20/deploy- your- own- splunk- cluster- on- aws- in- minutes/

Splunk AWS CloudFormaLon What can Splunk AWS CloudFormaLon do for you?! Accelerates deployment Lme down to minutes! Incorporates Splunk best pracjces for operalons and administralon! Abstracts away details of configuring distributed Splunk! Extensible and customizable templates to fit custom needs 38

Sample Architecture Search Head(s) Cluster Master License Master Indexer Indexer Indexer 39 39

Deploy Splunk Cluster in < 30 minutes 40

Simple User- Friendly Push- Buuon Form 41

Demo Time 42

QuesLons? 43

Contact Simeon Yep syep@splunk.com Business Development Roy Arsan rarsan@splunk.com Engineering 44

References! Splunk App for AWS: hup://apps.splunk.com/app/1274/! Hunk App for AWS ELB: hup://apps.splunk.com/app/1731/! Technical Brief: hup://www.splunk.com/web_assets/pdfs/secure/ Splunk_and_Amazon_Web_Services_Tech_Brief.pdf 45

References! Blogs: hup://blogs.splunk.com/2012/03/07/splunk- and- aws- sizing- revisited/ hup://blogs.splunk.com/2013/06/06/splunkit- v2-0- 2- results- ec2- storage- comparisons/ hup://blogs.splunk.com/2013/07/31/whats- going- on- with- aws- and- splunk/ hup://blogs.splunk.com/2014/05/20/deploy- your- own- splunk- cluster- on- aws- in- minutes/! AMIs Splunk: hups://aws.amazon.com/marketplace/pp/b00gizituo?sr=0-4 Hunk: hups://aws.amazon.com/marketplace/pp/b00gizk2qi?sr=0-2 46

THANK YOU