How to Deploy a Secure, Highly-Available Hadoop Platform
|
|
|
- Buck Lane
- 10 years ago
- Views:
Transcription
1 How to Deploy a Secure, Highly-Available Hadoop Platform Dr. Olaf Flebbe, Michael Weiser science + computing ag IT-Dienstleistungen und Software für anspruchsvolle Rechnernetze Tübingen München Berlin Düsseldorf
2 Agenda Environment Automation Kerberos Demo Seite 2
3 Target Scenario Build a secure, reliable, Debian based Hadoop system for the german BSI (Bundesamt für Sicherheit in der Informationstechnik) Cornerstones MIT Kerberos / OpenLDAP Debian Jessie (8) Able to rebuild from source Zookeeper/HA Hadoop/Hive/Hue Automation necessary Upstreaming into projects Offline Installation Seite 3
4 Code donated to Apache Bigtop We donate the automation of Hadoop with puppet to the Apache Bigtop project Advance the deployment recipes Demo of automation with puppet Demo of how to secure Apache Hadoop Ecosystem All of our code is under Apache 2 License Code is on Uses the Apache Bigtop convenience repositories Seite 4
5 MIT-Kerberos Master/Slave Setup Install and configure Kerberos-KDCs Kadmin Software kdc, kpropd Konfiguration of kdc.conf Configure replication: cronjob for kprop on Master und kpropd service on Slave Manage service- and host-principals kadmin, add_principal, modprinc, cpw, Keytab-Management: kadmin, ktadd Seite 5
6 Automation needed: NTP Nagios Client Config Postgresql Master/Slave Replication Hostname Resolution Kerberos KDC Master/Slave Replication OpenLDAP Multimaster Ldap client config Saslauthd SSSD Config Locales Firewall PAM Puppetdb manage ssh hostkeys Timezone Zookeeper Hadoop nn, jn, zkfc, rm, dn Hive Hue Tez Oozie Seite 6
7 Puppet in five Minutes: Declarative configuration of target state Stateless, without ordering but with dependencies puppet ecosystem: puppet (engine) facter (determine facts of the system, like os) hiera (hierarichal lookup of target properties) augeas (manage code sniplets in configuration files, in various formats) puppetdb (Database of generated properties, for instance ssh host keys) mcollective (massive remote commands) Seite 7
8 Puppet Concepts Manifest: Puppet code Class: logical group of Puppet code Smallest entity to call best practice: 1 manifest == 1 class Module: Group of classes for a feature to handle Many projects on GitHub most often with Apache 2 License Deployment on Puppet Forge Installation: puppet module install <author>-<module> Seite 8
9 Puppet Concepts hiera Configuration and instantiation of classes site-manifest: best practice: site-manifest almost empty, starts node classification via hiera catalog: Assembled collective of manifests and class for a particular host. Host determines deviation from target state and tries to reach target state Seite 9
10 Puppet Modes masterless/apply: All manifests are local to system Bigtop mode suitable for CI master/agent: Agent starts catalog generation on master and applies catalog on local system Creates a PKI SSL Connection with trust! Usually used for enterprise configurations Seite 10
11 Puppet Usage Typical practice: Search on Puppet Forge, try out Example: saz/locales Example: Apache Bigtop already has some configuration classes Seite 11
12 Automation done with puppet: NTP Nagios Client Config Postgresql Master/Slave Replication Hostname Resolution Kerberos KDC Master/Slave Replication OpenLDAP Multimaster Ldap client config Saslauthd SSSD Config Locales Firewall PAM Puppetdb manage ssh hostkeys Timezone Zookeeper Hadoop nn, jn, zkfc, rm, dn Hive Hue Tez Oozie Seite 12
13 puppet-modules for kerberos Only few modules available Problems: Some of the only implements Client-configuration Some of then uses hard-coded default passwords Michael Weiser has improved a promising one edgester/kerberos Unfortunately not available on PuppetForge, only on github: MIT License Seite 13
14 PKINIT - Kerberos with X.509 Certificates PKINIT ( replaces user passwords with X.509-Client certificates Substantial increase of cryptographical strength Side aspect: Allows use of smartcards for Kerberos authentication Automation possible if a suitable PKI already exists. Seite 14
15 PKINIT - Kerberos with X.509 Certificates Need for X.509-Certifikates for KDC and Kerberos Clients Notice the peculiarity: KDC-Server-Certificate needs extendedkeyusage with OID PKINIT-Client-Certificates needs extendedkeyusage and the attribute subjectaltname with the value of the principal name. Description of PKINIT within a MIT-Kerberos-Realm using OpenSSL: Seite 15
16 Supercharging: Use of Puppet-CA for PKINIT Puppet already uses SSL certificates why not use /var/lib/ puppet/certs/$fqdn.pem? Puppet-CA does not support extensions for extendedkeyusage and subjectaltname Developed patches for the Puppet-CA Unfortunately rejected by Upstream: We will look into an alternative implementation without patching puppet Seite 16
17 Hadoop with Puppet We enhanced the Bigtop templates: Supporting journaling, HA namenode HA Yarn resource manager Configuration of Hive on Tez Configuration of Hue Securing the Hadoop components and web interfaces Zookeeper Hadoop Hue We introduced a role concept, which is not the one which is implemented in upstream Bigtop Seite 17
18 Hadoop with Puppet The bigtop puppet kerberos support is not of production quality: It uses hardcoded passwords Upstream github edgester/kerberos module is now a drop-in replacement for the Bigtop kerberos class Seite 18
19 Kerberos with Hadoop Setup The principals are named: hdfs/fqdn, yarn/fqdn Users are olaf All the daemons support it for authentication Kerberos works mostly out of the box. Authentication: LGTM! Seite 19
20 Kerberos with Hadoop: Authorization: HDFS, a bit clumsy since user -> uid mapping is done decentralized on each node. Configuration of the NSS mapping is required e.g. a directory service: System users hive, yarn, mapred required Seite 20
21 Kerberos with Zookeeper Zookeeper: Supports ACL s, but there is no tool to set ACL s! The ZK Root is left unprotected! '/' '/zookeeper' '/zookeeper/quota' Everyone authenticated can damage HDFS journaling! Hadoop, Yarn sets ACL s (++) Hive does not set ACL s in ZK (--) '/hive_zookeeper_namespace' Workaround: we created a tool to set ACL s. Seite 21
22 Noteworthy things: Parallel installation of Hadoop on all nodes. Synchronisation with netcat on ports Formats ZK Formats Namenodes Starts standby HA Servers Trocla: Do not store passwords in manifests / configuration files, no plain passwords stored. Seite 22
23 Upstreaming Apache Bigtop: Fixed Debian build support: Made it in Bigtop 1.0! Automation and Configuration: only partly upstreamed. Debian: All our changes are in Debian git However, only one package made it into unstable puppet-module-asciiduck-sssd Puppet kerberos module Fixes are upstream except for the use of trocla IT WAS A GREAT EXPERIENCE! Seite 23
24 Upstream Fixes needed Hive: Must protect the hive root in the ZK with ACLS! /hive_zookeeper_namespace Zookeeper: Should secure the ZK Filesystem Hadoop/Bigtop: change daemon scripts to better support the systemd init replacement Some projects did not work with a HA Yarn RM setup: - Sqoop/Sqoop2 - Oozie Hue does not work with the tez jobmanager/timeline Seite 24
25 Wrapup None yet another deployment tool needed The generic system administration tools are far more advanced with respect to enterprise grade functionality: Master/Slave Kerberos Multimaster OpenLDAP AD Integration The concepts presented can be integrated in ansible, saltstack and many more. Be opinionated! Complexity can be reduced by reusing proven technology Kerberos Support in Hadoop is quite good Upstream! Seite 25
26 Demonstration Life Demo Seite 26
27 Thanks! Dr. Olaf Flebbe / Michael Weiser science + computing ag Telefon: [email protected]
How Bigtop Leveraged Docker for Build Automation and One-Click Hadoop Provisioning
How Bigtop Leveraged Docker for Build Automation and One-Click Hadoop Provisioning Evans Ye Apache Big Data 2015 Budapest Who am I Apache Bigtop PMC member Software Engineer at Trend Micro Develop Big
HADOOP MOCK TEST HADOOP MOCK TEST I
http://www.tutorialspoint.com HADOOP MOCK TEST Copyright tutorialspoint.com This section presents you various set of Mock Tests related to Hadoop Framework. You can download these sample mock tests at
developing sysadmin - sysadmining developers
developing sysadmin - sysadmining developers develop your platform and your application management GUUG Berlin 01.11.2012 Martin Alfke Agenda puppet environments puppet modules
Getting Hadoop, Hive and HBase up and running in less than 15 mins
Getting Hadoop, Hive and HBase up and running in less than 15 mins ApacheCon NA 2013 Mark Grover @mark_grover, Cloudera Inc. www.github.com/markgrover/ apachecon-bigtop About me Contributor to Apache Bigtop
Hadoop Elephant in Active Directory Forest. Marek Gawiński, Arkadiusz Osiński Allegro Group
Hadoop Elephant in Active Directory Forest Marek Gawiński, Arkadiusz Osiński Allegro Group Agenda Goals and motivations Technology stack Architecture evolution Automation integrating new servers Making
Integrating SAP BusinessObjects with Hadoop. Using a multi-node Hadoop Cluster
Integrating SAP BusinessObjects with Hadoop Using a multi-node Hadoop Cluster May 17, 2013 SAP BO HADOOP INTEGRATION Contents 1. Installing a Single Node Hadoop Server... 2 2. Configuring a Multi-Node
Univention Corporate Server. Extended domain services documentation
Univention Corporate Server Extended domain services documentation 2 Table of Contents 1. Integration of Ubuntu clients into a UCS domain... 4 1.1. Integration into the LDAP directory and the SSL certificate
CAC AND KERBEROS FROM VISION TO REALITY
CAC AND KERBEROS FROM VISION TO REALITY Mil OSS Conference 2011 Dmitri Pal Sr. Engineering Manager Red Hat Inc. Aug 31, 2011 Outline Setting up context... Card authentication now Open issues Pieces of
Pivotal HD Enterprise
PRODUCT DOCUMENTATION Pivotal HD Enterprise Version 1.1 Stack and Tool Reference Guide Rev: A01 2013 GoPivotal, Inc. Table of Contents 1 Pivotal HD 1.1 Stack - RPM Package 11 1.1 Overview 11 1.2 Accessing
Cloudera Manager Training: Hands-On Exercises
201408 Cloudera Manager Training: Hands-On Exercises General Notes... 2 In- Class Preparation: Accessing Your Cluster... 3 Self- Study Preparation: Creating Your Cluster... 4 Hands- On Exercise: Working
Pro Puppet. Jeffrey McCune. James TurnbuII. Apress* m in
Pro Puppet m in James TurnbuII Jeffrey McCune Apress* About the Authors About the Technical Reviewer Acknowledgments Introduction Chapter 1: Getting Started with Puppet What Is Puppet? Deployment Configuration
docs.hortonworks.com
docs.hortonworks.com : Security Administration Tools Guide Copyright 2012-2014 Hortonworks, Inc. Some rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform
Deploying Hadoop with Manager
Deploying Hadoop with Manager SUSE Big Data Made Easier Peter Linnell / Sales Engineer [email protected] Alejandro Bonilla / Sales Engineer [email protected] 2 Hadoop Core Components 3 Typical Hadoop Distribution
Identity Management based on FreeIPA
Identity Management based on FreeIPA SLAC 2014 Thorsten Scherf Red Hat EMEA What is an Identity Management System (IdM) An IdM system is a set of services and rules to manage the users of an organization
Install and Configure an Open Source Identity Server Lab
Install and Configure an Open Source Identity Server Lab SUS05/SUS06 Novell Training Services ATT LIVE 2012 LAS VEGAS www.novell.com Legal Notices Novell, Inc., makes no representations or warranties with
Deploying Foreman in Enterprise Environments 2.0. best practices and lessons learned. Nils Domrose Cologne, August, 4 2014
Deploying Foreman in Enterprise Environments 2.0 best practices and lessons learned Nils Domrose Cologne, August, 4 2014 About me senior linux systems engineer at inovex GmbH worked as a network engineer,
Spectrum Scale HDFS Transparency Guide
Spectrum Scale Guide Spectrum Scale BDA 2016-1-5 Contents 1. Overview... 3 2. Supported Spectrum Scale storage mode... 4 2.1. Local Storage mode... 4 2.2. Shared Storage Mode... 4 3. Hadoop cluster planning...
Document Type: Best Practice
Global Architecture and Technology Enablement Practice Hadoop with Kerberos Deployment Considerations Document Type: Best Practice Note: The content of this paper refers exclusively to the second maintenance
Chase Wu New Jersey Ins0tute of Technology
CS 698: Special Topics in Big Data Chapter 4. Big Data Analytics Platforms Chase Wu New Jersey Ins0tute of Technology Some of the slides have been provided through the courtesy of Dr. Ching-Yung Lin at
Ankush Cluster Manager - Hadoop2 Technology User Guide
Ankush Cluster Manager - Hadoop2 Technology User Guide Ankush User Manual 1.5 Ankush User s Guide for Hadoop2, Version 1.5 This manual, and the accompanying software and other documentation, is protected
Important Notice. (c) 2010-2015 Cloudera, Inc. All rights reserved.
Cloudera Security Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans contained in this document
GL550 - Enterprise Linux Security Administration
GL550 - Enterprise Linux Security Administration This highly technical course focuses on properly securing machines running the Linux operating systems. A broad range of general security techniques such
ENTERPRISE LINUX SECURITY ADMINISTRATION
ENTERPRISE LINUX SECURITY ADMINISTRATION COURSE DESCRIPTION: This highly technical course focuses on properly securing machines running the Linux operating systems. A broad range of general security techniques
Apache Sentry. Prasad Mujumdar [email protected] [email protected]
Apache Sentry Prasad Mujumdar [email protected] [email protected] Agenda Various aspects of data security Apache Sentry for authorization Key concepts of Apache Sentry Sentry features Sentry architecture
Pivotal HD Enterprise 1.0 Stack and Tool Reference Guide. Rev: A03
Pivotal HD Enterprise 1.0 Stack and Tool Reference Guide Rev: A03 Use of Open Source This product may be distributed with open source code, licensed to you in accordance with the applicable open source
Secure Linux Administration Conference 2013. Bernd Strößenreuther
Puppet getting started Best practices on how to turn Your environment into a Puppet managed environment Secure Linux Administration Conference 2013 Berlin 2013 06 06 Bernd Strößenreuther mailto:[email protected]
docs.hortonworks.com
docs.hortonworks.com Hortonworks Data Platform: Configuring Kafka for Kerberos Over Ambari Copyright 2012-2015 Hortonworks, Inc. Some rights reserved. The Hortonworks Data Platform, powered by Apache Hadoop,
How to extend Puppet using Ruby
How to ext Puppet using Ruby Miguel Di Ciurcio Filho [email protected] http://localhost:9090/onepage 1/43 What is Puppet? Puppet Architecture Facts Functions Resource Types Hiera, Faces and Reports
Big Data Operations Guide for Cloudera Manager v5.x Hadoop
Big Data Operations Guide for Cloudera Manager v5.x Hadoop Logging into the Enterprise Cloudera Manager 1. On the server where you have installed 'Cloudera Manager', make sure that the server is running,
Ansible. Configuration management tool and ad hoc solution. Marcel Nijenhof <[email protected]>
Ansible Configuration management tool and ad hoc solution Marcel Nijenhof Index Introduction Installing & configuration Playbooks Variables Roles Ansible galaxy Configuration management
Apache Hadoop new way for the company to store and analyze big data
Apache Hadoop new way for the company to store and analyze big data Reyna Ulaque Software Engineer Agenda What is Big Data? What is Hadoop? Who uses Hadoop? Hadoop Architecture Hadoop Distributed File
Nevepoint Access Manager 1.2 BETA Documentation
Nevepoint Access Manager 1.2 BETA Documentation Table of Contents Installation...3 Locating the Installation Wizard URL...3 Step 1: Configure the Administrator...4 Step 2: Connecting to Primary Connector...4
Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data
Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give
CDH 5 Quick Start Guide
CDH 5 Quick Start Guide Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans contained in this
ENTERPRISE LINUX SECURITY ADMINISTRATION
ENTERPRISE LINUX SECURITY ADMINISTRATION This highly technical course focuses on properly securing machines running the Linux operating systems. A broad range of general security techniques such as packet
Configuring Hadoop Security with Cloudera Manager
Configuring Hadoop Security with Cloudera Manager Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names
System Security Services Daemon
System Security Services Daemon System Security Services Daemon Manages communication with centralized identity and authentication stores Provides robust, predictable caching for network accounts Can cache
GL-550: Red Hat Linux Security Administration. Course Outline. Course Length: 5 days
GL-550: Red Hat Linux Security Administration Course Length: 5 days Course Description: This highly technical course focuses on properly securing machines running the Linux operating systems. A broad range
Certified Big Data and Apache Hadoop Developer VS-1221
Certified Big Data and Apache Hadoop Developer VS-1221 Certified Big Data and Apache Hadoop Developer Certification Code VS-1221 Vskills certification for Big Data and Apache Hadoop Developer Certification
Pivotal HD Enterprise
PRODUCT DOCUMENTATION Pivotal HD Enterprise Version 1.1.1 Release Notes Rev: A02 2014 GoPivotal, Inc. Table of Contents 1 Welcome to Pivotal HD Enterprise 4 2 PHD Components 5 2.1 Core Apache Stack 5 2.2
Red Hat Enterprise Identity (IPA) Centralized Management of Identities & Authentication
Red Hat Enterprise Identity (IPA) Centralized of Identities & Authentication Dmitri Pal Sr. Engineering Manager, Red Hat Inc. Robert Crittenden Sr. Engineer, Red Hat Inc. 05/06/11 Agenda What is IPA? Main
VMware vsphere Big Data Extensions Administrator's and User's Guide
VMware vsphere Big Data Extensions Administrator's and User's Guide vsphere Big Data Extensions 1.0 This document supports the version of each product listed and supports all subsequent versions until
Apache Hadoop 2.0 Installation and Single Node Cluster Configuration on Ubuntu A guide to install and setup Single-Node Apache Hadoop 2.
EDUREKA Apache Hadoop 2.0 Installation and Single Node Cluster Configuration on Ubuntu A guide to install and setup Single-Node Apache Hadoop 2.0 Cluster edureka! 11/12/2013 A guide to Install and Configure
PostgreSQL administration using Puppet. Miguel Di Ciurcio Filho
PostgreSQL administration using Puppet Miguel Di Ciurcio Filho [email protected] http://localhost:9090/onepage 1/25 What is Puppet? Puppet Architecture Managing PostgreSQL Installation Databases Roles
Continuous Integration using Docker & Jenkins
Jenkins LinuxCon Europe 2014 October 13-15, 2014 Mattias Giese Solutions Architect [email protected] - Linux/Open Source Consulting, Training, Support & Development Introducing B1 Systems founded in
ICINGA2 OPEN SOURCE MONITORING
www.icinga.org ICINGA2 OPEN SOURCE MONITORING backspace 08.12.2015 Blerim Sheqa Systems Engineer Open Source @bobapple ICINGA2 INTRODUCTION Icinga Nagios fork Icinga2 rewrite rethink Server Network Windows
Upcoming Announcements
Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC [email protected] Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within
Red Hat Identity Management
Red Hat Identity Management Overview Thorsten Scherf Senior Consultant Red Hat Global Professional Services Agenda What is Red Hat Identity Management? Main values Architecture Features Active Directory
LinuxCon North America
LinuxCon North America Enterprise Identity Management with Open Source Tools Dmitri Pal Sr. Engineering Manager Red Hat, Inc. 09.16.2013 Context What is identity management? 2 LinuxCon North America Context
HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM
HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM 1. Introduction 1.1 Big Data Introduction What is Big Data Data Analytics Bigdata Challenges Technologies supported by big data 1.2 Hadoop Introduction
Identity Management: The authentic & authoritative guide for the modern enterprise
Identity Management: The authentic & authoritative guide for the modern enterprise Ellen Newlands, Product Manager Dmitri Pal, Director, Engineering 06-26-15 Goals of the Presentation Introduce Identity
CHEF IN THE CLOUD AND ON THE GROUND
CHEF IN THE CLOUD AND ON THE GROUND Michael T. Nygard Relevance [email protected] @mtnygard Infrastructure As Code Infrastructure As Code Chef Infrastructure As Code Chef Development Models
docs.hortonworks.com
docs.hortonworks.com Hortonworks Data Platform : Ambari Security Guide Copyright 2012-2015 Hortonworks, Inc. All rights reserved. The Hortonworks Data Platform, powered by Apache Hadoop, is a massively
FreeIPA Cross Forest Trusts
Alexander Bokovoy Andreas Schneider May 10th, 2012 1 FreeIPA What is FreeIPA? Cross Forest Trusts 2 Samba 3 Demo Talloc Tutorial Pavel Brezina wrote Talloc tutorial! http://talloc.samba.org/
SolarWinds Log & Event Manager
Corona Technical Services SolarWinds Log & Event Manager Training Project/Implementation Outline James Kluza 14 Table of Contents Overview... 3 Example Project Schedule... 3 Pre-engagement Checklist...
How to build an Identity Management System on Linux. Simo Sorce Principal Software Engineer Red Hat, Inc.
How to build an Identity Management System on Linux Simo Sorce Principal Software Engineer Red Hat, Inc. What is an Identity Management System and why should I care? In a nutshell: an IdM system is a set
Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP
Big-Data and Hadoop Developer Training with Oracle WDP What is this course about? Big Data is a collection of large and complex data sets that cannot be processed using regular database management tools
Data Security in Hadoop
Data Security in Hadoop Eric Mizell Director, Solution Engineering Page 1 What is Data Security? Data Security for Hadoop allows you to administer a singular policy for authentication of users, authorize
Release Notes for Fuel and Fuel Web Version 3.0.1
Release Notes for Fuel and Fuel Web Version 3.0.1 June 21, 2013 1 Mirantis, Inc. is releasing version 3.0.1 of the Fuel Library and Fuel Web products. This is a cumulative maintenance release to the previously
Cloudera Manager Administration Guide
Cloudera Manager Administration Guide Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans
Authentication in a Heterogeneous Environment
Authentication in a Heterogeneous Environment Integrating Linux (and UNIX and Mac) Identity Management in Microsoft Active Directory Mike Patnode VP of Technology Centrify Corporation [email protected]
Security Provider Integration Kerberos Authentication
Security Provider Integration Kerberos Authentication 2015 Bomgar Corporation. All rights reserved worldwide. BOMGAR and the BOMGAR logo are trademarks of Bomgar Corporation; other trademarks shown are
The Puppet Show Managing Servers with Puppet
The Puppet Show Managing Servers with Puppet A gentle introduction to Puppet; How it works, What its benefits are And how to use it Robert Harker [email protected] whoami UNIX/Linux sysadmin for over 25
OS Installation: CentOS 5.8
OS Installation: CentOS 5.8 OpenTUSK Training University of Nairobi Mike Prentice [email protected] Tufts University Technology for Learning in the Health Sciences July 2013 Outline 1 OS Install
HADOOP MOCK TEST HADOOP MOCK TEST II
http://www.tutorialspoint.com HADOOP MOCK TEST Copyright tutorialspoint.com This section presents you various set of Mock Tests related to Hadoop Framework. You can download these sample mock tests at
Insights to Hadoop Security Threats
Insights to Hadoop Security Threats Presenter: Anwesha Das Peipei Wang Outline Attacks DOS attack - Rate Limiting Impersonation Implementation Sandbox HDP version 2.1 Cluster Set-up Kerberos Security Setup
DevOps Course Content
DevOps Course Content INTRODUCTION TO DEVOPS What is DevOps? History of DevOps Dev and Ops DevOps definitions DevOps and Software Development Life Cycle DevOps main objectives Infrastructure As A Code
COURSE CONTENT Big Data and Hadoop Training
COURSE CONTENT Big Data and Hadoop Training 1. Meet Hadoop Data! Data Storage and Analysis Comparison with Other Systems RDBMS Grid Computing Volunteer Computing A Brief History of Hadoop Apache Hadoop
This handout describes how to start Hadoop in distributed mode, not the pseudo distributed mode which Hadoop comes preconfigured in as on download.
AWS Starting Hadoop in Distributed Mode This handout describes how to start Hadoop in distributed mode, not the pseudo distributed mode which Hadoop comes preconfigured in as on download. 1) Start up 3
Introduction to Hadoop. New York Oracle User Group Vikas Sawhney
Introduction to Hadoop New York Oracle User Group Vikas Sawhney GENERAL AGENDA Driving Factors behind BIG-DATA NOSQL Database 2014 Database Landscape Hadoop Architecture Map/Reduce Hadoop Eco-system Hadoop
Remote Authentication and Single Sign-on Support in Tk20
Remote Authentication and Single Sign-on Support in Tk20 1 Table of content Introduction:... 3 Architecture... 3 Single Sign-on... 5 Remote Authentication... 6 Request for Information... 8 Testing Procedure...
How to configure High Availability (HA) in AlienVault USM (for versions 4.14 and prior)
Complete. Simple. Affordable How to configure High Availability (HA) in AlienVault USM Copyright 2015 AlienVault. All rights reserved. AlienVault, AlienVault Unified Security Management, AlienVault USM,
docs.hortonworks.com
docs.hortonworks.com Hortonworks Data Platform : Automated Install with Ambari Copyright 2012-2015 Hortonworks, Inc. Some rights reserved. The Hortonworks Data Platform, powered by Apache Hadoop, is a
docs.hortonworks.com
docs.hortonworks.com Hortonworks Data Platform : Hadoop Security Guide Copyright 2012-2014 Hortonworks, Inc. Some rights reserved. The Hortonworks Data Platform, powered by Apache Hadoop, is a massively
Workshop on Hadoop with Big Data
Workshop on Hadoop with Big Data Hadoop? Apache Hadoop is an open source framework for distributed storage and processing of large sets of data on commodity hardware. Hadoop enables businesses to quickly
Integrating Linux systems with Active Directory
Integrating Linux systems with Active Directory Dmitri Pal Engineering Director, Red Hat, Inc. Security Camp at BU Agenda Problem statement Aspects of integration Integration options Recommendations Security
CDH 5 High Availability Guide
CDH 5 High Availability Guide Important Notice (c) 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, and any other product or service names or slogans contained
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments
Cloudera Enterprise Reference Architecture for Google Cloud Platform Deployments Important Notice 2010-2015 Cloudera, Inc. All rights reserved. Cloudera, the Cloudera logo, Cloudera Impala, Impala, and
Overview. Big Data in Apache Hadoop. - HDFS - MapReduce in Hadoop - YARN. https://hadoop.apache.org. Big Data Management and Analytics
Overview Big Data in Apache Hadoop - HDFS - MapReduce in Hadoop - YARN https://hadoop.apache.org 138 Apache Hadoop - Historical Background - 2003: Google publishes its cluster architecture & DFS (GFS)
SSSD. Client side identity management. LinuxAlt 2012 Jakub Hrozek 3. listopadu 2012
Client side identity management LinuxAlt 2012 Jakub Hrozek 3. listopadu 2012 Section 1 Centralized user databases Centralized user databases User accounts in a large environment it is not practical to
F-Secure Messaging Security Gateway. Deployment Guide
F-Secure Messaging Security Gateway Deployment Guide TOC F-Secure Messaging Security Gateway Contents Chapter 1: Deploying F-Secure Messaging Security Gateway...3 1.1 The typical product deployment model...4
Oracle Linux Advanced Administration
Oracle University Contact Us: Local: 1800 103 4775 Intl: +91 80 40291196 Oracle Linux Advanced Administration Duration: 5 Days What you will learn This Oracle Linux Advanced Administration training is
Mac OS X Directory Services
Mac OS X Directory Services Agenda Open Directory Mac OS X client access Directory services in Mac OS X Server Redundancy and replication Mac OS X access to other directory services Active Directory support
Moving to Plesk Automation 11.5
Moving to Plesk Automation 11.5 Last updated: 2 June 2015 Contents About This Document 4 Introduction 5 Preparing for the Move 7 1. Install the PA Moving Tool... 8 2. Install Mail Sync Software (Windows
Handling POSIX attributes for trusted Active Directory users and groups in FreeIPA
Handling POSIX attributes for trusted Active Directory users and groups in FreeIPA Alexander Bokovoy May 21th, 2015 Samba Team / Red Hat 0 A crisis of identity (solved?) FreeIPA What is
ACE Management Server Deployment Guide VMware ACE 2.0
Technical Note ACE Management Server Deployment Guide VMware ACE 2.0 This technical note provides guidelines for the deployment of VMware ACE Management Servers, including capacity planning and best practices.
Hadoop Ecosystem B Y R A H I M A.
Hadoop Ecosystem B Y R A H I M A. History of Hadoop Hadoop was created by Doug Cutting, the creator of Apache Lucene, the widely used text search library. Hadoop has its origins in Apache Nutch, an open
YARN, the Apache Hadoop Platform for Streaming, Realtime and Batch Processing
YARN, the Apache Hadoop Platform for Streaming, Realtime and Batch Processing Eric Charles [http://echarles.net] @echarles Datalayer [http://datalayer.io] @datalayerio FOSDEM 02 Feb 2014 NoSQL DevRoom
http://cnmonitor.sourceforge.net CN=Monitor Installation and Configuration v2.0
1 Installation and Configuration v2.0 2 Installation...3 Prerequisites...3 RPM Installation...3 Manual *nix Installation...4 Setup monitoring...5 Upgrade...6 Backup configuration files...6 Disable Monitoring
Going in production Winbind in large AD domains today. Günther Deschner [email protected]. (Red Hat / Samba Team)
Going in production Winbind in large AD domains today Günther Deschner [email protected] (Red Hat / Samba Team) Agenda To go where no one has gone before Winbind scalability Find Domain Controllers Active Directory
VERALAB LDAP Configuration Guide
VERALAB LDAP Configuration Guide VeraLab Suite is a client-server application and has two main components: a web-based application and a client software agent. Web-based application provides access to
VMware Identity Manager Connector Installation and Configuration
VMware Identity Manager Connector Installation and Configuration VMware Identity Manager This document supports the version of each product listed and supports all subsequent versions until the document
docs.hortonworks.com
docs.hortonworks.com : Ambari Views Guide Copyright 2012-2015 Hortonworks, Inc. All rights reserved. The, powered by Apache Hadoop, is a massively scalable and 100% open source platform for storing, processing
docs.hortonworks.com
docs.hortonworks.com Hortonworks Data Platform : Hadoop Security Guide Copyright 2012-2015 Hortonworks, Inc. Some rights reserved. The Hortonworks Data Platform, powered by Apache Hadoop, is a massively
How to install Apache Hadoop 2.6.0 in Ubuntu (Multi node setup)
How to install Apache Hadoop 2.6.0 in Ubuntu (Multi node setup) Author : Vignesh Prajapati Categories : Hadoop Date : February 22, 2015 Since you have reached on this blogpost of Setting up Multinode Hadoop
Integrating Red Hat Enterprise Linux 6 with Microsoft Active Directory Presentation
Integrating Red Hat Enterprise Linux 6 with Microsoft Active Directory Presentation Agenda Overview Components Considerations Configurations Futures Summary What is needed? Thorough understanding components,
TIBCO Spotfire Platform IT Brief
Platform IT Brief This IT brief outlines features of the system: Communication security, load balancing and failover, authentication options, and recommended practices for licenses and access. It primarily
