Apache Zeppelin, the missing component for your BigData ecosystem



Similar documents
Introduction to Cassandra

How To Create A Data Visualization With Apache Spark And Zeppelin

DBX. SQL database extension for Splunk. Siegfried Puchbauer

Complete Java Classes Hadoop Syllabus Contact No:

Going Native With Apache Cassandra. QCon London, 2014

A very short talk about Apache Kylin Business Intelligence meets Big Data. Fabian Wilckens EMEA Solutions Architect

Apache Flink Next-gen data analysis. Kostas

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Survey of the Benchmark Systems and Testing Frameworks For Tachyon-Perf

Apache Ignite TM (Incubating) - In- Memory Data Fabric Fast Data Meets Open Source

Apache Sentry. Prasad Mujumdar

Consolidate by Migrating Your Databases to Oracle Database 11g. Fred Louis Enterprise Architect

Sisense. Product Highlights.

Qsoft Inc

Avaya Inventory Management System

SAP HANA Cloud Platform, Portal Service: Overview SAP Cloud Experience and SAP Portal Product Management May 2016

Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview

Workshop on Hadoop with Big Data

Introduction to Big Data Training

HADOOP ADMINISTATION AND DEVELOPMENT TRAINING CURRICULUM

A Plan for the Continued Development of the DNS Statistics Collector

Pentesting Web Frameworks (preview of next year's SEC642 update)

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

Spring,2015. Apache Hive BY NATIA MAMAIASHVILI, LASHA AMASHUKELI & ALEKO CHAKHVASHVILI SUPERVAIZOR: PROF. NODAR MOMTSELIDZE

Open PostgreSQL Monitoring

Introduction to Oracle Mobile Application Framework Raghu Srinivasan, Director Development Mobile and Cloud Development Tools Oracle

MySQL and Hadoop: Big Data Integration. Shubhangi Garg & Neha Kumari MySQL Engineering

WebSphere Application Server - Introduction, Monitoring Tools, & Administration

Unlocking Hadoop for Your Rela4onal DB. Kathleen Technical Account Manager, Cloudera Sqoop PMC Member BigData.

Nuxeo, an open source platform for content-centric business applications. Stéfane Fermigier, Nuxeo Laurent Doguin, Nuxeo

IBM Digital Experience. Using Modern Web Development Tools and Technology with IBM Digital Experience

Offerte del 13 giugno 2014

Version Overview. Business value

GlassFish v3. Building an ex tensible modular Java EE application server. Jerome Dochez and Ludovic Champenois Sun Microsystems, Inc.

PZVM1 Administration Guide. V1.1 February 2014 Alain Ganuchaud. Page 1/27

ITG Software Engineering

Write Once, Run Anywhere Pat McDonough

Case Study : 3 different hadoop cluster deployments

Building a Scalable News Feed Web Service in Clojure

Customize Mobile Apps with MicroStrategy SDK: Custom Security, Plugins, and Extensions

Building a Modular Server Platform with OSGi. Dileepa Jayakody Software Engineer SSWSO2 Inc.

Apache Cassandra Present and Future. Jonathan Ellis

Introduction to Apache Tajo: Data Warehouse for Big Data. Jihoon Son / Gruter inc.

Open Source Technologies on Microsoft Azure

Introducing Apache Pivot. Greg Brown, Todd Volkert 6/10/2010

Hadoop Job Oriented Training Agenda

Federated SQL on Hadoop and Beyond: Leveraging Apache Geode to Build a Poor Man's SAP HANA. by Christian

Building a Mobile App Security Risk Management Program. Copyright 2012, Security Risk Advisors, Inc. All Rights Reserved

Apache Jakarta Tomcat

Big Data Analytics - Accelerated. stream-horizon.com

Technical White Paper The Excel Reporting Solution for Java

Online Courses. Version 9 Comprehensive Series. What's New Series

Developing Web Services with Eclipse and Open Source. Claire Rogers Developer Resources and Partner Enablement, HP February, 2004

ISSN: (Online) Volume 3, Issue 4, April 2015 International Journal of Advance Research in Computer Science and Management Studies

MathCloud: From Software Toolkit to Cloud Platform for Building Computing Services

<Insert Picture Here> Move to Oracle Database with Oracle SQL Developer Migrations

The Hadoop Eco System Shanghai Data Science Meetup

Open Source Multi-Cloud, Multi- Tenant Automation in the cloud with SlipStream PaaS

Moving From Hadoop to Spark

Drupal CMS for marketing sites

Oracle Primavera Gateway

HDB++: HIGH AVAILABILITY WITH. l TANGO Meeting l 20 May 2015 l Reynald Bourtembourg

CURRENT STATE OF ICINGA

50 shades of Siebel mobile

the missing log collector Treasure Data, Inc. Muga Nishizawa

Web Dashboard User Guide

Database Applications Recitation 10. Project 3: CMUQFlix CMUQ s Movies Recommendation System

An Introduction to Android

Developer Tutorial Version 1. 0 February 2015

10A CA Plex in the Cloud. Rob Layzell CA Technologies

Deploy Your First CF App on Azure with Template and Service Broker. Thomas Shao, Rita Zhang, Bin Xia Microsoft Azure Team

HADOOP. Revised 10/19/2015

In Memory Accelerator for MongoDB

S3 Monitor Design and Implementation Plans

The Oracle Fusion Development Platform

Embed BA into Web Applications

ORACLE MOBILE APPLICATION FRAMEWORK DATA SHEET

Creating Big Data Applications with Spring XD

MENDIX FOR MOBILE APP DEVELOPMENT WHITE PAPER

Karl Lum Partner, LabKey Software Evolution of Connectivity in LabKey Server

Deploying Intellicus Portal on IBM WebSphere

Distributed DataFrame on Spark: Simplifying Big Data For The Rest Of Us

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Oracle Application Express MS Access on Steroids

28 What s New in IGSS V9. Speaker Notes INSIGHT AND OVERVIEW

Real-time Streaming Analysis for Hadoop and Flume. Aaron Kimball odiago, inc. OSCON Data 2011

Testing Tools using Visual Studio. Randy Pagels Sr. Developer Technology Specialist Microsoft Corporation

Creating Connection with Hive

Framework as a master tool in modern web development

How To Write A Nosql Database In Spring Data Project

Oracle Data Integrators for Beginners. Presented by: Dip Jadawala Company: BizTech Session ID: 9950

Automatic Text Analysis Using Drupal

Jet Data Manager 2012 User Guide

Build management & Continuous integration. with Maven & Hudson

Transcription:

Apache Zeppelin, the missing component for your BigData ecosystem DuyHai DOAN, Cassandra Technical Advocate

Who Am I?! Duy Hai DOAN Cassandra technical advocate talks, meetups, confs open-source devs (Achilles, ) OSS Cassandra point of contact duy_hai.doan@datastax.com 2

Datastax! Founded in April 2010 We contribute a lot to Apache Cassandra 400+ customers (25 of the Fortune 100), 400+ employees Headquarter in San Francisco Bay area EU headquarter in London, offices in France and Germany Datastax Enterprise = OSS Cassandra + extra features 3

What is Apache Zeppelin?! Presentation! Architecture!!

Zeppelin Presentation! 5

Zeppelin Architecture! JVM Spark Interpreter Group JVM Spark SparkSQL R E S T WebSocket Zeppelin Server Zeppelin Engine Zeppelin Interpreter Factory JVM Tajo Interpreter JVM Flink Interpreter JVM Cassandra Interpreter 6

What does Zeppelin provide?! Front-end & display system for free Generic back-end with REST APIs & WebSocket Pluggable interpreters system Task scheduler (à la CRON) 7

Zeppelin UI Layout! Notebook! Paragraph! UI elements!

Demo https://github.com/doanduyhai/incubator-zeppelin/tree/zeppelinpresentation

Zeppelin Display System!! Raw, Table, HTML! Available graphs! View modes! Dynamic form! Iframe export!

Demo https://github.com/doanduyhai/incubator-zeppelin/tree/zeppelinpresentation

Interpreter system!! Core interpreters! Third-parties interpreters! Interpreters conf & usage!

Interpreter processing lifecycle! 1 Receive input commands/data as raw text from form data 2 3 4 Process the input commands/data by the external back-end Format the response using Zeppelin display system Send response back to the Zeppelin engine 13

Core interpreters!! Spark (Spark core, SparkSQL/DataFrame, PySpark) Spark core = default (or %spark) SparkSQL = %sql Shell (%sh) Markdown (%md)! AngularJS (%angular) 14

Third-parties interpreters! Hive Phoenix Tajo Flink Ignite Lens Cassandra Geode PostgreSQL Kylin 15

Interpreter conf & usage https://github.com/doanduyhai/incubator-zeppelin/tree/zeppelinpresentation

Writing An Interpreter!! How To! Simple interpreter example (AsciiDoc)! Complex interpreter example (Cassandra)!

Steps to write your own interpreter! Create a class that extends Interpreter base class Register it in a static block static { Interpreter.register("MyInterpreterName", MyClassName.class.getName()); } Optionnally define default config params static { Interpreter.register("MyInterpreterName", MyClassName.class.getName(), new InterpreterPropertyBuilder().add("property1", "default value", "Description of property1").build()); } 18

To register your interpreter as default! Edit the enum ZeppelinConfiguration.ConfVars Add your interpreter FQCN in the property ZEPPELIN_INTERPRETERS 19

To register your interpreter in config files! Create conf/zeppelin-site.xml from conf/zeppelin-site.xml.template Add your interpreter FQCN in the property zeppelin.interpreters <property> <name>zeppelin.interpreters</name> <value>org.apache.zeppelin.spark.sparkinterpreter,org.apache.zeppelin.spark.pysparkinterpreter, org.apache.zeppelin.spark.sparksqlinterpreter,org.apache.zeppelin.spark.depinterpreter, org.apache.zeppelin.markdown.markdown,org.apache.zeppelin.shell.shellinterpreter, org.apache.zeppelin.hive.hiveinterpreter,com.me.mynewinterpreter </value> </property> 20

Simple AsciiDoc Interpreter! 1 Raw Text Block JVM Zeppelin Server 2 Raw Text Block Zeppelin Engine JVM AsciiDoc Interpreter 4 HTML Output 3 Converted To HTML 21

Simple interpreter (AsciiDoc) https://github.com/doanduyhai/incubator-zeppelin/tree/zeppelinpresentation

Cassandra Interpreter Architecture! 1 Raw Text Block JVM Zeppelin Server 2 Raw Text Block JVM Cassandra Interpreter 3 Async CQL statements Cassandra Java Driver Cassandra 6 Render HTML 5 Display Results as HTML 4 23

Cassandra Interpreter Commands! Native CQL statements SELECT * FROM ; INSERT INTO ; Schema commands DESCRIBE TABLE ; DESCRIBE KEYSPACE ; Options Commands @consistency ; @retrypolicy ; @fetchsize ; Prepared statements Commands @prepare ; @bind ; @remove_prepared ; Help command HELP; 24

Complex interpreter (Cassandra) https://github.com/doanduyhai/incubator-zeppelin/tree/zeppelinpresentation

Zeppelin future!! Roadmap!

Roadmap & future! More graph options (Map viz ZEPPELIN-157) Helium project, packaging Zeppelin view, logic (code) & resource into Applications Interpreters packaging re-design ship & compile core interpreters only third-parties interpreters can be pulled from repository which interpreter is core? Who will maintain? Community. Integrate security (Apache Shiro, ZEPPELIN-53 ) 27

Roadmap & future! Out of incubation state to become 1 st class Apache project 28

! "! Q & R

Thank You duy_hai.doan@datastax.com http://zeppelin.incubator.apache.org/