What s Cooking in KNIME



Similar documents
KNIME Server Workshop

Harnessing Big Data with KNIME

Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics

SEIZE THE DATA SEIZE THE DATA. 2015

Geo-Localization of KNIME Downloads

Big Data Analytics with Spark and Oscar BAO. Tamas Jambor, Lead Data Scientist at Massive Analytic

An In-Depth Look at In-Memory Predictive Analytics for Developers

Data Domain Profiling and Data Masking for Hadoop

Hadoop Ecosystem B Y R A H I M A.

Hadoop MapReduce and Spark. Giorgio Pedrazzi, CINECA-SCAI School of Data Analytics and Visualisation Milan, 10/06/2015

KNIME opens the Doors to Big Data. A Practical example of Integrating any Big Data Platform into KNIME

Creating a universe on Hive with Hortonworks HDP 2.0

Ensembles and PMML in KNIME

NetBeans IDE Field Guide

Data processing goes big

Name: Srinivasan Govindaraj Title: Big Data Predictive Analytics

SQL Server Administrator Introduction - 3 Days Objectives

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Big Data and Data Science: Behind the Buzz Words

Data and Machine Architecture for the Data Science Lab Workflow Development, Testing, and Production for Model Training, Evaluation, and Deployment

Actian Vortex Express 3.0

Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia

ANALYTICS CENTER LEARNING PROGRAM

Installation Guide of the Change Management API Reference Implementation

Configuring Nex-Gen Web Load Balancer

Technical Report. The KNIME Text Processing Feature:

Journée Thématique Big Data 13/03/2015

Set Up Hortonworks Hadoop with SQL Anywhere

Prerequisites. Course Outline

Big Data on Microsoft Platform

How to use. ankus v0.2.1 ankus community 작성자 : 이승복. This work is licensed under a Creative Commons Attribution 4.0 International License.

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Spark and the Big Data Library

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

DATA SCIENCE CURRICULUM WEEK 1 ONLINE PRE-WORK INSTALLING PACKAGES COMMAND LINE CODE EDITOR PYTHON STATISTICS PROJECT O5 PROJECT O3 PROJECT O2

Release System Administrator s Guide

Introduction to Big data. Why Big data? Case Studies. Introduction to Hadoop. Understanding Features of Hadoop. Hadoop Architecture.

Advanced Big Data Analytics with R and Hadoop

BIG DATA SOLUTION DATA SHEET

MEGA Web Application Architecture Overview MEGA 2009 SP4

Oracle Data Miner (Extension of SQL Developer 4.0)

Maximierung des Geschäftserfolgs durch SAP Predictive Analytics. Andreas Forster, May 2014

JMETER - MONITOR TEST PLAN

KNIME Enterprise server usage and global deployment at NIBR

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

RapidMiner Radoop Documentation

Cloudera Manager Training: Hands-On Exercises

Bentley CONNECT Dynamic Rights Management Service

Top 10 Oracle SQL Developer Tips and Tricks

! E6893 Big Data Analytics:! Demo Session II: Mahout working with Eclipse and Maven for Collaborative Filtering

Open Source Technologies on Microsoft Azure

HiBench Introduction. Carson Wang Software & Services Group

Sentimental Analysis using Hadoop Phase 2: Week 2

DEPLOYMENT GUIDE Version 1.2. Deploying the BIG-IP system v10 with Microsoft Exchange Outlook Web Access 2007

How To Create A Data Visualization With Apache Spark And Zeppelin

Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP

WhatsUp Gold v16.3 Installation and Configuration Guide

docs.hortonworks.com

AdminStudio Release Notes. 16 July Introduction New Features... 6

Enterprise Product Integration

Filr 2.0 Administration Guide. April 2016

Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related

About Dell Statistica

KNIME UGM 2014 Partner Session

Chapter 1 - Web Server Management and Cluster Topology

Integrating Apache Spark with an Enterprise Data Warehouse

Lavastorm Analytic Library Predictive and Statistical Analytics Node Pack FAQs

Ekran System Help File

User Manual. Onsight Management Suite Version 5.1. Another Innovation by Librestream

OpenText Information Hub (ihub) 3.1 and 3.1.1

EMC Documentum Connector for Microsoft SharePoint

DEPLOYMENT GUIDE Version 2.1. Deploying F5 with Microsoft SharePoint 2010

The basic data mining algorithms introduced may be enhanced in a number of ways.

Building and Using Web Services With JDeveloper 11g

5 Mistakes to Avoid on Your Drupal Website

Actian Analytics Platform Express Hadoop SQL Edition 2.0

EVALUATION ONLY. WA2088 WebSphere Application Server 8.5 Administration on Windows. Student Labs. Web Age Solutions Inc.

Important Notice. (c) Cloudera, Inc. All rights reserved.

tibbr Now, the Information Finds You.

Management Center. Installation and Upgrade Guide. Version 8 FR4

SAP BusinessObjects Query as a Web Service Designer SAP BusinessObjects Business Intelligence platform 4.0

TIBCO Spotfire Automation Services 6.5. User s Manual

TIBCO Spotfire Statistics Services Installation and Administration Guide. Software Release 5.0 November 2012

Windchill Service Information Manager Curriculum Guide

Veeam Backup Enterprise Manager. Version 7.0

HADOOP. Revised 10/19/2015

Model Deployment. Dr. Saed Sayad. University of Toronto

CHAPTER 1 - JAVA EE OVERVIEW FOR ADMINISTRATORS

Getting Started with Hadoop. Raanan Dagan Paul Tibaldi

Talend Open Studio for Big Data. Release Notes 5.2.1

Deploying Hadoop with Manager

Like what you hear? Tweet it using: #Sec360

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam

SelectSurvey.NET Developers Manual

Hadoop s Advantages for! Machine! Learning and. Predictive! Analytics. Webinar will begin shortly. Presented by Hortonworks & Zementis

SharePoint Training. Yes-M Systems LLC. Length: Hours Course

Enterprise Service Bus

SSIS Training: Introduction to SQL Server Integration Services Duration: 3 days

Communiqué 4. Standardized Global Content Management. Designed for World s Leading Enterprises. Industry Leading Products & Platform

Transcription:

What s Cooking in KNIME Thomas Gabriel Copyright 2015 KNIME.com AG

Agenda Querying NoSQL Databases Database Improvements & Big Data Copyright 2015 KNIME.com AG 2

Querying NoSQL Databases MongoDB & CouchDB - Alexander Fillbrunn - Copyright 2015 KNIME.com AG 3

Database Improvements and Big Data - Tobias Koetter - Copyright 2015 KNIME.com AG 4

Database Improvements Copyright 2015 KNIME.com AG 5

Database New Nodes Database Create Table Define constraints e.g. primary and unique keys Indexing options Database Column Rename Database Pivoting More connectors SAP Hana... Copyright 2015 KNIME.com AG 6 6

Database Framework Improvements Connection handling Connection pool (more speed) Dedicated connections (more control) Improved sql editors Syntax highlighting Code completion Copyright 2015 KNIME.com AG 7

Big Data Copyright 2015 KNIME.com AG 8

Machine Learning on Hadoop Based on Spark MLlib Scalable machine learning library Runs on Hadoop Algorithms for Classification (decision tree, naïve bayes, ) Regression (logistic regression, linear regression, ) Clustering (k-means) Collaborative filtering (ALS) Dimensionality reduction (SVD, PCA) Copyright 2015 KNIME.com AG 9 9

MLlib Integration Learn node for each algorithm Hive tables as input format MLlib model ports for model transfer MLlib model port Copyright 2015 KNIME.com AG 10

MLlib Integration MLlib nodes start and manage Spark jobs Copyright 2015 KNIME.com AG 11

MLlib Integration Copyright 2015 KNIME.com AG 12

MLlib Integration One predictor for all MLlib models Usage model and dialogs similar to existing KNIME mining nodes Copyright 2015 KNIME.com AG 13

MLlib Integration Copyright 2015 KNIME.com AG 14

MLlib Integration Hive tables as input/output format Data stays within your HDFS file system No unnecessary data movements Copyright 2015 KNIME.com AG 15

MLlib to KNIME Converts supported MLlib models to PMML Learning at scale on Hadoop Prediction with speed based on compiled models Can be combined with the new REST API Copyright 2015 KNIME.com AG 16

KNIME to MLlib Prediction at scale on Hadoop Compatible with KNIME models and pre-processing steps Copyright 2015 KNIME.com AG 17

Mix and Match Use all the KNIME nodes on your big data samples Copyright 2015 KNIME.com AG 18

Closing the Loop Apply model Learn model PMML model MLlib model Learn model Apply model Copyright 2015 KNIME.com AG 19

Agenda Querying NoSQL Databases Database Improvements & Big Data New KNIME Server Wizard Execution Workflow Diff Copyright 2015 KNIME.com AG 20

New KNIME Server New Server REST Interface WebPortal Templates Copyright 2015 KNIME.com AG 21

New KNIME Server WebPortal Templates & REST Interface - Thorsten Meinl - Copyright 2015 KNIME.com AG 22

Glassfish Tomcat Copyright 2015 KNIME.com AG 23

Why TomEE? Apache TomEE is based on Apache Tomcat Much higher adoption than Glassfish Additional libraries to support EJB Communication solely via HTTP No more firewall problems Encryption via HTTPS Installation and deployment considerably easier Better user and group management Simultaneous connection to multiple servers KNIME Server 4.0 available after the UGM 24 Copyright 2015 KNIME.com AG 24

WebPortal Templates (I) Copyright 2015 KNIME.com AG 25

WebPortal Templates (II) Copyright 2015 KNIME.com AG 26

WebPortal Templates (III) Copyright 2015 KNIME.com AG 27

WebPortal Templates (IV) Copyright 2015 KNIME.com AG 28

WebPortal Templates (V) Layout can be configured by templates Footer & header Main panel Login page Custom stylesheet Custom JavaScript libraries Can be re-used in JS-based views Copyright 2015 KNIME.com AG 29

WebPortal Templates (VI) Templates are part of the configuration and are not overridden by updates Copyright 2015 KNIME.com AG 30

REST Interface Main addition to KNIME Server 4.1 REST = Representational State Transfer Communication based on HTTP Usually clear text Many possible clients Web browser Java applications (e.g. via JAX-RS) KREST nodes :-) Goal: complete server interface based on REST 31 Copyright 2015 KNIME.com AG 31

REST Example: List Workflows (I) Via browser http://localhost:8080/com.knime.enterprise.server/rest/ v4/repository/list Requires user authentication Copyright 2015 KNIME.com AG 32

REST Example: List Workflows (II) Via KNIME and KREST nodes Copyright 2015 KNIME.com AG 33

REST Example: Execute Workflow (I) Via browser Load workflow http://localhost:8080/com.knime.enterprise.server/rest/v4 /jobs/load/ugm 2015/REST Demo/Report Returns unique job ID Execute job http://localhost:8080/com.knime.enterprise.server/rest/v4 /jobs/syncexec/24a76fec-a74e-45ba-b03f-cabf528b6a69 Returns final status Render report http://localhost:8080/com.knime.enterprise.server/rest/v4 /jobs/renderreport/24a76fec-a74e-45ba-b03fcabf528b6a69/pdf Format can be specified in request Copyright 2015 KNIME.com AG 34

REST Example: Execute Workflow (II) Via KNIME and KREST nodes Copyright 2015 KNIME.com AG 35

REST Example: Live-Scoring on server (I) Get expected parameter format from workflow Set input parameters in input quickform nodes Execute workflow Get results from quickform output nodes Scoring workflow, called via REST Copyright 2015 KNIME.com AG 36

REST Example: Live-Scoring on server (II) Get expected parameter format from workflow Set input parameters in input quickform nodes Execute workflow Get results from quickform output nodes Copyright 2015 KNIME.com AG 37

REST Example: Live Scoring on server (III) Via Call Remote Workflow node Analyzes input parameters Prepare input data accordingly Executes job and gets back results Copyright 2015 KNIME.com AG 38

New KNIME Server Wizard Execution New Server Workflow Diff REST Interface WebPortal Templates Copyright 2015 KNIME.com AG 39

Workflow Diff Simple Example I Copyright 2015 KNIME.com AG 40

Workflow Diff Simple Example II Copyright 2015 KNIME.com AG 41

Workflow Diff Extended Example Copyright 2015 KNIME.com AG 42

Workflow Diff Filtering Copyright 2015 KNIME.com AG 43

Wizard Execution I New Set of JavaScript-based interactive Views and QuickForm Nodes Copyright 2015 KNIME.com AG 44

Wizard Execution II Copyright 2015 KNIME.com AG 45

Wizard Execution III Input Variables Output Variables Copyright 2015 KNIME.com AG 46

Wizard Execution IV Copyright 2015 KNIME.com AG 47

Wizard Execution IV Copyright 2015 KNIME.com AG 48

What s Cooking? 12:30-13:30 It s lunchtime Copyright 2015 KNIME.com AG 49