Building Blocks of Cortana Intelligence Suite

Size: px
Start display at page:

Download "Building Blocks of Cortana Intelligence Suite"

Transcription

1 Building Blocks of Cortana Intelligence Suite Carolina IT Pro Group 6/20/2016 Melissa Coates Solution Architect, BlueGranite blue-granite.com Blog: sqlchick.com Slide content last updated: 6/19/2016

2 Building Blocks of Cortana Intelligence Suite Agenda 1. Azure overview 2. Introduce individual components -Purpose -Use cases -Building blocks for solutions 3. Getting started -Skills & expertise

3 Cortana Intelligence Suite Azure Overview

4 Microsoft Azure Platform Microsoft Azure is a cloud computing platform and infrastructure for building, deploying, and managing applications and services through a global network of Microsoft-managed datacenters. Azure provides services and supports many different programming languages, tools and frameworks, including both Microsoft-specific and third-party software and systems.

5 Azure Objectives Extend data center infrastructure Scalability Separate compute from storage Shared infrastructure = reduced cost of ownership Self-service provisioning Simplified management = reduced cost of ownership Open source interoperability Simplified development = faster time to value Integration of services Built-in high availability & disaster recovery Shared code base with (some) on-premises resources

6 Types of Cloud Deployments Shared Infrastructure (Lower Cost) On-Premises Cloud High Scalability Dedicated Infrastructure (Higher Cost of Ownership) More Control (Higher Administration Effort) Less Control (Lower Administration Effort) Limits to Scalability

7 Types of Cloud Deployments IaaS Azure Virtual Machines PaaS Azure SQL Data Warehouse Azure SQL Database Azure HDInsight Azure Data Lake Store Azure Stream Analytics SaaS Power BI SharePoint Online Office 365 Exchange Online Azure Data Catalog Azure Data Factory Azure Machine Learning IaaS Infrastructure-as-a-Service PaaS Platform-as-a- Service SaaS Software

8 Cortana Intelligence Suite Formerly known as Cortana Analytics Suite

9 Cortana Intelligence Suite Objectives Big Data and Advanced Analytics with less cost and effort Intelligent action from people or automated systems Enable opportunities for automation and innovation: Templates Preconfigured solutions Interoperability Easier to operationalize solutions Open standards Cortana Intelligence Suite = a marketing term for a bundle of integrated services

10 Cortana Intelligence Suite Sample Scenarios

11 Cortana Intelligence Suite Gallery Also some samples built into the Azure Portal + Azure documentation area + Visual Studio projects

12 Cortana Intelligence Components Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services Plus other solution components such as: IoT Hub Blob Storage Document DB Azure SQL Database Azure Virtual Machine R Analytics Excel Applications Azure Automation Azure Express Active Route Directory Virtual Network

13 Cortana Intelligence Suite Current State Young Set of Services Some services have become generally available in last several months Some services still in public preview Pre-Built Solutions Still Evolving Objective is to have a lot of solution templates to accelerate development time Pricing Still Evolving Still focused on individual services vs bundle Integration Still Evolving The ultimate goal is deep integration between many Azure services

14 Cortana Intelligence Suite Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services

15 Cortana Intelligence Suite Azure Data Catalog Generally Available as of April 2016

16 Azure Data Catalog Register, manage, search, and explore organizational data sources Data Preview (first x rows) Search by: Tag, Object Type, Source Type, Expert Name Data Profiling Data Profile Info

17 Azure Data Catalog Register, manage, search, and explore organizational data sources Descriptions for each column Tags for each column How to connect & how to request access

18 Azure Data Catalog Alternative to ADC interface: APIs in conjunction with custom portal

19 Azure Data Catalog Purpose Data Documentation & Discovery Enterprise-wide metadata catalog for data assets Simplified data source discovery via search Enrich & understand assets with tags & annotations Collaboration between data producers & data consumers Important to Know One Data Catalog per organization (not one per subscription) Authentication only accepts an organizational account (cannot use a Microsoft account)

20 Azure Data Catalog Common Use Cases Documentation for Centralized Data Sources Line of business systems Data warehouse, marts Analytic systems Reporting Services File system Data lake Facilitate Self-Service BI Data dictionary Assist combining data from multiple sources Reduce duplication of effort Capture Tribal Knowledge Documentation about the data is maintained by subject matter experts Enhance understanding Data Discovery & Provisioning Users search to discover data assets Who to contact to request access

21 Azure Data Catalog Building Blocks 1 Register metadata for on-premises data sources SQL Server: Data Warehouse SQL Server: CRM SQL Server: Inventory Azure SQL DB: Marketing 2 Register cloud & 3 rd party data sources 1 2 Azure Data Catalog 3 View database schema in SSDT 3 SQL Server Data Tools Connect & view a table of data in client tools Power BI Desktop Excel

22 Cortana Intelligence Suite Azure Data Factory Generally Available as of August 2015

23 Azure Data Factory A service for building and operating data pipelines Factory analogy: Raw Materials Acquire Raw Materials Integration and Preparation Finished Goods Deliver Finished Goods

24 Azure Data Factory Purpose Data Orchestration Automation of data ingestion, orchestration, and data processing Serves as the glue for stitching together services Sources Activities grouped in a Pipeline Destination (Sink)

25 Azure Data Factory How Does ADF compare to Integration Services? Integration Services Data source connection Azure Data Factory Linked service Control Flow Data Flow Source / destination Built-in transformations Custom transformations SQL Server Agent Scheduling Dataset Pipeline Activities Monitoring Scheduling Hive, Pig, C# scripts Stored proc Copy

26 Azure Data Factory Key Differences from SSIS ADF does *not* have built-in transformation capabilities (Hive, Pig, C#, or SQL DB stored procs, etc) Pipeline intent and scheduling are all combined together in ADF (not modular) JSON Most ADF elements are hand-written JSON scripts One Data Factory Frequently just one Data Factory per subscription (a resource can store just one key for use by ADF) ADF diagram can get large (no concept of one package per destination, though pipelines to group activities help) Data lineage can be tracked across the ADF diagram

27 Azure Data Factory JSON JavaScript Object Notation Data-interchange format Alternative to XML JSON used significantly in Azure

28 Azure Data Factory ADF Monitoring & Management App

29 Azure Data Factory Common Use Cases Big Data Processing HDInsight & big data stores are its strength Provide the script you want to run (Hive, Pig, etc) & it will spin up/tear down an HDInsight cluster on-demand Operationalize Solutions Scheduling of data processing operations Stage Data In Supported Data Source Certain Azure services only support cloud data sources Multiple Cloud Sources Integrate or relocate disparate data Combine or copy multiple sources

30 Azure Data Factory Building Blocks Data Movement with ADF 3 1 ADF extracts a subset of data into a cloud source supported by the Azure service to be utilized Data Warehouse 1 Azure SQL Database 2 Azure Machine Learning 2 ADF invokes API to execute Machine Learning model = Azure Data Factory (ADF) 3 ADF outputs scored results back to database for further analysis and integration with other data

31 Cortana Intelligence Suite Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services

32 Cortana Intelligence Suite Azure SQL Data Warehouse In Public Preview

33 Relational Data Warehousing in Azure* SQL Server in a Virtual Machine (IaaS) Run SQL Server workloads (including SSAS, SSRS, SSIS, etc) in an Azure Virtual Machine. Best for: Migrating/extending existing database Administer all aspects Bring your own license Dev/test scenarios *Excluding: Other technologies in a VM such as Oracle, and Big Data technologies like Hive, etc Azure SQL Database (PaaS) A relational database-as-a-service (DBaaS). Close feature parity with SQL Server. Best for: < 1TB data volume (sharding across DBs is not suitable for DW workloads) OLTP with scaling & pooling needs (unpredictable workloads) Reduced administration of DB, O/S, and hardware Azure SQL Data Warehouse (PaaS) A DW-as-a-service (DWaaS) optimized for performance and large scale, distributed workloads. Best for: Larger data volumes on MPP architecture Ability to scale up/down/ pause on-demand Combining relational + nonrelational data

34 Azure SQL DW SQL DW is suitable for large-scale data warehousing workloads Massively parallel processing (MPP) architecture across distributions (SQL DBs) SQL DW data stored in blob storage (*not* SQL DB) Part of the SQL Server family (with differences)

35 Azure SQL DW Purpose MPP Scale-Out Query Engine Cloud-based, multi-tenant, platform-as-a-service (PaaS) Massively parallel processing (MPP) Built on SQL Server Clustered columnstore indexes used by default Elastic Scale Scale up/down ondemand or on schedule PolyBase T-SQL for Hadoop queries & data loads Storage + Compute Storage and compute is decoupled Separate billing & scaling for storage vs. compute Data Warehouse Units (DWUs) controls compute billing Increase/decrease/pause compute ability independently of data storage

36 Azure SQL DW Common Use Cases Analytical and Ad Hoc Workloads Batch inserts and updates OLTP workload are not suitable for SQL DW Varying Workloads Workloads which suit ability to scale compute up/down (ex: during data load processing or intensive analytical operations) Large Scale Data Warehouse Easier to provision large-scale environments in cloud than on-premises Data Variety Integration with various data source types and data structures (i.e., takes advantage of PolyBase)

37 Azure SQL DW Two Ways of Using PolyBase 1 Querying of relational + semi/unstructured data in a single consolidated query. Objective: avoid data movement from where the data currently resides. Azure SQL Data Warehouse Azure Blob Storage T-SQL query results Driver Car State Avg Miles Name Model Speed Driven Amanda Civic WA Green Joe Brown Escort WA Driver Name Car Model State Amanda Green Civic WA Joe Brown Escort WA Telemetry data

38 Azure SQL DW Two Ways of Using PolyBase 2 Data movement from source to target. Currently PolyBase is the recommended method for loading SQL DW due to its parallel processing behavior. Control Node Compute Node Compute Node Compute Node Azure Blob Storage

39 Azure SQL DW Building Blocks Modern Data Warehouse 1 Ingest data into Azure (copy of raw data) 2 Process and transform data Line of Business Systems Web Data 1 Blob Storage 2 Blob Storage 3 4 SQL Data Warehouse 3 Load processed data to SQL DW in parallel with PolyBase 4 Load relational data to SQL DW (via BCP, SSIS, ADF, etc) = Azure Data Factory (ADF) 5 Data Catalog 5 Register data source(s) which users are permitted to access

40 Cortana Intelligence Suite Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services

41 Azure Data Lake A collection of 3 services: Data Lake Analytics Big Data queries-as-a-service HDInsight Big Data cluster-as-a-service Data Lake Store Big Data storage as-a-service

42 Cortana Intelligence Suite Azure Data Lake Store In Public Preview

43 Azure Data Lake Store Big Data storage-as-a-service

44 Azure Data Lake Store Purpose Big Data Storage Storage to support Hadoop applications HDFS (Hadoop Distributed File System) for the cloud, with no size limitations Stores data in its native format: objective is to not reformat File System Optimized for Analytics An alternative to general purpose Azure Blob Storage Parallel read scans Scaled out over multiple machines Low latency writes Large file sizes WebHDFS-Compatibility Accessible to all HDFScompliant projects (if integrated with HDInsight)

45 Azure Data Lake Store Common Use Cases Big Data Analytic Workloads Optimized to work with ADL Analytics Also supports HDInsight Influx of Data Persist large variety of data formats Retain original data in native format before it s processed Large File Sizes No limits on account or file sizes Can handle PB size files Low Latency Read files as they are being written Active Archive Availability of archival data Analytics Sandbox Working area for data scientists and analytsts to store files

46 Cortana Intelligence Suite Azure Data Lake Analytics In Public Preview

47 Azure Data Lake Analytics Big Data queries-as-a-service

48 Azure Data Lake Analytics Purpose Big Data Processing Ability to process any data, regardless of size or structure YARN application built on open standards Query scalability: resources allocated for each query Optimized to work with Azure Data Lake Store Simplification Abstracts away the cluster nodes focuses on convenience, efficiency, and scalability U-SQL = familiar SQL and C# to reduce learning curve Separation of ADL Analytics from ADL Store: easier to manage, debug, and optimize

49 Azure Data Lake Analytics U-SQL SQL + C# New big data query processing language Applies schema on read logic Mix of multiple SQL dialects (T-SQL and ANSI SQL) Native extensibility of user code written in C# Full C# expressions Reuse in assemblies Define custom types, functions, etc. Automatically scales and parallelizes across nodes

50 Azure Data Lake Analytics Common Use Cases Focus on Business Logic Focus on jobs rather than on infrastructure for a cluster Abstracts away the cluster nodes and focuses on convenience, efficiency, and scalability Proving Value of Data Initial experimentation or infrequent analysis Can be a precursor to integrating data into relational store Cloud Data Data resides in the cloud U-SQL is a Fit Skillsets and preferences are a fit for U-SQL (SQL + C#) Various Size Workloads Scalability on an individual job basis Objective is to not reserve capacity that s not needed

51 Azure Data Lake Analytics Building Blocks Augment a Data Warehouse with a Data Lake 1 Ingest new data into ADL Store (via ADF, copy, streaming, etc) Line of Business Systems Web Data Device Data 1 Blob Storage 2 Data Lake Store PolyBase 5 U-SQL 4 3 SQL Data Warehouse 3 U-SQL Data Lake Analytics Relocate existing data into ADL Store (optional) U-SQL: read data from SQL DW & ADL Store to analyze data U-SQL: write results back to ADL Store 5 Integrate results back into SQL DW = Azure Data Factory (ADF)

52 Cortana Intelligence Suite Azure HDInsight Generally Available as of October 2013

53 Azure HDInsight Hadoop-based distribution for Big Data solutions

54 Azure HDInsight Purpose Big Data Processing A Big Data cluster-as-a-service for distributed data processing, scaling, and querying capabilities Supports the Apache Hadoop open source ecosystem: Hive, Spark, R, Solr, Storm, etc Considered a compute service rather than as a big data store Linux or Windows Hortonworks Partnership Based on Hortonworks Data Platform (HDP) distribution (Microsoft + Hortonworks joint engineering team with joint roadmap)

55 Azure HDInsight Common Use Cases Big Data Scenarios Volume Variety Velocity Leveraging the Hadoop ecosystem You want to manage a cluster & go beyond what U- SQL can easily do with ADL Analytics Integration with other open source projects Development/POC Inexpensive way to test out proof of concept before investing in a big data cluster Data Processing Engine Computations and aggregations for refined data sent to DW and analytics systems Component of ETL operations Data Exploration Part of data scientist s toolbox Event Processing Storm

56 Azure HDInsight Important to Know A running cluster will charge for compute hours on a per-node basis whether it s being actively used or not. This is why clusters are frequently created and deleted (there s no pause/shutdown). It can take close to 30 minutes to create a cluster on-demand though. Because the data is stored separately from the HDInsight compute service, data is preserved when a cluster is deleted. Azure Blob Storage is currently the default storage of HDInsight, though it will change to Azure Data Lake Store in the future.

57 Azure HDInsight Building Blocks On-Demand HDInsight Cluster 1 Ingest new data into Blob Storage (via ADF, copy, streaming, import/export, etc) Web Data 1 Blob Storage 2 3 HDInsight 4 SQL Data Warehouse 2 ADF creates an HDInsight cluster on demand, then executes Hive script to perform data aggregations & calculations HDInsight 3 Output processed data to database for further analysis = Azure Data Factory (ADF) 4 HDInsight cluster is deleted by ADF (to save cost) once its time to live expires

58 Cortana Intelligence Suite Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services

59 Cortana Intelligence Suite Azure Machine Learning Generally Available as of February 2015

60 Azure Machine Learning A service for building predictive analytics solutions Data preprocessing modules Algorithms Execution of algorithms Custom R or Python

61 Azure Machine Learning Purpose Predictive Analytics Build predictive models using statistical techniques Learn from existing data to forecast future behaviors, outcomes & trends Minimize learning curve with predefined algorithms and drag & drop authoring environment Extensible with R and Python

62 Azure Machine Learning Common Use Cases Finding Anomalies Examining patterns for detection of fraud Locating unusual or abnormal equipment readings Descriptive Analytics Analysis of returns Customer segmentation (ex: by buying habits or age group) to improve customer service Personalized offer recommendations Predictive Analytics Credit risk Product demand & revenue predictions Customer retention Weather predictions Machine maintenance & smart buildings Hospital readmissions Student dropouts

63 Azure Machine Learning Building Blocks Operationalizing an ML Model 1 Relocate onpremises data to a supported online data source Line of Business Systems 1 Azure SQL Database 2 Azure Machine Learning API 3 2 Execute Machine Learning model by invoking the deployed machine learning API SQL Data Warehouse 3 Integrate scored results to SQL DW for further analysis = Azure Data Factory (ADF)

64 Cortana Intelligence Suite Azure Stream Analytics + Azure Event Hub Generally Available as of April 2015

65 Azure Stream Analytics Real-time analytics for high velocity streaming events

66 Azure Stream Analytics Purpose Stream Analytics Analytic processing engine for streaming events Internet of Things (IoT) solutions for data in motion from devices, sensors, social media, etc Event Hub A publish-subscribe service that handles high volume & high velocity data streams Allows events to be ingested into Azure from many platforms & devices (another option: IoT Hub) The preferred method of event ingestion for Stream Analytics Simplification Alternative to batch loading processes Lower bar to entry for developers by using SQLlike language More straightforward than an HDInsight Storm cluster

67 Azure Stream Analytics Common Use Cases Device Monitoring & Telemetry Level beyond acceptable threshold Business continuity Traffic Accident & traffic conditions Web Logs Clickstream analysis A/B testing Errors or degraded experience Identity Protection Real-time fraud alerts Identity theft scenarios Social Media Real-time sentiment analysis Demand-Based Pricing Bookings in past x minutes Inventory Levels Shelf volume vs. register checkouts

68 Azure Stream Analytics Building Blocks Equipment Maintenance Predictions 1 2 Events ingested to the raw event data queue Consume & process data for a window of time 3 Obtain reference data Machine Sensor Output 1 Event Hub HDInsight = Azure Data Factory (ADF) 6 2 Stream Analytics 3 5 Azure SQL Database Azure Machine Learning API Power BI SQL Data Warehouse Real-time reporting Persist streams of data for historical reporting & analysis Processing and aggregations of sensor data Predictions for maintenance and remaining useful life Integrate results for further analysis

69 Cortana Intelligence Suite Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services

70 Cortana Intelligence Suite Power BI Generally Available as of July 2015

71 Power BI Data analysis and visualization tools

72 Power BI Purpose Data Preparation, Data Modeling, Data Visualization Set of desktop, web, and mobile tools

73 Power BI Common Use Cases Self-Service BI Mashup of data into a small data model which is imported & refreshed in Power BI Service Data preparation & cleansing Data visualizations Front-End Reporting Reports and dashboards from Corporate BI sources via direct connect Third Party Reporting 3 rd party content packs quick start for isolated reporting scenarios Prototyping Test out ideas for data structure, calculations, and reports Embed in Custom App Power BI Embedded (separate Azure Service)

74 Power BI Building Blocks Bimodal Business Intelligence On-Prem Source Database Enterprise Gateway 2 Power BI Service Power BI Dataset Connection Power BI Report A A B User runs Report A Data is requested via gateway to present in report Data mashup prepared Data model and reports published SQL Data Warehouse Flat File A Power BI Desktop B C C Power BI Imported Dataset Power BI Report B D C D Data refresh schedule created for imported dataset User runs Report B & results are returned from the imported dataset

75 Cortana Intelligence Suite Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services

76 Cortana Intelligence Suite Cognitive Services In Public Preview

77 Cognitive Services Purpose Give Applications a Human Side Designed to make applications more personalized, intelligent and engaging Set of APIs, SDKs, and services to see, hear, and interpret Vision Emotion Speech Face Recognition Language etc Some APIs supported by Azure Machine Learning or Bing services

78 Cognitive Services Common Use Cases Sentiment Analysis Detect key phrases & topics being discussed Identify feedback posted to website Convert speech to text Emotion recognition Security Systems Facial detection and recognition Auto-Suggestions Completion of partial search queries Personalized Shopping Experience Recommend items likely to be purchased together

79 Cortana Intelligence Suite Bot Framework In Public Preview

80 Bot Framework Purpose Conversation Agent Automated interactions with users Enable applications and services to have a conversational user interface (CUI)

81 Bot Framework Common Use Cases Messaging Bots Web chats Text/SMS conversation Content Bots Share relevant content, such as news or weather, with you Watcher Bots for Alerting Flight is delayed Dental appointment reminder E-Commerce Bots Order food Book a flight Check inventory for a product

82 Cortana Intelligence Suite Cortana Assistant

83 Cortana Assistant Personal digital assistant

84 Cortana Assistant Purpose Apply Language More Pervasively Virtual personal assistant for: Asking questions Finding things on PC Managing calendar Tracking packages Integration Reminders are integrated between Windows devices Search within other apps, such as a calendar Interact with bots to make requests

85 Cortana Assistant Common Use Cases Reminders Help with remembering appointments Power BI Natural Language Type or talk question: Show me sales for this quarter in the East region and a Power BI chart will render Work With Bots Interact with a bot to place an order or schedule a meeting

86 Cortana Assistant Important To Know Three requirements to render Power BI reports via the Cortana Assistant 1. Permission in the Power BI Dataset for Cortana to access the data 2. Settings for PC are associated to the organizational account (Settings > Accounts > Work Access) 3. Cortana on PC is connected to Office 365 and the organizational account (Cortana > Notebook > Connected Accounts > Office 365)

87 Cortana Assistant Why Is the Suite Named After Cortana? As for Cortana, which is the Microsoft voice-driven personal assistant tool in Windows 10, it s a small part of the solution, but Sirosh says Microsoft named the suite after it because it symbolizes the contextualized intelligence that the company hopes to deliver across the entire suite.

88 Cortana Intelligence Suite Getting Started: Skills & Expertise Needed

89 Azure Portal Azure Portal Different services use the Azure Portal to varying degrees Other Related Portals Some services have their own specific portal for specific functionality: AzureDataCatalog.com DataFactory.Azure.com Studio.AzureML.net

90 Azure Portal Azure Resource Manager (ARM) The new portal is based on the ARM deployment model ARM allows you to: Logically organize related resources which have the same lifecycle Manage related resources as a group Secure related resources as a group Deploy resources via declarative JSON templates (Dev Test Prod) Source control Azure resources with JSON templates ( infrastructure as code ) Simplify deployment & rollback Track cost of an overall solution Tags Categorize resources for billing or management Specify owner and/or support for a resource

91 Azure Data Factory Skills & Expertise Needed Tools Authoring and development: Azure Portal ADF Tools for Visual Studio (VS Extension) Monitoring & management: ADF Monitoring & Management App (DataFactory.Azure.com) Data Management Gateway (access of on-prem data) Management of source & destination files in Azure Storage: Azure Storage Explorer AzCopy Languages & Scripting Options JSON Appropriate source system language (HiveQL, Pig Latin, C#, U-SQL, T-SQL) PowerShell REST API

92 Azure SQL DW Skills & Expertise Needed Tools Authoring, development, management: Azure Portal SQL Server Data Tools for Visual Studio SQL Server Management Studio coming soon Data movement options: PolyBase Azure Data Factory SQL Server Integration Services (SSIS) BCP Management of source & destination files in Azure Storage: Azure Storage Explorer AzCopy Languages & Scripting Options T-SQL; PolyBase REST APIs PowerShell

93 Azure Data Lake Store Skills & Expertise Needed Tools Data movement options: Azure Portal U-SQL (Azure Data Lake Analytics) Azure Data Factory ADLCopy (command line tool) DistCp (distributed copy-if integrated with HDInsight) Sqoop (if integrated with HDInsight) Languages & Scripting Options U-SQL PowerShell Various SDKs (.NET, Node.js, Java, Python) REST APIs (for unsupported languages & platforms) HDFS Projects (ex: Sqoop, Storm-if integrated with HDInsight)

94 Azure Data Lake Analytics Skills & Expertise Needed Tools Authoring and development: Azure Portal Azure Data Lake Tools for Visual Studio (VS Extension) Languages & Scripting Options U-SQL Various SDKs (Java,.NET, Node.js, Python) PowerShell

95 Azure HDInsight Skills & Expertise Needed Tools Authoring and development: Azure Portal HDInsight Tools for Visual Studio (VS Extension) Monitoring & management: Ambari Data movement options: SQL Server Integration Services (SSIS) DistCp (Distributed Copy) Sqoop, Storm, Flume, etc Management of source & destination files in Azure Storage: Azure Storage Explorer AzCopy Languages & Scripting Options Default programming language support: Java, Python (Additional languages can be installed) HiveQL, Pig Latin, Sqoop, etc PowerShell

96 Azure Machine Learning Skills & Expertise Needed Tools Authoring & Development: Azure Portal Azure Machine Learning Studio (Studio.AzureML.net) Jupyter Notebooks Deployment: Azure ML API Service Data Management Gateway (access of on-prem data) Library of Machine Learning Algorithms Pre-defined algorithms Sample experiments Cortana Intelligence Gallery Languages & Scripting Options R (for creation of custom modules) Python (for execution of scripts)

97 Azure Stream Analytics Skills & Expertise Needed Tools Azure Portal Languages & Scripting Options Stream Analytics Query Language (closely related to SQL) PowerShell.NET SDK

98 Power BI Skills & Expertise Needed Tools Power BI Desktop (authoring) Excel (authoring) Power BI Service (consumption, collaboration & some authoring) Power BI Mobile Apps (consumption) Power BI Visuals Gallery (published custom visuals) Enterprise Gateway + Personal Gateway (access on-prem data) Languages & Scripting Options M or Power Query Formula Language (data preparation) DAX - Data Analysis expressions (data model calculations & expressions) D3.js + related technologies (authoring of custom visuals) Power BI Embedded REST APIs and SDK (integration with custom application) Power BI APIs (integration with custom application)

99 Cognitive Services Skills & Expertise Needed Tools Azure Portal Visual Studio, or developer IDE of choice GitHub Languages & Scripting Options Source language applicable to the API (Python, C#, etc)

100 Bot Framework Skills & Expertise Needed Tools Azure Portal Visual Studio, or developer IDE of choice GitHub Bot Framework Emulator Languages & Scripting Options Bot Builder SDKs in C# and Node.js Bot Connector API

101 Cortana Intelligence Suite Conclusion

102 Cortana Intelligence Components Information Management Big Data Stores Machine Learning & Analytics Visualization Power BI Data Factory Data Catalog SQL Data Warehouse HDInsight Machine Learning Intelligence Event Hub Data Lake Store Data Lake Analytics Stream Analytics Cortana Assistant Bot Framework Cognitive Services Plus other solution components such as: IoT Hub Blob Storage Document DB Azure SQL Database Azure Virtual Machine R Analytics Excel Applications Azure Automation Azure Express Active Route Directory Virtual Network

103 Additional Information Setting up a PC for Azure Cortana Intelligence Suite Development What is the Cortana Intelligence Suite? Should You Use a SQL Server Marketplace Image for an Azure Virtual Machine? How to Build a Demo/Test Environment for Azure Data Catalog Overview of Azure Data Catalog in the Cortana Intelligence Suite

104 Thank You for Attending! To download a copy of this presentation: SQLChick.com Presentations & Downloads page Melissa Coates Solution Architect, BlueGranite blue-granite.com Blog: sqlchick.com Creative Commons License: Attribution-NonCommercial-NoDerivative Works 3.0

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent

More information

Azure Data Lake Analytics

Azure Data Lake Analytics Azure Data Lake Analytics Compose and orchestrate data services at scale Fully managed service to support orchestration of data movement and processing Connect to relational or non-relational data

More information

Building a BI Solution in the Cloud

Building a BI Solution in the Cloud Building a BI Solution in the Cloud Stacia Varga, Principal Consultant Email: stacia@datainspirations.com Twitter: @_StaciaV_ 2 SQLSaturday #467 Sponsors Stacia (Misner) Varga Over 30 years of IT experience,

More information

Microsoft Big Data. Solution Brief

Microsoft Big Data. Solution Brief Microsoft Big Data Solution Brief Contents Introduction... 2 The Microsoft Big Data Solution... 3 Key Benefits... 3 Immersive Insight, Wherever You Are... 3 Connecting with the World s Data... 3 Any Data,

More information

HDP Hadoop From concept to deployment.

HDP Hadoop From concept to deployment. HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some

More information

Modernizing Your Data Warehouse for Hadoop

Modernizing Your Data Warehouse for Hadoop Modernizing Your Data Warehouse for Hadoop Big data. Small data. All data. Audie Wright, DW & Big Data Specialist Audie.Wright@Microsoft.com O 425-538-0044, C 303-324-2860 Unlock Insights on Any Data Taking

More information

SQL Server 2014. What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.

SQL Server 2014. What s New? Christopher Speer. Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft. SQL Server 2014 What s New? Christopher Speer Technology Solution Specialist (SQL Server, BizTalk Server, Power BI, Azure) v-cspeer@microsoft.com The evolution of the Microsoft data platform What s New

More information

Please give me your feedback

Please give me your feedback Please give me your feedback Session BB4089 Speaker Claude Lorenson, Ph. D and Wendy Harms Use the mobile app to complete a session survey 1. Access My schedule 2. Click on this session 3. Go to Rate &

More information

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Highly competitive enterprises are increasingly finding ways to maximize and accelerate

More information

Hadoop in the Hybrid Cloud

Hadoop in the Hybrid Cloud Presented by Hortonworks and Microsoft Introduction An increasing number of enterprises are either currently using or are planning to use cloud deployment models to expand their IT infrastructure. Big

More information

Bringing Big Data to People

Bringing Big Data to People Bringing Big Data to People Microsoft s modern data platform SQL Server 2014 Analytics Platform System Microsoft Azure HDInsight Data Platform Everyone should have access to the data they need. Process

More information

Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com;

Microsoft Big Data Solutions. Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com; Microsoft Big Data Solutions Anar Taghiyev P-TSP E-mail: b-anarta@microsoft.com; Why/What is Big Data and Why Microsoft? Options of storage and big data processing in Microsoft Azure. Real Impact of Big

More information

Power BI as a Self-Service BI Platform:

Power BI as a Self-Service BI Platform: Power BI as a Self-Service BI Platform: Architecture, Use Cases, Strengths & Shortcomings SQL Saturday Atlanta 5/16/2015 Melissa Coates Solution Architect, BlueGranite Blog: sqlchick.com Twitter: @sqlchick

More information

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics

Harnessing the Power of the Microsoft Cloud for Deep Data Analytics 1 Harnessing the Power of the Microsoft Cloud for Deep Data Analytics Today's Focus How you can operate your business more efficiently and effectively by tapping into Cloud based data analytics solutions

More information

Updating Your SQL Server Skills to Microsoft SQL Server 2014

Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course Details Course Outline Module 1: Introduction to SQL Server 2014 This module introduces key features of SQL Server 2014.

More information

Melissa Coates. Tools & Techniques for Implementing Corporate and Self-Service BI. Triad SQL BI User Group 6/25/2013. BI Architect, Intellinet

Melissa Coates. Tools & Techniques for Implementing Corporate and Self-Service BI. Triad SQL BI User Group 6/25/2013. BI Architect, Intellinet Tools & Techniques for Implementing Corporate and Self-Service BI Triad SQL BI User Group 6/25/2013 Melissa Coates BI Architect, Intellinet Blog: sqlchick.com Twitter: @sqlchick About Melissa Business

More information

BIG DATA TRENDS AND TECHNOLOGIES

BIG DATA TRENDS AND TECHNOLOGIES BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.

More information

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014 www.etidaho.com (208) 327-0768 Course 10977A: Updating Your SQL Server Skills to Microsoft SQL Server 2014 5 Days About this Course This five day instructor led course teaches students how to use the enhancements

More information

Assignment # 1 (Cloud Computing Security)

Assignment # 1 (Cloud Computing Security) Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual

More information

Updating Your SQL Server Skills to Microsoft SQL Server 2014

Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course 10977B: Updating Your SQL Server Skills to Microsoft SQL Server 2014 Page 1 of 8 Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course 10977B: 4 days; Instructor-Led Introduction This

More information

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM

SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM David Chappell SELLING PROJECTS ON THE MICROSOFT BUSINESS ANALYTICS PLATFORM A PERSPECTIVE FOR SYSTEMS INTEGRATORS Sponsored by Microsoft Corporation Copyright 2014 Chappell & Associates Contents Business

More information

Microsoft Research Microsoft Azure for Research Training

Microsoft Research Microsoft Azure for Research Training Copyright 2014 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the

More information

Microsoft Research Windows Azure for Research Training

Microsoft Research Windows Azure for Research Training Copyright 2013 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the

More information

Ganzheitliches Datenmanagement

Ganzheitliches Datenmanagement Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

HDP Enabling the Modern Data Architecture

HDP Enabling the Modern Data Architecture HDP Enabling the Modern Data Architecture Herb Cunitz President, Hortonworks Page 1 Hortonworks enables adoption of Apache Hadoop through HDP (Hortonworks Data Platform) Founded in 2011 Original 24 architects,

More information

10977B: Updating Your SQL Server Skills to Microsoft SQL Server 2014

10977B: Updating Your SQL Server Skills to Microsoft SQL Server 2014 10977B: Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course Details Course Code: Duration: Notes: 10977B 5 days This course syllabus should be used to determine whether the course is appropriate

More information

Course 10977: Updating Your SQL Server Skills to Microsoft SQL Server 2014

Course 10977: Updating Your SQL Server Skills to Microsoft SQL Server 2014 Course 10977: Updating Your SQL Server Skills to Microsoft SQL Server 2014 Type:Course Audience(s):IT Professionals Technology:Microsoft SQL Server Level:300 This Revision:B Delivery method: Instructor-led

More information

Oracle Database 12c Plug In. Switch On. Get SMART.

Oracle Database 12c Plug In. Switch On. Get SMART. Oracle Database 12c Plug In. Switch On. Get SMART. Duncan Harvey Head of Core Technology, Oracle EMEA March 2015 Safe Harbor Statement The following is intended to outline our general product direction.

More information

Open Source Technologies on Microsoft Azure

Open Source Technologies on Microsoft Azure Open Source Technologies on Microsoft Azure A Survey @DChappellAssoc Copyright 2014 Chappell & Associates The Main Idea i Open source technologies are a fundamental part of Microsoft Azure The Big Questions

More information

Integrating a Big Data Platform into Government:

Integrating a Big Data Platform into Government: Integrating a Big Data Platform into Government: Drive Better Decisions for Policy and Program Outcomes John Haddad, Senior Director Product Marketing, Informatica Digital Government Institute s Government

More information

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations

Beyond Lambda - how to get from logical to physical. Artur Borycki, Director International Technology & Innovations Beyond Lambda - how to get from logical to physical Artur Borycki, Director International Technology & Innovations Simplification & Efficiency Teradata believe in the principles of self-service, automation

More information

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data

More information

Azure Day Application Development

Azure Day Application Development Azure Day Application Development Randy Pagels Developer Technology Specialist Tim Adams Developer Solutions Specialist Azure App Service.NET, Java, Node.js, PHP, Python Auto patching Auto scale Integration

More information

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft Introducing the Reimagined Power BI Platform Jen Underwood, Microsoft Thank You Sponsors Empower users with new insights through familiar tools while balancing the need for IT to monitor and manage user

More information

WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS

WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS WINDOWS AZURE DATA MANAGEMENT AND BUSINESS ANALYTICS Managing and analyzing data in the cloud is just as important as it is anywhere else. To let you do this, Windows Azure provides a range of technologies

More information

Comprehensive Analytics on the Hortonworks Data Platform

Comprehensive Analytics on the Hortonworks Data Platform Comprehensive Analytics on the Hortonworks Data Platform We do Hadoop. Page 1 Page 2 Back to 2005 Page 3 Vertical Scaling Page 4 Vertical Scaling Page 5 Vertical Scaling Page 6 Horizontal Scaling Page

More information

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer Automated Data Ingestion Bernhard Disselhoff Enterprise Sales Engineer Agenda Pentaho Overview Templated dynamic ETL workflows Pentaho Data Integration (PDI) Use Cases Pentaho Overview Overview What we

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

Advanced Analytics & IoT Architectures

Advanced Analytics & IoT Architectures Advanced Analytics & IoT Architectures Presented by: Tom Marek and Orion Gebremedhin Use Case: ETL Offloading Have you outgrown your data delivery SLAs? Get the right data at the right time 2 ETL Processing

More information

Whitepaper: Solution Overview - Breakthrough Insight. Published: March 7, 2012. Applies to: Microsoft SQL Server 2012. Summary:

Whitepaper: Solution Overview - Breakthrough Insight. Published: March 7, 2012. Applies to: Microsoft SQL Server 2012. Summary: Whitepaper: Solution Overview - Breakthrough Insight Published: March 7, 2012 Applies to: Microsoft SQL Server 2012 Summary: Today s Business Intelligence (BI) platform must adapt to a whole new scope,

More information

Introduction to the Cloud OS Windows Azure Overview Visual Studio Tooling for Windows Azure Scenarios: Dev/Test Web Mobile Hybrid

Introduction to the Cloud OS Windows Azure Overview Visual Studio Tooling for Windows Azure Scenarios: Dev/Test Web Mobile Hybrid Introduction to the Cloud OS Windows Azure Overview Visual Studio Tooling for Windows Azure Scenarios: Dev/Test Web Mobile Hybrid Development Management Identity Data Virtualization All services

More information

ORACLE DATA INTEGRATOR ENTERPRISE EDITION

ORACLE DATA INTEGRATOR ENTERPRISE EDITION ORACLE DATA INTEGRATOR ENTERPRISE EDITION Oracle Data Integrator Enterprise Edition 12c delivers high-performance data movement and transformation among enterprise platforms with its open and integrated

More information

Managing the PowerPivot for SharePoint Environment

Managing the PowerPivot for SharePoint Environment Managing the PowerPivot for SharePoint Environment Melissa Coates Blog: sqlchick.com Twitter: @sqlchick SharePoint Saturday 3/16/2013 About Melissa Business Intelligence & Data Warehousing Developer Architect

More information

Roadmap Talend : découvrez les futures fonctionnalités de Talend

Roadmap Talend : découvrez les futures fonctionnalités de Talend Roadmap Talend : découvrez les futures fonctionnalités de Talend Cédric Carbone Talend Connect 9 octobre 2014 Talend 2014 1 Connecting the Data-Driven Enterprise Talend 2014 2 Agenda Agenda Why a Unified

More information

The Enterprise Data Hub and The Modern Information Architecture

The Enterprise Data Hub and The Modern Information Architecture The Enterprise Data Hub and The Modern Information Architecture Dr. Amr Awadallah CTO & Co-Founder, Cloudera Twitter: @awadallah 1 2013 Cloudera, Inc. All rights reserved. Cloudera Overview The Leader

More information

The Inside Scoop on Hadoop

The Inside Scoop on Hadoop The Inside Scoop on Hadoop Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. Orion.Gebremedhin@Neudesic.COM B-orgebr@Microsoft.com @OrionGM The Inside Scoop

More information

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014 CÔNG TY CỔ PHẦN TRƯỜNG CNTT TÂN ĐỨC TAN DUC INFORMATION TECHNOLOGY SCHOOL JSC LEARN MORE WITH LESS! Course 10977 Upgrading Your SQL Server Skills to Microsoft SQL Server 2014 Length: 5 Days Audience: IT

More information

SQL Server 2012 Business Intelligence Boot Camp

SQL Server 2012 Business Intelligence Boot Camp SQL Server 2012 Business Intelligence Boot Camp Length: 5 Days Technology: Microsoft SQL Server 2012 Delivery Method: Instructor-led (classroom) About this Course Data warehousing is a solution organizations

More information

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata

BIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING

More information

Reference Architecture

Reference Architecture Reference Architecture Real-Time Event Processing with Microsoft Azure Abstract: The Reference Architecture for real-time event processing with Microsoft Azure provides a framework for designing and deploying

More information

Updating Your SQL Server Skills from Microsoft SQL Server 2008 to Microsoft SQL Server 2014

Updating Your SQL Server Skills from Microsoft SQL Server 2008 to Microsoft SQL Server 2014 Course Code: M10977 Vendor: Microsoft Course Overview Duration: 5 RRP: 2,025 Updating Your SQL Server Skills from Microsoft SQL Server 2008 to Microsoft SQL Server 2014 Overview This five-day instructor-led

More information

BIG DATA AND MICROSOFT. Susie Adams CTO Microsoft Federal

BIG DATA AND MICROSOFT. Susie Adams CTO Microsoft Federal BIG DATA AND MICROSOFT Susie Adams CTO Microsoft Federal THE WORLD OF DATA IS CHANGING Cloud What s making this possible? Electrical efficiency of computers doubles every year and ½. Laptops and mobile

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

Hortonworks CISC Innovation day

Hortonworks CISC Innovation day Hortonworks CISC Innovation day Simon gregory sgregory@hortonworks.com Here was the ask Hortonworks' data reposition - how this works and the types of data you work with. 1: Data Types & Value. What have

More information

Big Analytics in the Cloud. Matt Winkler PM, Big Data @ Microsoft @mwinkle

Big Analytics in the Cloud. Matt Winkler PM, Big Data @ Microsoft @mwinkle Big Analytics in the Cloud Matt Winkler PM, Big Data @ Microsoft @mwinkle Part 3: Single Slide JustGiving is a global online social platform for giving that lets you raise money for a cause you care about

More information

Designing Self-Service Business Intelligence and Big Data Solutions

Designing Self-Service Business Intelligence and Big Data Solutions This five-day instructor-led course teaches students how to implement self-service Business Intelligence (BI) and Big Data analysis solutions using the Microsoft data platform. The course discusses the

More information

Course MS20467C Designing Self-Service Business Intelligence and Big Data Solutions

Course MS20467C Designing Self-Service Business Intelligence and Big Data Solutions 3 Riverchase Office Plaza Hoover, Alabama 35244 Phone: 205.989.4944 Fax: 855.317.2187 E-Mail: rwhitney@discoveritt.com Web: www.discoveritt.com Course MS20467C Designing Self-Service Business Intelligence

More information

Architecting Open source solutions on Azure. Nicholas Dritsas Senior Director, Microsoft Singapore

Architecting Open source solutions on Azure. Nicholas Dritsas Senior Director, Microsoft Singapore Learn. Connect. Explore. Architecting Open source solutions on Azure Nicholas Dritsas Senior Director, Microsoft Singapore Agenda Developing OSS Apps on Azure Customer case with OSS Apps Hadoop on Azure

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

SQL Server 2016 New Features!

SQL Server 2016 New Features! SQL Server 2016 New Features! Improvements on Always On Availability Groups: Standard Edition will come with AGs support with one db per group synchronous or asynchronous, not readable (HA/DR only). Improved

More information

Parallel Data Warehouse

Parallel Data Warehouse MICROSOFT S ANALYTICS SOLUTIONS WITH PARALLEL DATA WAREHOUSE Parallel Data Warehouse Stefan Cronjaeger Microsoft May 2013 AGENDA PDW overview Columnstore and Big Data Business Intellignece Project Ability

More information

Using Tableau Software with Hortonworks Data Platform

Using Tableau Software with Hortonworks Data Platform Using Tableau Software with Hortonworks Data Platform September 2013 2013 Hortonworks Inc. http:// Modern businesses need to manage vast amounts of data, and in many cases they have accumulated this data

More information

Modern Data Warehousing

Modern Data Warehousing Modern Data Warehousing Cem Kubilay Microsoft CEE, Turkey & Israel Time is FY15 Gartner Survey April 2014 Piloting on premise 15% 10% 4% 14% 57% 2014 5% think Hadoop will replace existing DW solution (2013:

More information

How To Extend An Enterprise Bio Solution

How To Extend An Enterprise Bio Solution Course 20467C: Designing Self-Service Business Intelligence and Big Data Solutions Module 1: Introduction to Self-Service Business Intelligence This module introduces self-service BI. Extending Enterprise

More information

www.informatik-aktuell.de

www.informatik-aktuell.de www.informatik-aktuell.de The Microsoft data platform capabilities Transform + analyze Data Visualize + decide Capture + manage The Microsoft data platform capabilities Transform + analyze Data Visualize

More information

Migrating SaaS Applications to Windows Azure

Migrating SaaS Applications to Windows Azure Migrating SaaS Applications to Windows Azure Lessons Learned 04.04.2012 Speaker Introduction Deepthi Raju Marketing Technology Services Deepthi joined Smartbridge in 2005 and has over twenty years of technology

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment

More information

MicroStrategy Course Catalog

MicroStrategy Course Catalog MicroStrategy Course Catalog 1 microstrategy.com/education 3 MicroStrategy course matrix 4 MicroStrategy 9 8 MicroStrategy 10 table of contents MicroStrategy course matrix MICROSTRATEGY 9 MICROSTRATEGY

More information

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION

GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION GAIN BETTER INSIGHT FROM BIG DATA USING JBOSS DATA VIRTUALIZATION Syed Rasheed Solution Manager Red Hat Corp. Kenny Peeples Technical Manager Red Hat Corp. Kimberly Palko Product Manager Red Hat Corp.

More information

Upcoming Announcements

Upcoming Announcements Enterprise Hadoop Enterprise Hadoop Jeff Markham Technical Director, APAC jmarkham@hortonworks.com Page 1 Upcoming Announcements April 2 Hortonworks Platform 2.1 A continued focus on innovation within

More information

Data Governance in the Hadoop Data Lake. Michael Lang May 2015

Data Governance in the Hadoop Data Lake. Michael Lang May 2015 Data Governance in the Hadoop Data Lake Michael Lang May 2015 Introduction Product Manager for Teradata Loom Joined Teradata as part of acquisition of Revelytix, original developer of Loom VP of Sales

More information

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco Decoding the Big Data Deluge a Virtual Approach Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco High-volume, velocity and variety information assets that demand

More information

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this

More information

Oracle s Cloud Computing Strategy

Oracle s Cloud Computing Strategy Oracle s Cloud Computing Strategy Your Strategy, Your Cloud, Your Choice Sandra Cheevers Senior Principal Product Director Cloud Product Marketing Steve Lemme Director, Cloud Builder Specialization Oracle

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Hybrid Cloud Architectures for Operational Performance Management

Hybrid Cloud Architectures for Operational Performance Management Hybrid Cloud Architectures for Operational Performance Management Delbert Murphy Solution Architect / Data Scientist Microsoft Corporation GPDIS_2014.ppt 1 Delbert Murphy and Microsoft s Data Insights

More information

Hadoop & Spark Using Amazon EMR

Hadoop & Spark Using Amazon EMR Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful

More information

BIG DATA-AS-A-SERVICE

BIG DATA-AS-A-SERVICE White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers

More information

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Simplifying Big Data Analytics: Unifying Batch and Stream Processing John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Streaming Analy.cs S S S Scale- up Database Data And Compute Grid

More information

SAP and Hortonworks Reference Architecture

SAP and Hortonworks Reference Architecture SAP and Hortonworks Reference Architecture Hortonworks. We Do Hadoop. June Page 1 2014 Hortonworks Inc. 2011 2014. All Rights Reserved A Modern Data Architecture With SAP DATA SYSTEMS APPLICATIO NS Statistical

More information

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84 Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics

More information

Managed Self-Service BI & Data As A Service

Managed Self-Service BI & Data As A Service Managed Self-Service BI & Data As A Service Triangle SQL Server User Group 1/15/2013 Melissa Coates About Melissa Business Intelligence developer based in Charlotte, NC Architect with Intellinet Specialist

More information

Developing Microsoft Azure Solutions

Developing Microsoft Azure Solutions Course 20532A: Developing Microsoft Azure Solutions Page 1 of 7 Developing Microsoft Azure Solutions Course 20532A: 4 days; Instructor-Led Introduction This course is intended for students who have experience

More information

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014

Upgrading Your SQL Server Skills to Microsoft SQL Server 2014 Upgrading Your SQL Server Skills to Microsoft SQL Server 2014 Varighed: 5 Days Kursus Kode: M10977 Beskrivelse: This five-day instructor-led course teaches students how to use the enhancements and new

More information

Developing Microsoft Azure Solutions 20532A; 5 days

Developing Microsoft Azure Solutions 20532A; 5 days Lincoln Land Community College Capital City Training Center 130 West Mason Springfield, IL 62702 217-782-7436 www.llcc.edu/cctc Developing Microsoft Azure Solutions 20532A; 5 days Course Description This

More information

Informatica and the Vibe Virtual Data Machine

Informatica and the Vibe Virtual Data Machine White Paper Informatica and the Vibe Virtual Data Machine Preparing for the Integrated Information Age This document contains Confidential, Proprietary and Trade Secret Information ( Confidential Information

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically

More information

Microsoft Analytics Platform System. Solution Brief

Microsoft Analytics Platform System. Solution Brief Microsoft Analytics Platform System Solution Brief Contents 4 Introduction 4 Microsoft Analytics Platform System 5 Enterprise-ready Big Data 7 Next-generation performance at scale 10 Engineered for optimal

More information

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08

More information

Apache Hadoop: The Big Data Refinery

Apache Hadoop: The Big Data Refinery Architecting the Future of Big Data Whitepaper Apache Hadoop: The Big Data Refinery Introduction Big data has become an extremely popular term, due to the well-documented explosion in the amount of data

More information

A Modern Data Architecture with Apache Hadoop

A Modern Data Architecture with Apache Hadoop Modern Data Architecture with Apache Hadoop Talend Big Data Presented by Hortonworks and Talend Executive Summary Apache Hadoop didn t disrupt the datacenter, the data did. Shortly after Corporate IT functions

More information

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Datenverwaltung im Wandel - Building an Enterprise Data Hub with Datenverwaltung im Wandel - Building an Enterprise Data Hub with Cloudera Bernard Doering Regional Director, Central EMEA, Cloudera Cloudera Your Hadoop Experts Founded 2008, by former employees of Employees

More information

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC,

Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC, Session 0202: Big Data in action with SAP HANA and Hadoop Platforms Prasad Illapani Product Management & Strategy (SAP HANA & Big Data) SAP Labs LLC, Bellevue, WA Legal disclaimer The information in this

More information

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP Eva Andreasson Cloudera Most FAQ: Super-Quick Overview! The Apache Hadoop Ecosystem a Zoo! Oozie ZooKeeper Hue Impala Solr Hive Pig Mahout HBase MapReduce

More information