Complex Event Processing (CEP) - A Primer

Size: px
Start display at page:

Download "Complex Event Processing (CEP) - A Primer"

Transcription

1 Complex Event Processing (CEP) - A Primer In a nutshell, CEP systems query events on the fly (with no to very light storage requirements). In a CEP setup, the focus is on event streams. An event stream reflects a logical sequence of events that are available for processing within a certain time-window. To illustrate, a stock event steam may consist of events that describe the price change over time. Simplified, the users execute queries against the CEP engine (that contains the CEP logic) while the CEP engine matches the queries against the events that are flowing through via the event streams. It has to be pointed out that a CEP system differs from event processing or filtering environments as a CEP setup supports temporal queries that focus on temporal concepts (time windows) or a before and after relationship. So a simple CEP query would be one that states if the Chevron (CVX) stock value increased by more than 5% over the last 15 minutes, submit a notification. In general, CEP queries can be characterized as: They are perpetual and emit events when the events match the condition that is given via the query. They operate on the fly and only store a minimal amount of events (sometimes no storage is needed). They normally respond to the changing conditions/events with millisecond granularity (an actual performance target has to be designed/build into the solution). As outlined in Figure 3 and Table 3, many CEP components are available (some for free) today. Some solutions are focused on larger SMP systems (as the HW setup) and hence have some degree of vertical scalability designed into the product. The Big Data movement though is all about horizontal scalability and hence some of today's CEP solutions scale rather well horizontally as well. To really address CEP scalability, a few different workload scenarios have to be discussed. In this primer, the following CEP workload scenarios are elaborated on: a CEP workload that executes a very large number of queries a CEP workload that constitutes a very large memory footprint a CEP workload where the complex queries cannot be processed by a single SMP server a CEP workload that has to process a very large number of events Workload Scenario 1: A CEP workload that executes a very large number of queries With this workload scenario, a shared nothing architecture may be appropriate. In other words, n CEP engine instances can be deployed on n nodes and each node executes 1/nth of the queries (the goal is load balancing among the n nodes). From an actual event distribution perspective, depending on the tangible event (streams) workload, some environments are deployed in a brute-force manner where all the events are submitted to all the nodes. While such an approach may be feasible with smaller even sets, most larger production environments utilize a message broker (such as Kafka or RabbitMQ) as the buffer entity that is situated in front of the CEP engine nodes (from a dataflow perspective). Hence, each CEP engine node only analyzes the outcome of the queries based on the event streams that the note subscribes to. From an implementation perspective, the CEP environment matches each event stream to a topic that is available in the broker tier (such as in a Kafka message broker environment). It is very common these days to delegate the event distribution to a streams processing system such as Apache Storm, S4, or Spark. In other words, for scalability purposes it is rather common to have (as an example) a Kafka cluster (the message broker tier) that as a backend utilizes a streams cluster setup (such as Storm) where a CEP engine (such as Esper) is deployed/integrated.

2 Workload Scenario 2: A CEP workload that constitutes a very large memory footprint This case assumes a workload that triggers a long-running, complex query that operates on a rather large time-window where the events require a substantial amount of memory to operate. There is no clear cut answer to how to setup a CEP solution for a workload like this, but some form of distributed caching seems like a potential option. The best approach is to expand on the potential workload scenario and highlight a few more distinct cases: 1. Read and write to and from a large cache subsystem. That reflects the easiest setup that can be used if the memory footprint can be handled by a node (aka design the CEP nodes accordingly). 2. With really large streaming windows, the CEP engine may require some local disk space (aka the application will not run at 100% memory speed and hence, the IO latency behavior has to be clearly understood to quantify the performance potential of such a CEP solution). 3. Very large streaming windows where the CEP solution is distributed (aka the events are shared among n CEP nodes). Such a setup may also provide some HA capabilities. 4. Workload scenarios that integrate streaming data with (cached) reference data to perform continuous join operations among the streamed and cached reference data, respectively. In this case, the CEP engine has to provide an abstraction layer that properly supports this architecture via the EPL - event processing language (such as join keys). From a CEP perspective, solutions such as Esper or Drools cover scenarios 1, 2, and 4 while for case 3 some combination of Storm/Esper or Spark/Drools may be considered (maybe even with some HA spin designed into the solution). A CEP workload that cannot be processed by a single server system If the workload cannot be processed by a single node, actual workload distribution across n cluster nodes is required. Such a scenario may be due to either not being able to process a given event rate on a single node or where the required memory footprint exceeds the capabilities of a single server solution. To solve workload scenarios like this, the queries have to be decomposed into k steps (form a pipeline) where the events are matched against some conditions and based on the outcome, the matching events get processed again down the pipeline. The various steps (k) of the query that constitute the CEP solution can be executed on separate cluster nodes. To illustrate, the stock-price query Chevron (CVX) depicted below is used as the discussion blueprint. The query reports a match if within 60 seconds, 2 events report a stock price > $110 and where the actual delta between these 2 events is > 5%. select evn1.symbol, evn1.prize, evn2.prize from every evn1=stockstream[price > 110 symbol = CVX ] -> evn2=stockstream[price > 110 symbol = CVX ] [evn1.price < 1.05*evn2.prize][within.time=60] As depicted in Figure 1, the query can easily be distributed across 3 nodes where each node is processing a subpart of the query and forwards the events as defined by the application. Figure 1: CEP Simple Pipeline

3 It has to be pointed out that many queries contain other properties that allow further optimization scenarios. To illustrate, while the last step in the pipeline (matching the stock-price increase) is stateful, the other 2 steps outlined in Figure 1 are stateless. While stateful operations memorize information after processing an event (in a way so that earlier events - on the timeline - can affect the processing of later events), stateless operations only depend on the current event (on the timeline) that is being processed. Hence, for all these stateless operations, multiple instances arranged in a shared-nothing cluster setup can be utilized (see Figure 2). Figure 2: CEP Partitioned Setup As depicted in Figure 2, the stateless operations are distributed across 6 nodes while the stateful operation is processed after the events are consolidated from node 2, 4, and 6, respectively. Figure 2 basically depicts a directed acyclic graph (DAC) that describes the data (event) flow. As CEP processing normally occurs via some form of filtering, the number of events being processed further down in the pipeline diminishes (aka filtering adds value at the source and reduces the number of events that have to be further processed). Hence, from a design perspective, it is paramount to identify the filter operations that have the biggest impact on the workload reduction and execute them as early as possible. In general, with any CEP project (or Big Data project in general), it is paramount that the actual data flow is entirely understood so that an appropriate systems design is feasible. Performance, scalability, capacity, security, or reliability requirements have to be designed (not re-engineered) into the solution as well. As illustrated in Figure 1, a filter operation such as symbol=cvx may significantly reduce the working set of what is firehosed into the CEP environment down the pipeline. As depicted in Figure 2, partitioning the firehose data in a shared-nothing setup has the potential to significantly increase the scalability potential of the CEP environment and so may achieve rather high throughput results. To illustrate, assuming that the stock spouts operate on an aggregate event stream of 500,000 events per second, but that only 1% of the data (event) set matches the filter CVX, only 5,000 events will be forwarded to the next stage in the pipeline. It has to be pointed out that the here discussed scenario only works if there are 1 or more stateless operations to be processed. If a query contains a single stateful operation, a (distributed) pipelined approach will obviously not be applicable.

4 CEP Products- Status Quo The word complex in CEP mainly refers to the complexity of state management over time while processing the events. Some examples are: Calculations over sliding windows Correlation of events along a timeline (such as to determine that event x occurs prior to event y within an exact timeframe or to identify a non-occurrence of an event within a timeframe). Most CEP implementations also provide advanced pattern detection, such as a non-deterministic finite state automaton (similar to a regular expression search over a flow of events - with a time influence in the search). Another key influence of time is referred to as timeliness. Timeliness refers to the ability to handle events and produce an output in a constrained time scenario. The target can be described as the end-to-end latency and can reach the millisecond or microsecond scale. CEP tools also provide the ability to arbitrate between guaranteed time and correctness of output (waiting or not on late or unordered events). CEP tools normally handle rather large event volumes (thousands of events per second). Other complexity factors that may motivate a company to consider a move towards CEP technologies are: Number and type of event sources If the application is expected to significantly change over time (such as new event sources, new interactions and responses are introduced) Richness of information in output events (such as counts, averages, composition of events from different sources) Context dependant situations (such as detection of events occurring within a defined spatial distance or within a defined group of customers - possibly querying external systems to determine the context. Correlation of real-time data with historical data Intelligence in event processing (such as inference or machine learning models) Table 1: Fitting Event Processing scenarios: Event rates Application Complexity Timeliness High High High High High Low High Low High Low High High Application Complexity refers to time, state, and context. In other workload scenarios, more traditional messaging and/or transactional systems may be more appropriate than a CEP solution. Various CEP products that support different approaches/paradigms in their event processing cycle are available (See Figure 3 and Table 3). Some of the more common paradigms are:

5 Table 2: Some CEP Paradigms Paradigm Stream oriented, query based workload scenarios. The workload calls for perpetual queries that are operating on an infinite data flow. ECA (event/condition/action) rule based systems. Works similar to triggers in some database solutions ( trigger is a special kind of stored procedure that automatically executes when an event occurs in the database server). Inference rule systems (similar to business rule management systems - BRMS). A BRMS represents a SW system that is used to define, deploy, execute, monitor, and maintain the variety and complexity of decision logic used by operational systems within an organization or enterprise. Time-state based systems Possible applications Manipulation & aggregation of event data (may use SQL-like join logic). May operate among events and/or some external datastore Users define event patterns by composing simple rules System has to take action when certain states are reached. Or business activity monitoring with real-time decision support Monitoring systems with a well defined finite state space Some CEP Functional Features Data reduction (filtering), projection (discarding some attributes), and/or aggregation based on a certain time-window Modeling capabilities for event shape and payload (aka query logic) Transformation (enrichment or shape change, pattern detection- including the detection that an event is absent) Time focused, event timestamps, intervals of occurrence (in respect to time windows and pattern detection, sliding time windows) Context awareness, the context in which the event occurs is being taken into account, the capability to query external systems (other datastores or historical data repository) Logging and analysis for audit purposes or retrospective event processing (understanding precursor events that led to a particular output event) Prediction, learning, and adaptation, pattern discovery, scoring against a data-mining model or some other form of machine-learning capabilities Support capabilities similar to an integrated development environment (IDE) Some CEP Non-Functional Features Input/output connectivity to event sources and event sinks Routing (statically or dynamically) and partitioning for workload distribution Performance enhancements (optimized end-to-end latency - has to be designed into the solution) Predictability, low latency variance (such as a 95th-percentile with a guaranteed latency behavior) Scalability and elasticity (mostly via encapsulation with some distributed streams solution) Availability and recoverability, fault tolerance, continuous operation Consistency and integrity via a distributed system setup, management of temporal granularity Security and privacy, segregation of event streams Simple usability, maintainability, manageability (good design is required though)

6 Figure 3: Some of the major CEP products Figure courtesy of P. Vincent

7 As depicted in Figure 3, a rather comprehensive set of commercial and open sourced event processing solutions are available to design, architect, and develop event processing applications. In the literature, the solutions are sometimes labeled event processing platforms, complex-event processing (CEP) systems, event stream processing (ESP) systems, or distributed stream computing platforms (DSCPs). While Figure 3 is not entirely comprehensive, most of the major solutions are listed. The 3 main exceptions are: Amazon Kinesis (Cloud Based) Google DataFlow (Cloud Based) Stratio Streaming (Uses Spark Streaming & Siddhi CEP Engine) To complement Figure 3, some of the major solutions are listed again below (in a more readable form). Table 3: Some Major CEP components Apache Samza - DSCP Apache Spark - DSCP Apache Storm - DSCP Apache S4 - DSCP Codehaus/EsperTech s Esper, Nesper - Native CEP RedHat Drools Fusion/JBoss Enterprise BRMS - Native CEP, Comes with a Rule Engine DataTorrent RTS (Real-Time Streaming) - a DSCP/CEP Hybrid FeedZai Pulse - Operational Intelligence SQLStream s-server - Operational Intelligence Vitria Technology Operational Intelligence Analytic Server - Operational Intelligence IBM InfoSphere Streams - Commercial, DSCP/CEP Hybrid IBM Operational Decision Manager (ODM) - Commercial, Comes with a Rule Engine SAP Event Stream Processor (ESP) - Commercial Software AG Apama Event Processing Platform - Commercial Tibco BusinessEvents - Commercial, Comes with a Rule Engine Tibco StreamBase - Commercial Note: Operational intelligence (OI) refers to a type of real-time dynamic business analytics solutions that provide visibility/insight into the data, streaming events, and business operations. OI solutions execute queries against streaming data feeds and event data to deliver real-time analytic results that are referred to as operational instructions. The SW systems depicted in Figure 3 and Table 3 do provide a wide range of different features and techniques, but do all perform CEP in a technical sense. To reiterate, CEP (or ESP) describes a practice where incoming data (event data) is processed almost instantly to generate higher-level, useful/valuable, summarized information (labeled the complex events). Event processing platforms provide embedded capabilities for filtering the incoming data, storing windows of event data, compute aggregates, and detecting patterns. In a more formal context, CEP SW reflects any application that is capable of generating, reading, discarding, or performing calculations on complex events. A complex event is an abstraction of 1 or more base (input) events. Complex events may signify threats or opportunities that require a response. One complex event may be the result of performing some form of calculation on either a few or on millions of base events that have 1 or more input. Some of the more popular commercial products are IBM s ODM and InfoSphere Streams, SAP s ESP, Software AG s Apama, or TIBCO s Business Events and StreamBase. These products reflect comprehensive

8 development and runtime solutions that include adapters to integrate the CEP environment with various event sources, dashboards, and alert systems or administration tools. Other event processing platforms are combined with features such as query capabilities, reporting, interactive analytics, alerts, or key performance indicator tools and are specifically directed at operational intelligence. Examples include FeedZai Pulse, SQLStream s-server, or Vitria Technology s Operational Intelligence. On the DSCP side, Apache products such as Samza, Spark, Storm, or S4 represent general-purpose streaming platforms that do not provide a native CEP (analytic function) engine. These solutions though are highly scalable (horizontally) and developers have the opportunity to add the necessary (CEP) logic to address various types of streams processing related problems. The DSCPs can be merged with native CEP solutions such as Esper/Nesper or RedHat s Drools Fusion/JBoss Enterprise BRMS (both of which are open-sourced native CEP systems) to deploy comprehensive, distributed, saleable CEP solutions. Depending on the source, DataTorrent s RTS and IBM s InfoSphere Streams are classified as either a DSCP or a CEP solution (aka a hybrid). Further, some event processing platforms are bundled with an actual rule engine (such as IBM s ODM, RedHat Drools Fusion/JBoss Enterprise BRMS, or TIBCO Business Events). It has to be stated that the increasing popularity of CEP solutions is mainly due to the fact that this approach basically reflects the only way to absorb and process information from various event streams in real or near-real time. Any CEP target solution has to process the event data as the data arrives so that appropriate actions can swiftly be taken. When it comes to CEP, there really is no one-size-fits-all approach. All the solutions discussed in this article reflect general purpose development and runtime tools that can be used by the developers to design and implement customized event processing applications. What is provided is basically the core algorithms to process event streams. Some companies use CEP as part of a larger application solution. Some companies acquire a packaged application or subscribe to SaaS services that have CEP embedded under the hood. So a company may purchase a solution that happens to require event processing and the company may not even be aware that CEP is being used. For example, supply chain visibility products, security information and even management (SIEM) solutions, fraud detection and governance applications, risk and compliance (GRC) products, systems and network monitoring solutions or business activity monitoring (BAM) tools do implement CEP to a certain degree. But, an ever growing number of companies do need the kind of event processing subsystems discussed in this article to support their high-throughput and low-latency applications where the customized event processing logic is paramount to their unique business problems. In any case, a comprehensive understanding of the actual data flow is a necessity to even consider commencing a CEP project. Further, the business and systems (including performance, security, and salability) goals and objectives have to be fully understood and have to be designed into the solution. References 1. J. Krumeich, B. Weis, D. Werth and P. Loos: "Event-Driven Business Process Management: where are we now?: A comprehensive synthesis and analysis of literature", Business Process Management Journal, Luckham, David C. Event Processing for Business: Organizing the Real-Time Enterprise. Hoboken, New Jersey: John Wiley & Sons, P. Vincent, CEP Blog, Mulesoft, Twitter Complex Event Processing (CEP) with Esper and Drools, Thomas Vial, The Esper CEP Ecosystem, Run Esper with Storm, Storm & Esper, Distributed Cache to scale CEP, Srinath Perera, CEP Blog, Mathieu Despriee, CEP, Paul Vincent, CEP Market Players, CEP Wikis, www, World Wide Web CEP resources in general, 2015

A Brief Introduction to Apache Tez

A Brief Introduction to Apache Tez A Brief Introduction to Apache Tez Introduction It is a fact that data is basically the new currency of the modern business world. Companies that effectively maximize the value of their data (extract value

More information

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: [email protected] Website: www.qburst.com

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: bdg@qburst.com Website: www.qburst.com Lambda Architecture Near Real-Time Big Data Analytics Using Hadoop January 2015 Contents Overview... 3 Lambda Architecture: A Quick Introduction... 4 Batch Layer... 4 Serving Layer... 4 Speed Layer...

More information

The Synergy of SOA, Event-Driven Architecture (EDA), and Complex Event Processing (CEP)

The Synergy of SOA, Event-Driven Architecture (EDA), and Complex Event Processing (CEP) The Synergy of SOA, Event-Driven Architecture (EDA), and Complex Event Processing (CEP) Gerhard Bayer Senior Consultant International Systems Group, Inc. [email protected] http://www.isg-inc.com Table

More information

WHITE PAPER. Enabling predictive analysis in service oriented BPM solutions.

WHITE PAPER. Enabling predictive analysis in service oriented BPM solutions. WHITE PAPER Enabling predictive analysis in service oriented BPM solutions. Summary Complex Event Processing (CEP) is a real time event analysis, correlation and processing mechanism that fits in seamlessly

More information

BIG DATA ANALYTICS For REAL TIME SYSTEM

BIG DATA ANALYTICS For REAL TIME SYSTEM BIG DATA ANALYTICS For REAL TIME SYSTEM Where does big data come from? Big Data is often boiled down to three main varieties: Transactional data these include data from invoices, payment orders, storage

More information

White Paper. How Streaming Data Analytics Enables Real-Time Decisions

White Paper. How Streaming Data Analytics Enables Real-Time Decisions White Paper How Streaming Data Analytics Enables Real-Time Decisions Contents Introduction... 1 What Is Streaming Analytics?... 1 How Does SAS Event Stream Processing Work?... 2 Overview...2 Event Stream

More information

Unified Batch & Stream Processing Platform

Unified Batch & Stream Processing Platform Unified Batch & Stream Processing Platform Himanshu Bari Director Product Management Most Big Data Use Cases Are About Improving/Re-write EXISTING solutions To KNOWN problems Current Solutions Were Built

More information

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015 Pulsar Realtime Analytics At Scale Tony Ng April 14, 2015 Big Data Trends Bigger data volumes More data sources DBs, logs, behavioral & business event streams, sensors Faster analysis Next day to hours

More information

FINANCIAL SERVICES: FRAUD MANAGEMENT A solution showcase

FINANCIAL SERVICES: FRAUD MANAGEMENT A solution showcase FINANCIAL SERVICES: FRAUD MANAGEMENT A solution showcase TECHNOLOGY OVERVIEW FRAUD MANAGE- MENT REFERENCE ARCHITECTURE This technology overview describes a complete infrastructure and application re-architecture

More information

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control EP/K006487/1 UK PI: Prof Gareth Taylor (BU) China PI: Prof Yong-Hua Song (THU) Consortium UK Members: Brunel University

More information

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM

A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, [email protected] Assistant Professor, Information

More information

Getting Real Real Time Data Integration Patterns and Architectures

Getting Real Real Time Data Integration Patterns and Architectures Getting Real Real Time Data Integration Patterns and Architectures Nelson Petracek Senior Director, Enterprise Technology Architecture Informatica Digital Government Institute s Enterprise Architecture

More information

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase Architectural patterns for building real time applications with Apache HBase Andrew Purtell Committer and PMC, Apache HBase Who am I? Distributed systems engineer Principal Architect in the Big Data Platform

More information

Data Stream Ingestion & Complex Event Processing Systems for Data Driven Decisions. White Paper. www.htcinc.com

Data Stream Ingestion & Complex Event Processing Systems for Data Driven Decisions. White Paper. www.htcinc.com 01 0110 0001 01101 Data Stream Ingestion & Complex Event Processing Systems for Data Driven Decisions White Paper www.htcinc.com Contents 1. Introduction... 2 1.1 What are Event Patterns?... 3 2. Stream

More information

Intelligent Business Operations and Big Data. 2014 Software AG. All rights reserved.

Intelligent Business Operations and Big Data. 2014 Software AG. All rights reserved. Intelligent Business Operations and Big Data 1 What is Big Data? Big data is a popular term used to acknowledge the exponential growth, availability and use of information in the data-rich landscape of

More information

Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect

Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect Matteo Migliavacca (mm53@kent) School of Computing Conjugating data mood and tenses: Simple past, infinite present, fast continuous, simpler imperative, conditional future perfect Simple past - Traditional

More information

MASSIF: A Highly Scalable SIEM

MASSIF: A Highly Scalable SIEM MASSIF: A Highly Scalable SIEM Ricardo Jimenez-Peris Univ. Politecnica de Madrid (UPM) [email protected] DEMONS Workshop Berlin, April 25 th 2012 MASSIF in a Nutshell MASSIF aims at developing the next

More information

Apache Kafka Your Event Stream Processing Solution

Apache Kafka Your Event Stream Processing Solution 01 0110 0001 01101 Apache Kafka Your Event Stream Processing Solution White Paper www.htcinc.com Contents 1. Introduction... 2 1.1 What are Business Events?... 2 1.2 What is a Business Data Feed?... 2

More information

CAPTURING & PROCESSING REAL-TIME DATA ON AWS

CAPTURING & PROCESSING REAL-TIME DATA ON AWS CAPTURING & PROCESSING REAL-TIME DATA ON AWS @ 2015 Amazon.com, Inc. and Its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent

More information

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON

SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON SQLstream Blaze and Apache Storm A BENCHMARK COMPARISON 2 The V of Big Data Velocity means both how fast data is being produced and how fast the data must be processed to meet demand. Gartner The emergence

More information

Streaming Big Data Performance Benchmark. for

Streaming Big Data Performance Benchmark. for Streaming Big Data Performance Benchmark for 2 The V of Big Data Velocity means both how fast data is being produced and how fast the data must be processed to meet demand. Gartner Static Big Data is a

More information

Kafka & Redis for Big Data Solutions

Kafka & Redis for Big Data Solutions Kafka & Redis for Big Data Solutions Christopher Curtin Head of Technical Research @ChrisCurtin About Me 25+ years in technology Head of Technical Research at Silverpop, an IBM Company (14 + years at Silverpop)

More information

BASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS

BASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS WHITEPAPER BASHO DATA PLATFORM BASHO DATA PLATFORM SIMPLIFIES BIG DATA, IOT, AND HYBRID CLOUD APPS INTRODUCTION Big Data applications and the Internet of Things (IoT) are changing and often improving our

More information

How To Make Data Streaming A Real Time Intelligence

How To Make Data Streaming A Real Time Intelligence REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log

More information

NEEDLE STACKS & BIG DATA: USING EVENT STREAM PROCESSING FOR RISK, SURVEILLANCE & SECURITY ANALYTICS IN CAPITAL MARKETS

NEEDLE STACKS & BIG DATA: USING EVENT STREAM PROCESSING FOR RISK, SURVEILLANCE & SECURITY ANALYTICS IN CAPITAL MARKETS NEEDLE STACKS & BIG DATA: USING PROCESSING FOR RISK, SURVEILLANCE & SECURITY ANALYTICS IN CAPITAL MARKETS JERRY BAULIER, DIRECTOR, PROCESSING DAVID M. WALLACE, GLOBAL FINANCIAL SERVICES MARKETING MANAGER

More information

A Guide Through the BPM Maze

A Guide Through the BPM Maze A Guide Through the BPM Maze WHAT TO LOOK FOR IN A COMPLETE BPM SOLUTION With multiple vendors, evolving standards, and ever-changing requirements, it becomes difficult to recognize what meets your BPM

More information

Streaming Big Data Performance Benchmark for Real-time Log Analytics in an Industry Environment

Streaming Big Data Performance Benchmark for Real-time Log Analytics in an Industry Environment Streaming Big Data Performance Benchmark for Real-time Log Analytics in an Industry Environment SQLstream s-server The Streaming Big Data Engine for Machine Data Intelligence 2 SQLstream proves 15x faster

More information

BIG DATA-AS-A-SERVICE

BIG DATA-AS-A-SERVICE White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers

More information

Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings

Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings Solution Brief Enabling Database-as-a-Service (DBaaS) within Enterprises or Cloud Offerings Introduction Accelerating time to market, increasing IT agility to enable business strategies, and improving

More information

STREAM ANALYTIX. Industry s only Multi-Engine Streaming Analytics Platform

STREAM ANALYTIX. Industry s only Multi-Engine Streaming Analytics Platform STREAM ANALYTIX Industry s only Multi-Engine Streaming Analytics Platform One Platform for All Create real-time streaming data analytics applications in minutes with a powerful visual editor Get a wide

More information

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time

How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time SCALEOUT SOFTWARE How In-Memory Data Grids Can Analyze Fast-Changing Data in Real Time by Dr. William Bain and Dr. Mikhail Sobolev, ScaleOut Software, Inc. 2012 ScaleOut Software, Inc. 12/27/2012 T wenty-first

More information

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015 Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document

More information

Information Technology Policy

Information Technology Policy Information Technology Policy Security Information and Event Management Policy ITP Number Effective Date ITP-SEC021 October 10, 2006 Category Supersedes Recommended Policy Contact Scheduled Review [email protected]

More information

Streaming Analytics A Framework for Innovation

Streaming Analytics A Framework for Innovation Streaming Analytics A Framework for Innovation Jan Humble Solutions Architect 1 Volume and Scale of Sensing Data Can you TURN IT ON? Can you Identify Insights in REAL-TIME? Can you REACT and ENGAGE in

More information

Big Data Analytics - Accelerated. stream-horizon.com

Big Data Analytics - Accelerated. stream-horizon.com Big Data Analytics - Accelerated stream-horizon.com StreamHorizon & Big Data Integrates into your Data Processing Pipeline Seamlessly integrates at any point of your your data processing pipeline Implements

More information

From Spark to Ignition:

From Spark to Ignition: From Spark to Ignition: Fueling Your Business on Real-Time Analytics Eric Frenkiel, MemSQL CEO June 29, 2015 San Francisco, CA What s in Store For This Presentation? 1. MemSQL: A real-time database for

More information

3 Reasons Enterprises Struggle with Storm & Spark Streaming and Adopt DataTorrent RTS

3 Reasons Enterprises Struggle with Storm & Spark Streaming and Adopt DataTorrent RTS . 3 Reasons Enterprises Struggle with Storm & Spark Streaming and Adopt DataTorrent RTS Deliver fast actionable business insights for data scientists, rapid application creation for developers and enterprise-grade

More information

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84 Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics

More information

Azure Scalability Prescriptive Architecture using the Enzo Multitenant Framework

Azure Scalability Prescriptive Architecture using the Enzo Multitenant Framework Azure Scalability Prescriptive Architecture using the Enzo Multitenant Framework Many corporations and Independent Software Vendors considering cloud computing adoption face a similar challenge: how should

More information

Syslog Analyzer ABOUT US. Member of the TeleManagement Forum. [email protected] +1-916-290-9300 http://www.ossera.com

Syslog Analyzer ABOUT US. Member of the TeleManagement Forum. info@ossera.com +1-916-290-9300 http://www.ossera.com Syslog Analyzer ABOUT US OSSera, Inc. is a global provider of Operational Support System (OSS) solutions for IT organizations, service planning, service operations, and network operations. OSSera's multithreaded

More information

Find the Information That Matters. Visualize Your Data, Your Way. Scalable, Flexible, Global Enterprise Ready

Find the Information That Matters. Visualize Your Data, Your Way. Scalable, Flexible, Global Enterprise Ready Real-Time IoT Platform Solutions for Wireless Sensor Networks Find the Information That Matters ViZix is a scalable, secure, high-capacity platform for Internet of Things (IoT) business solutions that

More information

GigaSpaces Real-Time Analytics for Big Data

GigaSpaces Real-Time Analytics for Big Data GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and

More information

Streaming items through a cluster with Spark Streaming

Streaming items through a cluster with Spark Streaming Streaming items through a cluster with Spark Streaming Tathagata TD Das @tathadas CME 323: Distributed Algorithms and Optimization Stanford, May 6, 2015 Who am I? > Project Management Committee (PMC) member

More information

Event based Enterprise Service Bus (ESB)

Event based Enterprise Service Bus (ESB) Event based Enterprise Service Bus (ESB) By: Kasun Indrasiri 128213m Supervised By: Dr. Srinath Perera Dr. Sanjiva Weerawarna Abstract With the increasing adaptation of Service Oriented Architecture for

More information

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? December, 2014 What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Glenn Anderson IBM Lab Services and Training Today s mainframe is a hybrid system z/os Linux on Sys z DB2 Analytics Accelerator

More information

Architectures for massive data management

Architectures for massive data management Architectures for massive data management Apache Kafka, Samza, Storm Albert Bifet [email protected] October 20, 2015 Stream Engine Motivation Digital Universe EMC Digital Universe with

More information

Parallel Databases. Parallel Architectures. Parallelism Terminology 1/4/2015. Increase performance by performing operations in parallel

Parallel Databases. Parallel Architectures. Parallelism Terminology 1/4/2015. Increase performance by performing operations in parallel Parallel Databases Increase performance by performing operations in parallel Parallel Architectures Shared memory Shared disk Shared nothing closely coupled loosely coupled Parallelism Terminology Speedup:

More information

Real Time Fraud Detection With Sequence Mining on Big Data Platform. Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May 6 2014 Santa Clara, CA

Real Time Fraud Detection With Sequence Mining on Big Data Platform. Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May 6 2014 Santa Clara, CA Real Time Fraud Detection With Sequence Mining on Big Data Platform Pranab Ghosh Big Data Consultant IEEE CNSV meeting, May 6 2014 Santa Clara, CA Open Source Big Data Eco System Query (NOSQL) : Cassandra,

More information

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS

THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS THE DEVELOPER GUIDE TO BUILDING STREAMING DATA APPLICATIONS WHITE PAPER Successfully writing Fast Data applications to manage data generated from mobile, smart devices and social interactions, and the

More information

Putting Apache Kafka to Use!

Putting Apache Kafka to Use! Putting Apache Kafka to Use! Building a Real-time Data Platform for Event Streams! JAY KREPS, CONFLUENT! A Couple of Themes! Theme 1: Rise of Events! Theme 2: Immutability Everywhere! Level! Example! Immutable

More information

TIBCO Live Datamart: Push-Based Real-Time Analytics

TIBCO Live Datamart: Push-Based Real-Time Analytics TIBCO Live Datamart: Push-Based Real-Time Analytics ABSTRACT TIBCO Live Datamart is a new approach to real-time analytics and data warehousing for environments where large volumes of data require a management

More information

The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn

The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn The Big Data Ecosystem at LinkedIn Roshan Sumbaly, Jay Kreps, and Sam Shah LinkedIn Presented by :- Ishank Kumar Aakash Patel Vishnu Dev Yadav CONTENT Abstract Introduction Related work The Ecosystem Ingress

More information

Real-time Big Data Analytics with Storm

Real-time Big Data Analytics with Storm Ron Bodkin Founder & CEO, Think Big June 2013 Real-time Big Data Analytics with Storm Leading Provider of Data Science and Engineering Services Accelerating Your Time to Value IMAGINE Strategy and Roadmap

More information

Scalable Architecture on Amazon AWS Cloud

Scalable Architecture on Amazon AWS Cloud Scalable Architecture on Amazon AWS Cloud Kalpak Shah Founder & CEO, Clogeny Technologies [email protected] 1 * http://www.rightscale.com/products/cloud-computing-uses/scalable-website.php 2 Architect

More information

Energy Efficient MapReduce

Energy Efficient MapReduce Energy Efficient MapReduce Motivation: Energy consumption is an important aspect of datacenters efficiency, the total power consumption in the united states has doubled from 2000 to 2005, representing

More information

Intelligent Business Operations

Intelligent Business Operations Intelligent Business Operations Echtzeit-Datenanalyse und Aktionen im Zusammenspiel Dr. Jürgen Krämer VP Product Strategy IBO & Product Management Apama 23.06.2014 Helping Organizations Transform into

More information

Monitoring the Real End User Experience

Monitoring the Real End User Experience An AppDynamics Business White Paper HOW MUCH REVENUE DOES IT GENERATE? Monitoring the Real End User Experience Web application performance is fundamentally associated in the mind of the end user; with

More information

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data?

What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Glenn Anderson, IBM Lab Services and Training What s Happening to the Mainframe? Mobile? Social? Cloud? Big Data? Summer SHARE August 2014 Session 15595 (c) Copyright 2014 IBM Corporation 1 Today s mainframe

More information

Open Source Business Rules Management System Enables Active Decisions

Open Source Business Rules Management System Enables Active Decisions JBoss Enterprise BRMS Open Source Business Rules Management System Enables Active Decisions What is it? JBoss Enterprise BRMS provides an open source business rules management system that enables active

More information

A Novel Cloud Based Elastic Framework for Big Data Preprocessing

A Novel Cloud Based Elastic Framework for Big Data Preprocessing School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview

More information

Introduction to Apache Kafka And Real-Time ETL. for Oracle DBAs and Data Analysts

Introduction to Apache Kafka And Real-Time ETL. for Oracle DBAs and Data Analysts Introduction to Apache Kafka And Real-Time ETL for Oracle DBAs and Data Analysts 1 About Myself Gwen Shapira System Architect @Confluent Committer @ Apache Kafka, Apache Sqoop Author of Hadoop Application

More information

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time?

Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time? Hadoop and Data Warehouse Friends, Enemies or Profiteers? What about Real Time? Kai Wähner [email protected] @KaiWaehner www.kai-waehner.de Disclaimer! These opinions are my own and do not necessarily

More information

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1

CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level. -ORACLE TIMESTEN 11gR1 CASE STUDY: Oracle TimesTen In-Memory Database and Shared Disk HA Implementation at Instance level -ORACLE TIMESTEN 11gR1 CASE STUDY Oracle TimesTen In-Memory Database and Shared Disk HA Implementation

More information

Application Performance Management for Enterprise Applications

Application Performance Management for Enterprise Applications Application Performance Management for Enterprise Applications White Paper from ManageEngine Web: Email: [email protected] Table of Contents 1. Introduction 2. Types of applications used

More information

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013

SAP HANA SAP s In-Memory Database. Dr. Martin Kittel, SAP HANA Development January 16, 2013 SAP HANA SAP s In-Memory Database Dr. Martin Kittel, SAP HANA Development January 16, 2013 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase

More information

Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform

Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform Mitra Innovation Leverages WSO2's Open Source Middleware to Build BIM Exchange Platform May 2015 Contents 1. Introduction... 3 2. What is BIM... 3 2.1. History of BIM... 3 2.2. Why Implement BIM... 4 2.3.

More information

Luncheon Webinar Series May 13, 2013

Luncheon Webinar Series May 13, 2013 Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration

More information

BigData. An Overview of Several Approaches. David Mera 16/12/2013. Masaryk University Brno, Czech Republic

BigData. An Overview of Several Approaches. David Mera 16/12/2013. Masaryk University Brno, Czech Republic BigData An Overview of Several Approaches David Mera Masaryk University Brno, Czech Republic 16/12/2013 Table of Contents 1 Introduction 2 Terminology 3 Approaches focused on batch data processing MapReduce-Hadoop

More information

The IBM Cognos Platform for Enterprise Business Intelligence

The IBM Cognos Platform for Enterprise Business Intelligence The IBM Cognos Platform for Enterprise Business Intelligence Highlights Optimize performance with in-memory processing and architecture enhancements Maximize the benefits of deploying business analytics

More information

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed

More information

A Vision for Operational Analytics as the Enabler for Business Focused Hybrid Cloud Operations

A Vision for Operational Analytics as the Enabler for Business Focused Hybrid Cloud Operations A Vision for Operational Analytics as the Enabler for Focused Hybrid Cloud Operations As infrastructure and applications have evolved from legacy to modern technologies with the evolution of Hybrid Cloud

More information

Oracle SOA Suite: The Evaluation from 10g to 11g

Oracle SOA Suite: The Evaluation from 10g to 11g KATTA Durga Reddy TATA Consultancy Services. Oracle SOA Suite: The Evaluation from 10g to 11g Introduction Oracle SOA Suite is an essential middleware layer of Oracle Fusion Middleware. It provides a complete

More information

Hybrid Software Architectures for Big Data. [email protected] @hurence http://www.hurence.com

Hybrid Software Architectures for Big Data. Laurence.Hubert@hurence.com @hurence http://www.hurence.com Hybrid Software Architectures for Big Data [email protected] @hurence http://www.hurence.com Headquarters : Grenoble Pure player Expert level consulting Training R&D Big Data X-data hot-line

More information

Apache Flink Next-gen data analysis. Kostas Tzoumas [email protected] @kostas_tzoumas

Apache Flink Next-gen data analysis. Kostas Tzoumas ktzoumas@apache.org @kostas_tzoumas Apache Flink Next-gen data analysis Kostas Tzoumas [email protected] @kostas_tzoumas What is Flink Project undergoing incubation in the Apache Software Foundation Originating from the Stratosphere research

More information

SAP HANA SPS 09 - What s New? Smart Data Streaming

SAP HANA SPS 09 - What s New? Smart Data Streaming SAP HANA SPS 09 - What s New? Smart Data Streaming (Delta from SPS08 to SPS09) SAP HANA Product Management November, 2014 2014 SAP AG or an SAP affiliate company. All rights reserved. 1 Agenda Introduction

More information

SAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here

SAP HANA PLATFORM Top Ten Questions for Choosing In-Memory Databases. Start Here PLATFORM Top Ten Questions for Choosing In-Memory Databases Start Here PLATFORM Top Ten Questions for Choosing In-Memory Databases. Are my applications accelerated without manual intervention and tuning?.

More information

Performance & Scalability Characterization. By Richard Tibbetts Co-Founder and Chief Architect, StreamBase Systems, Inc.

Performance & Scalability Characterization. By Richard Tibbetts Co-Founder and Chief Architect, StreamBase Systems, Inc. Performance & Scalability Characterization By Richard Tibbetts Co-Founder and Chief Architect, StreamBase Systems, Inc. Motivation for Performance and Scalability in a CEP Engine CEP engines can be applied

More information

The Big Data Ecosystem at LinkedIn. Presented by Zhongfang Zhuang

The Big Data Ecosystem at LinkedIn. Presented by Zhongfang Zhuang The Big Data Ecosystem at LinkedIn Presented by Zhongfang Zhuang Based on the paper The Big Data Ecosystem at LinkedIn, written by Roshan Sumbaly, Jay Kreps, and Sam Shah. The Ecosystems Hadoop Ecosystem

More information

Understanding Data Locality in VMware Virtual SAN

Understanding Data Locality in VMware Virtual SAN Understanding Data Locality in VMware Virtual SAN July 2014 Edition T E C H N I C A L M A R K E T I N G D O C U M E N T A T I O N Table of Contents Introduction... 2 Virtual SAN Design Goals... 3 Data

More information

Implement Hadoop jobs to extract business value from large and varied data sets

Implement Hadoop jobs to extract business value from large and varied data sets Hadoop Development for Big Data Solutions: Hands-On You Will Learn How To: Implement Hadoop jobs to extract business value from large and varied data sets Write, customize and deploy MapReduce jobs to

More information

Developing an Application Tracing Utility for Mule ESB Application on EL (Elastic Search, Log stash) Stack Using AOP

Developing an Application Tracing Utility for Mule ESB Application on EL (Elastic Search, Log stash) Stack Using AOP Developing an Application Tracing Utility for Mule ESB Application on EL (Elastic Search, Log stash) Stack Using AOP Mohan Bandaru, Amarendra Kothalanka, Vikram Uppala Student, Department of Computer Science

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

Reference Architecture, Requirements, Gaps, Roles

Reference Architecture, Requirements, Gaps, Roles Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture

More information

TIBCO EVENT PROCESSING IN THE FAST DATA ARCHITECTURE OPERATIONAL INTELLIGENCE PLATFORM. TIBCO Live Datamart Continuous Query Processor

TIBCO EVENT PROCESSING IN THE FAST DATA ARCHITECTURE OPERATIONAL INTELLIGENCE PLATFORM. TIBCO Live Datamart Continuous Query Processor TIBCO BusinessEvents BENEFITS Breadth of capabilities that help organizations quickly build solutions for difficult, complex event processing Improved productivity and effective problemsolving as a result

More information

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!!

Simplifying Big Data Analytics: Unifying Batch and Stream Processing. John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Simplifying Big Data Analytics: Unifying Batch and Stream Processing John Fanelli,! VP Product! In-Memory Compute Summit! June 30, 2015!! Streaming Analy.cs S S S Scale- up Database Data And Compute Grid

More information

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence Augmented Search for Web Applications New frontier in big log data analysis and application intelligence Business white paper May 2015 Web applications are the most common business applications today.

More information

Architectures for Big Data Analytics A database perspective

Architectures for Big Data Analytics A database perspective Architectures for Big Data Analytics A database perspective Fernando Velez Director of Product Management Enterprise Information Management, SAP June 2013 Outline Big Data Analytics Requirements Spectrum

More information

HiBench Introduction. Carson Wang ([email protected]) Software & Services Group

HiBench Introduction. Carson Wang (carson.wang@intel.com) Software & Services Group HiBench Introduction Carson Wang ([email protected]) Agenda Background Workloads Configurations Benchmark Report Tuning Guide Background WHY Why we need big data benchmarking systems? WHAT What is

More information

From the Monolith to Microservices: Evolving Your Architecture to Scale. Randy Shoup @randyshoup linkedin.com/in/randyshoup

From the Monolith to Microservices: Evolving Your Architecture to Scale. Randy Shoup @randyshoup linkedin.com/in/randyshoup From the Monolith to Microservices: Evolving Your Architecture to Scale Randy Shoup @randyshoup linkedin.com/in/randyshoup Background Consulting CTO at Randy Shoup Consulting o o Helping companies from

More information

secure intelligence collection and assessment system Your business technologists. Powering progress

secure intelligence collection and assessment system Your business technologists. Powering progress secure intelligence collection and assessment system Your business technologists. Powering progress The decisive advantage for intelligence services The rising mass of data items from multiple sources

More information

DataStax Enterprise, powered by Apache Cassandra (TM)

DataStax Enterprise, powered by Apache Cassandra (TM) PerfAccel (TM) Performance Benchmark on Amazon: DataStax Enterprise, powered by Apache Cassandra (TM) Disclaimer: All of the documentation provided in this document, is copyright Datagres Technologies

More information

How To Write A Trusted Analytics Platform (Tap)

How To Write A Trusted Analytics Platform (Tap) Trusted Analytics Platform (TAP) TAP Technical Brief October 2015 TAP Technical Brief Overview Trusted Analytics Platform (TAP) is open source software, optimized for performance and security, that accelerates

More information

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum

Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms

More information

Big Data With Hadoop

Big Data With Hadoop With Saurabh Singh [email protected] The Ohio State University February 11, 2016 Overview 1 2 3 Requirements Ecosystem Resilient Distributed Datasets (RDDs) Example Code vs Mapreduce 4 5 Source: [Tutorials

More information

Azure Data Lake Analytics

Azure Data Lake Analytics Azure Data Lake Analytics Compose and orchestrate data services at scale Fully managed service to support orchestration of data movement and processing Connect to relational or non-relational data

More information