Data Challenges in Telecommunications Networks and a Big Data Solution

Size: px
Start display at page:

Download "Data Challenges in Telecommunications Networks and a Big Data Solution"

Transcription

1 Data Challenges in Telecommunications Networks and a Big Data Solution Abstract The telecom networks generate multitudes and large sets of data related to networks, applications, users, network operations and call processing. This large data set has the capability to give valuable business insights - for example, real-time user quality of service (QoS), network issues, customer satisfaction index, customer churn, network capacity forecast and many more revenue impacting insights. The traditional telecom application architectures and solutions utilize central network management & monitoring servers, RDBM databases in operator s Network Operations Center (NOC). These systems are not designed for high scale and processing large sets of data for deeper insights and analytics, due to which such data intelligence is either lost or not fully utilized. Insight from this data can be utilized to improve user QoS, ensure conformance to SLAs, launch new services and products based on subscriber preferences, and their location and social preferences and based on their associations to other subscribers. This white paper will present Incedo s solution architecture using modern Big Data systems and frameworks for mining, reporting and presenting insights into ever increasing very large sets of data using open source and commercial tools that employ massively parallel processing engines and distributed databases, with focus to retain operator s Capex and Opex lower while providing deeper insights into their own data.

2 Introduction In this paper, we present various sources of data that s present within a telecom network data that s generated from end devices, network elements and management systems and describe the traditional data analytics solutions. We will show how and why these traditional systems lack in providing deeper insights into the data, due to both the inefficient data storage and processing architectures and lack of rich sets of analytical tools mainly due to inefficient data processing systems. We also present how these architecture and system level issues are addressed by modern Big Data architectures and frameworks. We then present Incedo s Big Data solution for telecom data, with information of different frameworks and tools that are available currently and the various choices we made in our solution. We present Incedo s solution that addresses both the high demands of data storage and analytics of very large scale data using a mix of open source and commercial based massively parallel processing frameworks and distributed data storage systems, to provide rich and realtime insights into Operator s own network data with lower Capex and Opex. Data in Telecom Networks and its Uses network failures, real-time measurements of subscriber call QoS, and monitoring and ensuring SLA conformance. The data from call data records in wireless networks, and session data records in VoIP networks can be analyzed for insights into real- time fraud detection, for example based on signaling message redirections, sim swap detection. The same data, along with external data sources like geographical information, social networks can be analyzed to prepare tailored marketing campaigns targeted to specific subscribers, in real-time, based on their location, preferences, and based on their social network associations. In addition, the data analysis can give insights for the operator and help in innovation of new products based on the stories the data is telling about users, applications, preferences. Telecommunications networks, both wireline and wireless networks, ranging from TCP/IP networks, VoIP networks overlaid over TCP/IP networks to 3G, 4G wireless access networks generate lot of data data from end devices to network elements. This data has a lot of information embedded into it - data related to network operations, individual users' application preferences, and interests to data that can be inferred about customer satisfaction with the network and operator's service to real-time market forecasts. Operators have this rich set of data which is very powerful and the data is the new gold that can give information and insights into their networks, subscribers, business, and insights into future products that will make the operator ever more successful. The data from telecom networks can be analyzed for network operational issues and o p t i m i z a t i o n s r a n g i n g f r o m r o u t e optimization, automatic re-routing on

3 The data comes from many sources like logs, SDRs/CDRs, events/alarms from network elements; signaling messages and application data streams from subscriber end devices; transactions, billing records, call durations, call pattern data from operator's own OSS/BSS systems; subscriber social preferences, associations data from social media, s, SMS messages. Although the data has rich information and can give deep insights for the operators, the data can run into terabytes to petabytes of data per day based on the network size and the based on different types of data that's gathered. To process such extremely large data, not only the right set of data gathering, processing and analysis tools are required, but also highly scalable and massively parallel processing systems are required. Figure 1: below shows an example of traditional data analytics tools and systems in a typical operator's network operations environment. Figure 1: Traditional data analytics system in telecom networks In the next section, we describe why these traditional data systems as shown in the figure below, cannot do such complex data analytics, and in later sections, we describe how the new big data architectures are the right tools and systems for this need.

4 Data Challenges in Traditional Telecom World As the number of end devices (desktops, laptops, notepads and smartphones) and rich media applications are ever increasing, the wired and wireless networks and network nodes not only have enough challenge in handling the signaling and data from these devices to support the standard services, the amount data that needs to be stored & analyzed by network operators is growing exponentially and the complexity of such analysis is also becoming extremely challenging. Figure 2 below shows some of the challenges in analysis large sets of complex data in traditional data analytics systems. Telecom network elements (for example, enodeb or MME or SGW or PGW in a 4G LTE network) in a typical deployment of few hundred to many hundred cells and few thousands of subscribers will generate log data for recording the ongoing signaling and data activities, to help with troubleshooting the system in case of service or software issues. And these telecom network elements, as a total, can generate many 100s of MBs to GBs of data per hour. Many of the times, all this data will be streamed to servers in Network operations center for live and post processing. Another example is, where these network elements generate logs related to call events (for example, SIP messages related to a call flow) and all intermediate events that occur during the call, including account start/stop events which are written to CDR or SDR records or files. With a telecom system as whole, that can serve millions of calls per hour, the amount of call record data can run into many terabytes per day. Figure 2: Challenges in traditional data analytics systems

5 Retention of such large data, ability to efficiently search and mine such extremely huge data, and generate real-time business reports on such data has been a huge challenge (and in many cases still a challenge in many telecom operator environments), as the data can't be retained beyond few hours to few days, searches and reports take linearly long time, few hours to many hours all discoursing as it impedes real-time operations and business analysis and operators will be unable to take timely recovery actions. beyond which the performance can't be improved. In addition, from software release to release, the diagnostic log format or call record formats keep changing and this information of record format changes are propagated from vendor to operator to adjust their tools, in many cases requiring new software releases in both the places and causing the engineering and deployment overheads associated with new releases. Traditionally, these extreme volumes of data is Also, the tools that parse these logs and call analyzed in a central data center in operator's records are either custom developed by NOC, using stacked server blades and RDBMSs operators or by 3rd parties outsourced to by the for metadata and table data storage and SAN or operator increasing Opex, time to market and RAID arrays for raw data storage. These unavoidable dependencies all causing systems can only provide linear performance increased costs and business risks all till a point, where the CPU and network amounting to Big Data challenges. latencies of SAN become the bottlenecks, How Telecom Traditional Tools Won't Work Traditional data analytics systems in a typical telecom operator deployment, use RDBMs for storing data related to configuration, network discovery, faults and alarms, network diagnostic logs, CDR logs, CDRs, operational metrics, etc. all in the RDBMS in tables. And all this data is destined to operators' EMS/NMS systems. First issue is significant part of the data is nonnetwork management data. For example, although configuration, faults and alarms are data related to network management, data related to operational metrics, CDRs are operations data and give insights into historical operations and helps with troubleshooting networks. The diagnostic logs and CDR logs, media data give real-time streaming information about call quality, real- time network and system issues. In typical telecom network deployments, above 90% of data is non-network management related, but it's being stored at the EMS/NMS systems. As different types of mixed data sets are destined to same system and same RDBMs, with each data set being stored in a separate table, the non-network management tables continuously grow to millions and billions of rows causing the Big Table issue. As the RDBM's table size increases, the storage and search becomes very suboptimal; reports and analytics tools take longer and longer to process the data, essentially incapacitating operators to build deep analytic tools and reports to mine the business and real-time insights.

6 Second issue is, due to sheer volume, data cannot be retained beyond few days on these systems as it starts to hit disk capacity of central NMS/EMS systems, essentially losing insights into historical patterns and long period analytics. Third issue is traditional data analytics systems in most cases are centralized servers and centralized databases in operator's Network Operation Center (NOC) and the system can't be easily scale up. The scale up is typically limited to a front end load balancer with multiple servers but still destined to a single cluster of disk raids and RDBMS instances causing bottlenecks at the data storage layer. Fourth issue is cost associated with RDBMs systems. As the data size and storage needs increase, more instances of RDBMs are required and the associated license costs and server costs increase Capex and Opex. How The New Big Data Systems Can Address the Telecom Data Challenges Effectively Big data challenges faced by telecom operators need the new age Big Data solutions. The first issue described in previous section, the Big Table issue is addressed by Hbase, where a big table with large number of rows is split into regions that are served by the region servers. Regions are vertically divided by column families into Stores, which are internally stored as files in HDFS. Each Region Server is hosted on a clusters, for example multi-node HBase and HDFS clusters in a Big Data architecture. With ever increasing data, the cluster size can be increased to retain large amounts of data for very long periods giving sufficient data for the operators for insights into historical patterns and long period analytics. physically different machine. The third issue of centralized servers and databases which become performance With the new data architectures of HDFS, bottlenecks is addressed by massively parallel Hadoop and Spark that enable massive processing systems that run on multi-node distributed storage, data splitting and parallel clusters. For example, Spark framework, with processing, the challenges of telecom big data cluster of nodes that orchestrate parallel processing are aptly addressed. The Hadoop processing and execution, provides job and task and Spark based systems will scale to provide schedulers that can execute queries in parallel consistent performance even when the on the database region servers and collect the number of data sources and the amount of results. The system also comes with a purpose generated data ever increase all due to built SQL query engine, SparkSQL, which distributed and parallel processing provides traditional SQL syntax but in the architecture. background executes the query in parallel on all The second issue of retention of data is addressed by employing multi-node database region servers in parallel, collects the results and responds to the query.

7 The fourth issue of high cost analytics systems is addressed by open source and cost free databases like HBase and HDFS, which also provide high scalability. Many of the new Big Data systems are open source, and further reduce the Capex and Opex costs for operators, while at the same time addressing the issues present in the traditional data a n a l y t i c s s y s t e m s. T h e s e B i g D a t a architectures also support redundancy, load balancing options further enhancing the system's stability and high availability. In addition, in Big Data systems like in the Spark architecture, the streaming data processing capabilities, iterative nature of map-reduce machine learning capabilities built into its framework, in-memory caching of data leads to many folds improvement in performance, and making it a reality for telecom operators to analyze their networks in near-real-time to real-time. For the rich user interface and dynamic reporting, many tools and platforms like JasperSoft, Pentaho are available for rich set of business intelligence analysis and reporting, which in the backend use SQL and SparkSQL queries to access data and run analytics algorithms to mine the data for intelligence and present in intuitive manner for operational and business insights. An Example Telecom Big Data Solution from Incedo The figure below shows an example solution from Incedo to address the telecom Big Data challenges; and also shows rich operational and business intelligent system for analyzing rich and deep insights into the data. Our solution uses multi-tier Big Data architecture as shown below. In this solution, we address a specific problem of telecom operators, where the issue is all of the data from the data sources is structured data, but the traditional data analytics systems had challenges in analyzing this data due to the Big Table issue and performance bottleneck issues due to central server(s) and database(s), both the issues that were described in previous sections. In this multi-tier Big Data solution, the data sources are the typical historical and streaming structured data in telecom networks like, CDRs, SDRs, logs, events, alarms, CDR logs and user data.

8 Figure 3: Incedo Big Data solution architecture for telecom network data For the Data Ingestion and ETL (extract, transform and load) tier, where the data is extracted, parsed transformed as needed and stored into appropriate data storage and databases for further analysis. For ETL function, there are many tools, both commercial and open source are available. For example, some of the commercial and proprietary ETL tools are Informatica, ODI and Datastage. Some of the ETL tools are open source but incur costs when productized, like the Pentaho ETL, Talend ETL, and then they are fully open source tools like Flume, Kafka. Or one can develop their own custom scripts in Python, Java and other scripting languages. In our solution, we have used custom Spark python scripts, as these scripts were used not only to parse, extract the structured data but also to generate aggregation reports on the fly as they parse the data and store into the databases i.e. one set of reports (aggregate reports) are prepared in-line with the extract and transform function. For Big Data System tier, where the data is stored and processed, there are many choices for the tools and frameworks. For the RDBMS based storage and processing, some of the available options are MariaDB, Inforbright, InfiniDB, Vertica, Amazon Redshift, Hana and Taradata. For the NoSQL based storage and processing some of the available choices are HBase, MongoDB and Cassandra. For simple file system systems, one can use HDFS. For the data processing engines, both open source and commercial tools are available. Apache Spark, for example, is an open source data processing engine while Amazon Elastic Search is a commercial option. Both are equally capable processing engines for Big Data analytics needs.

9 In our solution, we have used Spark clusters architecture components that provide with HBase database clusters on HDFS. For massively parallel processing capabilities with streaming data processing, we have used Spark dynamic scaling, redundancy, load balancing; Stream engine cluster. Main reason for this and one that provides dynamic and real-time selection is to keep costs low and attain high reports that give deep insights into the data processing performance as well as near- operational and business aspects of telecom real-time analysis of streaming data. networks ranging from user QoS, SLA conformance, KPIs, customer retention, real- The Data Access tier options depend on the time revenue forecasts. The solution can be data processing engine option that's selected. easily adapted to unstructured data or a mix of Most data processing engines provide SQL structured and unstructured data and can be syntax for queries, although systems like Spark customized to address other related Big Data provide SparkSQL library which provides Analytics challenges. The solution uses latest standard SQL syntax to an end user but components of open source big data systems to underlying uses Spark multi-node cluster and contain the Capex and Opex of operators, while massive parallel processing architecture of providing rich insights into their data and thus Spark to execute the SQL query on multiple helping Operator success. nodes in parallel, collect the results and report back to user. In our solution, we have used both SparkSQL for the spark subsystem that processes historical structured data, and SQL for the streaming structured data. For Reporting and Visualization tier, some of the available tools and frameworks options are JasperSoft, Pentaho, Kibana, Tableau, MicroStrategy, and Qlikview all of commercial tools. We found these tools are better in presenting and visualizing dynamic and sophisticated reports. Or one can develop a custom web application rather than using a third party Reporting and Visualization tool, although the cost and development time can become disadvantage. In our solution, we have used JasperSoft, as we found it provided the desired dynamic reporting capability with manageable costs. Incedo's Big Data solution for telecom networks provides an end to end solution using highly scalable new and latest data

10 Conclusions The telecom data challenges are real and data is growing larger and larger. With traditional data processing and analytics solutions, operators are unable to effectively use the data and information available to them and incapacitated to find deeper operational and business insights into their own data. Telecom operators need new solutions for the Big Data that their networks generate and the valuable information they have from their own OSS/BSS systems. Incedo solution provides a comprehensive solution that gives insights into operational and business intelligence; with latest and modern open source big data architecture involving massively parallel processing and distributed database frameworks and dynamic reporting and visualization. Thus our solution provides a comprehensive view and gives deep insights into telecom data with lower Capex and Opex costs. References: New-Approach-for-the-Telecom-Industry.pdf t _ html t _ html t _ html

11 NAGESH DEVISETTI Director - Wireless, Communication Engineering nagesh.devisetti@incedoinc.com MANISH GUPTA Vice President - Communication Engineering manishg@incedoinc.com About us Incedo Inc (formerly a part of $4Bn Indiabulls Group) is a technology solutions and servicing organization headquartered in the Bay Area, USA with workforce across North America, South Africa and India (Gurgaon, Bangalore). We specialize in Data & Analytics and Product Engineering Services, with deep expertise in Financial Services, Life Science and Communication Engineering. Our key focus is on Emerging Technologies and Innovation. Our end-to-end capabilities span across Application Services, Infrastructure and Operations. What really differentiates us is: Strong engineering talent Focus and passion for innovation Flat organization structure responsive engagement models Agile and flexible delivery and commercial models Focus on long term partnership with clients USA: 2350 Mission College Boulevard, Suite 246 Santa Clara, California Tel: INDIA: 248, Udyog Vihar Phase-IV, Gurgaon Tel: /01/02

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84 Index A Amazon Web Services (AWS), 50, 58 Analytics engine, 21 22 Apache Kafka, 38, 131 Apache S4, 38, 131 Apache Sqoop, 37, 131 Appliance pattern, 104 105 Application architecture, big data analytics

More information

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Managing Big Data with Hadoop & Vertica A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database Copyright Vertica Systems, Inc. October 2009 Cloudera and Vertica

More information

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract

W H I T E P A P E R. Deriving Intelligence from Large Data Using Hadoop and Applying Analytics. Abstract W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the

More information

The Future of Data Management

The Future of Data Management The Future of Data Management with Hadoop and the Enterprise Data Hub Amr Awadallah (@awadallah) Cofounder and CTO Cloudera Snapshot Founded 2008, by former employees of Employees Today ~ 800 World Class

More information

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014

BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 BIG DATA CAN DRIVE THE BUSINESS AND IT TO EVOLVE AND ADAPT RALPH KIMBALL BUSSUM 2014 Ralph Kimball Associates 2014 The Data Warehouse Mission Identify all possible enterprise data assets Select those assets

More information

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015

Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015 Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document

More information

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014

5 Keys to Unlocking the Big Data Analytics Puzzle. Anurag Tandon Director, Product Marketing March 26, 2014 5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for

More information

Native Connectivity to Big Data Sources in MSTR 10

Native Connectivity to Big Data Sources in MSTR 10 Native Connectivity to Big Data Sources in MSTR 10 Bring All Relevant Data to Decision Makers Support for More Big Data Sources Optimized Access to Your Entire Big Data Ecosystem as If It Were a Single

More information

The 4 Pillars of Technosoft s Big Data Practice

The 4 Pillars of Technosoft s Big Data Practice beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed

More information

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani

A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to

More information

Big Data Analytics - Accelerated. stream-horizon.com

Big Data Analytics - Accelerated. stream-horizon.com Big Data Analytics - Accelerated stream-horizon.com Legacy ETL platforms & conventional Data Integration approach Unable to meet latency & data throughput demands of Big Data integration challenges Based

More information

Big Data at Cloud Scale

Big Data at Cloud Scale Big Data at Cloud Scale Pushing the limits of flexible & powerful analytics Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For

More information

Big Data Success Step 1: Get the Technology Right

Big Data Success Step 1: Get the Technology Right Big Data Success Step 1: Get the Technology Right TOM MATIJEVIC Director, Business Development ANDY MCNALIS Director, Data Management & Integration MetaScale is a subsidiary of Sears Holdings Corporation

More information

Transforming the Telecoms Business using Big Data and Analytics

Transforming the Telecoms Business using Big Data and Analytics Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe

More information

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth

Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth MAKING BIG DATA COME ALIVE Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth Steve Gonzales, Principal Manager steve.gonzales@thinkbiganalytics.com

More information

GigaSpaces Real-Time Analytics for Big Data

GigaSpaces Real-Time Analytics for Big Data GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and

More information

Testing Big data is one of the biggest

Testing Big data is one of the biggest Infosys Labs Briefings VOL 11 NO 1 2013 Big Data: Testing Approach to Overcome Quality Challenges By Mahesh Gudipati, Shanthi Rao, Naju D. Mohan and Naveen Kumar Gajja Validate data quality by employing

More information

Architectures for Big Data Analytics A database perspective

Architectures for Big Data Analytics A database perspective Architectures for Big Data Analytics A database perspective Fernando Velez Director of Product Management Enterprise Information Management, SAP June 2013 Outline Big Data Analytics Requirements Spectrum

More information

In-Memory Analytics for Big Data

In-Memory Analytics for Big Data In-Memory Analytics for Big Data Game-changing technology for faster, better insights WHITE PAPER SAS White Paper Table of Contents Introduction: A New Breed of Analytics... 1 SAS In-Memory Overview...

More information

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume

More information

Data processing goes big

Data processing goes big Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,

More information

Cloudera Enterprise Data Hub in Telecom:

Cloudera Enterprise Data Hub in Telecom: Cloudera Enterprise Data Hub in Telecom: Three Customer Case Studies Version: 103 Table of Contents Introduction 3 Cloudera Enterprise Data Hub for Telcos 4 Cloudera Enterprise Data Hub in Telecom: Customer

More information

From Spark to Ignition:

From Spark to Ignition: From Spark to Ignition: Fueling Your Business on Real-Time Analytics Eric Frenkiel, MemSQL CEO June 29, 2015 San Francisco, CA What s in Store For This Presentation? 1. MemSQL: A real-time database for

More information

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase

Architectural patterns for building real time applications with Apache HBase. Andrew Purtell Committer and PMC, Apache HBase Architectural patterns for building real time applications with Apache HBase Andrew Purtell Committer and PMC, Apache HBase Who am I? Distributed systems engineer Principal Architect in the Big Data Platform

More information

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics

Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning

More information

Big data blue print for cloud architecture

Big data blue print for cloud architecture Big data blue print for cloud architecture -COGNIZANT Image Area Prabhu Inbarajan Srinivasan Thiruvengadathan Muralicharan Gurumoorthy Praveen Codur 2012, Cognizant Next 30 minutes Big Data / Cloud challenges

More information

HDP Hadoop From concept to deployment.

HDP Hadoop From concept to deployment. HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some

More information

INTRODUCTION TO CASSANDRA

INTRODUCTION TO CASSANDRA INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open

More information

Building Big with Big Data Now companies are in the middle of a renovation that forces them to be analytics-driven to continue being competitive.

Building Big with Big Data Now companies are in the middle of a renovation that forces them to be analytics-driven to continue being competitive. Unlocking Big Data Building Big with Big Data Now companies are in the middle of a renovation that forces them to be analytics-driven to continue being competitive. Data analysis provides a complete insight

More information

Hadoop. http://hadoop.apache.org/ Sunday, November 25, 12

Hadoop. http://hadoop.apache.org/ Sunday, November 25, 12 Hadoop http://hadoop.apache.org/ What Is Apache Hadoop? The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using

More information

Workshop on Hadoop with Big Data

Workshop on Hadoop with Big Data Workshop on Hadoop with Big Data Hadoop? Apache Hadoop is an open source framework for distributed storage and processing of large sets of data on commodity hardware. Hadoop enables businesses to quickly

More information

How To Make Data Streaming A Real Time Intelligence

How To Make Data Streaming A Real Time Intelligence REAL-TIME OPERATIONAL INTELLIGENCE Competitive advantage from unstructured, high-velocity log and machine Big Data 2 SQLstream: Our s-streaming products unlock the value of high-velocity unstructured log

More information

Navigating the Big Data infrastructure layer Helena Schwenk

Navigating the Big Data infrastructure layer Helena Schwenk mwd a d v i s o r s Navigating the Big Data infrastructure layer Helena Schwenk A special report prepared for Actuate May 2013 This report is the second in a series of four and focuses principally on explaining

More information

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP

BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP BIG DATA AND THE ENTERPRISE DATA WAREHOUSE WORKSHOP Business Analytics for All Amsterdam - 2015 Value of Big Data is Being Recognized Executives beginning to see the path from data insights to revenue

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

Hadoop Submitted in partial fulfillment of the requirement for the award of degree of Bachelor of Technology in Computer Science

Hadoop Submitted in partial fulfillment of the requirement for the award of degree of Bachelor of Technology in Computer Science A Seminar report On Hadoop Submitted in partial fulfillment of the requirement for the award of degree of Bachelor of Technology in Computer Science SUBMITTED TO: www.studymafia.org SUBMITTED BY: www.studymafia.org

More information

Advanced In-Database Analytics

Advanced In-Database Analytics Advanced In-Database Analytics Tallinn, Sept. 25th, 2012 Mikko-Pekka Bertling, BDM Greenplum EMEA 1 That sounds complicated? 2 Who can tell me how best to solve this 3 What are the main mathematical functions??

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically

More information

Three Open Blueprints For Big Data Success

Three Open Blueprints For Big Data Success White Paper: Three Open Blueprints For Big Data Success Featuring Pentaho s Open Data Integration Platform Inside: Leverage open framework and open source Kickstart your efforts with repeatable blueprints

More information

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: bdg@qburst.com Website: www.qburst.com

Lambda Architecture. Near Real-Time Big Data Analytics Using Hadoop. January 2015. Email: bdg@qburst.com Website: www.qburst.com Lambda Architecture Near Real-Time Big Data Analytics Using Hadoop January 2015 Contents Overview... 3 Lambda Architecture: A Quick Introduction... 4 Batch Layer... 4 Serving Layer... 4 Speed Layer...

More information

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ

End to End Solution to Accelerate Data Warehouse Optimization. Franco Flore Alliance Sales Director - APJ End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,

More information

Trafodion Operational SQL-on-Hadoop

Trafodion Operational SQL-on-Hadoop Trafodion Operational SQL-on-Hadoop SophiaConf 2015 Pierre Baudelle, HP EMEA TSC July 6 th, 2015 Hadoop workload profiles Operational Interactive Non-interactive Batch Real-time analytics Operational SQL

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data

Introduction to Hadoop HDFS and Ecosystems. Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Introduction to Hadoop HDFS and Ecosystems ANSHUL MITTAL Slides credits: Cloudera Academic Partners Program & Prof. De Liu, MSBA 6330 Harvesting Big Data Topics The goal of this presentation is to give

More information

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM

QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QLIKVIEW DEPLOYMENT FOR BIG DATA ANALYTICS AT KING.COM QlikView Technical Case Study Series Big Data June 2012 qlikview.com Introduction This QlikView technical case study focuses on the QlikView deployment

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics

Big Data Open Source Stack vs. Traditional Stack for BI and Analytics Big Data Open Source Stack vs. Traditional Stack for BI and Analytics Part I By Sam Poozhikala, Vice President Customer Solutions at StratApps Inc. 4/4/2014 You may contact Sam Poozhikala at spoozhikala@stratapps.com.

More information

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES Relational vs. Non-Relational Architecture Relational Non-Relational Rational Predictable Traditional Agile Flexible Modern 2 Agenda Big Data

More information

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data

CA Technologies Big Data Infrastructure Management Unified Management and Visibility of Big Data Research Report CA Technologies Big Data Infrastructure Management Executive Summary CA Technologies recently exhibited new technology innovations, marking its entry into the Big Data marketplace with

More information

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer

Consulting and Systems Integration (1) Networks & Cloud Integration Engineer Ericsson is a world-leading provider of telecommunications equipment & services to mobile & fixed network operators. Over 1,000 networks in more than 180 countries use Ericsson equipment, & more than 40

More information

BIG DATA. Using the Lambda Architecture on a Big Data Platform to Improve Mobile Campaign Management. Author: Sandesh Deshmane

BIG DATA. Using the Lambda Architecture on a Big Data Platform to Improve Mobile Campaign Management. Author: Sandesh Deshmane BIG DATA Using the Lambda Architecture on a Big Data Platform to Improve Mobile Campaign Management Author: Sandesh Deshmane Executive Summary Growing data volumes and real time decision making requirements

More information

Processing and Analyzing Streams. CDRs in Real Time

Processing and Analyzing Streams. CDRs in Real Time Processing and Analyzing Streams of CDRs in Real Time Streaming Analytics for CDRs 2 The V of Big Data Velocity means both how fast data is being produced and how fast the data must be processed to meet

More information

STREAM ANALYTIX. Industry s only Multi-Engine Streaming Analytics Platform

STREAM ANALYTIX. Industry s only Multi-Engine Streaming Analytics Platform STREAM ANALYTIX Industry s only Multi-Engine Streaming Analytics Platform One Platform for All Create real-time streaming data analytics applications in minutes with a powerful visual editor Get a wide

More information

Big Data Defined Introducing DataStack 3.0

Big Data Defined Introducing DataStack 3.0 Big Data Big Data Defined Introducing DataStack 3.0 Inside: Executive Summary... 1 Introduction... 2 Emergence of DataStack 3.0... 3 DataStack 1.0 to 2.0... 4 DataStack 2.0 Refined for Large Data & Analytics...

More information

Hadoop IST 734 SS CHUNG

Hadoop IST 734 SS CHUNG Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to

More information

Traditional BI vs. Business Data Lake A comparison

Traditional BI vs. Business Data Lake A comparison Traditional BI vs. Business Data Lake A comparison The need for new thinking around data storage and analysis Traditional Business Intelligence (BI) systems provide various levels and kinds of analyses

More information

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce

Analytics in the Cloud. Peter Sirota, GM Elastic MapReduce Analytics in the Cloud Peter Sirota, GM Elastic MapReduce Data-Driven Decision Making Data is the new raw material for any business on par with capital, people, and labor. What is Big Data? Terabytes of

More information

How To Create A Data Visualization With Apache Spark And Zeppelin 2.5.3.5

How To Create A Data Visualization With Apache Spark And Zeppelin 2.5.3.5 Big Data Visualization using Apache Spark and Zeppelin Prajod Vettiyattil, Software Architect, Wipro Agenda Big Data and Ecosystem tools Apache Spark Apache Zeppelin Data Visualization Combining Spark

More information

A Scalable Data Transformation Framework using the Hadoop Ecosystem

A Scalable Data Transformation Framework using the Hadoop Ecosystem A Scalable Data Transformation Framework using the Hadoop Ecosystem Raj Nair Director Data Platform Kiru Pakkirisamy CTO AGENDA About Penton and Serendio Inc Data Processing at Penton PoC Use Case Functional

More information

Performance and Scalability Overview

Performance and Scalability Overview Performance and Scalability Overview This guide provides an overview of some of the performance and scalability capabilities of the Pentaho Business Analytics Platform. Contents Pentaho Scalability and

More information

JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service

JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service JDSU Partners with Infobright to Help the World s Largest Communications Service Providers Ensure the Highest Quality of Service Overview JDSU (NASDAQ: JDSU; and TSX: JDU) innovates and markets diverse

More information

HadoopTM Analytics DDN

HadoopTM Analytics DDN DDN Solution Brief Accelerate> HadoopTM Analytics with the SFA Big Data Platform Organizations that need to extract value from all data can leverage the award winning SFA platform to really accelerate

More information

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect

Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate

More information

Databricks. A Primer

Databricks. A Primer Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful

More information

Ganzheitliches Datenmanagement

Ganzheitliches Datenmanagement Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist

More information

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances

Well packaged sets of preinstalled, integrated, and optimized software on select hardware in the form of engineered systems and appliances INSIGHT Oracle's All- Out Assault on the Big Data Market: Offering Hadoop, R, Cubes, and Scalable IMDB in Familiar Packages Carl W. Olofson IDC OPINION Global Headquarters: 5 Speen Street Framingham, MA

More information

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing Evaluating NoSQL for Enterprise Applications Dirk Bartels VP Strategy & Marketing Agenda The Real Time Enterprise The Data Gold Rush Managing The Data Tsunami Analytics and Data Case Studies Where to go

More information

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns

How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization

More information

Descriptive to Predictive to Prescriptive Analytics: Move Up the Value Chain. Suren Nathan CTO

Descriptive to Predictive to Prescriptive Analytics: Move Up the Value Chain. Suren Nathan CTO Descriptive to Predictive to Prescriptive Analytics: Move Up the Value Chain Suren Nathan CTO What We Do Deliver cloud based predictive analytics solutions to the communications industry to help streamline

More information

G-Cloud Big Data Suite Powered by Pivotal. December 2014. G-Cloud. service definitions

G-Cloud Big Data Suite Powered by Pivotal. December 2014. G-Cloud. service definitions G-Cloud Big Data Suite Powered by Pivotal December 2014 G-Cloud service definitions TABLE OF CONTENTS Service Overview... 3 Business Need... 6 Our Approach... 7 Service Management... 7 Vendor Accreditations/Awards...

More information

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru

Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy. Presented by: Jeffrey Zhang and Trishla Maru Tapping Into Hadoop and NoSQL Data Sources with MicroStrategy Presented by: Jeffrey Zhang and Trishla Maru Agenda Big Data Overview All About Hadoop What is Hadoop? How does MicroStrategy connects to Hadoop?

More information

Understanding the Value of In-Memory in the IT Landscape

Understanding the Value of In-Memory in the IT Landscape February 2012 Understing the Value of In-Memory in Sponsored by QlikView Contents The Many Faces of In-Memory 1 The Meaning of In-Memory 2 The Data Analysis Value Chain Your Goals 3 Mapping Vendors to

More information

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap

Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap Aligning Your Strategic Initiatives with a Realistic Big Data Analytics Roadmap 3 key strategic advantages, and a realistic roadmap for what you really need, and when 2012, Cognizant Topics to be discussed

More information

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale

The Power of Pentaho and Hadoop in Action. Demonstrating MapReduce Performance at Scale The Power of Pentaho and Hadoop in Action Demonstrating MapReduce Performance at Scale Introduction Over the last few years, Big Data has gone from a tech buzzword to a value generator for many organizations.

More information

BIG DATA What it is and how to use?

BIG DATA What it is and how to use? BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14

More information

Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook

Talend Real-Time Big Data Sandbox. Big Data Insights Cookbook Talend Real-Time Big Data Talend Real-Time Big Data Overview of Real-time Big Data Pre-requisites to run Setup & Talend License Talend Real-Time Big Data Big Data Setup & About this cookbook What is the

More information

BIG DATA IS MESSY PARTNER WITH SCALABLE

BIG DATA IS MESSY PARTNER WITH SCALABLE BIG DATA IS MESSY PARTNER WITH SCALABLE SCALABLE SYSTEMS HADOOP SOLUTION WHAT IS BIG DATA? Each day human beings create 2.5 quintillion bytes of data. In the last two years alone over 90% of the data on

More information

BIG DATA & DATA SCIENCE

BIG DATA & DATA SCIENCE BIG DATA & DATA SCIENCE ACADEMY PROGRAMS IN-COMPANY TRAINING PORTFOLIO 2 TRAINING PORTFOLIO 2016 Synergic Academy Solutions BIG DATA FOR LEADING BUSINESS Big data promises a significant shift in the way

More information

CitusDB Architecture for Real-Time Big Data

CitusDB Architecture for Real-Time Big Data CitusDB Architecture for Real-Time Big Data CitusDB Highlights Empowers real-time Big Data using PostgreSQL Scales out PostgreSQL to support up to hundreds of terabytes of data Fast parallel processing

More information

Business Intelligence for Big Data

Business Intelligence for Big Data Business Intelligence for Big Data Will Gorman, Vice President, Engineering May, 2011 2010, Pentaho. All Rights Reserved. www.pentaho.com. What is BI? Business Intelligence = reports, dashboards, analysis,

More information

Big Data & the Cloud: The Sum Is Greater Than the Parts

Big Data & the Cloud: The Sum Is Greater Than the Parts E-PAPER March 2014 Big Data & the Cloud: The Sum Is Greater Than the Parts Learn how to accelerate your move to the cloud and use big data to discover new hidden value for your business and your users.

More information

Data Refinery with Big Data Aspects

Data Refinery with Big Data Aspects International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data

More information

Harnessing the Power of Big Data for Real-Time IT: Sumo Logic Log Management and Analytics Service

Harnessing the Power of Big Data for Real-Time IT: Sumo Logic Log Management and Analytics Service Harnessing the Power of Big Data for Real-Time IT: Sumo Logic Log Management and Analytics Service A Sumo Logic White Paper Introduction Managing and analyzing today s huge volume of machine data has never

More information

Hadoop & Spark Using Amazon EMR

Hadoop & Spark Using Amazon EMR Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?

More information

Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA

Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA WHITE PAPER April 2014 Driving IBM BigInsights Performance Over GPFS Using InfiniBand+RDMA Executive Summary...1 Background...2 File Systems Architecture...2 Network Architecture...3 IBM BigInsights...5

More information

Hadoop and Map-Reduce. Swati Gore

Hadoop and Map-Reduce. Swati Gore Hadoop and Map-Reduce Swati Gore Contents Why Hadoop? Hadoop Overview Hadoop Architecture Working Description Fault Tolerance Limitations Why Map-Reduce not MPI Distributed sort Why Hadoop? Existing Data

More information

Next-Generation Cloud Analytics with Amazon Redshift

Next-Generation Cloud Analytics with Amazon Redshift Next-Generation Cloud Analytics with Amazon Redshift What s inside Introduction Why Amazon Redshift is Great for Analytics Cloud Data Warehousing Strategies for Relational Databases Analyzing Fast, Transactional

More information

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage

Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage White Paper Scaling Objectivity Database Performance with Panasas Scale-Out NAS Storage A Benchmark Report August 211 Background Objectivity/DB uses a powerful distributed processing architecture to manage

More information

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist

2015 Analyst and Advisor Summit. Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist 2015 Analyst and Advisor Summit Advanced Data Analytics Dr. Rod Fontecilla Vice President, Application Services, Chief Data Scientist Agenda Key Facts Offerings and Capabilities Case Studies When to Engage

More information

Performance Testing of Big Data Applications

Performance Testing of Big Data Applications Paper submitted for STC 2013 Performance Testing of Big Data Applications Author: Mustafa Batterywala: Performance Architect Impetus Technologies mbatterywala@impetus.co.in Shirish Bhale: Director of Engineering

More information

Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges

Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges Prerita Gupta Research Scholar, DAV College, Chandigarh Dr. Harmunish Taneja Department of Computer Science and

More information

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney Introduction to Hadoop New York Oracle User Group Vikas Sawhney GENERAL AGENDA Driving Factors behind BIG-DATA NOSQL Database 2014 Database Landscape Hadoop Architecture Map/Reduce Hadoop Eco-system Hadoop

More information

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012

Big Data Buzzwords From A to Z. By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords From A to Z By Rick Whiting, CRN 4:00 PM ET Wed. Nov. 28, 2012 Big Data Buzzwords Big data is one of the, well, biggest trends in IT today, and it has spawned a whole new generation

More information

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study

Oracle Big Data Spatial & Graph Social Network Analysis - Case Study Oracle Big Data Spatial & Graph Social Network Analysis - Case Study Mark Rittman, CTO, Rittman Mead OTN EMEA Tour, May 2016 info@rittmanmead.com www.rittmanmead.com @rittmanmead About the Speaker Mark

More information

CRITEO INTERNSHIP PROGRAM 2015/2016

CRITEO INTERNSHIP PROGRAM 2015/2016 CRITEO INTERNSHIP PROGRAM 2015/2016 A. List of topics PLATFORM Topic 1: Build an API and a web interface on top of it to manage the back-end of our third party demand component. Challenge(s): Working with

More information

Eliminating Complexity to Ensure Fastest Time to Big Data Value

Eliminating Complexity to Ensure Fastest Time to Big Data Value Eliminating Complexity to Ensure Fastest Time to Big Data Value Copyright 2015 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest

More information

marlabs driving digital agility WHITEPAPER Big Data and Hadoop

marlabs driving digital agility WHITEPAPER Big Data and Hadoop marlabs driving digital agility WHITEPAPER Big Data and Hadoop Abstract This paper explains the significance of Hadoop, an emerging yet rapidly growing technology. The prime goal of this paper is to unveil

More information

Moving From Hadoop to Spark

Moving From Hadoop to Spark + Moving From Hadoop to Spark Sujee Maniyam Founder / Principal @ www.elephantscale.com sujee@elephantscale.com Bay Area ACM meetup (2015-02-23) + HI, Featured in Hadoop Weekly #109 + About Me : Sujee

More information

Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019

Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019 MARKET RESEARCH STORE Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019 Market Research Store included latest deep and professional market research report on Big

More information

Solace s Solutions for Communications Services Providers

Solace s Solutions for Communications Services Providers Solace s Solutions for Communications Services Providers Providers of communications services are facing new competitive pressures to increase the rate of innovation around both enterprise and consumer

More information