Transforming the Telecoms Business using Big Data and Analytics



Similar documents
Applications for Big Data Analytics

Big Data Explained. An introduction to Big Data Science.

Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014

Big Data. Introducción. Santiago González

How To Make Data Streaming A Real Time Intelligence

Are You Ready for Big Data?

Are You Ready for Big Data?

How To Handle Big Data With A Data Scientist

How To Scale Out Of A Nosql Database

Big Data Analytics for Space Exploration, Entrepreneurship and Policy Opportunities. Tiffani Crawford, PhD

Hadoop. Sunday, November 25, 12

BIG DATA IN THE CLOUD : CHALLENGES AND OPPORTUNITIES MARY- JANE SULE & PROF. MAOZHEN LI BRUNEL UNIVERSITY, LONDON

The Internet of Things and Big Data: Intro

Hortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc All Rights Reserved

SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera

BIG DATA What it is and how to use?

Hadoop implementation of MapReduce computational model. Ján Vaňo

Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop

The 4 Pillars of Technosoft s Big Data Practice

INTRODUCTION TO APACHE HADOOP MATTHIAS BRÄGER CERN GS-ASE

BIG DATA TECHNOLOGY. Hadoop Ecosystem

Big Data and Analytics: Challenges and Opportunities

The Future of Data Management with Hadoop and the Enterprise Data Hub

AGENDA. What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story. Our BIG DATA Roadmap. Hadoop PDW

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Improving Data Processing Speed in Big Data Analytics Using. HDFS Method

Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities

BIG DATA & ANALYTICS. Transforming the business and driving revenue through big data and analytics

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

ANALYTICS CENTER LEARNING PROGRAM

Making Sense of Big Data in Insurance

Testing Big data is one of the biggest

Managing Big Data with Hadoop & Vertica. A look at integration between the Cloudera distribution for Hadoop and the Vertica Analytic Database

Cloudera Enterprise Data Hub in Telecom:

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

locuz.com Big Data Services

Chase Wu New Jersey Ins0tute of Technology

Beyond Watson: The Business Implications of Big Data

Real Time Big Data Processing

Big Data: Tools and Technologies in Big Data

Big Data. Lyle Ungar, University of Pennsylvania

ANALYTICS BUILT FOR INTERNET OF THINGS

COMP9321 Web Application Engineering

Chapter 6. Foundations of Business Intelligence: Databases and Information Management

The Next Wave of Data Management. Is Big Data The New Normal?

Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Intro to Big Data and Business Intelligence

Microsoft Big Data Solutions. Anar Taghiyev P-TSP

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Big data for the Masses The Unique Challenge of Big Data Integration

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Keywords Big Data, NoSQL, Relational Databases, Decision Making using Big Data, Hadoop

Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

BIG DATA ANALYTICS REFERENCE ARCHITECTURES AND CASE STUDIES

Next presentation starting soon Business Analytics using Big Data to gain competitive advantage

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Big Data and Data Science: Behind the Buzz Words

BIG DATA CHALLENGES AND PERSPECTIVES

Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.

The evolution of database technology (II) Huibert Aalbers Senior Certified Executive IT Architect

Integrating a Big Data Platform into Government:

Big Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA

Using Data Mining and Machine Learning in Retail

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

Managing Cloud Server with Big Data for Small, Medium Enterprises: Issues and Challenges

WHITE PAPER ON. Operational Analytics. HTC Global Services Inc. Do not copy or distribute.

Session 1: IT Infrastructure Security Vertica / Hadoop Integration and Analytic Capabilities for Federal Big Data Challenges

You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.

The Big Data Paradigm Shift. Insight Through Automation

Implement Hadoop jobs to extract business value from large and varied data sets

Big Data Are You Ready? Jorge Plascencia Solution Architect Manager

Database Marketing, Business Intelligence and Knowledge Discovery

Foundations of Business Intelligence: Databases and Information Management

Datenverwaltung im Wandel - Building an Enterprise Data Hub with

Chukwa, Hadoop subproject, 37, 131 Cloud enabled big data, 4 Codd s 12 rules, 1 Column-oriented databases, 18, 52 Compression pattern, 83 84

How To Use Big Data For Telco (For A Telco)

Internals of Hadoop Application Framework and Distributed File System

How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning

Evolution from Big Data to Smart Data

WHITE PAPER. Four Key Pillars To A Big Data Management Solution

Addressing Open Source Big Data, Hadoop, and MapReduce limitations

The Future of Data Management

The? Data: Introduction and Future

Comprehensive Analytics on the Hortonworks Data Platform

HDP Hadoop From concept to deployment.

Exploiting Data at Rest and Data in Motion with a Big Data Platform

Customized Report- Big Data

Big Data Are You Ready? Thomas Kyte

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

Application and practice of parallel cloud computing in ISP. Guangzhou Institute of China Telecom Zhilan Huang

BIG DATA TRENDS AND TECHNOLOGIES

Transcription:

Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1

Objectives Describe the concept of big data Importance of big data Understand the three dimensions of big data 2

Big data A definition Big data broadly refers to the new methods and technologies for collecting, managing, and analyzing in real time the vast amount of both structured and unstructured data. 3

Big Data - Applications Organizations collect and analyze vast stores of data for insights that can help identify trends, predict behavior, and empower decision makers. It considers: How much data is generated How this data is identified and managed as an asset to the organization How this data is turned into usable information How organizations use this data to make decisions 4

Growth of Data Within the last decade, the volume of data that was produced in a year is now produced in a week. That amounts to over 20 exabytes of data produced a week. Data continues to grow exponentially as more of the unconnected become connected. 5

Dimensions of Big Data There are three primary dimensions of Big Data that must be accounted for: Volume - Volume describes the amount of data being transported and stored. Variety - Variety describes the type of data it is Velocity - Velocity describes the rate at which this data is moving dependent infrastructure (input/output, bandwidth, and latency) 6

Volume Data Volume 44x increase from 2009 2020 From 0.8 zettabytes to 35zb Data volume is increasing exponentially Exponential increase in collected/generated data 7 7

Velocity Various formats, types, and structures Text, numerical, images, audio, video, sequences, time series, social media data, multi-dim arrays, etc Static data vs. streaming data A single application can be generating/collecting many types of data To extract knowledge all these types of data need to linked together 8

Velocity Data is begin generated fast and need to be processed fast Online Data Analytics Late decisions missing opportunities Examples E-Promotions: Based on your current location, your purchase history, what you like send promotions right now for store next to you Healthcare monitoring: sensors monitoring your activities and body any abnormal measurements require immediate reaction 9

Sources of Data The vast amount of Data come from many quarters. Social media sites Sensors Digital photos/videos Business transactions Location-based data 10

Sources of Data Mobile devices (tracking all objects all the time) Social media and networks (all of us are generating data) The progress and innovation is no longer hindered by the ability to collect data But, by the ability to manage, analyze, Scientific instruments (collecting all sorts of data) summarize, visualize, and discover knowledge from the collected data in a timely manner and in a scalable fashion Sensor technology and networks (measuring all kinds of data) 11

Big Data Source: Cisco VNI Mobile, 2015 12

Growth of Data Mobile data traffic is expected to grow by approximately 60 percent year over year through 2016, driven mainly by video. 13

The big data in telecommunication business Unstructured data Network performance data Network traffic data Call detail record Internet related data See next slides 14

What is Data Mining? Discovery of useful, possibly unexpected, patterns in data Non-trivial extraction of implicit, previously unknown and potentially useful information from data Exploration & analysis, by automatic or semi-automatic means, of large quantities of data in order to discover meaningful patterns 15

Data Mining Tasks Classification [Predictive] Clustering [Descriptive] Association Rule Discovery [Descriptive] Sequential Pattern Discovery [Descriptive] Regression [Predictive] Deviation Detection [Predictive] Collaborative Filter [Predictive] 16

In a Nutshell Big Data is about better analytics! 17

Big Data and Business Intelligence (BI) 18

Big Data Technology NoSQL (Not Only SQL): Databases that move beyond relational data models (i.e., no tables, limited or no use of SQL) Focus on retrieval of data and appending new data (not necessarily tables) Focus on key-value data stores that can be used to locate data objects Focus on supporting storage of large quantities of unstructured data SQL is not used for storage or retrieval of data No ACID (atomicity, consistency, isolation, 19

NoSQL NoSQL focuses on a schema-less architecture (i.e., the data structure is not predefined) In contrast, traditional relation DBs require the schema to be defined before the database is built and populated. Data are structured Limited in scope Designed around ACID principles. 20

Hadoop Hadoop is a distributed file system and data processing engine that is designed to handle extremely high volumes of data in any structure. Hadoop has two components: The Hadoop distributed file system (HDFS), which supports data in structured relational form, in unstructured form, and in any form in between The MapReduce programing paradigm for managing applications on multiple distributed servers The focus is on supporting redundancy, distributed architectures, and parallel processing 21

Hadoop Related Names Apache Avro: designed for communication between Hadoop nodes through data serialization Cassandra and Hbase: a non-relational database designed for use with Hadoop Hive: a query language similar to SQL (HiveQL) but compatible with Hadoop Mahout: an AI tool designed for machine learning; that is, to assist with filtering data for analysis and exploration Pig Latin: A data-flow language and execution framework for parallel computation ZooKeeper: Keeps all the parts coordinated and working together 22

What to do with the data 23

The big data in telecommunication business Unstructured data like the emails, text-based documentation, contracts, and social media sources, which contains critical information about customer engagement and customer satisfaction. Network performance data including: gigabytes of events logs about services, network, transmission - this provides insights into root cause of alarms and affected services and customers. 24

The big data in telecommunication business Cont d Network traffic data This includes identification of the type of data passing over the network, bandwidth usage and the type of user which is needed to enable better traffic engineering and capacity management and hence informed investment decisions. Call detail record which can provide useful call patterns by correlating data about, frequency, date and time, duration, called parties, subscriber location etc. 25

The big data in telecommunication business Cont d Internet related data showing IP addresses used, GPS location, frequency, type of data, social networking websites accessed, other sites like new and entertainment, bandwidth requirements etc 26

Application of Telecoms Big Data All these new, vast, and complex type of data is what constitutes the Big data and can provide significant, material and actionable insights into customers, products, and operations. The new data sources from Web activities, mobile data, user location combined with traditional data enable businesses gain more understanding of the customer requirements. 27

Big data technology Big data tools and software are required to process the extremely large volumes of data that a business has collected to determine which data is relevant and can be analyzed to drive better business decisions in the future. Telcos require technology that gathers the vast amounts of data generated by 4G networks, CDRs, clickstreams, IPv6 devices, location sensors, and machine-to-machine monitors in a single format information platform. 28

Big data technology The technology must have the capability to integrate data in near real time, scale costeffectively and integrate with legacy systems and technologies, and shrink batch windows for high performance. Some of the current big data languages capable of efficiently processing massive amounts both structured and unstructured data include the Hadoop and the R language. 29

Benefits of Big data Optimizing routing and quality of service by analyzing network traffic in real time Analyzing call data records in real time to identify fraudulent behavior immediately Allowing call center reps to flexibly and profitably modify subscriber calling plans immediately Tailoring marketing campaigns to individual customers using location-based and social networking technologies Using insights into customer behavior and usage to develop new products and services 30

Benefits of Big data Used in finance and HR to provide invaluable strategic asset with access to actionable business insights that improve talent acquisition, retention, development and organizational performance. Improved fraud management by correlating internal location, usage, and account data with external sources such as credit reports, operators could significantly increase the detection of fraudulent activity such as looping or call forwarding on hacked PBXs. Monetizing the data itself and selling insights about customers to third parties. 31

Discussion How can big data has been used to improve the human relation function in an organization. What are the challenges faced by telecommunications business in the implementation of big data What are the security considerations should be put in place to protect privacy of client information Most telecommunications organizations in Africa are also engaged in other services like mobile money, how can the use of big data impact on improvement of such services 32

Summary Big Data creates new ways of approaching Telecommunications: business: discovery-led rather than theory-led. Big data offers telecom business a real opportunity to gain a much more complete picture of their operations and their customers, and to further their innovation efforts. Telecommunications organizations that can incorporate these new strategies of learning consumer needs into their organizational processes will gain a more competitive advantage than their counterparts who stick to the traditional methods of learning the market requirements. 33

Thank you. 34