Questionnaire about the skills necessary for people. working with Big Data in the Statistical Organisations
|
|
|
- Nathan Garrison
- 10 years ago
- Views:
Transcription
1 Questionnaire about the skills necessary for people working with Big Data in the Statistical Organisations Preliminary results of the survey ( ) More detailed analysis will be prepared by October 2014 Total of 137 were received, because some of the responses were not complete 107 responses were used for the analysis. Detailed results by question: Question 2: Does your organisation work with Big Data? Answer Options Percent Count Yes 36.8% 39 No 19.8% 21 Planned in the near future 43.4% 46 Comments 32 answered question 106 skipped question 1 1
2 Comments: 1 We are just in the beginning 2 We will organize trainings on big data in statistics 3 Currently we are working in some experiments and probe of concept 4 We are a few people following the topic, but has no specific projects or programmes at the moment. 5 With the announcement of a national data sharing and accessibility policy, our government has defined the objective to facilitate access to the Government owned shareable data generated using public funds in machine readable format across the country in a pro-active and periodically updatable manner. A large quantum of the Government data which is currently generated by various government organizations and institutions in the country remains inaccessible to civil society, although most of such data may be non-sensitive in nature and could be used by public for scientific, economic and social developmental purposes. Thus, started toward collating Big Data. The sources of official statistics in our country: 1. Statistical surveys conducted by the national statistical organisation 2. Statistical surveys conducted by other producers of statistics (National Bank, 6 Ministry of Finance, Ministry of Education, Ministry of Health and other) 3. Administrative data 7 A first working group has been constituted within the Institute to analyse this possibility 8 We don't have yet enough experience and capacity to work with big data 9 As in pilots, not yet for publication/production purposes. 10 Except perhaps with scanned data for prices 11 Trying to get access to electricity consumption data (hourly data per household) We would propose to simplify the classification and only make a distinction between methodological 12 and IT skills. 13 for pilot projects, yet 14 For pilot projects, yet. 15 Early phase 16 RnD-phase 17 Hard to say - define big data 18 Some experimental work taking place to my knowledge 19 Developing area 20 We have plans to create a data warehouse under IPA project 21 There are some intentions to learn more about Big Data but no current projects. 22 We are, however participating in the UNECE Big Data Project 23 I am interested in statistical methodology, mainly with data quality We work with some large administrative data sources but I would expect that some of the new sources 24 will require quite different approaches from a technology angle. We have established a 12 month project to understand the potential but also the challenges of using 26 big data within official statistics - still in a research phase 27 The Production Department that I represent, does not work with Big Data 28 we have just started a Big Data project 29 We intend to develop a pilot for using Big Data as a source of statistical information in Supermarket scanner data is in use for quite a while 31 I work in my own research which is not strictly connected to what my office does 32 Evaluation of technology and possibilities 33 Started experiments 34 Analysis and POC in progress 35 We did already some pilots, but I'm not exactly informed. 36 Just commenced work - at very early stage 2
3 Question 3: How important do you think are those skills for working with Big Data? Please rate them from 1 (not important) to 5 (very important) 3.1 IT skills, ability to use the following programs, languages, technologies and software such as... Answer Options Rating Average Count nosql databases SQL databases Hadoop Map/Reduce R SAS Machine learning Java Hive QL Python Pig Latin Mahout Tableau Julia IPython Ruby Qlikview Other (please specify) 18 answered question 100 skipped question 7 Other answers: 1 IT depends on the kind of source and its usage 2 Scala (5), Spark (5), D3 (or another visualization tool, 5) 4 Scala - 3, Java Script Hadoop administration, cluster management 6 STATISTICA 7 Other in memory tools like qlickview like sas visual analytic 8 SPSS Don t work directly on this side and it is a developing area, so identified what I have heard from staff in 9 the Statistical side of the House Note the above responses are preliminary. We have licensed Splunk and will look at wider 10 applicability than initial business case in Security monitoring. 11 Not able to specify, as we are not working with Big Data 12 SPSS, Statistica Linux (5), HDFS (4), Business Intelligence, Enterprise Architecture, Data management, Systems 13 administrators with skills in administering, monitoring and supporting specific big data platforms 14 I'm not an IT worker and therefore it is difficult to answer to this question. 15 Scala, Linux, NB score of 1 above if not open source 16 Bash, awk and C/C++ (low-level and very fast!) 17 ETL software 18 No knowledge on IT skills required 3
4 Average rating 4
5 3.2 Statistical skills, such as... Answer Options Rating Average Count Methodology for processing Big Data Data mining Standards for processing Big Data Data management skills including documentation, registration, access control Strong/power user of software such as Excel, SAS, SPSS Ability to work with text analytics Other (please specify) 7 answered question 106 skipped question 1 Other answers: 1 Assessment from a more general point of view and might vary individually 2 Mathematical statistics skills (5), multivariate data analysis (5) 3 Statistical thinking, TQM, data quality 4 Not able to specify, as we are not working with Big Data Analytic modelling techniques such as predictive modelling; subject matter 5 knowledge Since there is no standard yet, the ability to think outside the box is very important 6 (there are no standard recipes yet) 7 Data integration skills 5
6 Average rating 3.3 Other skills, such as... Answer Options Rating Average Count Creative problem solving Data governance Ethics Initiative Privacy Teamwork Communication Other (please specify) 8 answered question 105 skipped question 2 6
7 Average rating 7
8 Question 4: Which of the following skills you already have in your organisation and at what level? 4.1 IT skills, such as... Answer Options Not available Basic level Intermediate level Advanced level Planned in the near future Count Hadoop Mahout Python Java Ruby SAS Pig Latin Hive QL SQL databases nosql databases R Tableau Qlikview Julia IPython Map/Reduce Machine learning Other (please specify) 15 answered question 99 skipped question 8 Other answers: 1 Several tools might exist but relevance must be checked for individual application 2 Scala (basic, planned), Spark (basic, planned), D3 (basic, planned) 3 SPSS - advanced 4 Scala - 3 Java Script STATISTICA - maps and analyzes (advanced level) 6 SPSS level 4 SAS is considered very important in our organisation. We have just completed 3 courses in R for staff interested. Again this is not my side of the house, and not 7 aware of some of those headings. 8 SPSS 9 Linux 10 Rhadoop is also available on the basic level. 11 I miss GPGPU programming abilities (CUDA C or Open-CL for example) 12 R is used for specific purposes and not widely. 8
9 9
10 4.2 Statistical skills, such as... Answer Options Methodology for processing Big Data Standards for processing Big Data Not available Basic level Intermediate level Advanced level Planned in the near future Count Strong/power user of software such as Excel, SAS, SPSS Ability to work with text analytics Data management skills including documentation, registration, access control Data mining Other (please specify) 8 answered question 101 skipped question 6 Other answers: Mathematical Statistics (Advanced, Planned), Multivariate Data Analysis (Intermediate, 1 planned), 2 SAS is not available 3 Advanced user of Excel 4 Not too sure about this as its not my domain, hence middle markings 5 We have a big experience to work with administrative registers 6 Data Scientist One may wonder if methodology for processing Big Data is already at an advanced level. I think 7 our organization is approaching that. 10
11 4.3 Other skills, such as... Answer Options Not available Basic level Intermediate level Advanced level Planned in the near future Count Communication Creative problem solving Initiative Privacy Data governance Ethics Teamwork Other (please specify) 5 answered question 102 skipped question 5 11
12 Other answers: 1 All skills are available at high level for the traditional system of generating and releasing statistics; adoption and modifications for big data purposes seem possible Except the listed skills, our staff periodically participate in training courses in such areas as: 1. The acquisition of knowledge in economics, law, improvement of management skills 2. Improving professional knowledge in statistics 3. Improvement of knowledge on modern information 2 technologies 4. The acquisition and improvement of foreign language skills Difference between levels is not clear when applied to listed skills. How does intermediate level 3 communication differ from basic level communication in practice? 4 Of course we have skilled in these areas but not when it comes to Big Data... The use of visualisation methods for Big Data is missing here. Because of the amount of data, 5 visualisation methods are essential in getting insight in the effect of the various analyses steps. 12
13 Question 5: Please indicate in which areas you have training in your statistical organisation and indicate if you have training materials that you can share or recommend? (Training materials include: books, internet resources, training materials developed in the Statistical Organisations, etc). 5.1 IT skills, such as... Answer Options Training Training materials that you can share or recommend Count SAS SQL databases Java R Hadoop Python nosql databases Map/Reduce Pig Latin Hive QL Machine learning Ruby Qlikview IPython Mahout Tableau Julia If you have any training material, please provide us the 14 title or the link to the website answered question 80 skipped question 27 If you have any training material, please provide us the title or the link to the website: 1 There are no extended big data projects running until now 2 I need to follow up and see if our training partners are willing to share training materials. 3 Material is in the form of PPT slides, just note that it's in our national language To R: various readers and best practices developed by our employees. Mainly in our 4 national language. 5 Training material is not specific to the context of big data (standard SAS training). 6 No training! 1. Beginning T-SQL with MS SQL Server 2005 and 2008, Paul Turley, Dan Wood 2. ACCESS 2010 Bible, Michael R. Groh 3. MS SQL Server 2008 Bible, Paul Nielsen, Uttam 7 Parui 8 Internal training material available in our national language for SAS and SQL. 9 SPSS We don't currently have training material for big data in any of these areas. We would have 10 some material for Java and potentially R but not specific to big data processing\analytics. 13
14 part of our Big Data project is to compile such a list from internet resources. Not currently 11 available but will be at a later date For many IT-skills on-line training sources are available (see the Coursera website). We are 12 very experienced in using R to analyse Big Data
15 5.2 Statistical skills, such as... Answer Options Training Training materials that you can share or recommend Count Strong/power user of software such as Excel, SAS, SPSS Data management skills including documentation, registration, access control Data mining Ability to work with text analytics Methodology for processing Big Data Standards for processing Big Data If you have any training material, please provide us the title or the link to the website 9 answered question 65 skipped question 42 If you have any training material, please provide us the title or the link to the website: Cooperation with the ESS is planned; We take part in a Eurostat Working Group on 1 Big data 2 No training, except Excel 1. EXCEL 2010 Bible 2. EXCEL 2010 PowePivot for the Data Analyst 3. The Excel 3 Analyst's Guide to Access, Michael Alexander 15
16 5.3 Other skills, such as... Answer Options Training Training materials that you can share or recommend Count Communication Privacy Teamwork Data governance Ethics Creative problem solving Initiative If you have any training material, please provide us the title or the link to the website 8 answered question 57 skipped question 50 16
17 If you have any training material, please provide us the title or the link to the website: All skills are provided at high level for traditional statistical system, adoptions 1 and specifications to big data seem possible 2 The Argus documentation. 3 Lean SixSigma by UNC Plus Delta 4 We could share material on the ESTP course on Big Data 5 We have office notices and also courses developed on Confidentiality. For communication we have improved the user satisfaction survey and developed documents: how to write a press release, how to organize a press conference. For Ethics, we have translated the ISI s Declaration on Professional Ethics into our national language and posted on its website, approved by the Resolution of the State Council on Statistics to apply it in the activity of our organisation and to suggest to others dealing with the statistical activity to follow 6 the Declaration. 7 I wonder if the two topics that are not checked can be trained with a program. 8 See also ESTP Training programme 17
18 Question 6: Please indicate top 5 priorities for training for your statistical organisation across all areas: IT, Statistics and other (by marking them 1-5, where 1 is the highest) 6.1 IT skills Training priority total score SAS Hadoop SQL databases R nosql Java Map/Reduce Machine Hadoop Pig Latin Hive QL Python Mahout Mahout Qlikview Tableau IPython Julia Ruby Weighted score 18
19 6.2 Statistical skills Statistical skills training priorities Methodology for processing Big Data Standards for processing Big Data Strong/power user of software such as Excel, SAS, SPSS Data mining Data management skills including documentation, registration, access Ability to work with text analytics Weighted score 6.3 Other skills Other skills training priorities Creative problem solving Communication Data governance Privacy Initiative Teamwork Ethics Weighted score 19
20 Question 7: Other comments/suggestions It is really not easy to answer this questionnaire. Very different things are put at the same level. The answer represent my idea, and in no case those of my organisation as a whole. We consider acute the study of Big Data problems and are interested in acquiring new knowledge in this field I answer as an individual with limited background in big data but experience of large datasets of survey/census data Encourage the use of a monthly or quarterly newsletter giving the latest developments and innovations in the area. This could also give examples and which tools were used in these cases. 6 7 We would recommend different trainings for different levels or purposes. There should be courses for Big Data for managers to give an overview on methodologies and tools and mainly to raise awareness about organisational and management issues induced by the use of Big Data sources. There should be training on methodology and IT tools for more technically oriented staff to be able to use these methodologies. Our fifth priority for training would be methodology on statistical learning. As a HR coordinator I could not interpret certain questions of the survey, because I do not have enough information on the IT field. In my opinion, people who will work with big data should have some Java skills. Because Hadoop environment is based on Java and so for example all exception messages are in 8 Java.To better understand the hadoop environment, java is the essential skill. 9 Good luck! Big data is not the issue. Administrative data governance and a willingness to accept/adopt 10 machine learning are the issues. 11 It's too early to priorities the technologies and training we will adopt. We are expecting to enter a phase of experimentation\exploration and learning in the next 12 months or so with the aim of having a better understanding of what tools\approaches we should invest in. It is important not to underestimate the methodological component associated with Big Data. Once the IT component is more well understood, the relevance of the data as well as the accuracy and representativity issues need to be explored. -the computational aspect of Big Data (algorithmic approaches, computer optimizations, visualization) needs to be considered, as well as efficient techniques to visualize, pattern match, analyze, manage. There is an Information Architecture-related component as well (structural, semantic, ontological) -There are other software tools not mentioned in the questionnaire (Web Crawlers, Big Data platforms such as those from 1010data, Amazon, Cloudera, HP, Hortonworks, Actian, Teradata, SAP, 12 Pivotal, MapR, kognitio, infobright, InfiniDB and IBM. Big Data teams should be made up of people having skills from both IT and Methodology. 13 People with good skills in both areas are very rare, hence expensive! 14 It would be desirable to have a training on work with big data. Some of the areas on which questions were asked are just emerging (Statistical methodology for Big Data). Which makes it very difficult to answer these questions. We are learning by doing, 15 which forms the basis of a training program that will be set up in the near future. 16 Keep us updated about the survey's results! 17 In general, I'm very interested in the subject of Big Data. However, so far I'm not really experienced, there are other colleagues in my organisation who are more the experts. I would appreciate it to be informed on further developments in this project. 20
Big Data and Data Science: Behind the Buzz Words
Big Data and Data Science: Behind the Buzz Words Peggy Brinkmann, FCAS, MAAA Actuary Milliman, Inc. April 1, 2014 Contents Big data: from hype to value Deconstructing data science Managing big data Analyzing
ANALYTICS CENTER LEARNING PROGRAM
Overview of Curriculum ANALYTICS CENTER LEARNING PROGRAM The following courses are offered by Analytics Center as part of its learning program: Course Duration Prerequisites 1- Math and Theory 101 - Fundamentals
Integrating a Big Data Platform into Government:
Integrating a Big Data Platform into Government: Drive Better Decisions for Policy and Program Outcomes John Haddad, Senior Director Product Marketing, Informatica Digital Government Institute s Government
ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
Building Your Big Data Team
Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.
Big Data. Lyle Ungar, University of Pennsylvania
Big Data Big data will become a key basis of competition, underpinning new waves of productivity growth, innovation, and consumer surplus. McKinsey Data Scientist: The Sexiest Job of the 21st Century -
Consulting and Systems Integration (1) Networks & Cloud Integration Engineer
Ericsson is a world-leading provider of telecommunications equipment & services to mobile & fixed network operators. Over 1,000 networks in more than 180 countries use Ericsson equipment, & more than 40
The? Data: Introduction and Future
The? Data: Introduction and Future Husnu Sensoy Global Maksimum Data & Information Technologies Global Maksimum Data & Information Technologies The Data Company Massive Data Unstructured Data Insight Information
Bringing Big Data to People
Bringing Big Data to People Microsoft s modern data platform SQL Server 2014 Analytics Platform System Microsoft Azure HDInsight Data Platform Everyone should have access to the data they need. Process
Big Data Explained. An introduction to Big Data Science.
Big Data Explained An introduction to Big Data Science. 1 Presentation Agenda What is Big Data Why learn Big Data Who is it for How to start learning Big Data When to learn it Objective and Benefits of
Big Data Multi-Platform Analytics (Hadoop, NoSQL, Graph, Analytical Database)
Multi-Platform Analytics (Hadoop, NoSQL, Graph, Analytical Database) Presented By: Mike Ferguson Intelligent Business Strategies Limited 2 Day Workshop : 25-26 September 2014 : 29-30 September 2014 www.unicom.co.uk/bigdata
Introduction to Big Data! with Apache Spark" UC#BERKELEY#
Introduction to Big Data! with Apache Spark" UC#BERKELEY# So What is Data Science?" Doing Data Science" Data Preparation" Roles" This Lecture" What is Data Science?" Data Science aims to derive knowledge!
Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia
Monitis Project Proposals for AUA September 2014, Yerevan, Armenia Distributed Log Collecting and Analysing Platform Project Specifications Category: Big Data and NoSQL Software Requirements: Apache Hadoop
Oracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
BIG DATA & DATA SCIENCE
BIG DATA & DATA SCIENCE ACADEMY PROGRAMS IN-COMPANY TRAINING PORTFOLIO 2 TRAINING PORTFOLIO 2016 Synergic Academy Solutions BIG DATA FOR LEADING BUSINESS Big data promises a significant shift in the way
You should have a working knowledge of the Microsoft Windows platform. A basic knowledge of programming is helpful but not required.
What is this course about? This course is an overview of Big Data tools and technologies. It establishes a strong working knowledge of the concepts, techniques, and products associated with Big Data. Attendees
BIG DATA. Value 8/14/2014 WHAT IS BIG DATA? THE 5 V'S OF BIG DATA WHAT IS BIG DATA?
WHAT IS BIG DATA? BIG DATA DR. KLARA NELSON THE UNIVERSITY OF TAMPA "Volumes of data that are unusually large, or types of data that are unstructured" Thomas Davenport, Keeping Up with the Quants, 2013,
This survey addresses individual projects, partnerships, data sources and tools. Please submit it multiple times - once for each project.
Introduction This survey has been developed jointly by the United Nations Statistics Division (UNSD) and the United Nations Economic Commission for Europe (UNECE). Our goal is to provide an overview of
Hadoop Ecosystem Overview. CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook
Hadoop Ecosystem Overview CMSC 491 Hadoop-Based Distributed Computing Spring 2015 Adam Shook Agenda Introduce Hadoop projects to prepare you for your group work Intimate detail will be provided in future
Infomatics. Big-Data and Hadoop Developer Training with Oracle WDP
Big-Data and Hadoop Developer Training with Oracle WDP What is this course about? Big Data is a collection of large and complex data sets that cannot be processed using regular database management tools
Sunnie Chung. Cleveland State University
Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:
ESS event: Big Data in Official Statistics
ESS event: Big Data in Official Statistics v erbi v is 1 Parallel sessions 2A and 2B LEARNING AND DEVELOPMENT: CAPACITY BUILDING AND TRAINING FOR ESS HUMAN RESOURCES FACILITATOR: JOSÉ CERVERA- FERRI 2
Sentimental Analysis using Hadoop Phase 2: Week 2
Sentimental Analysis using Hadoop Phase 2: Week 2 MARKET / INDUSTRY, FUTURE SCOPE BY ANKUR UPRIT The key value type basically, uses a hash table in which there exists a unique key and a pointer to a particular
Data processing goes big
Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani
A Tour of the Zoo the Hadoop Ecosystem Prafulla Wani Technical Architect - Big Data Syntel Agenda Welcome to the Zoo! Evolution Timeline Traditional BI/DW Architecture Where Hadoop Fits In 2 Welcome to
COMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
INVESTOR PRESENTATION. First Quarter 2014
INVESTOR PRESENTATION First Quarter 2014 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
An interdisciplinary model for analytics education
An interdisciplinary model for analytics education Raffaella Settimi, PhD School of Computing, DePaul University Drew Conway s Data Science Venn Diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
HDP Enabling the Modern Data Architecture
HDP Enabling the Modern Data Architecture Herb Cunitz President, Hortonworks Page 1 Hortonworks enables adoption of Apache Hadoop through HDP (Hortonworks Data Platform) Founded in 2011 Original 24 architects,
Transforming the Telecoms Business using Big Data and Analytics
Transforming the Telecoms Business using Big Data and Analytics Event: ICT Forum for HR Professionals Venue: Meikles Hotel, Harare, Zimbabwe Date: 19 th 21 st August 2015 AFRALTI 1 Objectives Describe
Advanced Big Data Analytics with R and Hadoop
REVOLUTION ANALYTICS WHITE PAPER Advanced Big Data Analytics with R and Hadoop 'Big Data' Analytics as a Competitive Advantage Big Analytics delivers competitive advantage in two ways compared to the traditional
THE MCKINSEY GLOBAL INSTITUTE has predicted that by 2018, the US alone could face a shortage of between 140,000 to 190,000 people with deep
THE MCKINSEY GLOBAL INSTITUTE has predicted that by 2018, the US alone could face a shortage of between 140,000 to 190,000 people with deep analytical skills, and a shortage of 1.5 million managers and
http://glennengstrand.info/analytics/fp
Functional Programming and Big Data by Glenn Engstrand (September 2014) http://glennengstrand.info/analytics/fp What is Functional Programming? It is a style of programming that emphasizes immutable state,
The Inside Scoop on Hadoop
The Inside Scoop on Hadoop Orion Gebremedhin National Solutions Director BI & Big Data, Neudesic LLC. VTSP Microsoft Corp. [email protected] [email protected] @OrionGM The Inside Scoop
Data Analytics Infrastructure
Data Analytics Infrastructure Data Science SG Nov 2015 Meetup Le Nguyen The Dat @lenguyenthedat Backgrounds ZALORA Group (2013 2014) o Biggest online fashion retails in South East Asia o Data Infrastructure
Big Data and Analytics: Challenges and Opportunities
Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif
Customized Report- Big Data
GINeVRA Digital Research Hub Customized Report- Big Data 1 2014. All Rights Reserved. Agenda Context Challenges and opportunities Solutions Market Case studies Recommendations 2 2014. All Rights Reserved.
2015 Ironside Group, Inc. 2
2015 Ironside Group, Inc. 2 Introduction to Ironside What is Cloud, Really? Why Cloud for Data Warehousing? Intro to IBM PureData for Analytics (IPDA) IBM PureData for Analytics on Cloud Intro to IBM dashdb
Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth
MAKING BIG DATA COME ALIVE Big Data Architecture & Analytics A comprehensive approach to harness big data architecture and analytics for growth Steve Gonzales, Principal Manager [email protected]
BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?
BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand? The Big Data Buzz big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database
Safe Harbor Statement
Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment
Big Data & QlikView. Democratizing Big Data Analytics. David Freriks Principal Solution Architect
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
DBMS / Business Intelligence, Business Intelligence / Big Data
DBMS / Business Intelligence, Business Intelligence / Big Data Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to
P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland
P4.1 Reference Architectures for Enterprise Big Data Use Cases Romeo Kienzler, Data Scientist, Advisory Architect, IBM Germany, Austria, Switzerland IBM Center of Excellence for Data Science, Cognitive
The 4 Pillars of Technosoft s Big Data Practice
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
Forecast of Big Data Trends. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
Workshop on Hadoop with Big Data
Workshop on Hadoop with Big Data Hadoop? Apache Hadoop is an open source framework for distributed storage and processing of large sets of data on commodity hardware. Hadoop enables businesses to quickly
Platfora Big Data Analytics
Platfora Big Data Analytics ISV Partner Solution Case Study and Cisco Unified Computing System Platfora, the leading enterprise big data analytics platform built natively on Hadoop and Spark, delivers
Hadoop Evolution In Organizations. Mark Vervuurt Cluster Data Science & Analytics
In Organizations Mark Vervuurt Cluster Data Science & Analytics AGENDA 1. Yellow Elephant 2. Data Ingestion & Complex Event Processing 3. SQL on Hadoop 4. NoSQL 5. InMemory 6. Data Science & Machine Learning
Outline. What is Big data and where they come from? How we deal with Big data?
What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,
How To Work For Amdocs
1. Financial Analyst (Requisition number 29449) Amdocs Raanana is looking for Financial Analyst who will be responsible for: Managing the Business Unit s financial measurements, and monthly/quarterly close
How To Learn To Use Big Data
Information Technologies Programs Big Data Specialized Studies Accelerate Your Career extension.uci.edu/bigdata Offered in partnership with University of California, Irvine Extension s professional certificate
Presenters: Luke Dougherty & Steve Crabb
Presenters: Luke Dougherty & Steve Crabb About Keylink Keylink Technology is Syncsort s partner for Australia & New Zealand. Our Customers: www.keylink.net.au 2 ETL is THE best use case for Hadoop. ShanH
HDP Hadoop From concept to deployment.
HDP Hadoop From concept to deployment. Ankur Gupta Senior Solutions Engineer Rackspace: Page 41 27 th Jan 2015 Where are you in your Hadoop Journey? A. Researching our options B. Currently evaluating some
IBM BigInsights Has Potential If It Lives Up To Its Promise. InfoSphere BigInsights A Closer Look
IBM BigInsights Has Potential If It Lives Up To Its Promise By Prakash Sukumar, Principal Consultant at iolap, Inc. IBM released Hadoop-based InfoSphere BigInsights in May 2013. There are already Hadoop-based
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
DATA MINING WITH HADOOP AND HIVE Introduction to Architecture
DATA MINING WITH HADOOP AND HIVE Introduction to Architecture Dr. Wlodek Zadrozny (Most slides come from Prof. Akella s class in 2014) 2015-2025. Reproduction or usage prohibited without permission of
Big Data and Data Science. The globally recognised training program
Big Data and Data Science The globally recognised training program Certificate in Big Data Analytics Duration 5 days Big Data and Data Science enables value creation from data, through the use of calculative
Predictive Analytics. Noam Zeigerson, CTO
Predictive Analytics Noam Zeigerson, CTO Agenda The Predictive Analytics Need Innovative Technologies Business Solutions The problem: Inconsistent stream of revenue Available Data Sources ERP data Web
Cisco Data Preparation
Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and
Microsoft Research Windows Azure for Research Training
Copyright 2013 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
Hadoop 只 支 援 用 Java 開 發 嘛? Is Hadoop only support Java? 總 不 能 全 部 都 重 新 設 計 吧? 如 何 與 舊 系 統 相 容? Can Hadoop work with existing software?
Hadoop 只 支 援 用 Java 開 發 嘛? Is Hadoop only support Java? 總 不 能 全 部 都 重 新 設 計 吧? 如 何 與 舊 系 統 相 容? Can Hadoop work with existing software? 可 以 跟 資 料 庫 結 合 嘛? Can Hadoop work with Databases? 開 發 者 們 有 聽 到
Big Data Too Big To Ignore
Big Data Too Big To Ignore Geert! Big Data Consultant and Manager! Currently finishing a 3 rd Big Data project! IBM & Cloudera Certified! IBM & Microsoft Big Data Partner 2 Agenda! Defining Big Data! Introduction
Cisco IT Hadoop Journey
Cisco IT Hadoop Journey Alex Garbarini, IT Engineer, Cisco 2015 MapR Technologies 1 Agenda Hadoop Platform Timeline Key Decisions / Lessons Learnt Data Lake Hadoop s place in IT Data Platforms Use Cases
TE's Analytics on Hadoop and SAP HANA Using SAP Vora
TE's Analytics on Hadoop and SAP HANA Using SAP Vora Naveen Narra Senior Manager TE Connectivity Santha Kumar Rajendran Enterprise Data Architect TE Balaji Krishna - Director, SAP HANA Product Mgmt. -
INVESTOR PRESENTATION. Third Quarter 2014
INVESTOR PRESENTATION Third Quarter 2014 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
BIG DATA What it is and how to use?
BIG DATA What it is and how to use? Lauri Ilison, PhD Data Scientist 21.11.2014 Big Data definition? There is no clear definition for BIG DATA BIG DATA is more of a concept than precise term 1 21.11.14
Analytics 2013. A survey on analytic usage, trends, and future initiatives. Research conducted and written by:
Analytics 2013 A survey on analytic usage, trends, and future initiatives Research conducted and written by: Lavastorm Analytics A global analytics software company that enables a new, agile way to analyze,
Microsoft Research Microsoft Azure for Research Training
Copyright 2014 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
Programme Specification Postgraduate Programmes
Programme Specification Postgraduate Programmes Awarding Body/Institution Teaching Institution University of London Goldsmiths, University of London Name of Final Award and Programme Title MSc Data Science
How To Handle Big Data With A Data Scientist
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business
BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business Instructor: Kunpeng Zhang ([email protected]) Lecture-Discussions:
The BIg Picture. Dinsdag 17 september 2013
The BIg Picture Dinsdag 17 september 2013 2 Agenda A short historical overview on BI Current Issues Current trends Future architecture First steps to this architecture 3 MIS/EIS Data Warehouse BI Multidimensional
SAP and Hortonworks Reference Architecture
SAP and Hortonworks Reference Architecture Hortonworks. We Do Hadoop. June Page 1 2014 Hortonworks Inc. 2011 2014. All Rights Reserved A Modern Data Architecture With SAP DATA SYSTEMS APPLICATIO NS Statistical
Report of the 2015 Big Data Survey. Prepared by United Nations Statistics Division
Statistical Commission Forty-seventh session 8 11 March 2016 Item 3(c) of the provisional agenda Big Data for official statistics Background document Available in English only Report of the 2015 Big Data
Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015
Hortonworks and ODP: Realizing the Future of Big Data, Now Manila, May 13, 2015 We Do Hadoop Fall 2014 Page 1 HDP delivers a comprehensive data management platform GOVERNANCE Hortonworks Data Platform
Big Data Analytics. Copyright 2011 EMC Corporation. All rights reserved.
Big Data Analytics 1 Priority Discussion Topics What are the most compelling business drivers behind big data analytics? Do you have or expect to have data scientists on your staff, and what will be their
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns
How Transactional Analytics is Changing the Future of Business A look at the options, use cases, and anti-patterns Table of Contents Abstract... 3 Introduction... 3 Definition... 3 The Expanding Digitization
CS555: Distributed Systems [Fall 2015] Dept. Of Computer Science, Colorado State University
CS 555: DISTRIBUTED SYSTEMS [SPARK] Shrideep Pallickara Computer Science Colorado State University Frequently asked questions from the previous class survey Streaming Significance of minimum delays? Interleaving
Predictive Analytics
Predictive Analytics How many of you used predictive today? 2015 SAP SE. All rights reserved. 2 2015 SAP SE. All rights reserved. 3 How can you apply predictive to your business? Predictive Analytics is
SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP. Eva Andreasson Cloudera
SOLVING REAL AND BIG (DATA) PROBLEMS USING HADOOP Eva Andreasson Cloudera Most FAQ: Super-Quick Overview! The Apache Hadoop Ecosystem a Zoo! Oozie ZooKeeper Hue Impala Solr Hive Pig Mahout HBase MapReduce
TABLE OF CONTENTS 1 Chapter 1: Introduction 2 Chapter 2: Big Data Technology & Business Case 3 Chapter 3: Key Investment Sectors for Big Data
TABLE OF CONTENTS 1 Chapter 1: Introduction 1.1 Executive Summary 1.2 Topics Covered 1.3 Key Findings 1.4 Target Audience 1.5 Companies Mentioned 2 Chapter 2: Big Data Technology & Business Case 2.1 Defining
Big Data & Analytics @ Netflix. Paul Ellwood February 9th, 2015
Big Data & Analytics @ Netflix Paul Ellwood February 9th, 2015 Who Am I? Director, Data Science & Engineering Also Leader, DataKind San Francisco chapter Formerly: Director, Product Analytics @ Netflix
Programming Hadoop 5-day, instructor-led BD-106. MapReduce Overview. Hadoop Overview
Programming Hadoop 5-day, instructor-led BD-106 MapReduce Overview The Client Server Processing Pattern Distributed Computing Challenges MapReduce Defined Google's MapReduce The Map Phase of MapReduce
Moving From Hadoop to Spark
+ Moving From Hadoop to Spark Sujee Maniyam Founder / Principal @ www.elephantscale.com [email protected] Bay Area ACM meetup (2015-02-23) + HI, Featured in Hadoop Weekly #109 + About Me : Sujee
This Symposium brought to you by www.ttcus.com
This Symposium brought to you by www.ttcus.com Linkedin/Group: Technology Training Corporation @Techtrain Technology Training Corporation www.ttcus.com Big Data Analytics as a Service (BDAaaS) Big Data
