Big data in official statistics Insights about world heritage from the analysis of Wikipedia use
|
|
- Elmer Moore
- 8 years ago
- Views:
Transcription
1 Big data in official statistics Insights about world heritage from the analysis of Wikipedia use Fernando Reis, European Commission - Eurostat International Symposium on the Measurement of Digital Cultural Products Montreal, 9-11 May 2016
2 Defining big data in 1 minute Data deluge Exhaust data + sensors High detail, massive size (large p, large n) Data-driven analytical applications Statistical modelling, machine learning Visualisation Data-driven economy Official statistics does not have a nearly statistical monopoly anymore Eurostat
3 Scheveningen Memorandum on Big Data Examine the potential of Big Data sources for official statistics Official Statistics Big Data strategy as part of wider government strategy Address privacy and data protection Collaboration at European and global level Address need for skills Partnerships between different stakeholders (government, academics, private sector) Developments in Methodology, quality assessment and IT Adopt action plan and roadmap for the European Statistical System Eurostat
4 Big data strategy Start with concrete pilots 3 time-frames Short-term Medium-term Long-term Review the roadmap Eurostat
5 Big Data Action Plan and a glance Governance Policy Quality Skills Experience sharing Legislation IT Infrastructures Methods Ethics / Communication Big data sources Pilots Eurostat
6 Policy Quality Skills Experience sharing Methods Governance Legislation Ethics / Communication IT Infrastructures Big data sources Challenges cooperation, sharing of know-how development of a sound methodology ("from designbased to model-based approach") exploration & tentative implementation Looking for partners Pilots Action (example) Pilot projects, carried out by the Member States (ESSnet) (European Statistical System network) Exploring different big data sources (but also IT architecture, partnerships), developing generic guidelines and frameworks Establish Parternships with data providers and research and international organisations Cooperation with UN on Metodological Framework 6 Eurostat
7 Contracts Eurostat big data pilots Feasibility study on the use of mobile phone data for tourism statistics Internet as a data source for information society statistics Accreditation of big data sources Internal projects Wikipedia use Mobile phone for urban statistics Web evidence for nowcasting Eurostat
8 Wikipedia as a big data source Insights about world heritage from the analysis of Wikipedia use
9 World Heritage Sites Convention Concerning the Protection of the World Cultural and Natural Heritage List of World Heritage Sites maintained by UNESCO
10 Data sources: UNESCO 1. List of World Heritage Sites from UNESCO Public source Official information
11 Data sources: Wikipedia 2. Wikipedia Public source Digital traces left by people Widely used In 2013, 44% of individuals 16 to 74 years old living in EU consulted wikis to obtain knowledge (e.g. Wikipedia) This was 69% for individuals between 16 and 24 years old Community Survey on ICT Usage by Individuals
12 Data sources: Wikipedia 2.1 Content (text and links) Selection of articles related to World Heritage sites 2.2 Page views Wikistats: hourly number of page views for all articles of all wiki projects of the Wikimedia foundation
13 Wikipedia page views raw data Example: zu.z Ulimi 8 AE1,LN2O1Q1,AX1,FB2, [wiki code][article title][monthly total][hourly counts] Monthly files From Jan 2012 to Oct 2015 Total: 935 GB
14 Data processing Sandbox computer cluster 4 nodes, each: 2 x Intel Xeon E v3 10 cores 128GB RAM 4 x 4TB disk FDR Infiniband (56Gbit) 3 stages: Pre-processing Extraction Analytics
15 Pre-processing Scripts in Unix shell and Pig Filtering of raw data to needed project and language Change of format: en.z Banc_d'Arguin_National_Park [0, 0, 0, 5, 6, 16, 5, 20, 25, 21, 48, 29, 43, 40, 46, 0, 30, 55, 36, 39, 28, 28, 204, 218] Processing time: 8 Hours
16 Extraction Map-reduce jobs Scripts in Unix shell and python Filtering to list of articles supplied Time aggregation from hourly to daily, weekly and monthly Processing time: 2 hours
17 Data analysis R and RStudio Querying APIs (CatScan, Wikipedia Miner, Wikimedia) Web scrapping of Wikipedia for selection of articles (geocoordinates, categorisation, information boxes, article redirects, articles links) Statistics, maps, graphics
18 Number of page views of related Wikipedia articles per country of location of the WHS Reference: Jan.2012 Oct languages
19 Average number of page views according to the date of inscription Reference: Jan.2012 Oct languages
20 Top 20 World Heritage Sites in number of page views of related Wikipedia articles Reference: Jan.2012 Oct languages
21 Distribution of page views of articles related to World Heritage Sites by language of Wikipedia en de fr ru it pt nl tr es pl Reference: Jan.2012 Oct languages
22 Top 5 WHS in number of page views of related Wikipedia articles by language English Spanish German French Reference: Jan.2012 Oct languages
23 45M Page views of Wikipedia articles related to World Heritage Sites 40M 35M 30M 25M 20M 15M 10M 5M 0M Reference: Jan.2012 Oct languages Mar 2012 Jul 2012 Nov 2012 Mar 2013 Jul 2013 Nov 2013 Mar 2014 Jul 2014 Nov 2014 Mar 2015 Jul 2015
24 Page views of Wikipedia articles related to World Heritage Sites (English Wikipedia) Vatican City What happened in March 2013?!
25 Distribution of WHS by number of page views (log)
26 Distribution of WHS by number of page views (NOT log) The percentage of page views going to the top 20 WHS is 32%
27 Thank you for your attention Fernando Reis Eurostat Task Force on Big Data Eurostat
big data in the European Statistical System
Conference by STATEC and EUROSTAT Savoir pour agir: la statistique publique au service des citoyens big data in the European Statistical System Michail SKALIOTIS EUROSTAT, Head of Task Force 'Big Data'
More informationNew Frontiers for Official Statistics
European Data Forum 2015 November 16-17, 2015, Luxembourg New Frontiers for Official Statistics Mariana KOTZEVA EUROSTAT, Deputy Director General Key issues 1. A dynamically changing data ecosystem 2.
More informationThe Sandbox 2015 Report
United Nations Economic Commission for Europe Statistical Division Workshop on the Modernisation of Official Statistics November 24-25, 2015 The Sandbox project The Sandbox 2015 Report Antonino Virgillito
More informationThe use of Big Data for statistics
Workshop on the use of mobile positioning data for tourism statistics Prague (CZ), 14 May 2014 The use of Big Data for statistics EUROSTAT, Unit G-3 "Short-term statistics; tourism" What is the role of
More informationKeywords: big data, official statistics, quality, Wikipedia page views, AIS.
Comparative assessment of three quality frameworks for statistics derived from big data: the cases of Wikipedia page views and Automatic Identification Systems Fernando Reis 1, Loredana di Consiglio 1,
More informationAnalysis of Big Data Survey 2015 on Skills, Training and Capacity Building
Analysis of Big Data Survey 2015 on Skills, Training and Capacity Building D R A F T Version 1.0 12 Oct 2015 By UN Global Working Group on Big Data for Official Statistics Task Team on Skills, Training
More informationHortonworks & SAS. Analytics everywhere. Page 1. Hortonworks Inc. 2011 2014. All Rights Reserved
Hortonworks & SAS Analytics everywhere. Page 1 A change in focus. A shift in Advertising From mass branding A shift in Financial Services From Educated Investing A shift in Healthcare From mass treatment
More informationONS Big Data Project Progress report: Qtr 1 Jan to Mar 2014
Official ONS Big Data Project Qtr 1 Report May 2014 ONS Big Data Project Progress report: Qtr 1 Jan to Mar 2014 Jane Naylor, Nigel Swier, Susan Williams Office for National Statistics Background The amount
More informationUNECE HLG-MOS: Achievements
UNECE HLG-MOS: Achievements Lidia Bratanova, Director UNECE Statistical Division Tokyo, December 2015 Introducing UNECE Statistics Regional Commission of the UN 56 Member countries Europe, North America,
More informationHU CZ FI PL SI PT IT ES NO NL FR DK SE IE GB AT DE CH LU 0 10 20 30 40 Foreigners' share Source: Eurostat More trust 3 4 5 6 7 PL HU CZ SI PT GR ES DK FI SE
More informationAn Oracle White Paper June 2012. High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database
An Oracle White Paper June 2012 High Performance Connectors for Load and Access of Data from Hadoop to Oracle Database Executive Overview... 1 Introduction... 1 Oracle Loader for Hadoop... 2 Oracle Direct
More informationA Novel Cloud Based Elastic Framework for Big Data Preprocessing
School of Systems Engineering A Novel Cloud Based Elastic Framework for Big Data Preprocessing Omer Dawelbeit and Rachel McCrindle October 21, 2014 University of Reading 2008 www.reading.ac.uk Overview
More informationData and Machine Architecture for the Data Science Lab Workflow Development, Testing, and Production for Model Training, Evaluation, and Deployment
Data and Machine Architecture for the Data Science Lab Workflow Development, Testing, and Production for Model Training, Evaluation, and Deployment Rosaria Silipo Marco A. Zimmer Rosaria.Silipo@knime.com
More informationHadoop & SAS Data Loader for Hadoop
Turning Data into Value Hadoop & SAS Data Loader for Hadoop Sebastiaan Schaap Frederik Vandenberghe Agenda What s Hadoop SAS Data management: Traditional In-Database In-Memory The Hadoop analytics lifecycle
More informationTowards Smart and Intelligent SDN Controller
Towards Smart and Intelligent SDN Controller - Through the Generic, Extensible, and Elastic Time Series Data Repository (TSDR) YuLing Chen, Dell Inc. Rajesh Narayanan, Dell Inc. Sharon Aicler, Cisco Systems
More informationOracle Big Data Building A Big Data Management System
Oracle Big Building A Big Management System Copyright 2015, Oracle and/or its affiliates. All rights reserved. Effi Psychogiou ECEMEA Big Product Director May, 2015 Safe Harbor Statement The following
More informationJoined up Government needs Joined up Data. John.Dunne@cso.ie IPA/ICS 10 th Annual Public Sector IT Conference Dublin Castle 31 st October 2014
Joined up Government needs Joined up Data John.Dunne@cso.ie IPA/ICS 10 th Annual Public Sector IT Conference Dublin Castle 31 st October 2014 Joined up data - context Irish Statistical System: The Way
More informationHadoop Distributed File System. Dhruba Borthakur Apache Hadoop Project Management Committee dhruba@apache.org dhruba@facebook.com
Hadoop Distributed File System Dhruba Borthakur Apache Hadoop Project Management Committee dhruba@apache.org dhruba@facebook.com Hadoop, Why? Need to process huge datasets on large clusters of computers
More informationInternational collaboration to understand the relevance of Big Data for official statistics
Statistical Journal of the IAOS 31 (2015) 159 163 159 DOI 10.3233/SJI-150889 IOS Press International collaboration to understand the relevance of Big Data for official statistics Steven Vale United Nations
More informationA Study of Data Management Technology for Handling Big Data
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 9, September 2014,
More informationCost-Effective Business Intelligence with Red Hat and Open Source
Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,
More informationDisco: Beyond MapReduce
Disco: Beyond MapReduce Prashanth Mundkur Nokia Mar 22, 2013 Outline BigData/MapReduce Disco Disco Pipeline Model Disco Roadmap BigData/MapReduce Data too big to fit in RAM/disk of any single machine Analyze
More informationBig Data Challenges in Bioinformatics
Big Data Challenges in Bioinformatics BARCELONA SUPERCOMPUTING CENTER COMPUTER SCIENCE DEPARTMENT Autonomic Systems and ebusiness Pla?orms Jordi Torres Jordi.Torres@bsc.es Talk outline! We talk about Petabyte?
More informationBig Data and Analytics: Challenges and Opportunities
Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif
More informationScaling Out With Apache Spark. DTL Meeting 17-04-2015 Slides based on https://www.sics.se/~amir/files/download/dic/spark.pdf
Scaling Out With Apache Spark DTL Meeting 17-04-2015 Slides based on https://www.sics.se/~amir/files/download/dic/spark.pdf Your hosts Mathijs Kattenberg Technical consultant Jeroen Schot Technical consultant
More informationMilk Market Situation. Brussels, 27 August 2015
Milk Market Situation Brussels, 27 August EU Productions EU Productions (Jan-Jun compared to Jan-Jun 214) +,8% + 1,2% + 3,2% +,2% +,7% - 2,% - 3,1% -,1% - 11,1%!!! Data from some Member States are confidential
More informationHPC technology and future architecture
HPC technology and future architecture Visual Analysis for Extremely Large-Scale Scientific Computing KGT2 Internal Meeting INRIA France Benoit Lange benoit.lange@inria.fr Toàn Nguyên toan.nguyen@inria.fr
More informationAn Introduction to High Performance Computing in the Department
An Introduction to High Performance Computing in the Department Ashley Ford & Chris Jewell Department of Statistics University of Warwick October 30, 2012 1 Some Background 2 How is Buster used? 3 Software
More informationOpen source Google-style large scale data analysis with Hadoop
Open source Google-style large scale data analysis with Hadoop Ioannis Konstantinou Email: ikons@cslab.ece.ntua.gr Web: http://www.cslab.ntua.gr/~ikons Computing Systems Laboratory School of Electrical
More informationCOMP9321 Web Application Engineering
COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411
More informationCSPA. Common Statistical Production Architecture International activities on Big Data in Official Statistics. Carlo Vaccari Istat (vaccari@istat.
CSPA Common Statistical Production Architecture International activities on Big Data in Official Statistics Carlo Vaccari Istat (vaccari@istat.it) Data deluge Big Data definitions Data Characteristics:
More informationData Processing in the Cloud at Yahoo!
Data Processing in the Cloud at Sanjay Radia Chief Architect, Hadoop/Grid Sradia@yahoo-inc.com Cloud Computing & Data Infrastructure Yahoo Inc. 1 Outline What are clouds and their benefits Clouds at Yahoo
More informationA Concept Model for the UK Public Sector
A Concept Model for the UK Public Sector January 2012, Version 0.2 January 2012, Version 0.2 Introduction This paper is produced by the CTO Council Information Domain to scope and propose a concept model
More informationItem 5.2. 3 rd International Transport Forum. Big Data to monitor air and maritime transport. Paris, 17-18 March 2016
3 rd International Transport Forum Paris, 17-18 March 2016 Item 5.2 Big Data to monitor air and maritime transport DG EUROSTAT - Anna Białas-Motyl, Transport statistics & TF Big Data Content Big Data at
More informationThematic Unit of Excellence on Computational Materials Science Solid State and Structural Chemistry Unit, Indian Institute of Science
Thematic Unit of Excellence on Computational Materials Science Solid State and Structural Chemistry Unit, Indian Institute of Science Call for Expression of Interest (EOI) for the Supply, Installation
More informationThe UNECE Big Data Sandbox: What Means to What Ends?
Distr. GENERAL UNITED NATIONS ECONOMIC COMMISSION FOR EUROPE (UNECE) CONFERENCE OF EUROPEAN STATISTICIANS Working Paper No. 17 th April 2015 ENGLISH ONLY Workshop on the Modernisation of Statistical Production
More informationOfficial Statistics in the Age. of Big Data. SAS Forum BeLux 2014. Michail.Skaliotis@ec.europa.eu Albrecht.Wirthmann@ec.europa.eu.
Official Statistics in the Age SAS Forum BeLux 2014 of Big Data Michail.Skaliotis@ec.europa.eu Albrecht.Wirthmann@ec.europa.eu Table of Contents / Storyboard What is "Official Statistics"? Drivers of Big
More information22 nd Meeting of the European Statistical System Committee
22 nd Meeting of the European Statistical System Committee Riga (Latvia), 26 September 2014 Item 8 of the agenda ESS Big Data Action Plan and Roadmap 1.0 Work Programme Objective 11.1 Eurostat Big Data
More informationThe Greenplum Analytics Workbench
The Greenplum Analytics Workbench External Overview 1 The Greenplum Analytics Workbench Definition Is a 1000-node Hadoop Cluster. Pre-configured with publicly available data sets. Contains the entire Hadoop
More informationTHE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES
THE ATLAS DISTRIBUTED DATA MANAGEMENT SYSTEM & DATABASES Vincent Garonne, Mario Lassnig, Martin Barisits, Thomas Beermann, Ralph Vigne, Cedric Serfon Vincent.Garonne@cern.ch ph-adp-ddm-lab@cern.ch XLDB
More informationAT&T Global Network Client for Windows Product Support Matrix January 29, 2015
AT&T Global Network Client for Windows Product Support Matrix January 29, 2015 Product Support Matrix Following is the Product Support Matrix for the AT&T Global Network Client. See the AT&T Global Network
More informationHadoop IST 734 SS CHUNG
Hadoop IST 734 SS CHUNG Introduction What is Big Data?? Bulk Amount Unstructured Lots of Applications which need to handle huge amount of data (in terms of 500+ TB per day) If a regular machine need to
More informationOutline. High Performance Computing (HPC) Big Data meets HPC. Case Studies: Some facts about Big Data Technologies HPC and Big Data converging
Outline High Performance Computing (HPC) Towards exascale computing: a brief history Challenges in the exascale era Big Data meets HPC Some facts about Big Data Technologies HPC and Big Data converging
More informationBig Data: What You Should Know. Mark Child Research Manager - Software IDC CEMA
Big Data: What You Should Know Mark Child Research Manager - Software IDC CEMA Agenda Market Dynamics Defining Big Data Technology Trends Information and Intelligence Market Realities Future Applications
More informationOBSERVEIT DEPLOYMENT SIZING GUIDE
OBSERVEIT DEPLOYMENT SIZING GUIDE The most important number that drives the sizing of an ObserveIT deployment is the number of Concurrent Connected Users (CCUs) you plan to monitor. This document provides
More informationElasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack
Elasticsearch on Cisco Unified Computing System: Optimizing your UCS infrastructure for Elasticsearch s analytics software stack HIGHLIGHTS Real-Time Results Elasticsearch on Cisco UCS enables a deeper
More informationData Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
More informationAn Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics
An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,
More informationScalable Cloud Computing Solutions for Next Generation Sequencing Data
Scalable Cloud Computing Solutions for Next Generation Sequencing Data Matti Niemenmaa 1, Aleksi Kallio 2, André Schumacher 1, Petri Klemelä 2, Eija Korpelainen 2, and Keijo Heljanko 1 1 Department of
More informationHadoop Submitted in partial fulfillment of the requirement for the award of degree of Bachelor of Technology in Computer Science
A Seminar report On Hadoop Submitted in partial fulfillment of the requirement for the award of degree of Bachelor of Technology in Computer Science SUBMITTED TO: www.studymafia.org SUBMITTED BY: www.studymafia.org
More informationSchema and Release Support. Job Spooler / Job Server Compatibility. PTC Creo Elements/Direct Model Manager
Page 1 of 12 PT reo Elements/Direct Manager Server 19.0 supports:.1.1 (64 bit).1 Pro.1 Pro (64 bit).1 Enterprise.1 Enterprise (64 bit) (64 bit) Pro Pro (64 bit) (64 bit) PT reo Elements/Direct Manager
More informationDealing with Data Especially Big Data
Dealing with Data Especially Big Data INFO-GB-2346.30 Spring 2016 Very Rough Draft Subject to Change Professor Norman White Background: Most courses spend their time on the concepts and techniques of analyzing
More informationBig Data Challenges. technology basics for data scientists. Spring - 2014. Jordi Torres, UPC - BSC www.jorditorres.
Big Data Challenges technology basics for data scientists Spring - 2014 Jordi Torres, UPC - BSC www.jorditorres.eu @JordiTorresBCN Data Deluge: Due to the changes in big data generation Example: Biomedicine
More informationBig Data for everyone Democratizing big data with the cloud. Steffen Krause Technical Evangelist @AWS_Aktuell skrause@amazon.de
Big Data for everyone Democratizing big data with the cloud Steffen Krause Technical Evangelist @AWS_Aktuell skrause@amazon.de Does this Data make me look big? Overview Designing big data solutions in
More informationUsing the HP Vertica Analytics Platform to Manage Massive Volumes of Smart Meter Data
Technical white paper Using the HP Vertica Analytics Platform to Manage Massive Volumes of Smart Meter Data The Internet of Things is expected to connect billions of sensors that continuously gather data
More informationOracle Big Data SQL Technical Update
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
More informationSavanna Hadoop on. OpenStack. Savanna Technical Lead
Savanna Hadoop on OpenStack Sergey Lukjanov Savanna Technical Lead Mirantis, 2013 Agenda Savanna Overview Savanna Use Cases Roadmap & Current Status Architecture & Features Overview Hadoop vs. Virtualization
More informationLarge scale processing using Hadoop. Ján Vaňo
Large scale processing using Hadoop Ján Vaňo What is Hadoop? Software platform that lets one easily write and run applications that process vast amounts of data Includes: MapReduce offline computing engine
More informationStrategies For Setting Up Your Organisation For Success With Big Data. Kevin Long Business Development Director Teradata
Strategies For Setting Up Your Organisation For Success With Big Data Kevin Long Business Development Director Teradata Agenda Developing a big data strategy and plan that is aligned with your organisation
More informationOracle Database - Engineered for Innovation. Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya
Oracle Database - Engineered for Innovation Sedat Zencirci Teknoloji Satış Danışmanlığı Direktörü Türkiye ve Orta Asya Oracle Database 11g Release 2 Shipping since September 2009 11.2.0.3 Patch Set now
More informationTowards an Optimized Big Data Processing System
Towards an Optimized Big Data Processing System The Doctoral Symposium of the IEEE/ACM CCGrid 2013 Delft, The Netherlands Bogdan Ghiţ, Alexandru Iosup, and Dick Epema Parallel and Distributed Systems Group
More informationNavigating the Big Data infrastructure layer Helena Schwenk
mwd a d v i s o r s Navigating the Big Data infrastructure layer Helena Schwenk A special report prepared for Actuate May 2013 This report is the second in a series of four and focuses principally on explaining
More informationData Mining with Hadoop at TACC
Data Mining with Hadoop at TACC Weijia Xu Data Mining & Statistics Data Mining & Statistics Group Main activities Research and Development Developing new data mining and analysis solutions for practical
More informationData Analytics at NERSC. Joaquin Correa JoaquinCorrea@lbl.gov NERSC Data and Analytics Services
Data Analytics at NERSC Joaquin Correa JoaquinCorrea@lbl.gov NERSC Data and Analytics Services NERSC User Meeting August, 2015 Data analytics at NERSC Science Applications Climate, Cosmology, Kbase, Materials,
More informationOverview Motivation MapReduce/Hadoop in a nutshell Experimental cluster hardware example Application areas at the Austrian National Library
Overview Motivation MapReduce/Hadoop in a nutshell Experimental cluster hardware example Application areas at the Austrian National Library Web Archiving Austrian Books Online SCAPE at the Austrian National
More informationESS event: Big Data in Official Statistics
ESS event: Big Data in Official Statistics v erbi v is 1 Parallel sessions 2A and 2B LEARNING AND DEVELOPMENT: CAPACITY BUILDING AND TRAINING FOR ESS HUMAN RESOURCES FACILITATOR: JOSÉ CERVERA- FERRI 2
More informationVery Large Enterprise Network, Deployment, 25000+ Users
Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
More informationCouncil of the European Union Brussels, 13 February 2015 (OR. en)
Council of the European Union Brussels, 13 February 2015 (OR. en) 6022/15 NOTE From: To: Presidency RECH 19 TELECOM 29 COMPET 30 IND 16 Permanent Representatives Committee/Council No. Cion doc.: 11603/14
More informationDevelopment of Monitoring and Analysis Tools for the Huawei Cloud Storage
Development of Monitoring and Analysis Tools for the Huawei Cloud Storage September 2014 Author: Veronia Bahaa Supervisors: Maria Arsuaga-Rios Seppo S. Heikkila CERN openlab Summer Student Report 2014
More informationCS 493N: Big Data Engineering - Overview
CS 493N: Big Data Engineering - Overview Introduction Data Structures for Big Data Data structures Basic Search Algorithms for Big Data Machine Learning and Predictive Analytics Intro to machine learning
More informationBIG DATA SOLUTION DATA SHEET
BIG DATA SOLUTION DATA SHEET Highlight. DATA SHEET HGrid247 BIG DATA SOLUTION Exploring your BIG DATA, get some deeper insight. It is possible! Another approach to access your BIG DATA with the latest
More informationArchitecture & Experience
Architecture & Experience Data Mining - Combination from SAP HANA, R & Hadoop Markus Severin, Solution Principal Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein
More informationCOMPARISON OF FIXED & VARIABLE RATES (25 YEARS) CHARTERED BANK ADMINISTERED INTEREST RATES - PRIME BUSINESS*
COMPARISON OF FIXED & VARIABLE RATES (25 YEARS) 2 Fixed Rates Variable Rates FIXED RATES OF THE PAST 25 YEARS AVERAGE RESIDENTIAL MORTGAGE LENDING RATE - 5 YEAR* (Per cent) Year Jan Feb Mar Apr May Jun
More informationCOMPARISON OF FIXED & VARIABLE RATES (25 YEARS) CHARTERED BANK ADMINISTERED INTEREST RATES - PRIME BUSINESS*
COMPARISON OF FIXED & VARIABLE RATES (25 YEARS) 2 Fixed Rates Variable Rates FIXED RATES OF THE PAST 25 YEARS AVERAGE RESIDENTIAL MORTGAGE LENDING RATE - 5 YEAR* (Per cent) Year Jan Feb Mar Apr May Jun
More informationHadoop in Social Network Analysis - overview on tools and some best practices - Headline Goes Here
Hadoop in Social Network Analysis - overview on tools and some best practices - Headline Goes Here Speaker Name or Subhead Goes Here GridKa School 2013, Karlsruhe 2013-08-27 Mirko Kämpf mirko@cloudera.com
More informationAnalysis One Code Desc. Transaction Amount. Fiscal Period
Analysis One Code Desc Transaction Amount Fiscal Period 57.63 Oct-12 12.13 Oct-12-38.90 Oct-12-773.00 Oct-12-800.00 Oct-12-187.00 Oct-12-82.00 Oct-12-82.00 Oct-12-110.00 Oct-12-1115.25 Oct-12-71.00 Oct-12-41.00
More informationModernization of European Official Statistics through Big Data methodologies and best practices: ESS Big Data Event Roma 2014
Modernization of European Official Statistics through Big Data methodologies and best practices: ESS Big Data Event Roma 2014 CONCEPT PAPER (DRAFT VERSION v0.3) Big Data for Official Statistics: recognition
More informationBIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research &
BIG DATA Alignment of Supply & Demand Nuria de Lama Representative of Atos Research & Innovation 04-08-2011 to the EC 8 th February, Luxembourg Your Atos business Research technologists. and Innovation
More informationHDFS. Hadoop Distributed File System
HDFS Kevin Swingler Hadoop Distributed File System File system designed to store VERY large files Streaming data access Running across clusters of commodity hardware Resilient to node failure 1 Large files
More informationBig Data Keep it Simple
RSM Leadership Summit Big Data Keep it Simple Rotterdam, October 3 rd 2014 Jens-Peter Seick, VP Head of Product Management and Development Fujitsu in Europe 0 Agenda Big Data phenomena Big Data technologies
More informationRedmine: A project management software tool. January, 2013
Redmine: A project management software tool January, 2013 Outline Introduction to Redmine. Important concepts of Redmine. How to use Redmine. 1 Introduction: What is Redmine? Redmine is a project management
More informationVirtual Machine (VM) These VMs are to be used for teaching: they are not workstations for calculation.
Computer and Software Infrastructure Available to Teachers and Students of the MSc in Big Data Virtual Machine (VM) All students have at their disposal a VM Windows 7 64-bit, 3 GB RAM, 1 vcpu. This VM
More informationReal-time Big Data Analytics with Storm
Ron Bodkin Founder & CEO, Think Big June 2013 Real-time Big Data Analytics with Storm Leading Provider of Data Science and Engineering Services Accelerating Your Time to Value IMAGINE Strategy and Roadmap
More informationQi Liu Rutgers Business School ISACA New York 2013
Qi Liu Rutgers Business School ISACA New York 2013 1 What is Audit Analytics The use of data analysis technology in Auditing. Audit analytics is the process of identifying, gathering, validating, analyzing,
More informationBig Data: What Can Official Statistics Expect?
Big Data: What Can Official Statistics Expect? Peter Hackl Österreichische Statistiktage 2015 Outline Data Needs in Official Statistics Alternative Data Sources Historical Facts Some Initiatives in Detail
More informationTesting Big data is one of the biggest
Infosys Labs Briefings VOL 11 NO 1 2013 Big Data: Testing Approach to Overcome Quality Challenges By Mahesh Gudipati, Shanthi Rao, Naju D. Mohan and Naveen Kumar Gajja Validate data quality by employing
More informationAMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013
AMD SEAMICRO OPENSTACK BLUEPRINTS CLOUD- IN- A- BOX OCTOBER 2013 OpenStack What is OpenStack? OpenStack is a cloud operaeng system that controls large pools of compute, storage, and networking resources
More informationlocuz.com Big Data Services
locuz.com Big Data Services Big Data At Locuz, we help the enterprise move from being a data-limited to a data-driven one, thereby enabling smarter, faster decisions that result in better business outcome.
More informationBig Data andofficial Statistics Experiences at Statistics Netherlands
Big Data andofficial Statistics Experiences at Statistics Netherlands Peter Struijs Poznań, Poland, 10 September 2015 Outline Big Data and official statistics Experiences at Statistics Netherlands with:
More informationBuilding 1000 node cluster on EMR Manjeet Chayel
Building 1000 node cluster on EMR Manjeet Chayel What is EMR? Amazon Elas+c MapReduce Hadoop- as- a- service Map- Reduce engine What is EMR? Integrated with tools Massively parallel Integrated to AWS services
More informationbwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 24.
bwgrid Treff MA/HD Sabine Richling, Heinz Kredel Universitätsrechenzentrum Heidelberg Rechenzentrum Universität Mannheim 24. November 2010 Richling/Kredel (URZ/RUM) bwgrid Treff WS 2010/2011 1 / 17 Course
More informationBIG DATA: FROM HYPE TO REALITY. Leandro Ruiz Presales Partner for C&LA Teradata
BIG DATA: FROM HYPE TO REALITY Leandro Ruiz Presales Partner for C&LA Teradata Evolution in The Use of Information Action s ACTIVATING MAKE it happen! Insights OPERATIONALIZING WHAT IS happening now? PREDICTING
More informationBig Data and Data Science. The globally recognised training program
Big Data and Data Science The globally recognised training program Certificate in Big Data Analytics Duration 5 days Big Data and Data Science enables value creation from data, through the use of calculative
More informationNews and trends in Data Warehouse Automation, Big Data and BI. Johan Hendrickx & Dirk Vermeiren
News and trends in Data Warehouse Automation, Big Data and BI Johan Hendrickx & Dirk Vermeiren Extreme Agility from Source to Analysis DWH Appliances & DWH Automation Typical Architecture 3 What Business
More informationA Performance Analysis of Distributed Indexing using Terrier
A Performance Analysis of Distributed Indexing using Terrier Amaury Couste Jakub Kozłowski William Martin Indexing Indexing Used by search
More informationAn introduction to Fyrkat
Cluster Computing May 25, 2011 How to get an account https://fyrkat.grid.aau.dk/useraccount How to get help https://fyrkat.grid.aau.dk/wiki What is a Cluster Anyway It is NOT something that does any of
More informationBig Data Mining Services and Knowledge Discovery Applications on Clouds
Big Data Mining Services and Knowledge Discovery Applications on Clouds Domenico Talia DIMES, Università della Calabria & DtoK Lab Italy talia@dimes.unical.it Data Availability or Data Deluge? Some decades
More informationBiDAl: Big Data Analyzer for Cluster Traces
BiDAl: Big Data Analyzer for Cluster Traces Alkida Balliu, Dennis Olivetti, Ozalp Babaoglu, Moreno Marzolla, Alina Sirbu Department of Computer Science and Engineering University of Bologna, Italy BigSys
More informationThe Rise of Industrial Big Data. Brian Courtney General Manager Industrial Data Intelligence
The Rise of Industrial Big Data Brian Courtney General Manager Industrial Data Intelligence Agenda Introduction Big Data for the industrial sector Case in point: Big data saves millions at GE Energy Seeking
More informationBITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand?
BITKOM& NIK - Big Data Wo liegen die Chancen für den Mittelstand? The Big Data Buzz big data is a collection of data sets so large and complex that it becomes difficult to process using on-hand database
More information