Optimists are calling it a

Size: px
Start display at page:

Download "Optimists are calling it a"

Transcription

1 What is big data, and could it transform development policy? by Emmanuel Letouzé, University of California In just a few years big data have affected industries and activities from marketing and advertising to intelligence gathering and law enforcement, stirring much excitement and scepticism. With policymaking increasingly looking like big data s next frontier, is this phenomenon what one expert, Andreas Weigend, is calling the new oil that needs to be refined poised to be a blessing or a curse for human development and social progress [1, 2]? Optimists are calling it a revolution that will change, mostly for the better, how we live, think and work. Some World Bank officials have even expressed the hope that Africa s statistical tragedy that is, the dearth of reliable official statistics in some of the world s poorest places may be partly fixed by big data [3, 4]. But sceptics and critics have been more circumspect, and some plainly antagonistic referring to big data as a big ruse, a big hype, a big risk as well as, of course, big brother, in the wake of the revelations by former US National Security Agency contractor Edward Snowden. Gaining ground Big data, especially as applied to development and public policy issues, is in its intellectual and operational infancy. Joe Hellerstein, computer scientist at the University of California, Berkeley, United States, made an early mention of an upcoming Industrial Revolution of data in November 2008; while The Economist talked about a data deluge in early 2010 [5, 6]. "Big data" itself became a mainstream term only a couple of years ago. Google searches are one metric that shows this: the number of searches that include the term did not take off until In those two years, four major reports were published: by the UN Global Pulse, the World Economic Forum, the McKinsey Global Institute, and Danah Boyd and Kate Crawford, researchers at Microsoft and academic institutions [7-10]. Over the past three years, publications and initiatives about big data for development or data science for social good have become a source of big data themselves. Of course, the big data buzz could just be a bubble, or just hype: as some observers point out, automated analysis of large datasets is not new. So what is? What is big data? There is no single agreed definition of big data. For one, it is data generated through our increasing use of digital devices and web-supported tools and platforms in our daily lives. In any given minute, hundreds of millions of individuals across the globe use some of the world s 7- to 8-billion mobile phones to make a call, send a text Fig. 1: Global Mobile Data 2014 traffic growth and forecast. message or an . Or they may wire money, buy a book, search online, pay for a meal with a credit card, update their Facebook status, send tweets, upload videos to YouTube, publish a blog post and so on. Each of these actions leaves a digital trace. Added up, this digital information makes up the bulk of big data. Each year since 2012, well over 1,2 ZB of data has been produced 1021 bytes, enough to fill 80-billion 16 GB iphones (which would circle the earth more than 100 times) (see Table 1). And the volume of these data is growing fast [11]. So volume, velocity and variety are the three Vs that characterise big data, with the value that could be extracted from them often added as a fourth V. 40 PositionIT April/May 2014

2 Unit Size What it means Bit (b) 1 or 0 Short for binary digit, after the binary code (1 or 0) computers use to store and process data including text, numbers, images, videos, etc. Byte (B) 8 bits Enough information to create a number or an English letter in computer code. It is the basic unit of computing. Kilobyte (kb) 1000, or 2 10, bytes From thousand in Greek. One page of typed text is 2 kb. Megabyte (MB) 1000 kb, or 2 20, bytes From large in Greek. The MP3 file of a typical song is about 4 MB. Gigabytes (GB) Terabyte (TB) Petabyte (PB) Exabyte (EB) Zettabyte (ZB) Yottabyte (YB) 1000 MB, or 2 30, bytes 1000 GB, or 2 40, bytes 1000 TB, or 2 50, bytes 1000 PB, or 2 60, bytes 1000 EB, or 2 70, bytes 1000 ZB, or 2 80, bytes From giant in Greek. A two-hour film can be compressed into 1-2 GB. A 1 GB text file contains over 1-billion characters, or roughly 290 copies of Shakespeare s complete works. From monster in Greek. All the catalogued books in America s Library of Congress total 15 TB. All the tweets sent before the end of 2013 would approximately fill an 18,5 TB text file. Printing such a file (at a rate of 15 A4-sized pages per minute) would take over 1200 years. The NSA is reportedly analysing 1,6 % of global internet traffic, or about 30 PB, per day. Continuously playing 30 PB of music would take over years, which corresponds to the time that has elapsed since the first Homo Sapiens left Africa. 1 EB of data corresponds to the storage capacity of iphone 5 devices with a 32 GB memory. By 2018, the total volume of monthly mobile data traffic is forecast to be about half of an EB. If this volume of data were stored on 32 GB iphone 5 devices stacked one on top of the other, the pile would be over 283 times the height of the Empire State Building. It is estimated that in 2013, humanity generated 4-5 ZB of data, which exceeds the quantity of data in 46-trillion print issues of The Economist. If that many magazines were laid out sheet by sheet on the ground, they would cover the total land surface area of the Earth. The contents of one human s genetic code can be stored in less than 1,5 GB, meaning that 1 YB of storage could contain the genome of over 800-trillion people, or roughly that of times the entire world population. The prefixes are set by the International Bureau of Weights and Measures. Source: Adapted and updated from The Economist by Emmanuel Letouzé and Gabriel Pestre, using data from Cisco, the Daily Mail, Twitter (via SEC Archives (via and the book Uncharted: Big Data as a Lens on Human Culture (2013) by Erez Aiden and Jean-Baptiste Michel. Table 1: Data inflation. And much as a population with a sudden outburst of fertility gets both larger and younger, the proportion of digital data produced recently is growing ever faster up to 90% of the world s data was created over just two years ( ), according to one much cited account [12]. Data types Big data come in different types. One kind is small pieces of hard data numbers or facts, for example described by Alex Sandy Pentland, a professor at the Massachusetts Institute of Technology, United States, as digital breadcrumbs [13]. They are said to be structured because they make up datasets of variables that can be easily tagged, categorised, and organised (in columns and rows for instance) for systematic analysis. One example is Call Detail Records (CDRs) collected by mobile phone operators. CDRs are metadata (data about data) that capture subscribers use of their cell-phones including an identification code and, at a minimum, the location of the phone tower that routed the call for both caller and receiver and the time and duration of call. Large operators collect over CDRs per day [14] (see Fig. 1). A second kind of big data are videos, documents, blog posts and other social media content. Most of these data are unstructured and so harder to analyse. They differ from breadcrumbs in that they are subject to their authors editorial choices and, being subjective, may paint a deceiving picture. For example, you might blog that you are boycotting a certain product, but your credit card statement may reveal a different preference based on actual purchases. A third kind of big data is gathered remotely by digital sensors and reflects human actions. These might be smart meters installed in homes to record electricity consumption, or satellite imagery that can pick up physical information such as vegetation cover as an indicator of deforestation [15]. Some consider the universe of big data to be much wider including administrative records, price or weather data, for instance, or books that have been previously digitised which, taken collectively, may constitute a fourth kind. Defining features But the bulk of big data is machinereadable, generated about and by people some combination of the types mentioned above. These data were unavailable 10 years ago, before the age of Facebook or the explosion of mobile phone use and they stem from powerful technological and societal changes. Big data s main novelty is that they come from electronic sources and end up in databases whose primary purpose is not statistical inference [16]. In other words, they were not collected or sampled with the explicit intention of drawing conclusions from them. This also makes putting big data to use challenging. So the term big data may be a misleading misnomer: size isn t their defining feature. For example, an Excel spreadsheet with CDRs may not be a big file; the entire World Bank Development Indicators database is a big file but the latter results from fully controlled processes, including surveys and statistical imputations undertaken by official bodies. The difference is primarily qualitative it s in the kinds of information contained in the data and the way these are generated. To add an extra layer of complexity, Big Data is not about the data, as Harvard University professor Gary King puts PositionIT April/May

3 Applications Explanation Examples Comments and caveats UN Global Pulse Taxonomy Descriptive Big data can document and convey what is happening. This application is quite similar to the "real-time awareness" application although it is less ambitious in its objectives. Any infographic, including maps, that renders vast amounts of data legible to the reader is an example of a descriptive application. Describing data always implies making choices and assumptions about what and how data are displayed that need to be made explicit and understood; it is well known that even bar graphs and maps can be misleading. Predictive Big data could give a sense of what is likely to happen, regardless of why. One kind of "prediction" refers to what may happen next the predictive policing mentioned above is one example. Another kind refers to predicting prevailing conditions through big data as in the cases of socioeconomic levels using CDRs in Latin America and the Ivory Coast. Similar comments as those made for the "early-warning" and "real-time awareness" applications apply. Prescriptive, or diagnostic Big data might shed light on why things may happen and what could be done about it. So far there have been next to no clear-cut examples of this application in development contexts. The example of CDR data used to show that bus routes in Abidjan could be "optimised" falls closest to a case where the analysis identifies causal links and can shape policy. Most comments about "real-time feedback" application apply. Strictly speaking, an example of the diagnostics application would require being able to assign causality. The prescriptive application works best in theory when supported by feedback systems and loops on the effect of policy actions. Alternative Taxonomy Early warning Early detection of anomalies in how populations use digital devices and services can enable faster response in times of crisis. Predictive policing, based upon the notion that analysis of historical data can reveal certain combinations of factors associated with greater likelihood of increased criminality in a given area; it can be used to allocate police resources. Google Flu trends is another example, where searches for particular terms ( runny nose, itchy eyes ) are analysed to detect the onset of the flu season although its accuracy is debated. This application assumes that certain regularities in human behaviours can be observed and modeled. Key challenges for policy include the tendency of most malfunction-detection systems and forecasting models to over-predict i.e. to have a higher prevalence of "false positives". Real-time awareness Big data can paint a fine-grained and current representation of reality which can inform the design and targeting of programs and policies. Using data released by Orange,researchers found a high degree of association between social networks and language distribution in Ivory Coast suggesting that such data may provide information about language communities in countries where it is unavailable. The appeal and underlying argument for this application is the notion that big data may be a substitute for bad or scarce data; however models that show high correlations between "big data-based" and "traditional" indicators often require the availability of the latter to be trained and built. "Real-time" here means using high frequency digital data to get a picture of reality at any given time. Real-time feedback The ability to monitor a population in real-time makes it possible to understand where policies and programs are failing, and make the necessary adjustments. Private corporations already use big data analytics. For development, this might include analysing the impact of a policy action e.g. the introduction of new traffic regulations in real-time. Although appealing, few (if any) actual examples of this application exist; a challenge is making sure that any observed change can be attributed to the intervention or "treatment". However high-frequency data can also contain "natural experiments" such as a sudden drop in online prices of a given good that can be leveraged to infer causality. Sources: [7, 42] Table 2: Actual and potential uses of big data for development. 42 PositionIT April/May 2014

4 it [17]. It s about big data analytics, which broadly refers to improvements in computing power and analytical capacities such as statistical machinelearning and algorithms that are able to look for and unveil patterns and trends in vast amounts of complex data. This is the second feature of big data: the tools and methods, hardware and software now available to analyse digital data. A third, less discussed but important property of big data is that it has become is a movement [18]. And that movement is increasingly attracting multidisciplinary teams of social and computer scientists with a mindset to turn mess into meaning, as data scientist Andreas Weigend puts it in essence, defining big data as a movement to turn data into decision making [2]. Statements such as this have renewed interest in the prospects and promise of data-driven or evidence-based policymaking although there are, technological, commercial and political implications that are far from trivial. How exactly can big data new kinds of data, new capacities to analyse it, with new intentions affect societies? And what explains the buzz it has created? The promise stems from two aspects: supply of ever-more data, and demand for better, faster and cheaper information in other words there is both a push for and a pull towards big data. Data demand People are frustrated with the current tools and systems available for decision-making. For instance, a good indicator of a region s poverty or underdevelopment is a lack of poverty or development data [19]. Some countries (most of them with a recent history of conflict) haven t had a census in four decades or more. Their population size, structure and distribution is essentially anyone s guess. Even though official figures exist, they are often based on incomplete data [20]. Poor data also mean that some countries official GDP figures get an overnight boost of 40% for Ghana in 2010 or 60% for Nigeria in 2014 when changes in the structure of their economies, such as the rise of the technology sector, are finally taken into account [21-22]. This lack of reliable data has presided over the recent UN call for a data revolution. The basic rationale is that, in the age of big data, economies should be steered by policymakers relying on better navigation instruments and indicators that let them design and implement more agile and better targeted policies and programmes. Big data has even been said to hold the potential for national statistical systems in data-poor areas to leapfrog ahead, much as many poor countries skipped the landline phase to jump straight into the mobile phone era [4]. Supplying new knowledge The appeal of potentially leaping ahead is also shaped by the supply side of big data. There is early practical evidence and a growing body of work on big data s novel potential to understand and affect human populations and processes. For example, big data has been used to track inflation online, estimate and predict changes in GDP in near real-time, monitor traffic or even a dengue outbreak [23-26]. Monitoring social media data to analyse people s sentiments is opening new ways to measure welfare, while and Twitter data could be used to study internal and international migration [25,27]. And an especially rich and growing academic literature is using CDRs to study migration patterns, socioeconomic levels and malaria spread, among others. Guidance for analysing big data, published by UN Global Pulse, has focused on four fields: disaster response, public health, poverty and socioeconomic levels, and human mobility and transportation [28]. Mobile phone data analysis examples, based on UN Global Pulse s primer and the 2013 World Disaster Report Data on mobile money transfers in the aftermath of the 2008 earthquake in Rwanda was used to analyse the timing, magnitude, and motivation of donations to affected communities revealing notably that transfers were more likely to benefit wealthier individuals [29]. CDR analysis has been used to study infectious disease spread and control in an urban slum in Kibera, Kenya. An especially promising avenue is using CDRs to predict socio-economic levels. This is done by overlaying and matching CDR-based indicators (such as average call volumes in an area) with known socioeconomic variables (such as income levels) to build statistical models able to predict patterns and trends. 44 PositionIT April/May 2014

5 For human mobility and transportation, CDRs from Côte d Ivoire, made available by Orange under the umbrella of a D4D (Data for Development) challenge, helped model bus routes in Abidjan and show that travel time could be reduced by 10%. Real-time traffic information For instance, Google Traffic Alerts provide information to consumers on their daily commute using a mix of data sources some public (such as construction schedules), some private (such as telecom companies tracking individual user devices to calculate time to work) and some passively-generated (for example, a cluster of calls made from a similar location might indicate a traffic jam). Enhanced understanding of travel behaviour This requires matching together travel data derived from mobile phone use with other socio-economic data to reveal a pattern of preferences in travel behaviour (as opposed to stated preferences, derived from surveys). As an example, in the UK, East Coast Trains used data from Telefonica to better understand customer behaviour on the London to Edinburgh route [28,42]. Meanwhile, various other authors have proposed how big data could benefit development. The UN Global Pulse distinguished the early warning uses from real-time awareness, or from real-time monitoring of the impact of a policy. Others contrast its descriptive function (such as a real-time map) from predictive and diagnostic functions [7,30] (see Table 2). Risks and challenges Of course, big data s promise has been met with warnings about its perils. The risks, challenges and more generally the hard questions were articulated as early as 2011 [10]. Perhaps the most severe risks and most urgent avenues for research and debate are to individual rights, privacy, identity, and security. In addition to the obvious intrusion of surveillance activities and issues around their legality and legitimacy, there are important questions about data anonymisation : what it means and its limits. A study of movie rentals showed that even anonymised data could be de-anonymised linked to a known individual by correlating rental dates of as few as three movies with the dates of posts on an online movie platform [31]. Other research has found that CDRs that record location and time, even when free of any individual identifier could be re-individualised. In that case, four data points were theoretically sufficient to uniquely single out individuals out of the whole dataset with 95% accuracy [32]. Critics also point to the risks associated with basing decisions on biased data or dubious analyses (sometimes called threats to both external and internal validity). If policymakers come to believe that the data don t lie, such risks could be especially worrisome. Some examples are outlined below: Big data risks to drawing valid conclusions A key challenge in big data is that the people generating it have selected themselves as data generators through their activity. In terms this is a "selection bias" and it means that analysis of big data is likely to yield a different result from a traditional survey (or poll), which would seek out a representative cross section of the population. For example, trying to answer the question do people in country A prefer rice or chips? by mining data on Twitter would be biased in favour of young people s preferences as Manufactured for premium performance CONNECTING THE DOTS INTELLIGENT SOLUTIONS Océ South Africa aims to be the top supplier of innovative and high quality products and services for the printing and management of documents in professional environments. The company s solutions for market segments include print on demand, transactional printing and wide format. Supplied and serviced by Océ SA Contact: Charné Oosthuizen Tel: +27 (0) info@oce.co.za PositionIT April/May

6 they make up more of Twitter s users. So analyses based on big data may lack "external validity", although it is possible that individuals that differ in almost all respects may have similar preferences and display identical behaviors (young people may have the same preferences as older people). Another risk comes from analyses that are flawed because they lack "internal validity". For instance, a sharp drop in the volume of CDRs from an area might be interpreted, based on past events, as heralding a looming conflict. But it could actually be caused by something different, such as a mobile phone tower having gone down in the area. Another risk is that analyses based on big data will focus too much on correlation and prediction at the expense of cause, diagnostics or inference, without which policy is essentially blind. A good example is predictive policing. Since about 2010, police and law enforcement forces in some US and UK cities have crunched data to assess the likelihood of increased crime in certain areas, predicting rises based on historical patterns. Forces dispatch their resources accordingly, and this has reduced crime in most cases [33]. However, unless there is knowledge of why crime is rising it s not possible to put in place preventive policy that tackles the root causes or contributing factors [34]. Yet another big risk that has not received the attention it merits is big data s potential to create a new digital divide that may widen rather than close existing gaps in income and power worldwide [35]. One of the "three paradoxes" of big data is that because it requires analytical capacities and access to data that only a fraction of institutions, corporations and individuals have, the data revolution may disempower the very communities and countries it promises to serve [36]. People with the most data and capacities would be in the best position to exploit big data for economic advantage, even as they claim to use them to benefit others. A related and basic challenge is that of putting the data to use. All discussions about the data revolution assume that data matter ; that poor data are partly to blame for poor policies. But history has shown that lack of data or information has historically played only a marginal role in the decisions leading to bad policies and poor outcomes. And a blind algorithmic future may undercut the very processes that are meant to ensure that the way data Fig. 2. How open data relates to other types of data. Credit: James Manyika and others [40]. are turned into decisions is subject to democratic oversight. Big future But since the growth in data production is highly unlikely to abate, the big data bubble is similarly unlikely to burst in the near future. The world can expect more papers and controversies about big data s potential and perils for development. The future of big data will likely be shaped by three main strands: academic research, legal and frameworks for ethical use of data, and larger societal demands for greater accountability. Research will continue to examine whether and how methodological and scientific frontiers can be pushed, especially in two areas: drawing stronger inferences, and measuring and correcting sample biases. Policy debate will develop frameworks and standards normative, legal and for collecting, storing and sharing big data. These developments fall under the umbrella term ethics of big data [37, 38]. Technical advances will help, for example by injecting noise in datasets to make re-identification of the individuals represented in them more difficult. But a comprehensive approach to the ethics of big data would ideally encompass other humanistic considerations such as privacy and equality, and champion data literacy [39]. A third influence on the future of big data will be how it engages and evolves alongside the open data movement and its underlying social drivers where open data refers to data that is easily accessible, machine-readable, accessible for free or at negligible cost, and with minimal limitations on its use, transformation, and distribution [40]. (See Fig. 2.) For the foreseeable future, the big data and open data movements will be the two main pillars of a larger data revolution. Both rise against a background of increased public demand for more openness, agility, transparency and accountability for public data and actions. The political overtones so easily forgotten are clear. And so a true big data revolution should be one where data can be leveraged to change power structures and decision-making processes, not just create insights [41]. Acknowledgements This article was originally published in the SciDev.Net newsletter of 16 April 2014 and has been republished with permission of the author and SciDev. Net ( References [1] Andreas Weigend, get link [2] The new data refineries: transforming big data into decisions. (Technology Services Industry Association blog, covering a talk by Andreas Weigend. 6 January 2014). [3] Shanta Devarajan: Africa s statistical PositionIT April/May 2014

7 tragedy. (World Bank blog, 6 October 2011) [4] Marcelo Giugale: Fix Africa s statistics. (The World Post 18 December 2012) [5] Joseph Hellerstein: The commoditization of massive data analysis. (Blog on O Reilly.com 19 November 2008) [6] Data data everywhere. Kenneth Cukier interviewed for The Economist (25 February 2010) [7] Emmanuel Letouzé: Big data for development: opportunities and challenges. (UN Global Pulse, May 2012) [8] Big data, big impact: new possibilities for international development. (World Economic Forum, 2012) [9] James Manyika and others: Big data: the next frontier for innovation, competition and productivity. (McKinsey Global Institute May 2011) [10] Danah Boyd and Kate Crawford: Six provocations for Big Data. (A Decade in Internet Time: Symposium on the Dynamics of the Internet and Society, September 2011) [11] The physical size of big data. Infographic by Domo. (14 May 2013) [12] Christopher Frank: Improving decision making in the world of Big Data. (Forbes, 25 March 2012) [13] Reinventing society in the wake of Big Data. A Conversation with Alex (Sandy) Pentland (Edge, 30 August 2012) [14] Eric Bouillet, and others: Processing 6-billion CDRs/day: from research to production (experience report) pp in Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems (2012) [15] Social impact through satellite remote sensing: visualising acute and chronic crises beyond the visible spectrum. (UN Global Pulse, 28 November 2011) [16] Michael Horrigan: Big Data: a perspective from the BLS. Column written for AMSTATNEWS, the magazine of the American Statistical Association. (1 January 2013) [17] Gary King: Big Data is not about the data! Presentation (Harvard University USA, 19 November 2013) [18] Sanjeev Sardana: Big Data: it's not a buzzword, it s a movement (Forbesblog, 20 November 2013) [19] C Melamed: Development data: how accurate are the figures? (The Guardian, 31 January 2014) [20] 2010 World population and housing census programme. United Nations Statistics Division. [21] Laura Gray: How to boost GDP stats by 60% (BBC News Magazine, 9 December 2012) [22] Nigeria's economy will soon overtake South Africa's (The Economist, 21 January 2014) [23] The billion prices project. Massachusetts Institute of Technology [24] Measuring economic sentiment (The Economist, 18 July 2012) [25] Piet Daas and Mark van der Loo: Big Data (and official statistics)working paper prepared for the Meeting on the Management of Statistical Information Systems. (23-25 April 2013) [26] Rebecca Tave Gluskin and others. Evaluation of Internet-Based Dengue Query Data: Google Dengue Trends. (PLOS Neglected Tropical Diseases, 27 February 2014) [27] Emilio Zagheni and others: Inferring international and internal migration patterns from Twitter data. (World Wide Web Conference, April 7-11, 2014, Seoul, Korea) [28] New primer on mobile phone network data for development. (UN Global Pulse, 5 November 2013) [29] Joshua Blumenstock and others: Motives for mobile phone-based giving: evidence in the aftermath of natural disasters (30 December, 2013) [30] Michael Wu: Big Data Reduction 3: from descriptive to prescriptive. (Science of Social blog, Lithium 10 April 2013) [31] Arvind Narayanan and Vitaly Shmatikov: Robust de-anonymization of large sparse datasets. Pages in Proceedings of the 2008 IEEE Symposium on Security and Privacy (IEEE Computer Society Washington, DC, USA 2008) [32] Yves-Alexandre de Montjoye and others: Unique in the Crowd: The privacy bounds of human mobility (Nature scientific reports 25 March 2013) [33] Erica Goode: Sending the police before there s a crime. (The New York Times, 15 August 2011) [34] It is getting easier to foresee wrongdoing and spot likely wrongdoers (The Economist, 18 July 2013) [35] Kate Crawford: Think again: Big Data. Why the rise of machines isn t all it s cracked up to be. (Foreign Policy, 9 May 2013) [36] M Neil Richards and H Jonathan King: Three paradoxes of Big Data. (Stanford Law Review, 3 September 2013) [37] M Neil Richards and H Jonathan King: Big Data ethics. (Wake Forest Law Review, 23 January 2014) [38] M Neil Richards and H Jonathan King: Gigabytes gone wild. (Aljazeera America, 2 March 2014) [39] Rahul Bhargava: Toward a concept of popular data. (MIT Center for Civic Media, 18 November 2013) [40] James Manyika and others. Open data: unlocking innovation and performance with liquid information (McKinsey Global Institute, October 2013) [41] Emmanuel Letouzé: The Big Data revolution should be about knowledge security (Post-2015.org, 1 April 2014) [42] Robert Kirkpatrick. Digital smoke signals. (UN Global Pulse, 21 April 2011) [43] "Focus on technology and the future of humanitarian action". (International Federation of Red Cross and Red Crescent Societies, 2013). Contact Emmanuel Letouzé, University of California, eletouze@berkeley.edu PositionIT April/May

DATA-POP ALLIANCE PRIMERS SERIES Big Data and Development: An Overview

DATA-POP ALLIANCE PRIMERS SERIES Big Data and Development: An Overview DATA-POP ALLIANCE PRIMERS SERIES Big Data and Development: An Overview Emmanuel Letouzé Draft for discussion * This version: November 17 th, 2014 About this document This is the inaugural Primer in Data-Pop

More information

DATA-POP ALLIANCE PRIMERS SERIES BIG DATA & DEVELOPMENT AN OVERVIEW. March 2015. Written by. Emmanuel Letouzé

DATA-POP ALLIANCE PRIMERS SERIES BIG DATA & DEVELOPMENT AN OVERVIEW. March 2015. Written by. Emmanuel Letouzé DATA-POP ALLIANCE PRIMERS SERIES BIG DATA & DEVELOPMENT AN OVERVIEW March 2015 Written by Emmanuel Letouzé About Data-Pop Alliance Data-Pop Alliance is a research, policy and capacity building think-tank

More information

BIG DATA & DEVELOPMENT: AN OVERVIEW

BIG DATA & DEVELOPMENT: AN OVERVIEW PRIMERS SERIES BIG DATA & DEVELOPMENT: AN OVERVIEW Written by Emmanuel Letouzé May 2015 IN PARTNERSHIP AND WITH FUNDING FROM ABOUT DATA-POP ALLIANCE Data-Pop Alliance is a research, policy and capacity

More information

Big Data for Development: What May Determine Success or failure?

Big Data for Development: What May Determine Success or failure? Big Data for Development: What May Determine Success or failure? Emmanuel Letouzé letouze@unglobalpulse.org OECD Technology Foresight 2012 Paris, October 22 Swimming in Ocean of data Data deluge Algorithms

More information

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP

The Big Deal about Big Data. Mike Skinner, CPA CISA CITP HORNE LLP The Big Deal about Big Data Mike Skinner, CPA CISA CITP HORNE LLP Mike Skinner, CPA CISA CITP Senior Manager, IT Assurance & Risk Services HORNE LLP Focus areas: IT security & risk assessment IT governance,

More information

Introduction to the Mathematics of Big Data. Philippe B. Laval

Introduction to the Mathematics of Big Data. Philippe B. Laval Introduction to the Mathematics of Big Data Philippe B. Laval Fall 2015 Introduction In recent years, Big Data has become more than just a buzz word. Every major field of science, engineering, business,

More information

Collaborations between Official Statistics and Academia in the Era of Big Data

Collaborations between Official Statistics and Academia in the Era of Big Data Collaborations between Official Statistics and Academia in the Era of Big Data World Statistics Day October 20-21, 2015 Budapest Vijay Nair University of Michigan Past-President of ISI vnn@umich.edu What

More information

BIG DATA FUNDAMENTALS

BIG DATA FUNDAMENTALS BIG DATA FUNDAMENTALS Timeframe Minimum of 30 hours Use the concepts of volume, velocity, variety, veracity and value to define big data Learning outcomes Critically evaluate the need for big data management

More information

Congrats to Game Winners. How can computation use data to solve problems? What topics have we covered in CS 202? Part 1: Completed!

Congrats to Game Winners. How can computation use data to solve problems? What topics have we covered in CS 202? Part 1: Completed! CS 202: Introduction to Computation " UNIVERSITY of WISCONSIN-MADISON Computer Sciences Department Professor Andrea Arpaci-Dusseau How can computation use data to solve problems? Congrats to Game Winners

More information

Doing Multidisciplinary Research in Data Science

Doing Multidisciplinary Research in Data Science Doing Multidisciplinary Research in Data Science Assoc.Prof. Abzetdin ADAMOV CeDAWI - Center for Data Analytics and Web Insights Qafqaz University aadamov@qu.edu.az http://ce.qu.edu.az/~aadamov 16 May

More information

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013

Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013 Big Data & Analytics: Your concise guide (note the irony) Wednesday 27th November 2013 Housekeeping 1. Any questions coming out of today s presentation can be discussed in the bar this evening 2. OCF is

More information

Is big data the new oil fuelling development?

Is big data the new oil fuelling development? Is big data the new oil fuelling development? 12th National Convention on Statistics Manila, Philippines 2 October, 2013 Johannes Jütting PARIS21 Big data (2 The future? Linked data: Is this the future?..

More information

The big data revolution

The big data revolution The big data revolution Friso van Vollenhoven (Xebia) Enterprise NoSQL Recently, there has been a lot of buzz about the NoSQL movement, a collection of related technologies mostly concerned with storing

More information

Big Data, Official Statistics and Social Science Research: Emerging Data Challenges

Big Data, Official Statistics and Social Science Research: Emerging Data Challenges Big Data, Official Statistics and Social Science Research: Emerging Data Challenges Professor Paul Cheung Director, United Nations Statistics Division Building the Global Information System Elements of

More information

Big Data for Development

Big Data for Development Big Data for Development Malarvizhi Veerappan Senior Data Scientist @worldbankdata @malarv Why is Big Data relevant for Development? In developing countries there are large data gaps, both in quantity

More information

BIG DATA, BIOBANKS AND PREDICTIVE ANALYTICS FOR A BETTER CLINICAL OUTCOME

BIG DATA, BIOBANKS AND PREDICTIVE ANALYTICS FOR A BETTER CLINICAL OUTCOME BIG DATA, BIOBANKS AND PREDICTIVE ANALYTICS FOR A BETTER CLINICAL OUTCOME Π. Ε. Βάρδας MD, PhD(London) DISCLOSURES My great love to innovative ideas BIG DATA It is a broad term for data sets, so large

More information

Big Data a threat or a chance?

Big Data a threat or a chance? Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but

More information

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI

So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI So Just What Is Big Data? James E. Tcheng, MD, FACC, FSCAI Disclosures James E. Tcheng, MD, FACC, FSCAI Affiliations / Financial Relationships / Other RWI ACC Chair, Informatics and Health IT Task Force

More information

A Future Without Secrets. A NetPay Whitepaper. www.netpay.co.uk www.netpay.ie. more for your money

A Future Without Secrets. A NetPay Whitepaper. www.netpay.co.uk www.netpay.ie. more for your money A Future Without Secrets A NetPay Whitepaper A Future Without Secrets The new business buzz word is Big Data - everyone who is anyone in business is talking about it, but is this terminology just another

More information

BIG DATA : Big Opportunity or Big Threat for Official Statistics?* Jose Ramon G. Albert, Ph.D. Secretary General, NSCB Email: jrg.albert@nscb.gov.

BIG DATA : Big Opportunity or Big Threat for Official Statistics?* Jose Ramon G. Albert, Ph.D. Secretary General, NSCB Email: jrg.albert@nscb.gov. BIG DATA : Big Opportunity or Big Threat for Official Statistics?* Jose Ramon G. Albert, Ph.D. Secretary General, NSCB Email: jrg.albert@nscb.gov.ph 1 *Views expressed do not reflect those at NSCB Outline

More information

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014

Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014 What is Big Data? Of all the data in recorded human history, 90 percent has been created in the last two years. - Mark van Rijmenam, Think Bigger, 2014 Data in the Twentieth Century and before In 1663,

More information

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016

Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data! Majed Al-Ghandour, PhD, PE, CPM Division of Planning and Programming NCDOT 2016 NCAMPO Conference- Greensboro, NC May 12, 2016 Big Data: Data Analytical Tools for Decision Support 2 Outline Introduce

More information

The Real Questions about. Social Media Monitoring/Web Listening

The Real Questions about. Social Media Monitoring/Web Listening The Real Questions about Social Media Monitoring/Web Listening Should this new marketing discipline be called social media monitoring or web listening? Or any of the other 10 terms identified in this paper?

More information

Statistical Challenges with Big Data in Management Science

Statistical Challenges with Big Data in Management Science Statistical Challenges with Big Data in Management Science Arnab Kumar Laha Indian Institute of Management Ahmedabad Analytics vs Reporting Competitive Advantage Reporting Prescriptive Analytics (Decision

More information

Data Centric Computing Revisited

Data Centric Computing Revisited Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data

More information

How To Use Big Data In Healthcare

How To Use Big Data In Healthcare Big data challenges hll and opportunities in healthcare: h application to detecting faint signals Dr. Greg Slabaugh City University London School of Informatics Data, data, and more data According to IBM,

More information

CORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE

CORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE CORRALLING THE WILD, WILD WEST OF SOCIAL MEDIA INTELLIGENCE Michael Diederich, Microsoft CMG Research & Insights Introduction The rise of social media platforms like Facebook and Twitter has created new

More information

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics

Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,

More information

What s New About Big Data? Implications for the Accountability Community

What s New About Big Data? Implications for the Accountability Community What s New About Big Data? Implications for the Accountability Community Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office personst@gao.gov / www.gao.gov Presentation to the

More information

US-China Internet Industry Forum

US-China Internet Industry Forum US-China Internet Industry Forum Remarks by Victoria Espinel, President and CEO, BSA The Software Alliance Subject: Promoting Responsible Data Innovation Location: Washington, DC Date: Tuesday, December

More information

Analytical Tools: What Auditors Need to Know About Big Data

Analytical Tools: What Auditors Need to Know About Big Data Analytical Tools: What Auditors Need to Know About Big Data Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office personst@gao.gov / www.gao.gov / @GAOChfScientist Presentation

More information

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY

INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer

More information

Applications for Business Intelligence, Predictive Analytics and Big Data

Applications for Business Intelligence, Predictive Analytics and Big Data Finance, Management, & Operations Applications for Business Intelligence, Predictive Analytics and Big Data Patrick Bogan, Chief Information Officer, Fuzion Analytics Kyle Korzenowski, Chief Information

More information

One thing everyone seems to agree with is that Big Data reflects the geometric growth of captured data and our intent to take advantage of it.

One thing everyone seems to agree with is that Big Data reflects the geometric growth of captured data and our intent to take advantage of it. 1. CONTEXT Everywhere you turn these days, you will hear three buzzwords: SaaS (Software as a Service), cloud computing and Big Data. The first is a business model, the second a capacity model. The last

More information

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing

Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics

More information

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA

Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data

More information

We are Big Data A Sonian Whitepaper

We are Big Data A Sonian Whitepaper EXECUTIVE SUMMARY Big Data is not an uncommon term in the technology industry anymore. It s of big interest to many leading IT providers and archiving companies. But what is Big Data? While many have formed

More information

Matt Erickson Economist American Farm Bureau Federation March 5, 2014

Matt Erickson Economist American Farm Bureau Federation March 5, 2014 Matt Erickson Economist American Farm Bureau Federation March 5, 2014 What is Big Data? Discuss the intricacies of Big Data that exist today and what to expect ahead. Discuss the strengths, challenges

More information

Introduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com

Introduction to Predictive Analytics. Dr. Ronen Meiri ronen@dmway.com Introduction to Predictive Analytics Dr. Ronen Meiri Outline From big data to predictive analytics Predictive Analytics vs. BI Intelligent platforms What can we do with it. The modeling process. Example

More information

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank

Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»

More information

Big Data. What is Big Data? Over the past years. Big Data. Big Data: Introduction and Applications

Big Data. What is Big Data? Over the past years. Big Data. Big Data: Introduction and Applications Big Data Big Data: Introduction and Applications August 20, 2015 HKU-HKJC ExCEL3 Seminar Michael Chau, Associate Professor School of Business, The University of Hong Kong Ample opportunities for business

More information

What happens when Big Data and Master Data come together?

What happens when Big Data and Master Data come together? What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information

More information

Big Data: A Guided Tour

Big Data: A Guided Tour #3 March 2016 IPSOS By Rich Timpone, Ph.D. Senior Vice President, Ipsos Science Centre FOREWORD Look at its power. Imagine the potential. It often feels like Big Data is omnipresent. Not least in the

More information

CivicScience Insight Report

CivicScience Insight Report CivicScience Insight Report Consumer Sentiment Toward Data Privacy Part 1 of 2: Internet-Based Data Sharing and Collection Our lives as consumers in a modern society have become characterized by digitized

More information

Britepaper. How to grow your business through events 10 easy steps

Britepaper. How to grow your business through events 10 easy steps Britepaper How to grow your business through events 10 easy steps 1 How to grow your business through events 10 easy steps As a small and growing business, hosting events on a regular basis is a great

More information

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: 2454-2377 Vol. 1, Issue 6, October 2015. Big Data and Hadoop ISSN: 2454-2377, October 2015 Big Data and Hadoop Simmi Bagga 1 Satinder Kaur 2 1 Assistant Professor, Sant Hira Dass Kanya MahaVidyalaya, Kala Sanghian, Distt Kpt. INDIA E-mail: simmibagga12@gmail.com

More information

Analyzing Big Data: The Path to Competitive Advantage

Analyzing Big Data: The Path to Competitive Advantage White Paper Analyzing Big Data: The Path to Competitive Advantage by Marcia Kaplan Contents Introduction....2 How Big is Big Data?................................................................................

More information

Big Data. Fast Forward. Putting data to productive use

Big Data. Fast Forward. Putting data to productive use Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize

More information

DATA MINING AND WAREHOUSING CONCEPTS

DATA MINING AND WAREHOUSING CONCEPTS CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation

More information

BIG DATA TECHNOLOGY. Hadoop Ecosystem

BIG DATA TECHNOLOGY. Hadoop Ecosystem BIG DATA TECHNOLOGY Hadoop Ecosystem Agenda Background What is Big Data Solution Objective Introduction to Hadoop Hadoop Ecosystem Hybrid EDW Model Predictive Analysis using Hadoop Conclusion What is Big

More information

Big Data and Social Analytics certificate course

Big Data and Social Analytics certificate course MASSACHUSETTS INSTITUTE OF TECHNOLOGY Big Data and Social Analytics certificate course Online and part-time PAGE 1 Massachusetts Institute of Technology School of Architecture + Planning Ninety percent

More information

Grabbing Value from Big Data: The New Game Changer for Financial Services

Grabbing Value from Big Data: The New Game Changer for Financial Services Financial Services Grabbing Value from Big Data: The New Game Changer for Financial Services How financial services companies can harness the innovative power of big data 2 Grabbing Value from Big Data:

More information

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica

HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica HP Vertica at MIT Sloan Sports Analytics Conference March 1, 2013 Will Cairns, Senior Data Scientist, HP Vertica So What s the market s definition of Big Data? Datasets whose volume, velocity, variety

More information

Driving more business from your website

Driving more business from your website For financial intermediaries only. Not approved for use with customers. Driving more business from your website Why have a website? Most businesses recognise the importance of having a website to give

More information

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013

BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013 BIG DATA: ARE YOU READY? Andy Kyiet Demand Flow Intelligence May, 2013 PERSONAL BACKGROUND Founder of the first specialist Service Management & Helpdesk System provider in Europe Past President of AFSMI

More information

Science: what is possible. Engineering: turn science into an everyday commodity (cheap, safe, reliable, resilient, )

Science: what is possible. Engineering: turn science into an everyday commodity (cheap, safe, reliable, resilient, ) : Big Data Analytics for Renewable Energy Mark J. Embrechts Dept. Industrial and Systems Engineering Rensselaer Polytechnic Institute, Troy, NY, USA What is Data Mining? Data Mining Big Data Analytics

More information

A New Era Of Analytic

A New Era Of Analytic Penang egovernment Seminar 2014 A New Era Of Analytic Megat Anuar Idris Head, Project Delivery, Business Analytics & Big Data Agenda Overview of Big Data Case Studies on Big Data Big Data Technology Readiness

More information

Research Note What is Big Data?

Research Note What is Big Data? Research Note What is Big Data? By: Devin Luco Copyright 2012, ASA Institute for Risk & Innovation Keywords: Big Data, Database Management, Data Variety, Data Velocity, Data Volume, Structured Data, Unstructured

More information

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme

Big Data Analytics. Prof. Dr. Lars Schmidt-Thieme Big Data Analytics Prof. Dr. Lars Schmidt-Thieme Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany 33. Sitzung des Arbeitskreises Informationstechnologie,

More information

Big Data Introduction, Importance and Current Perspective of Challenges

Big Data Introduction, Importance and Current Perspective of Challenges International Journal of Advances in Engineering Science and Technology 221 Available online at www.ijaestonline.com ISSN: 2319-1120 Big Data Introduction, Importance and Current Perspective of Challenges

More information

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith / http://about.me/mikessmith

Now, Next and the Future: IT, Big Data and other Implications for RIM. Presented by Michael S. Smith / http://about.me/mikessmith Now, Next and the Future: IT, Big Data and other Implications for RIM Agenda for This Afternoon Now: What trends are creating implications within the profession? Next: Why is IT now concerned about RIM?

More information

Cloud beyond the obvious, an approach for innovation

Cloud beyond the obvious, an approach for innovation Cloud beyond the obvious, an approach for innovation Christian Verstraete Chief Technologist Cloud Strategy Our World is Changing Living in the age of tectonic shifts, and welcome to the new style of IT

More information

WHITE PAPER. Social media analytics in the insurance industry

WHITE PAPER. Social media analytics in the insurance industry WHITE PAPER Social media analytics in the insurance industry Introduction Insurance is a high involvement product, as it is an expense. Consumers obtain information about insurance from advertisements,

More information

Kingdom Big Data & Analytics Summit 28 FEB 1 March 2016 Agenda MASTERCLASS A 28 Feb 2016

Kingdom Big Data & Analytics Summit 28 FEB 1 March 2016 Agenda MASTERCLASS A 28 Feb 2016 Kingdom Big Data & Analytics Summit 28 FEB 1 March 2016 Agenda MASTERCLASS A 28 Feb 2016 9.00am To 12.00pm Big Data Technology and Analytics Workshop MASTERCLASS LEADERS Venkata P. Alla A highly respected

More information

Are You Ready for Big Data?

Are You Ready for Big Data? Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?

More information

General overview, and sources and uses of Big Data for urban and regional analysis

General overview, and sources and uses of Big Data for urban and regional analysis General overview, and sources and uses of Big Data for urban and regional analysis Carson Farmer! @carsonfarmer " carsonfarmer.com # carson.farmer@hunter.cuny.edu TRB Executive Committee Policy Session,

More information

BIG DATA FOR DEVELOPMENT: A PRIMER

BIG DATA FOR DEVELOPMENT: A PRIMER June 2013 BIG DATA FOR DEVELOPMENT: A PRIMER Harnessing Big Data For Real-Time Awareness WHAT IS BIG DATA? Big Data is an umbrella term referring to the large amounts of digital data continually generated

More information

How to select the right Marketing Cloud Edition

How to select the right Marketing Cloud Edition How to select the right Marketing Cloud Edition Email, Mobile & Web Studios ith Salesforce Marketing Cloud, marketers have one platform to manage 1-to-1 customer journeys through the entire customer lifecycle

More information

Machine Data Analytics with Sumo Logic

Machine Data Analytics with Sumo Logic Machine Data Analytics with Sumo Logic A Sumo Logic White Paper Introduction Today, organizations generate more data in ten minutes than they did during the entire year in 2003. This exponential growth

More information

Big Data: Public Sector Opportunities, Challenges, and Implications

Big Data: Public Sector Opportunities, Challenges, and Implications Big Data: Public Sector Opportunities, Challenges, and Implications Timothy M. Persons, Ph.D. Chief Scientist U.S. Government Accountability Office personst@gao.gov / www.gao.gov / @GAOChfScientist Presentation

More information

Extreme Computing. Big Data. Stratis Viglas. School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk. Stratis Viglas Extreme Computing 1

Extreme Computing. Big Data. Stratis Viglas. School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk. Stratis Viglas Extreme Computing 1 Extreme Computing Big Data Stratis Viglas School of Informatics University of Edinburgh sviglas@inf.ed.ac.uk Stratis Viglas Extreme Computing 1 Petabyte Age Big Data Challenges Stratis Viglas Extreme Computing

More information

Concept and Project Objectives

Concept and Project Objectives 3.1 Publishable summary Concept and Project Objectives Proactive and dynamic QoS management, network intrusion detection and early detection of network congestion problems among other applications in the

More information

THE POWER OF INFLUENCE TAKING THE LUCK OUT OF WORD OF MOUTH

THE POWER OF INFLUENCE TAKING THE LUCK OUT OF WORD OF MOUTH THE POWER OF INFLUENCE INTRODUCTION Word-of-mouth marketing has always been a powerful driver of consumer behavior. Every experienced marketer knows that customers are more likely to base purchasing decisions

More information

Social Return on Investment

Social Return on Investment Social Return on Investment Valuing what you do Guidance on understanding and completing the Social Return on Investment toolkit for your organisation 60838 SROI v2.indd 1 07/03/2013 16:50 60838 SROI v2.indd

More information

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V

ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V ATA DRIVEN GLOBAL VISION CLOUD PLATFORM STRATEG N POWERFUL RELEVANT PERFORMANCE SOLUTION CLO IRTUAL BIG DATA SOLUTION ROI FLEXIBLE DATA DRIVEN V WHITE PAPER Create the Data Center of the Future Accelerate

More information

ENHANCING INTELLIGENCE SUCCESS: DATA CHARACTERIZATION Francine Forney, Senior Management Consultant, Fuel Consulting, LLC May 2013

ENHANCING INTELLIGENCE SUCCESS: DATA CHARACTERIZATION Francine Forney, Senior Management Consultant, Fuel Consulting, LLC May 2013 ENHANCING INTELLIGENCE SUCCESS: DATA CHARACTERIZATION, Fuel Consulting, LLC May 2013 DATA AND ANALYSIS INTERACTION Understanding the content, accuracy, source, and completeness of data is critical to the

More information

Big Data and Due Process: Toward a Framework to Redress Predictive Privacy Harms

Big Data and Due Process: Toward a Framework to Redress Predictive Privacy Harms NELLCO NELLCO Legal Scholarship Repository New York University Public Law and Legal Theory Working Papers New York University School of Law 10-1-2013 Big Data and Due Process: Toward a Framework to Redress

More information

Term Paper. Emerging Technology Trends. Big Data Analytics

Term Paper. Emerging Technology Trends. Big Data Analytics Term Paper Emerging Technology Trends Big Data Analytics By Purnima Dubey Srishti - ADP First year Term Paper on Emerging Technology Trends Big Data Page 1 Table of Contents 1. Introduction... 3 2. What

More information

THE INFLUENCE OF SOCIAL MEDIA ON INFORMATION FLOW. Presented By Eubert Espira Contacts: eubert.espira@infotrakresearch.com Tel: 0733-777879

THE INFLUENCE OF SOCIAL MEDIA ON INFORMATION FLOW. Presented By Eubert Espira Contacts: eubert.espira@infotrakresearch.com Tel: 0733-777879 THE INFLUENCE OF SOCIAL MEDIA ON INFORMATION FLOW Presented By Eubert Espira Contacts: eubert.espira@infotrakresearch.com Tel: 0733-777879 INTRODUCTION Today people don t trust companies. One of the things

More information

Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013

Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013 Cloud Computing and Big Data That s Why! Ray Walshe 14 th March 2013 Data Centre Growth We need more data centres??? (c) Ray Walshe 2013 2 By 2015 By 2015, about 24% of all new business software purchases

More information

The Impact of Big Data on Social Research David Rhind Sharon Witherspoon

The Impact of Big Data on Social Research David Rhind Sharon Witherspoon The Impact of Big Data on Social Research David Rhind Sharon Witherspoon 1 www.nuffieldfoundation.org The landscape to be covered What is Big Data? Just consultants hype? Key questions for SRA Technology

More information

Digital Earth: Big Data, Heritage and Social Science

Digital Earth: Big Data, Heritage and Social Science Digital Earth: Big Data, Heritage and Social Science The impact on geographic information and GIS Geographic Information Systems Analysis for Decision Support Impact of Big Data Digital Earth Citizen Engagement

More information

When to Leverage Video as a Platform A Guide to Optimizing the Retail Environment

When to Leverage Video as a Platform A Guide to Optimizing the Retail Environment When to Leverage Video as a Platform A Guide to Optimizing the Retail Environment Contents S1 An Industry in Transition Over the past few years, retail has seen seismic changes in how the customer shops.

More information

Multichannel Customer Listening and Social Media Analytics

Multichannel Customer Listening and Social Media Analytics ( Multichannel Customer Listening and Social Media Analytics KANA Experience Analytics Lite is a multichannel customer listening and social media analytics solution that delivers sentiment, meaning and

More information

INTRUSION PREVENTION AND EXPERT SYSTEMS

INTRUSION PREVENTION AND EXPERT SYSTEMS INTRUSION PREVENTION AND EXPERT SYSTEMS By Avi Chesla avic@v-secure.com Introduction Over the past few years, the market has developed new expectations from the security industry, especially from the intrusion

More information

WHAT IS ECONOMICS. MODULE - 1 Understanding Economics OBJECTIVES 1.1 MEANING OF ECONOMICS. Notes

WHAT IS ECONOMICS. MODULE - 1 Understanding Economics OBJECTIVES 1.1 MEANING OF ECONOMICS. Notes 1 WHAT IS Economics as a subject has assumed great importance in the field of social science. In our day to day life we use a lot of economic concepts such as goods, market, demand, supply, price, inflation,

More information

Human mobility and displacement tracking

Human mobility and displacement tracking Human mobility and displacement tracking The importance of collective efforts to efficiently and ethically collect, analyse and disseminate information on the dynamics of human mobility in crises Mobility

More information

Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University

Mining Big Data. Pang-Ning Tan. Associate Professor Dept of Computer Science & Engineering Michigan State University Mining Big Data Pang-Ning Tan Associate Professor Dept of Computer Science & Engineering Michigan State University Website: http://www.cse.msu.edu/~ptan Google Trends Big Data Smart Cities Big Data and

More information

Visualizing the Future of MS Research: ACP s Repository Holds the Map

Visualizing the Future of MS Research: ACP s Repository Holds the Map Dear Friends, This issue is all about Big Data to Knowledge, otherwise known as BD2K. This refers to society s growing ability to gather a wealth of information about people in our case, people with MS

More information

How To Handle Big Data With A Data Scientist

How To Handle Big Data With A Data Scientist III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution

More information

Big Data: Key Concepts The three Vs

Big Data: Key Concepts The three Vs Big Data: Key Concepts The three Vs Big data in general has context in three Vs: Sheer quantity of data Speed with which data is produced, processed, and digested Diversity of sources inside and outside.

More information

Mobile Youth Around the World

Mobile Youth Around the World Mobile Youth Around the World December 2010 Overview From texting to video to social networking, mobile phones are taking an ever-expanding role in our daily lives. And young people around the world are

More information

ANALYTICS BUILT FOR INTERNET OF THINGS

ANALYTICS BUILT FOR INTERNET OF THINGS ANALYTICS BUILT FOR INTERNET OF THINGS Big Data Reporting is Out, Actionable Insights are In In recent years, it has become clear that data in itself has little relevance, it is the analysis of it that

More information

cprax Internet Marketing

cprax Internet Marketing cprax Internet Marketing cprax Internet Marketing (800) 937-2059 www.cprax.com Table of Contents Introduction... 3 What is Digital Marketing Exactly?... 3 7 Digital Marketing Success Strategies... 4 Top

More information

Annex 8. Market Failure in Broadcasting

Annex 8. Market Failure in Broadcasting Annex 8 Market Failure in Broadcasting 202 Review of the Future Funding of the BBC Market Failure in the Broadcasting Industry An efficient broadcasting market? Economic efficiency is a situation in which

More information

Bringing European values to the Internet of Things

Bringing European values to the Internet of Things SPEECH/10/279 Neelie Kroes European Commissioner for Digital agenda Bringing European values to the Internet of Things 2 nd Annual Internet of Things Conference Brussels, 1 st June 2010 Ladies and gentlemen,

More information

Algorithms and optimization for search engine marketing

Algorithms and optimization for search engine marketing Algorithms and optimization for search engine marketing Using portfolio optimization to achieve optimal performance of a search campaign and better forecast ROI Contents 1: The portfolio approach 3: Why

More information

Statistics for BIG data

Statistics for BIG data Statistics for BIG data Statistics for Big Data: Are Statisticians Ready? Dennis Lin Department of Statistics The Pennsylvania State University John Jordan and Dennis K.J. Lin (ICSA-Bulletine 2014) Before

More information

Two Recent LE Use Cases

Two Recent LE Use Cases Two Recent LE Use Cases Case Study I Have A Bomb On This Plane (Miami Airport) In January 2012, an airline passenger tweeted she had a bomb on a Jet Blue commercial aircraft at the Miami International

More information