1 Sensing as a Service and Big Data Arkady Zaslavsky #1, Charith Perera #*2, Dimitrios Georgakopoulos #3 # ICT Centre, CSIRO, Canberra, ACT, 2601, Australia 2 * Research School of Computer Science, The Australian National University, Canberra, ACT 0200, Australia Abstract Internet of Things (IoT) will comprise billions of devices that can sense, communicate, compute and potentially actuate. Data streams coming from these devices will challenge the traditional approaches to data management and contribute to the emerging paradigm of big data. This paper discusses emerging Internet of Things (IoT) architecture, large scale sensor network applications, federating sensor networks, sensor data and related context capturing techniques, challenges in cloud-based management, storing, archiving and processing of sensor data. can be collected, analysed and interpreted. Further, European Commission  predicts that the present 'Internet of PCs' will move towards an 'Internet of Things' in which 50 to 100 billion devices will be connected to the Internet by Keywords Big Data, Sensing as a Service, Internet of Things, Large Scale Sensor Networks, Cloud Computing, Data Management. 1. Introduction The modern technology-savvy world is full of devices comprising sensors, actuators, and data processors. Such concentration of computational resources enables sensing, capturing, collection and processing of real time data from billions of connected devices serving many different applications including environmental monitoring, industrial applications, business and human-centric pervasive applications. These developments have brought us to the era of Internet of Things (IoT)  by introducing IoT in 1998as concept . However, sensing the environment around us and objects populating this environment became synonymous with the introduction of pervasive or ubiquitous computing by the paper The Computer for 21 st Century  in 1991 in the same year where World Wide Web became available. The major enabler of IoT is sensor networks. IoT has three unique features : intermittent sensing, regular data collection, and Sense-Compute-Actuate (SCA) loops. Figure 1: The total amount of data generated on earth exceeded one zettabyte in 2010.It is predicted that data volume will grow exponentially as depicted 1. In 2010, the total amount of data on earth exceeded one zettabyte (ZB) ,  (see figure 1). By end of 2011, the number grew up to 1.8 ZB . Further, it is expected that this number will reach 35 ZB in As in many cases with ICT, this estimate may prove to be too conservative. IEEE Spectrum  recognises both sensors and big data as to of the five technologies that will shape the world (figure 2). According to Gartner Research , by 2015, wirelessly networked sensors in everything we own will form a new Web. But it will only be of value if the 'terabyte' of data it generates Figure 2: Data generated from the Internet of Things will grow exponentially as the number of connected nodes increases. Estimated numbers of connected nodes based on different sectors are presented in Millions . 1
2  defines IoT as the growing and largely invisible web of interconnected smart objects that promises to transform the way we interact with everyday things. Due to recent development in sensor devices and other related technologies, the cost of data acquisition has dramatically come down , . Even though sometime, initial costs are high, continues data acquisition remains very cheap. Initial costs are also going down with recent developments in sensor technologies such as Arduino (arduino.com). According to the BCC Research , global market for sensors was around $56.3 billion in In 2011, it was around $62.8 billion. Global market for sensors is expected to increase up to $91.5 billion by 2016, at a compound annual growth rate of 7.8%. IBM attributed the growing amount of big data towards instrumented, interconnected, intelligent world which is envisioned by Internet of Things . Infosys  acknowledges intelligence, cloud computing, and sensor networks as three significant themes in the evolution of pervasive computing. 2. Sensing Big Data Big data is not new concept or idea. However, earlier notions of big data was limited to few organizations such as Google, Yahoo, Microsoft, and European Organization for Nuclear Research (CERN). However, with recent developments in technologies such as sensors, computer hardware and the Cloud, the storage and processing power increase and the cost comes down rapidly. As a result, many sources (sensors, humans, applications) start generating data and organizations tend to store them for long time due to inexpensive storage and processing capabilities. Once that big data is stored, a number of challenges arise such as processing and analysing. Thus big data has become a buzz word in industry. There is no clear definition for Big Data. It is defined based on some of its characteristics. The big data does not mean the size. There are three characteristics that can be used to define big data, as also known as 3V s : volume, variety, and velocity (figure 3). Volume: Volume relates to size of the data such as terabytes (TB), petabytes (PB), zettabytes (ZB), etc. Variety: Variety means the types of data. In addition, difference sources will produce big data such as sensors, devices, social networks, the web, mobile phones, etc. For example, data could be web logs, RFID sensor readings, unstructured social networking data, streamed video and audio. Velocity: This means how frequently the data is generated. For example, every millisecond, second, minute, hour, day, week, month, year. Processing frequency may also differ from the user requirements. Some data need to be processed real-time and some may only be processed when needed. Typically, we can identify three main categories: occasional, frequent, and real-time. Volume Megabyte to Zetabyte Veriety Velocity Batch to Streaming Data Text, video, audio, images, Structured, Unstrctured Fig 3: Characteristics of Big Data (V 3 )  Some researchers consider Value also as a main characteristic of big data. This means that somewhere within that data, there is some valuable information golden data to extract, though most of the pieces of data individually may seem valueless. EMC  has defined big data as any attribute that challenges constraints of a system capability or business need. For example, EMC recognizes 40MB PowerPoint presentation as a big data, because 40MB is big compared to typical size of a PowerPoint presentation. Further, 1PB animation and a 1TB medical image are considered as big data as they are big in size compared to the typical size of each. Further, the data we consider big today may not be considered big tomorrow due to the advances in processing, storage and other system capabilities. There is a number of challenges related to big data. Due to its characteristics some of the inherited challenges in big data are capture, storage, search, analysis, and virtualization. Some of the typical areas that produce big data are meteorology, genomics, physics, simulations, biology and environmental science. It is predicted that vehicles will be a major source that would produce big data . The number of vehicles used and the growing rates around the world justify the prediction. Further, some of the potential application areas of big data analytics are smart homes, finance, log analysis, security, traffic control, telecommunications, search quality, manufacturing, trade analysis, fraud and risk , . Big data is everywhere, even in jogging. One example of application of sensing technologies in our everyday life is Nike+iPod/iPhone application . It is an application that collects and tracks information such as workout details,
3 distance, calories burnt, etc. of a jogger using Nike footwear and iphone/ipod. Another similar application is ismoothrun (www.ismoothrun.com ). Further, it allows uploading data to the fitness social networks such as RunKeeper.com. The data becomes big when we consider millions of users. The state of California  has developed a greenhouse gas sensor network located throughout California where it collects massive amounts of real-time information about greenhouse gases and their behaviour. The St. Anthony Falls Bridge in Minneapolis  consists of over 200 sensors embedded in strategic points. These sensors provide fully detailed sensor readings regarding the bridge to a monitoring system. This system collects variety of different measurements such as a change in temperature and the bridge s concrete reaction to that change is available for analysis. Sensors collect information in natural disaster situations in order to optimally utilize resources and manage supply chains . For example, during the disaster at Japan s Fukushima Daichi nuclear plant in 2011, sensors were used to take radiation measurements without human intervention . The following statistics can be used to further explain the big data. For example, more than 12TB (155 million tweets) of tweets in Twitter and 25TB of log data (30 billion pieces of content) in Facebook are generated every day. Further, 30 billion RFID tags and 4.6 billion camera phones are used around the world today. In addition, 200 million smart meters to be operated in Moreover, there were 2 billion people on web in 2011 , . Four detectors in Large Hadron Collider (LHS) produced 13 petabyte (PB) of data in 2010 . Wal-Mart generates 2.5 petabyte (PB) of customer transaction data every hour. Further, if Wal-Mart operates RFID on item level, it is expected to generate 7 terabytes (TB) of data every day . In future, large radio astronomy arrays, for example, Square Kilometre Array (SKA), will generate data at rates far higher than that can be processed or stored affordably using current technology . Boeing Jet  generates 10 terabytes (TB) of data per engine every 30 minutes. A single six hour flight would generate 240 terabyte (TB) of data. In addition, there are about commercial flights in the sky in United States on any given day. These statistics allow us to understand the scale of the sensor data generated. However, these collected data may not be processed all the time. Usually the information are closely analysed only when airplane get crashed. The big data has become an industry itself which is worth around USD 100 billion and growing at a 10% rate annually , where major players such as Oracle, Microsoft, SAP, and IBM spend significantly due to the potential value of big data. Ninja Blocks (ninjablocks.com) can be identified as latest development in where IoT and cloud computing is merging together. Ninja Blocks are small and cheap devices which encompass variety of different sensors such as acceleration, temperature, current, humidity, motion, distance, sound, light and even capture video. In addition, these block are capable of connect themselves to the Ninja cloud where the data can be processes and stored. It has the capability to identify some of the event occur based on the sensor data and perform actions such as tweet, sms, , Facebook or Dropbox uploading. Another similar start-up is i-voltmeter (i-voltmeter.com). These types of devices have the potential to generate big data in coming decade. SFpark (sfpark.org) has deployed sensors over 8200 parking space in several San Francisco neighbourhoods in order to track real-time inventory and to perform changes to the pricing based on the consumer behaviour. School in Vittoria da Conquista, Brazil  has equipped uniforms with RFID chips allowing their parent to verify that their children are show up at school. Very large scale sensor networks domain  use cloud computing to process data in the cloud. Such data can be characterised as polymorphous, heterogeneous, large in quantity and time-limited. In a very large scale sensor network, manage the sensing resources and computational resources, and store and process these data are identified as key challenges. The potential applications are vehicle tracking, traffic dispatching, and expressway planning. London  has deployed significant amount of sensor roadways in order to monitor traffic in real-time to optimum traffic management targeting 2012 Olympics. In general, sensors in roadways can be used to optimize route for fuel efficiency. Let s assume that governments deploy and own these sensors. However, there would be several other organizations that would interest on the data collected by those roadway sensors such as traffic update providers and weather information providers . Government may deploy roadways sensors with the focus of collecting traffic information. However, several other organizations would like to access those sensor data so they can analyse sensor data in different perspectives and provide their customer with value added services. Further, these third party organizations can pay a fee to the government and get access to the sensor data provided by roadway sensors. In summary, government can provide sensing data as a service. Therefore, this Sensing as a service model can be applied well here. The map in figure 4 shows how the big data have been generated and stored across the world. 3. Why Big Data? Big data is important for us in many perspectives. The significant amount of data generated allows us to make decisions in timely manner where money can be saved and operations can be more optimized in both public and private
4 sector. For example, in retail business, consumer behaviour and preferences can be understand by analysing the big data which includes, customer movement in the store or online webs site, transactions, product searches, etc. . Big data allows Data-Driven Decision Making . United States Healthcare Big Data World , which comprises records of over 50 million patients, uses data-driven concept to find out the challenges in healthcare sector. The potential queries that they are expecting to process over big data are very complex. The following statistical facts summarise the value of big data . $300 billion potential annual value to US health care. 250 billion potential annual value to Europe s public sector administration. $600 billion potential annual consumer surplus from using personal location data globally. 60% potential increase in retailers operating margins possible with big data 140, ,000 more deep analytical talent positions. 1.5 million more data-savvy managers needed to take full advantage of big data in the United States. Figure 4: Amount of new data stored varies across geography. New data stored (in Petabytes (PB)) by geography in New data stored is defined as the amount of available storage used in a given year . Today s supply chain management poses many problems. It performs poorly in responding to market demands, supplier condition, etc. in real-time. The big data collected through IoT infrastructure based on sensing as a service has the potential to diminish these inefficiencies. Research on supply chain management  has identified five technologies that will make the process more efficient: mobility, Internet of Things, big data, predictive analysis, and cloud computing. These five areas are strongly interconnected. Internet of Things and mobility provide the sensors that can sense in real-time even while moving. These sensors will produce big data, that is high volume, high variety data in high velocity. So these collected data need to be analysed in order to extract knowledge. Predictive analysis techniques will do the job. However, it cannot be done in traditional computing environment where processing, and storing power are limited and expensive. An elastic infrastructure, such as the cloud, needs to be used to perform such techniques over big data. In other terms, the cloud binds to the Internet of Things . In business perspective, big data has the potential to generate more revenue, reduce risk, and predict future outcomes with greater confidence in low cost . IBM  has identified some of the challenges that can be addressed by using big data. For example, analyse multi-channel customer sentiment and experience, detect life-threatening conditions at hospitals in time to intervene, predict weather patterns to plan optimal wind turbine usage, and optimize capital expenditure on asset placement, make risk decisions based on real-time transactional data, identify criminals and threats from disparate video, audio, and data feeds. 4. Technologies around Big Data McKinsey Global Institute  discusses some of the existing technologies such as machine learning techniques that need to be extended to be used in big data. Further, McKinsey report identifies additional technologies such as massive parallelprocessing (MPP) databases, distributed file systems, cloud computing technologies, etc. that complement the big data management. The techniques need to be developed to tackle the challenges such as extracting, transformation, integrating, sorting and manipulating data . Basic techniques of meaningful data extraction from big data consists five steps : define, search, transform, entity resolution, answer the query. These conceptual steps are more related to traditional data mining research domain. However, technology behind these conceptual steps will be varying significantly due to the unique characteristics of big data. SAP s Hana tool (sap.com/hana) provides analysis of massive amount of data 3,600 times faster for real-time business. Scalable and distributed data management has been a key priority among the database research community for more than three decades. Due to lack of cloud feature and high costs in relational database management systems are less favour to be used in big data and cloud environment . NoSQL movement has fuelled the big data significantly. As storage is essential component in big data, there are numbers of commercial and open source solutions available that supports big data requirements. Commercial products are Greenplum (greenplum.com), IBM DB2 (ibm.com/software/data/db2) or Netezza (netezza.com), Microsoft SQL Server (microsoft.com/sqlserver), MySQL (mysql.com), Oracle (oracle.com), or Teradata (teradata.com). In NoSQL 2 technologies, there are four varieties: Key-value, document store, wide column stores, and graph databases. Out of these four technologies, graph databases are favoured to be used to store information with complex relationships among pieces of data such as social network data, semantic and linked data. Neo4J is an example for this type of database. 2 nosql-database.org
5 Other three technologies are more favoured to be used commonly across in big data, cloud and IoT applications. Popular key-value based data storage products are Dynamo (aws.amazon.com/dynamodb), Redis (redis.io), Riak (basho.com/riak), Amazon SimpleDB (aws.amazon.com/simpledb), and Windows Azure Table Storage (windowsazure.com). Popular document store technology based products are CouchDB (couchdb.apache.org), and MongoDB (mongodb.org). Popular wide column based products are Apache HBase (hbase.apache.org), Hadoop (hadoop.apache.org), and Cassandra (cassandra.apache.org). However, the one getting the most attention is Hadoop as it based on popular distributed processing friendly MapReduce 3 technology. However, there is no single perfect data management solution for the cloud to manage big data. Due to high number of big data producers and high frequency of data generation, the gap between data available to organization and data an organization can process is getting wider all the time as illustrated in figure 5 . 5. Sensing as a service Model Providing everything as a service is model that emerged with cloud computing. Garter defines  cloud computing as a style of computing in which massively scalable IT-related capabilities are provided as a service using Internet technologies to multiple external customers. Cloud computing is expected to play a significant role in IoT paradigm. The cloud storage and processing capabilities are essential to make the IoT vision a reality. Cloud computing consists of three main layers or model, Infrastructure as a Service (IaaS), Platform as a Service (PaaS), and Software as a Service (SaaS). Each layer has been discussed comprehensively in . In addition to the above main layer, some other layers are also introduced and discussed in literature such as Database as a Service (DBaaS), Data as a Service (DaaS), Ethernet as a Service (EaaS), Network as a Service (NaaS), Identity and Policy Management as a Service (IPMaaS), and Sensing as a service (SaaS). In general, all these models are called XaaS, which means X can be virtually anything. In this paper we discuss only, Sensing as a service (SaaS) model , . IoT envision that sensors to be attached everywhere. In such an environment, owner sill be able publish their sensor data and earn a fee or discount as a return. To visualize in this concept in real world let s consider a scenario. Fig 5: the volumes of data available to organizations today in on rise rapidly, while the volumes of data that can be processed by an organization rise slowly. This creates a gap between the two. In Other words, the percentage (%) of data an organization can analyse is on the decline . Pervasive systems produce high-dimensional data with the use of very sophisticated and advanced sensors. They require high computational power to be processed and storage. For example, high-dimensional data such as brain signal (EEG) are usually processed using client-server architecture. However,  has shown that map-reduce based scalable cloud computing architecture can process brain signal more efficiently. In addition, how to manage sensor data in cloud environment has been explained in  in relation with wearable sensor domain. Arduino as sensor platform, Google App Engine (appengine.google.com) as cloud platform, and Google Datastore as the database to store sensor data has been employed in the experiments. Connecting and managing sensor via cloud is a critical milestone in the process of sharing sensors and sensor data in Sensing as a service model. 3 Mike bought a new refrigerator for his new home. He brought it home and plugged it to the power. Fridge automatically identified the availability of WIFI in the house. Then, prompted Mike for asking permission to be connected to the Internet via the display available in the refrigerator. Once he agreed, refrigerator identified the sensors attached to itself such as RFID reader, temperature, door sensors in refrigerator and deep-freezer, etc. Refrigerator prompted Mike asking whether he would like to publish these sensors on the Internet so interested parties would access them by paying a fee or any other offer. Mike agreed to publish the sensors given that he would return a satisfactory offer. After some time, Mike received an from a company called DairyIceCreeam, an ice cream manufacturer, with and offer. DairyIceCreeam is interested to have access to the RFID reader in Mike s refrigerator and the door sensor attached to the deep-freezer. As a return, DairyIceCreeam will offer either 5% discount on every product sold by DairyIceCreeam or a monthly fee of 2 dollars. As Mike likes DairyIceCreeam products, he agreed the offer of 5% discount instead of the monthly fee. This scenario explains sensing as a service at personal level where similar processes could be applied at public and private organization levels. Owners of sensors (could be a person, private organization, public organization, government, etc) will be able to publish their sensor data and get a return on investment. Privacy and security need to be handled appropriately. Further, at least three categories of people
6 would participate in the process: Sensor owner such as Mike, sensor publishing and managing companies (organizations act as mediators such as ebay), and the sensor data consumers such as DairyIceCreeam cooperation. Further, in DairyIceCreeam corporate perspective, it is much cheaper for them to acquire sensors as a service so they don t need to deploy sensors themselves or manually collect data from users. It also provides real-time and accurate data which is far better than manually collect data regarding customer behaviour via retail stores or directly via customers. By accessing RFID reader and deep-freezers door sensor, DairyIceCreeam can acquire much more detailed and comprehensive set of information which is impossible to collect using traditional methods. For example, with use of RFID tags in ice cream containers, DairyIceCreeam can find out information such as how many times user consumes dairy per week, at which time users are more likely to eat them and so on. In the above scenario, we considered only one person (Mike) and one thing (refrigerator). As IoT envisioned, when 50 billion things connected to the Internet, they will generate big data at unprecedented scale. Further strengthening our idea IEEE spectrum says doesn t be surprised when your fridge joins Facebook . In IoT domain, sensor data are not the only information that will be stored. More importantly relevant context information will be always annotated to the raw sensor data for later retrieval purpose. The context data add more meaning and value to the sensor data. However, it will also increase the storage requirements in many times. For example , for each sensor reading context information such as what properties are measured, which sensors available, where are they located, how are they configured, who is responsible will be stored. Architectural design challenges for sensor web are presented in  as flexibility to facilitate different application requirements, distributed reasoning and decision making, number of layers to be average not low or high, ease of deployment, and availability in term of licensing.  have also survey variety of different architectures and sensor web applications. Sensor web architecture and application can be considered as preliminary approached of sensing as a service model. SaaS can be developed on top of these sensor web architectures. Sensor-Cloud  is an infrastructure that aims at managing physical sensors by connecting them to the cloud. Sensor- Cloud uses SensorML 4 to describe the metadata of physical sensors such as physical sensors description and measurement processes. SensorML is a standard model and an XML encoding mechanism for describing sensors defined OGC (Open Geospatial Consortium). Sensor-Cloud bundles the physical sensors in o virtual sensors where users can combine them together to achieve advanced results. Sensor- Cloud allows users to use sensors without worring about the details. It also provides physical sensor management capabilities such as checking the usage of their physical sensors. However, Sensor-Cloud does not focus on offering sensor data as a service. Instead, it focuses on managing sensors via cloud. SenaaS  (Sensor-as-a-Service) has proposed to encapsulate both physical and virtual sensors in to services according to Service Oriented Architecture (SOA). SenaaS mainly focuses on providing sensor management as a service rather than providing sensor data (collection and dissemination) as a service. It consists three layers: real world access layer, semantic overlay layer, and services virtualization layer. Real world access layer is responsible for communicating with sensor hardware. Semantic overlay layer adds semantic annotation to the sensor configuration process. Services virtualization layer facilitates the users. Similar approach is presented in  where sensors are wrapped in to service in SOA with semantic annotation which leads to automated sensor service discovery and composition OpenIoT  (Open Source blueprint for large scale selforganizing cloud environments for IoT applications) is aimed at developing an open source middleware platform to connect Internet-objects to the cloud. The OpenIoT project will leverage existing IoT middleware solutions such as Global Sensor Network 5 and ASPIRE (fp7-aspire.eu). OpenIoT defines IoT as a combination of sensor data and applications. OpenIoT concentrates on providing a cloud-based middleware infrastructure in order to deliver on-demand access to IoT services, which could be formulated over multiple infrastructure providers . The Objective of IoT is to provide utility based services such as Sensing as a service, Location-as-a-Service, and Traceability-as-a-Service. Every year, Australian grain breeders plant up to 1 million 10 m2 plots across the country to find the best high yielding varieties of wheat and barley. The plots are usually located in remote places often requiring more than four hours travel one-way to reach. The challenge is to monitor the crop performance and growing environment throughout the season and return the information in an easily accessible format. Phenonet wireless sensor networks are now routinely deployed in wheat variety trials throughout Australia by staff at the High Resolution Plant Phenomics Centre (HRPPC) (http://phenonet.com) in order to collect information over a field of experimental crops. Currently, the sensors are deployed around 40 plots. These sensors produce two million data points per week. If and when the sensors are eventually deployed around all or majority of plots, the overall amount of sensed data will really be big data sourceforge.net/apps/trac/gsn/
7 6. Challenges in Big Data Management The challenges in big data can be broadly divided in to two categories: engineering and semantic . Engineering challenge is to perform data management activities such as query, and storage efficiently. Semantic challenge is to extract the meaning of the information from massive volumes of unstructured dirty data. Several other challenges in big data are presented in  with details. The Jet Propulsion Laboratory (JPL) has identified number of major challenges in big data management . High volume of processing using low power consumed digital processing architecture. Power that needs to process the data as well as power that is required to cool the processing system need to be considered together in designing such systems. Discovery of data-adaptive Machine learning techniques that can analyse data in real-time. These techniques are critical specifically, in situation where data, high time and spectral resolution, are produced by many antennas in Very Long Baseline Array that cannot be stored due to size for later analysis. Design scalable data storages that provide efficient data mining. Storing any kind of data is useless unless they cannot be retrieved and extract the knowledge efficiently. For example, if we send all the data sensed to the cloud to be stored at very high sampling rate, the wide-area networks may cause network congestion. They are always a trade of between when we decide where to store and process, specifically in IoT domain. If we choose to process in the cloud, all the data need to be send to the cloud which will resulted unacceptable communication latency. Similarly, if we choose to process data locally, the resource may not be enough to achieve the desired result at desired speed. Patidar et al  have identified a number of key challenges in big data management related to the cloud. Data security and privacy: This is about managing access control to the big data. It is critical in Sensing as a service model that we discussed earlier. Approximate results: Due to the volume and the velocity of big data, approximate results would be order of magnitude faster compared to traditional query execution. It need be decided where and when to use approximation where it will not harm the accuracy of the results or decision that emerged based on approximated results. Data exploration to enable deep analytics: Certainly, new technologies are required to analyse large volumes of data faster with efficient resource and power consumptions. Enterprise data enrichment with web and social media: The big data has more relationships among themselves that ever in history. This is also related to Linked Data and social media. Real value of big data will be merged once we are able to preserver and identify the relationships between data. Query optimization: In order to harvest the knowledge hidden in big data optimised query processing in essential. Optimization need to consider in different perspectives such as energy consumption, required memory, processing time, storage requirements, etc. Parallel processing would be the key in query processing in cloud environments. Performance isolation for multi-tenancy: This is about designing and developing model of performance service level agreements for multitenant data systems that can be metered at low overhead. 7. Conclusions Big data consists of hidden gold (high-valued data) mixed with dirty (noise, erroneous and raw) data. The advanced systems and innovative technologies will allow us to efficiently process massive amounts of dirty data and extract gold out of it. It will require concentrated efforts by ICT researchers and practitioners to meet the challenge of big data and ride the wave of information implosion. Acknowledgment Support from EU FP7 OpenIoT project and SSN TCP, CSIRO is thankfully acknowledged. References  Harald Sundmaeker, Patrick Guillemin, Peter Friess, and Sylvie Woelffle, "Vision and Challenges for Realising the Internet of Things," European Commission Information Society and Media, Tech. rep  Kevin Ashton. (2009, June) That 'Internet of Things' Thing In the real world, things matter more than ideas. [Online].  Mark Weiser, "The computer for the 21st century," SIGMOBILE Mob. Comput. Commun. Rev., vol. 3, no. 3, pp. 3-11, July [Online].  Pankesh Patel, Animesh Pathak, Thiago Teixeira, and Val, "Towards application development for the internet of things,", New York, NY, USA, 2011, pp. 5:1--5:6. [Online].  Paul Zikopoulos. (2012, March) IBM Big Data: What is Big Data Part 1 and 2. [Online].  E. Ackerman and E. Guizzo, "5 technologies that will shape the web," Spectrum, IEEE, vol. 48, no. 6, pp , june  D.A. Reed, D.B. Gannon, and J.R. Larus, "Imagining the Future: Thoughts on Computing," Computer, vol. 45, no. 1, pp , jan  Mark Raskino, Jackie Fenn, and Alexander Linden, "Extracting Value From the Massively Connected World of 2015," Gartner Research, Tech. rep  James Manyika et al., "Big data: The next frontier for innovation, competition, and productivity," McKinsey Global Institute, Tech. rep. May  S. Patidar, D. Rane, and P. Jain, "A Survey Paper on Cloud
8 Computing,", Jan. 2012, pp  Henriette Cramer, Mattias Rost, Frank Bentley, and David Ayman Shamma, "2nd workshop on research in the large. using app stores, wide distribution channels and big data in ubicomp research,", New York, NY, USA, 2011, pp [Online].  BCC Research, "Sensors: Technologies and Global Markets,"  Chris Eaton, Dirk Deroos, Tom Deutsch, George Lapis, and Paul Zikopoulos, Understanding Big Data.: McGraw-Hill Companies, April 2012, 6USEN.pdf.  Infosys Limited, "Pervasive Computing View Point," Infosys, Tech. rep  Patricia Florissi. (2012, February) Emc And Big Data - A Fun Explanation. [Online]. [Accessed on: ]  Katie Fehrenbacher. (2010, August) Car Data As the Next Platform for Innovation. [Online].  D.L. Jones et al., "Big data challenges for large radio arrays,", march 2012, pp  Apple Inc. (2006, May) Nike+iPod Sports Kit. [Online].  Katie Fehrenbacher. (2011, December) California to develop big data greenhouse gas sensor network. [Online].  Geoff Brumfiel, "High-energy physics: Down the petabyte highway," Nature, pp , January  Leonardo Weiss Ferreira Chaves and Christian Decker, "A survey on organic smart labels for the Internet-of-Things,", pp  Stacey Higginbotham. (2010, September) Sensor Networks Top Social Networks for Big Data. [Online]. [Accessed on: ]  The Economist Newspaper Limited. (2010, February) Data, data everywhere. [Online].  Tuan C. Nguyen. (2012, March) School uniform alerts parents when students skip class. [Online].  Jing Liu et al., "An open, flexible and multilevel data storing and processing platform for very large scale sensor network,", feb. 2012, pp  Steve Wildstrom. (2012, April) Better Living Through Big Data. [Online]. Through-Big-Data  Christopher J. Bucholtz. (2012, March) Customers, Big Data, and the Internet of Things. [Online]. [Accessed on: ]  Kevin Benedict, Moneyball, Big Data, The Internet of Things and Enterprise Mobility, March 2012,  Christian Bizer, Peter Boncz, Michael L. Brodie, and Orri Erling, "The meaningful use of big data: four perspectives -- four challenges," SIGMOD Rec., vol. 40, no. 4, pp , #jan# [Online].  Lora Cecere. (2012, June) Five Supply Chain Opportunities in Big Data and Predictive Analytics. [Online].  Basil Yunan. (2012, June) What is Big Data's role in helping companies achieve competitive advantage through analytics. [Online]. https://www- 950.ibm.com/events/wwe/grp/grp004.nsf/v16_agenda?openform&semi nar=9ebkw9es&locale=en_us  Doug Henschen. (2012, October) 3 Big Data Challenges: Expert Advice. [Online].  Divyakant Agrawal, Sudipto Das, and Amr El Abbadi, "Big data and cloud computing: current state and future opportunities,", New York, NY, USA, 2011, pp [Online].  Samah H. Gad, "Cloud computing and mapreduce for reliability and scalability of ubiquitous learning systems,", New York, NY, USA, 2011, pp [Online].  Charalampos Doukas and Ilias Maglogiannis, "Managing Wearable Sensor Data through Cloud Computing,", Washington, DC, USA, 2011, pp [Online].  Bhaskar Prasad Rimal, Eunmi Choi, and Ian Lumb, "A Taxonomy and Survey of Cloud Computing Systems,", Washington, DC, USA, 2009, pp [Online].  Minqi Zhou, Rong Zhang, Dadan Zeng, and Weining Qian, "Services in the Cloud Computing era: A survey,", oct. 2010, pp  Jean-Paul Calbimonte, Hoyoung Jeung, Oscar Corcho, and Karl Aberer, "Semantic Sensor Data Search in a Large-Scale Federated Sensor Network,",  Klaithem Al Nuaimi, Mariam Al Nuaimi, Nader Mohamed, Imad Jawhar, and Khaled Shuaib, "Web-based wireless sensor networks: a survey of architectures and applications,", New York, NY, USA, 2012, pp. 113:1--113:9. [Online].  M. Yuriyama and T. Kushida, "Sensor-Cloud Infrastructure - Physical Sensor Management with Virtualized Sensors on Cloud Computing,", sept. 2010, pp  S. Alam, M. M. R., and J. Noll, "SenaaS: An event-driven sensor virtualization approach for Internet of Things cloud,", pp  J. Ibbotson et al., "Sensors as a Service Oriented Architecture: Middleware for Sensor Networks,", july 2010, pp  OpenIoT Consortium. (2012, January) Open Source Solution for the Internet of Things into the Cloud. [Online]. [Accessed on: ]  Martín Serrano, PROBEIoT Workshop: Challenges for Semantic Interoperability, March 2012, content/uploads/2012/04/7- OpenIoT_Semantic_Interoperability_ _Final_Version.pdf.
Big Data a threat or a chance? Helwig Hauser University of Bergen, Dept. of Informatics Big Data What is Big Data? well, lots of data, right? we come back to this in a moment. certainly, a buzz-word but
CIS492 Special Topics: Cloud Computing د. منذر الطزاونة Big Data Definition No single standard definition Big Data is data whose scale, diversity, and complexity require new architecture, techniques, algorithms,
Survey Results Table of Contents Survey Results... 4 Big Data Company Strategy... 6 Big Data Business Drivers and Benefits Received... 8 Big Data Integration... 10 Big Data Implementation Challenges...
So What s the Big Deal? Presentation Agenda Introduction What is Big Data? So What is the Big Deal? Big Data Technologies Identifying Big Data Opportunities Conducting a Big Data Proof of Concept Big Data
Architecting for Big Data Analytics and Beyond: A New Framework for Business Intelligence and Data Warehousing Wayne W. Eckerson Director of Research, TechTarget Founder, BI Leadership Forum Business Analytics
The 3 questions to ask yourself about BIG DATA Do you have a big data problem? Companies looking to tackle big data problems are embarking on a journey that is full of hype, buzz, confusion, and misinformation.
Industry 4.0 and Big Data Marek Obitko, firstname.lastname@example.org Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and
Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development
Hadoop Beyond Hype: Complex Adaptive Systems Conference Nov 16, 2012 Viswa Sharma Solutions Architect Tata Consultancy Services 1 Agenda What is Hadoop Why Hadoop? The Net Generation is here Sizing the
From Internet Data Centers to Data Centers in the Cloud This case study is a short extract from a keynote address given to the Doctoral Symposium at Middleware 2009 by Lucy Cherkasova of HP Research Labs
Big Data, Why All the Buzz? (Abridged) Anita Luthra, February 20, 2014 Defining Big Not Just Massive Data Big data refers to data sets whose size is beyond the ability of typical database software tools
Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08
Piyush Chaudhary Technical Computing Solutions Data Centric Computing Revisited SPXXL/SCICOMP Summer 2013 Bottom line: It is a time of Powerful Information Data volume is on the rise Dimensions of data
Tutorial: Big Data Algorithms and Applications Under Hadoop KUNPENG ZHANG SIDDHARTHA BHATTACHARYYA http://kzhang6.people.uic.edu/tutorial/amcis2014.html August 7, 2014 Schedule I. Introduction to big data
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics February 11, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,
What is big data? Raul F. Chong Senior program manager Big data, DB2, and Cloud IM Cloud Computing Center of Competence - IBM Toronto Lab, Canada 1 2011 IBM Corporation Agenda The world is changing What
Danny Wang, Ph.D. Vice President of Business Strategy and Risk Management Republic Bank Agenda» Overview» What is Big Data?» Accelerates advances in computer & technologies» Revolutionizes data measurement»
DATA STORAGE Martin Strohbach, AGT International (R&D) THE DATA VALUE CHAIN Value Chain Data Acquisition Data Analysis Data Curation Data Storage Data Usage Structured data Unstructured data Event processing
BIG DATA & SOCIAL INNOVATION KENNETH THOMAS, CLIENT MANAGER 1 MAKING THE RIGHT DECISSION AT THE RIGHT PLACE AT THE RIGHT TIME 2 THE DATA MULTIPLIER EFFECT AT WORK BUSINESS DRIVEN HUMAN DRIVEN MACHINE DRIVEN
White Paper A Real-time Data Hub For Smarter City Applications Intelligent Transportation Innovation for Real-time Traffic Flow Analytics with Dynamic Congestion Management 2 Understanding traffic flow
ITU Workshop on "Future Trust and Knowledge Infrastructure", Phase 1 Geneva, Switzerland, 24 April 2015 Session 4 Cloud computing for future ICT Knowledge platforms Olivier Le Grand, Senior Standardization
Surfing the Data Tsunami: A New Paradigm for Big Data Processing and Analytics Dr. Liangxiu Han Future Networks and Distributed Systems Group (FUNDS) School of Computing, Mathematics and Digital Technology,
III Big Data Technologies Today, new technologies make it possible to realize value from Big Data. Big data technologies can replace highly customized, expensive legacy systems with a standard solution
Exploiting Data at Rest and Data in Motion with a Big Data Platform Sarah Brader, email@example.com What is Big Data? Where does it come from? 12+ TBs of tweet data every day 30 billion RFID tags
ABSTRACT The emergence of big data technology and analytics Bernice Purcell Holy Family University The Internet has made new sources of vast amount of data available to business executives. Big data is
Software Engineering for Big Data CS846 Paulo Alencar David R. Cheriton School of Computer Science University of Waterloo Big Data Big data technologies describe a new generation of technologies that aim
Big Data Are You Ready? Jorge Plascencia Solution Architect Manager Big Data: The Datafication Of Everything Thoughts Devices Processes Thoughts Things Processes Run the Business Organize data to do something
Big Data & QlikView Democratizing Big Data Analytics David Freriks Principal Solution Architect TDWI Vancouver Agenda What really is Big Data? How do we separate hype from reality? How does that relate
Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores Composite Software October 2010 TABLE OF CONTENTS INTRODUCTION... 3 BUSINESS AND IT DRIVERS... 4 NOSQL DATA STORES LANDSCAPE...
1300 530 335 firstname.lastname@example.org www.c3businesssolutions.com GPO Box 589 Melbourne VIC 3001 Australia ABN 35 122 885 465 White Paper Big Data Changing the face of Business Intelligence & Information
Big Data Analytics DAMA NY DAMA Day October 17, 2013 IBM 590 Madison Avenue 12th floor New York, NY Tom Haughey InfoModel, LLC 868 Woodfield Road Franklin Lakes, NJ 07417 201 755 3350 email@example.com
AGENDA What is BIG DATA? What is Hadoop? Why Microsoft? The Microsoft BIG DATA story Hadoop PDW Our BIG DATA Roadmap BIG DATA? Volume 59% growth in annual WW information 1.2M Zetabytes (10 21 bytes) this
Impact of Big Data in Oil & Gas Industry Pranaya Sangvai Reliance Industries Limited 04 Feb 15, DEJ, Mumbai, India. New Age Information 2.92 billions Internet Users in 2014 Twitter processes 7 terabytes
The future: Big Data, IoT, VR, AR Leif Granholm Tekla / Trimble buildings Senior Vice President / BIM Ambassador What is Big Data? 2 Big Data is when the amount of data becomes part of the problem 3 Big
BIG DATA TRENDS AND TECHNOLOGIES THE WORLD OF DATA IS CHANGING Cloud WHAT IS BIG DATA? Big data are datasets that grow so large that they become awkward to work with using onhand database management tools.
The Internet of Things Giles Norman MobileFirst Consulting Manager, IBM Daniel Dombach Director EMEA, Industry Solutions, Zebra Technologies Internet of Things Daniel Dombach Director EMEA Industry Solutions
White Paper BIG DATA-AS-A-SERVICE What Big Data is about What service providers can do with Big Data What EMC can do to help EMC Solutions Group Abstract This white paper looks at what service providers
Big Data Putting data to productive use Fast Forward What is big data, and why should you care? Get familiar with big data terminology, technologies, and techniques. Getting started with big data to realize
Chapter 1 Understanding Big Data Analytics In This Chapter Defining Big Data Understanding Big Data Analytics Contrasting traditional and visual analytics approaches The era of Big Data is upon us. The
Informix The Intelligent Database for IoT Kiran Challapalli Informix Competitive Technology & Enablement firstname.lastname@example.org +91-80431-91802 Agenda What is Internet of Things (IoT) Why it matters IoT
W H I T E P A P E R Deriving Intelligence from Large Data Using Hadoop and Applying Analytics Abstract This white paper is focused on discussing the challenges facing large scale data processing and the
Bringing Big Data into the Enterprise Overview When evaluating Big Data applications in enterprise computing, one often-asked question is how does Big Data compare to the Enterprise Data Warehouse (EDW)?
The benefits and implications of the Cloud and Software as a Service (SaaS) for the Location Services Market John Caulfield Solutions Director What Is Cloud Computing Cloud Computing Everyone Is Talking
Next Internet Evolution: Getting Big Data insights from the Internet of Things Internet of things are fast becoming broadly accepted in the world of computing and they should be. Advances in Cloud computing,
End to End Solution to Accelerate Data Warehouse Optimization Franco Flore Alliance Sales Director - APJ Big Data Is Driving Key Business Initiatives Increase profitability, innovation, customer satisfaction,
5 Keys to Unlocking the Big Data Analytics Puzzle Anurag Tandon Director, Product Marketing March 26, 2014 1 A Little About Us A global footprint. A proven innovator. A leader in enterprise analytics for
Eric Ledu, The Createch Group, a BELL company Intelligence Analytics maturity Past Present Future Predictive Modeling Optimization What is the best that could happen? Raw Data Cleaned Data Standard Reports
Big Data and the Cloud Trends, Applications, and Training Stavros Christodoulakis MUSIC/TUC Lab School of Electronic and Computer Engineering Technical University of Crete email@example.com Data Explosion
beyond possible Big Use End-user applications Big Analytics Visualisation tools Big Analytical tools Big management systems The 4 Pillars of Technosoft s Big Practice Overview Businesses have long managed
NextGen Infrastructure for Big DATA Analytics. So What is Big Data? Data that exceeds the processing capacity of conven4onal database systems. The data is too big, moves too fast, or doesn t fit the structures
FOR A FEW TERABYTES MORE THE GOOD, THE BAD and THE BIG DATA Cenk Kiral Senior Director of BI&EPM solutions ECEMEA region Big Data Buzz The promise of big data Intelligent Utility - 8/28/11 Big data, analytics
Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical
Deploying Big Data to the Cloud: Roadmap for Success James Kobielus Chair, CSCC Big Data in the Cloud Working Group IBM Big Data Evangelist. IBM Data Magazine, Editor-in- Chief. IBM Senior Program Director,
Big Data and Data Science: Behind the Buzz Words Peggy Brinkmann, FCAS, MAAA Actuary Milliman, Inc. April 1, 2014 Contents Big data: from hype to value Deconstructing data science Managing big data Analyzing
Are You Ready for Big Data? Jim Gallo National Director, Business Analytics April 10, 2013 Agenda What is Big Data? How do you leverage Big Data in your company? How do you prepare for a Big Data initiative?
VIEWPOINT High Performance Analytics Industry Context and Trends In the digital age of social media and connected devices, enterprises have a plethora of data that they can mine, to discover hidden correlations
What happens when Big Data and Master Data come together? Jeremy Pritchard Master Data Management fgdd 1 What is Master Data? Master data is data that is shared by multiple computer systems. The Information
GigaSpaces Real-Time Analytics for Big Data GigaSpaces makes it easy to build and deploy large-scale real-time analytics systems Rapidly increasing use of large-scale and location-aware social media and
Mr Dinesh G Umale Saraswati College,Shegaon (Department of MCA) CLOUD COMPUTING IN HIGHER EDUCATION Abstract Technology has grown rapidly with scientific advancement over the world in recent decades. Therefore,
An Introduction to the Internet of Things (IoT) Part 1. of The IoT Series November 2013 Lopez Research LLC 2269 Chestnut Street #202 San Francisco, CA 94123 T (866) 849-5750 E firstname.lastname@example.org W
Brochure More information from http://www.researchandmarkets.com/reports/2643647/ Big Data and Telecom Analytics Market: Business Case, Market Analysis & Forecasts 2014-2019 Description: Big Data refers
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A SURVEY ON BIG DATA ISSUES AMRINDER KAUR Assistant Professor, Department of Computer
BIG DATA CHALLENGES AND PERSPECTIVES Meenakshi Sharma 1, Keshav Kishore 2 1 Student of Master of Technology, 2 Head of Department, Department of Computer Science and Engineering, A P Goyal Shimla University,
Towards a Thriving Data Economy: Open Data, Big Data, and Data Ecosystems Volker Markl email@example.com dima.tu-berlin.de dfki.de/web/research/iam/ bbdc.berlin Based on my 2014 Vision Paper On
Chapter 7 Using Hadoop Cluster and MapReduce Modeling and Prototyping of RMS for QoS Oriented Grid Page 152 7. Using Hadoop Cluster and MapReduce for Big Data Problems The size of the databases used in
Forecast of Big Data Trends Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 3 September 2014 Big Data transforms Business 2 Data created every minute Source http://mashable.com/2012/06/22/data-created-every-minute/
G-Cloud Big Data Suite Powered by Pivotal December 2014 G-Cloud service definitions TABLE OF CONTENTS Service Overview... 3 Business Need... 6 Our Approach... 7 Service Management... 7 Vendor Accreditations/Awards...
How to Leverage Big Data in the Cloud to Gain Competitive Advantage James Kobielus, IBM Big Data Evangelist Editor-in-Chief, IBM Data Magazine Senior Program Director, Product Marketing, Big Data Analytics
Understanding the impact of the connected revolution Vodafone Power to you 02 Introduction With competitive pressures intensifying and the pace of innovation accelerating, recognising key trends, understanding
NOSQL, BIG DATA AND GRAPHS Technology Choices for Today s Mission- Critical Applications 2 NOSQL, BIG DATA AND GRAPHS NOSQL, BIG DATA AND GRAPHS TECHNOLOGY CHOICES FOR TODAY S MISSION- CRITICAL APPLICATIONS
Data Modeling s Role in the Evolving World of Data Management Donna Burbank CA Technologies Who am I? More than more than 15 years of experience in the areas of data management, metadata management, and
Better Decision Making Big Data Analytics Webinar, November 2013 Dr. Wolfgang Martin Analyst and Member of the Boulder BI Brain Trust Better Decision Making Process Oriented Businesses. Decision Making:
INTERNATIONAL JOURNAL OF PURE AND APPLIED RESEARCH IN ENGINEERING AND TECHNOLOGY A PATH FOR HORIZING YOUR INNOVATIVE WORK A REVIEW ON BIG DATA MANAGEMENT AND ITS SECURITY PRUTHVIKA S. KADU 1, DR. H. R.
The future of Big Data A United Hitachi View Alex van Die Pre-Sales Consultant 1 Oktober 2014 1 Agenda Evolutie van Data en Analytics Internet of Things Hitachi Social Innovation Vision and Solutions 2
Cloud beyond the obvious, an approach for innovation Christian Verstraete Chief Technologist Cloud Strategy Our World is Changing Living in the age of tectonic shifts, and welcome to the new style of IT
Now, Next and the Future: IT, Big Data and other Implications for RIM Agenda for This Afternoon Now: What trends are creating implications within the profession? Next: Why is IT now concerned about RIM?
2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Big Data: Using ArcGIS with Apache Hadoop David Kaiser Erik Hoel Offering 1330 Esri UC2013. Technical Workshop.
Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3
Technology Insight Paper Converged, Real-time Analytics Enabling Faster Decision Making and New Business Opportunities By John Webster February 2015 Enabling you to make the best technology decisions Enabling
Volker Markl Professor and Chair Database Systems and Information Management (DIMA), Technische Universität Berlin www.dima.tu-berlin.de Big Data Analytics Chances and Challenges Volker Markl DIMA BDOD
Big Data Are You Ready? Thomas Kyte http://asktom.oracle.com The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated
Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots
Orange County Convention Center Orlando, Florida June 3-5, 2014 Big Data and the SAP Data Platform Including SAP HANA and Apache Hadoop Balaji Krishna, SAP HANA Product Management @balajivkrishna LEARNING
Trends and Research Opportunities in Spatial Big Data Analytics and Cloud Computing NCSU GeoSpatial Forum Siva Ravada Senior Director of Development Oracle Spatial and MapViewer 2 Evolving Technology Platforms