Dynamic Data Analytics for the Internet of Things: Challenges and Opportunities Payam Barnaghi Institute for Communication Systems (ICS) University of Surrey/CityPulse Consortium Guildford, United Kingdom IoT Large-Scale Analytics Workshop IoT Week Lisbon, June 2015 1
Contextual Challenges Data Volume Security, Reliability, Trust and Privacy AnyThing Networking and Communication AnyPlace AnyTime Services and Applications Societal Impacts, Economic Values and Viability 2
IoT Data- Challanges Multi-modal and heterogeneous Noisy and incomplete Time and location dependent Dynamic and varies in quality Crowed sourced data can be unreliable Requires (near-) real-time analysis Privacy and security are important issues Data can be biased- we need to know our data! 3
Relying merely on data from sources that are unevenly distributed, without considering background information or social context, can lead to imbalanced interpretations and decisions. It s also about automation in addition to insight and information extraction.? 4
Data Lifecycle Source: The IET Technical Report, Digital Technology Adoption in the Smart Built Environment: Challenges and opportunities of data driven systems for building, community and city-scale applications, http://www.theiet.org/sectors/built-environment/resources/digital-technology.cfm 5
IoT environments are usually dynamic and (near-) real-time Off-line Data analytics Data analytics in dynamic environments Image sources: ABC Australia and 2dolphins.com 6
IoT Data 7
Deep IoT 8
The ultimate goal is transforming the raw data to insights and actionable knowledge and/or creating effective representation forms for machines and also human users and creating automation. This usually requires data from multiple sources, (near-) real time analytics and visualisation and/or semantic representations. 9
Data will come from various source and from different platforms and various systems. This requires an ecosystem of IoT systems with several backend support components (e.g. pub/sub, storage, discovery, and access services). Semantic interoperability is also a key requirement. 10
Search on the Internet/Web in the early days 11
IoT discovery engines? Working across different systems and various platforms is a key requirement. Internet search engines work very well with textual data, but IoT data comes in various forms and often as streams. This requires an ecosystem of IoT systems with several backend support components (e.g. pub/sub, storage, discovery, and access services). 12
IoT discovery engines? To make it more complex, IoT resources are often mobile and/or transient. Quality and trust (and obviously privacy) are among the other key challenges. This requires efficient distributed index and update mechanisms, quality-aware an resourceaware selection and ranking, and privacy control and preservation methods (and governance models). 13
Accessing IoT data The internet/web norm (for now) is usually searching for the data; the search engines are usually information locators return the link to the information; IoT data access is more opportunistic and context aware. This requires context-aware and opportunistic push mechanism, dynamic device/resource associations and (software-defined) data routing networks. 14
Web search is already adapting this model Image credits: the Economist 15
A discovery engine for the IoT A. HosseiniTabatabaie, P. Barnaghi, C. Wang, L. Dong, R. Tafazolli, "Method and Apparatus for Scalable Data Discovery in IoT Systems, US Patents, May 2014. 16
CityPulse demo 17
KAT- Knowledge Acquisition Toolkit http://kat.ee.surrey.ac.uk/
The future: borders will blend Source: IEEE Internet Computing, Special issue on Physical-Cyber-Social Computing 19
In conclusion IoT data analytics is different from common big data analytics. Data collection in the IoT comes at the cost of bandwidth, network, energy and other resources. Data collection, delivery and processing is also depended on multiple layers of the network. We need more resource-aware data analytics methods and cross-layer optimisations (Deep IoT). The solutions should work across different systems and multiple platforms (Ecosystem of systems). Data sources are more than physical (sensory) observation. The IoT requires integration and processing of physical-cyber-social data. The extracted insights and information should be converted to a feedback and/or actionable information. 20
Smart city datasets http://iot.ee.surrey.ac.uk:8080 21
IET sector briefing report Available at: http://www.theiet.org/sectors/built-environment/resources/digital-technology.cfm 22
Q&A Thank you. EU FP7 CityPulse Project: http://www.ict-citypulse.eu/ @pbarnaghi p.barnaghi@surrey.ac.uk