Mega Modeling for Scien/fic Big Data Processing

Size: px
Start display at page:

Download "Mega Modeling for Scien/fic Big Data Processing"

Transcription

1 Mega Modeling for Scien/fic Big Data Processing Stefano Ceri, Emanuele Della Valle (Politecnico di Milano) Dino Pedreschi, Roberto Trasar/ (ISTI- CNR and University of Pisa) 1

2 The context 2

3 Scenario BIG DATA: A new data revolu/on. Data is reshaping every individual and collec/ve ac/vity of people s life. - Sensors and people produce huge amounts of data - Data is becoming accessible everywhere via the Web Scien/fic big data is changing our avtude towards science, from specialized to massive experiments and from focused to broad ques/ons. A data- centric vision goes towards Horizon 2020 s objec/ves. 3

4 Examples of Big Data A. London Traffic 4

5 Challenges of Scien/fic Big Data Processing Smart Ci/es Ci/es are becoming smarter, as governments, businesses, and communi/es increasingly rely on technology to overcome the challenges from rapid urbaniza/on. Typical ques/ons for smart ci/es: Where in the city are people converging during a typical week day? Or during weekends? Is public transporta/on dynamically adap/ng to people s density? Is a traffic jam going to happen on this road? And is it then convenient to reallocate travellers based upon the forecast? Where are all my friends mee/ng? Can I reach them? Should I use public transports or go by car? 5

6 B. Pulse of the Na/on inferred from Twicer [source hcp:// ] 6

7 C. Facebook World s Geography The social network behind Facebook! 7

8 Challenges of Scien/fic Big Data Processing Social Mining Using user- generated content for discovering and analyzing emergent social behaviors, by combining sensing of personal micro- data (tweets, web logs, mobile phones traces) and par/cipatory sensing (via crowdsourcing, GWAP, ). Typical ques/ons for social mining: Who will win US elec/ons? What s the elector s current inten/on of vote? How reliable is it? Which are the indicators of social well- being (beyond GDP) and how can they be computed and monitored? How is the aging popula/on effec/vely helped by the social par/cipa/on to digital community services? What is the link between media ownership and media content? Is there bias in news repor/ng? And in content reviews? Is an infec/ve disease emerging? How is its diffusion model? 8

9 D. Genomic Data 9

10 Challenges of Scien/fic Big Data Processing Genomic Compu/ng The context: thanks to Fast DNA Sequencing, personalized genomic medicine will become possible: aner a blood sample, with a cost below 100$ and within hours or minutes of compu/ng /me, have the en/re genome of each individual available at a genome browser New ques/ons and scenarios: Am I the carrier of gene/c muta/ons? Will I develop cancer? How obesity correlates with breast cancer? Which computa/onal approach can discriminate between "driver" or "passenger" cancer DNA muta/ons? How can specific target genes be assigned to epigene/cally defined regulatory regions? How do epigene/c modifica/ons affect DNA synthesis during the replica/on of genomes? 10

11 All the scenarios require MODELS MODEL Representa/on of the problem space in the ICT vocabulary (concepts, data, processes, systems). Computa/onal abstrac/ons extrac/ng relevant data from input data Models can: Based upon analy/cal/sta/s/cal laws Based upon simula/ons, extrac/ng general behaviors from many observa/ons of the behavior of individuals Based upon induc/ve methods applied to data Challenge: convergence of three types of models 11

12 Mo/va/ng Context: FutureICT Flagship SCIENCE: The ul/mate goal of the FuturICT flagship project is to understand and manage complex, global, socially interac/ve systems, with a focus on sustainability and resilience. POLICY: FuturICT will build a Living Earth Plasorm, a simula/on, visualiza/on and par/cipa/on plasorm to support decision- making of policy- makers, business people and ci/zens. TECHNOLOGY: Integra/ng ICT, Complexity Science and the Social Sciences will create a paradigm shin, facilita/ng a symbio/c co- evolu/on of ICT and society. 12

13 FuturICT Vision 13

14 A s/mulus from FuturICT vision: World- of- Modeling Plasorm THEORY Classify models by type and describe each type s proper/es. Define (type- aware) strong interoperability within the elements of the same class Define model interoperability among models of different classes PRACTICE Build language abstrac/ons and sonware plasorms suppor/ng them 14

15 Mega- Modeling Concept 15

16 Mega- Modeling for Scien/fic Data General goal: Building a model of models - which describes each model s proper/es and interac/ons - for suppor/ng opera/ons upon models, such as selec/on, inspec/on, composi/on, subs/tu/on, reduc/on, extension, and search. Keywords: big data, data pacerns, management of complexity, uncertainty, dynamic composi/on, adapta/on. Chris Welty (Jeopardy): Increasingly computa/onal tasks require inexact solu/ons that combine mul/ple methods in unpredictable ways (WWW 2012, Lyon) 16

17 Which scien/fic computa/ons? Mathema=cal model: uses mathema/cal concepts and language. Analy=cal Model: mathema/cal models that have a closed form solu/on Numerical Model: mathema/cal models that are solved by numerical approxima/on Sta=s=cal model: uses sta/s/cal concepts and language, e.g. probability distribu/on func/ons. Data mining model: extracts pacerns from large data sets. Simula=on model: predicts the expected behavior of a system. Agent- based model: simulates the ac/ons and interac/ons of autonomous agents (represen/ng individuals, groups or organiza/ons) 17

18 How should they be modeled? By embedding scien/fic computa/ons within a conceptual/ontological model of reality that serves the purpose of defining how computa/onal models share and exchange data, with a clear seman/cs 18

19 The root: Mega- Programming Wiederhold- Wegner- Ceri, CACM, Nov Mega- module: Internally homogeneous, independently maintained sonware system. Each mega- module describes its externally accessible data structures and opera/ons. Megaprogramming language MPL A form of programming in the large It developed into: mediators, web services, Workflow / business process languages, seman/c web services, web

20 Useful ideas of mega- programming Every mega- module exposes a data model and certain opera/ons to a mega- program: SUPPLY: provide data in model- compa/ble format INVOKE: ac/vate computa/on through entry points EXTRACT: provides mega- module results EXAMINE: makes access to internal state variables ESTIMATE: gets informa/on about execu/on comple/on LIMIT: constraints execu/on /me & cost 20

21 Previous Uses of Mega- Modeling Term BEZEVIN- VALDURIEZ: On the need for megamodels (2004), emphasis on meta- models and model registry. BEZIVIN: Model of models (2004), a model of rela/onships between models. FAVRE: Meta- model of model transforma/ons (2005), models linked by rela/onships such as representa(onof, conformsto, istransformedin. SEIBEL et al. (2010) dynamic hierarchical data models for traceability emphasis on dependencies between model ar/facts. SEIBEL et al. (2011) mega- models for modeling run/me behavior 21

22 Data- driven computa/on paradigms Data analysis: process of extrac/ng useful informa/on from input data by using any kind of model (including data mining). Data mining: automa/c or semi- automa/c analysis of large data sets to extract previously unknown interes(ng paeerns (emphasis on induc/on). 22

23 On the meaning of pacern PaEern type = context- independent data format for expressing the results of data analysis and data mining ac/vi/es e.g. trajectories PaEern instance = context- specific data item compliant to the pacern type - e.g. my trajectory from office to home today PaEern = context- specific popula/on of pacern instances, featuring an intensional descrip/on (name, pacern type, qualifying parameters, including quality parameters) and an extension (set of pacern instances) e.g. the cluster of trajectories leading to Linate airport through the highway PaEern extrac=on = compu/ng pacerns in a given context, by first evalua/ng pacern instances and then abstrac/ng the common proper/es that collec/vely describe a popula/on 23

24 The authors history of pacerns 24

25 MineRule Operator (associa/on rules) Data type Tabular representa/on of associa/on rules (HEAD, BODY, SUPPORT, CONFIDENCE) Pacern type Associa/on rule HEAD - > BODY, featuring sta/s/cal proper/es of confidence, support Paradigm Mine Rule Operator: SQL- based language for extrac/ng associa/on rules and puvng them into a tabular format, with built- in variables HEAD, BODY, SUPPORT, CONFIDENCE 25

26 Mine Rule Pacern MINE RULE PurchaseBasket AS SELECT DISTINCT l..n item AS BODY, I..1 item AS HEAD, SUPPORT, CONFIDENCE FROM Purchase WHERE DATE BETWEEN AND GROUP BY Transac/on HAVING COUNT(*) >= 3 EXTRACTING RULES WITH SUPPORT: 0.2, CONFIDENCE: 0.2 Associations body head support confidence ski_pants jacket hiking_boots jacket ski_pants, hiking_boots jacket col_shirt jacket col_shirt,hiking_boots jacket

27 Stream Reasoning Data Types RDF Stream: unbound sequence of /mestamped RDF triples Window (sliding or tumbling): top por/on of the RDF stream Time stamp func/on: associated to triples Pacern Type Computa/on of a new stream from data and streams Paradigm Addi/on to standard Sparql of new data types and of con/nuous seman/cs (i.e., streams and registered queries over streams) 27

28 An Example of C-SPARQL Stream Who are the opinion makers? i.e., the users who are likely to influence the behaviour of other users who follow them REGISTER STREAM OpinionMakers COMPUTED EVERY 5m AS CONSTRUCT {?opinionmaker sd:about?resource } FROM STREAM < [RANGE 30m STEP 5m] WHERE { }?opinionmaker?opinion?resource.?follower sioc:follows?opinionmaker.?follower?opinion?resource. FILTER ( cs:timestamp(?follower) > cs:timestamp(?opinionmaker) &&?opinion!= sd:accesses ) HAVING ( COUNT(DISTINCT?follower) > 3 ) ER Stefano Ceri 28

29 M- Atlas Interoperability for trajectories Data types Points, lines, polygons, trajectories (moving points) Pacerns Clusters: trajectories of points with the same label Flows: trajectories moving between regions Flocks: spa/o- temporal coincidence of flows Paradigm SQL- like language for building pacerns and for querying, transforming, composing and visualizing them. 29

30 M- Atlas queries for social mining How do people leave Milan s city center toward suburban areas? CREATE MODEL MilanODMatrix AS MINE ODMATRIX FROM (SELECT t.id, t.trajectory FROM TrajectoryTable t), (SELECT orig.id, orig.area FROM MunicipalityTable orig), (SELECT dest.id, dest.area FROM MunicipalityTable dest) CREATE RELATION CenterToNESuburbTrajectories USING ENTAIL FROM (SELECT t.id, t.trajectory FROM TrajectoryTable t, MilanODMatrix m WHERE m.origin = Milan AND m.des/na/on IN (Monza,..., Brugherio)) CREATE MODEL ClusteringTable AS MINE T- CLUSTERING FROM (Select t.id, t.trajectory from CenterToNESuburbTrajectories t) SET T- CLUSTERING.FUNCTION = ROUTE_SIMILARITY AND T- CLUSTERING.EPS = 400 AND T- CLUSTERING.MIN_PTS = 5 30

31 Search Compu/ng Data type: Ranked data services with input/output parameters Pacern type: Service combina/ons obtained by compu/ng top- k join queries Paradigm: SeCoQL, a query language and protocol suppor/ng ranked queries on services and exploratory search 31

32 Search Compu/ng Queries DEFINE QUERY NightPlan($X:String, $Y: string, $Z:Integer, $U:String, $V:String) AS SELECT M.*, T.*, R.*, TotalPrice=T.Price + R.AvgPrice FROM ((Movie (igenre: $X, icountry: Y, iyear: $Z) AS M USING IMDB_MOVIES, JOIN Theatre (iaddress: $U, icity: $V, icountry: $Y) AS T USING GOOGLE_DISPLAYING ON M.Title=T.Title) JOIN Restaurant (icountry: $Y, icategory: "Italian Restaurant") AS R USING YQL_LOCAL ON T.address=R.Address AND T.city=R.City) WHERE R.Ra/ng>3 RANK BY (R=0.4, T=0.3, M=0.3) LIMIT 20 TUPLES AND 50 CALLS 32

33 CrowdSearcher Data type: List of search items with a regular schema (possibly produced by a conven/onal search system) Pacern types: Annota/ons on search items (like, dislike, recommend, tag, score, order, group, top, insert delete, correct, connect) Paradigm: Use of crowd for adding pacerns to search items 33

34 CrowdSearcher Model Data type: collec/on of tuples Query type: Like, Add, Sort / Rank, Comment, Modify 34

35 Example of crowdsourcing 35

36 Crowdsearcing results

37 Common aspects of five pacerns High- level data representa/on through tables High- level data manipula/on language as an extension of major rela/onal languages, one of: SQL, Sparql, Datalog+- Recipe: Expose a tabular representa/on Use a rela/onal language extension for computa/on & composi/on 37

38 (just a bit more) Systema/c view 38

39 Pacerns for classifica/on & clustering CLASSIFICATION. The computa/on extracts classes from a popula/on, each class has a name and sta/s/cs from simple frequencies up. Data: Popula/on(Item) Pacern: Class(Name, AggrStats) CLUSTERING. The computa/on extracts clusters from a collec/on, each cluster has a name, an extent (consis/ng of its elements), a centroid element, and sta/s/cs from cardinali/es up. Data: Pacern: Collec/on(Item) Cluster(Name, Extent: [Item], CentroidItem, AggrStats) 39

40 Pacerns for Streams STREAMING. Stream compu/ng aggregates data of a given type from a stream; it associates each type with a valid /me interval, typically the most recent, and aggregate proper/es. Data: Stream(TimeStamp, Item) Pacern: StreamStats(ItemType, TimeInterval, AggrStats) STREAMING WITH WINDOWS. The stream is subdivided in windows, stream compu/ng associates a given type and window with aggregate proper/es. Data: Stream(Window, StartTimeStamp, EndTimeStamp, Content:[Item]) Pacern: WindowedStats(Window, ItemType, AggrStats) 40

41 Pacerns for Associa/on Rules ASSOCIATION RULES. They solve the basket analysis problem; each associa/on rule has an head and a body describing item sets, and then sta/s/cal proper/es of support and confidence defining the rule s interest. Data Basket(Tid,Item) Pacern: Rule(Head:[Item], Body:[Item], Support, Confidence) 41

42 Pacerns for Trees TREE. Classical computa/ons provide the descendants or ancestors of a given node, or classify a new node rela/ve to a taxonomy, by returning the path from the root to the most similar node Data: Tree (Item, Children: [Item]) Pacern: Descendants(Item, To: [Item]) Ancestors(Item, From: [Item]) Classify (Item, Path[Item]) 42

43 Pacerns for Graphs GRAPH. Classical computa/ons provide a decomposi/on of a graph into components or find the friend nodes which are at a given nearness from a given node. Data: Pacern: Graph(FromItem, ToItem) Components(Name, Components: [Node]) Friends(FromItem, NearnessLevel, To: [Item]) DISTANCE- GRAPH. Shortest path between any two items expressed as a sequence of nodes connec/ng them and a totaldistance. Data: Pacern: D- Graph(FromItem, ToItem, Distance) ShortestPath(OriginItem, Des/na/onItem, Path: [Item], TotalDistance) 43

44 Pacerns for Moving Points MOVING POINTS. Reconstruc/on of the trajectories as sequences of loca/ons which are traversed by the same item. Data: Pacern: Point(Item, Time, Loca/on) Trajectory(Item, FromLoca/on, ToLoca/on, Steps:[Loca/on], StepCount: Number) FLOCKS. Combina/on of trajectories together to recognize flocks, i.e. simultaneous movements of groups of individuals across regions. Data: Trajectory(Item, FromLoca/on, ToLoca/on, Steps:[Loca/on], StepCount: Number) Pacern: Flock(FlockName, FromRegion, ToRegion, TimeInterval, Objects: [Items], ObjectCount: Number) 44

45 (eventually) Mega- modules 45

46 Mega- modules 46

47 Format Data prepara/on Purpose: assembling input objects typically applica/on- specific Techniques: abstrac/on, seman/c enrichment, noise reduc/on Computa/on complexity: low (a data scan or sort) Data analysis Purpose: performing the core scien/fic processing, compu/ng output objects applica/on- independent Techniques: computa/onal models Computa/on complexity: as required (par//oning and streaming recommended) Data evalua/on Purpose: extrac/ng & presen/ng results typically applica/on- specific Techniques: quality assessment, filtering, significance measuring, diversifica/on, ranking Computa/on complexity: as required (object transforma/ons to fit needs) 47

48 Inspec/ons and controls Megamodule inspec/on Aner prepara/on: view of input objects Aner execu/on: view of output objects Megamodule controls Based upon inspec/on May alter behavior, suspend, resume, terminate 48

49 Ra/onale Data analysis: reusable transforma/on of input objects into output objects Classical mathema/cal/sta/s/cal algorithms compute output data Simula/on algorithms predict output data Data mining methods induce output data Applica/on- independent input and output objects compliant with pacern types 49

50 Rela/onal View of Mega- Modules Input/output objects for data analysis in object- rela/onal format? Poten/al for high- level declara/ve data analysis descrip/on using extended rela/onal query language Easing inspec/on and control Easing data analysis reuse 50

51 Example: M- Atlas 51

52 Running Example Data prepara/on GPS observa/ons of the same individual are assembled into a trajectory Data analysis Trajectories are assembled and reported as simultaneous movements of groups of people (flocks) Data evalua/on Flocks which are most relevant (above threshold) are reported upon a map 52

53 Composi/on Abstrac/ons Used for assembling mega- modules into higher order computa/ons If appropriately chosen, are key to mega- module reuse Ideal design process = top- down, recursive applica/on of (de)composi/on abstrac/ons up to finding the appropriate mega- modules within a repository 53

54 Composi/on Abstrac/ons (so far) General- purpose Pipeline Parallel/Itera/ve Recurrent What- if control Drin control 54

55 Pipeline 55

56 Parallel/Itera/ve 56

57 Map- Reduce 57

58 What- If 58

59 Drin Control 59

60 Graph Decomposi/on 60

61 Summary of ICT Requirements for Scien/fic Big Data Management In the small (modules, each processing terabytes of data) Iden/fy reusable data formats as pacern types Iden/fy reusable computa/ons as data analysis models Iden/fy appropriate data transforma/ons for data prepara/on Iden/fy appropriate quality assessments for data evalua/on In the large (composing mega- modules) Foster composi/on through appropriate composi/on abstrac/ons + infrastructures Allow for assessing proper/es of the mega- module composi/on Correctness, reliability, etc. Allow for inspec/on of mega- modules during processing Assessing current state, intermediate results, etc. Allow for dynamic reconfigura/on of each mega- module Scale up and down in response to the load, recover a computa/on aner a fault, etc. 61

62 Examples of applica/ons through composi/ons of MegaModules 62

63 BOTTARI: restaurant recommender based on geo- aware social media analy/cs ER Stefano Ceri 63

64 BOTTARI as a Mega- Model Composi/on Explicit module structure with input- output rela/onships Outputs Inputs Geo-Spatial Model BOTTARI Predictive Model Social Media Crawler and Miner Temporal Model 64

65 BOTTARI Models Geo- spa(al model Input: User posi/on, seman/c + geo- spa/al descrip/on of restaurants Output: a list of matching restaurants ranked by distance from the user Temporal model Input: stream of liked restaurants Output: ranking of restaurants in like order in the last week/month/ quarter Predic(ve model Input: materialized stream of liked restaurants Output: predic/on of the restaurant which will be chosen by the user as best- fit Social Media Crawler and Miner Input: stream of tweets of people about restaurants Output: stream of most liked restaurant aner named en/ty recogni/on and sen/ment mining 65

66 Mega- modulariza/on of Bocari 66

67 Mobility analysis system 67

68 Mobility Manager Service How do driver get to Linate? Two alterna/ve routes to Linate Airport Trajectories that entails the clusters whose des/na/on is Linate GPS Tracks 68

69 End- User Service User s Mobility Profiling for Car Pooling Trajectories that entail the cluster Home- Work Spa/o- Temporal User s mobility profile User s GPS Tracks Home = most frequent loca/on Work = second most frequent loca/on Trajectories that entail the cluster Work- Home 69

70 Mega- modulariza/on of Trajectory Clustering Input GPS data TRAJECTORY RECONSTRUCTION & SELECTION TRAJECTORY CLUSTERING Geography, Zoning and Road Network CLUSTER EVALUATION Clustered Trajectories Cluster Statistics 70

71 Trajectory Clustering Megamodule Usages End- user Service Mobility Mng. Service 71

72 Mega- modulariza/on for Mobility Manager Service All Users Trajectories Trajectory Clusters Spatio-Temporal Observations Destination e.g., Linate Routes to Linate DATA CLEANING TRAJECTORIES RECONSTRUCTION Semantic of a Stop TRAJECTORIES FILTERING TRAJECTORY CLUSTERING Spatio-temporal Distance function ROUTES IDENTIFICATION 72

73 Mega- modulariza/on of Trajectory Clustering for Car Pooling Single User s Trajectories Single User s Trajectory Clusters Spatio-Temporal Observations User s Mobility Profile DATA CLEANING TRAJECTORIES RECONSTRUCTION Semantic of a Stop TRAJECTORIES FILTERING TRAJECTORY CLUSTERING Spatio-temporal Distance function CLUSTERING DECOMPOSITIO N USER MOBILITY PROFILE COMPUTATION Spatio-Temporal Thresholds PROFILE AGGREGATION 73

74 Research ques/ons & agenda Express a large collec/on of pacerns through suitable (rela/onal) language extensions Build an ontology of mega- models, support reasoning upon the ontology for deriving proper/es of mega- models Define/classify composi/on abstrac/ons and define the mega- modeling composi/on language Consider research problems related to: Op/miza/on (inter vs intra) Orchestra/on Inspec/on Adapta/on Build the sonware engineering tools and environment for building and composing mega- models 74

75 Summary of the talk Mo/va/ons Examples of big scien/fic data, FuturICT Typical research ques/ons Why MegaModelling? History of the term What should be solved What is a pacern Applica/on- independent, tabular, composable What is a mega- module Ingredients: Prepara/on / Analysis / Evalua/on Composi/on abstrac/ons Examples of mega- modulariza/ons To- do list 75

Data Warehousing. Yeow Wei Choong Anne Laurent

Data Warehousing. Yeow Wei Choong Anne Laurent Data Warehousing Yeow Wei Choong Anne Laurent Databases Databases are developed on the IDEA that DATA is one of the cri>cal materials of the Informa>on Age Informa>on, which is created by data, becomes

More information

An Open Dynamic Big Data Driven Applica3on System Toolkit

An Open Dynamic Big Data Driven Applica3on System Toolkit An Open Dynamic Big Data Driven Applica3on System Toolkit Craig C. Douglas University of Wyoming and KAUST This research is supported in part by the Na3onal Science Founda3on and King Abdullah University

More information

How To Use A Webmail On A Pc Or Macodeo.Com

How To Use A Webmail On A Pc Or Macodeo.Com Big data workloads and real-world data sets Gang Lu Institute of Computing Technology, Chinese Academy of Sciences BigDataBench Tutorial MICRO 2014 Cambridge, UK INSTITUTE OF COMPUTING TECHNOLOGY 1 Five

More information

Keeping Pace with Big Data

Keeping Pace with Big Data - A Data Mining Perspec>ve Huan Liu, Tempe, AZ hep://www.public.asu.edu/~huanliu NSF Workshop on Big Data Analy6cs for Infrastructure and Building Resilience and Sustainability, Beijing, China Sept 19-20,

More information

ANALYTICAL TECHNIQUES FOR DATA VISUALIZATION

ANALYTICAL TECHNIQUES FOR DATA VISUALIZATION ANALYTICAL TECHNIQUES FOR DATA VISUALIZATION CSE 537 Ar@ficial Intelligence Professor Anita Wasilewska GROUP 2 TEAM MEMBERS: SAEED BOOR BOOR - 110564337 SHIH- YU TSAI - 110385129 HAN LI 110168054 SOURCES

More information

Data Management in the Cloud: Limitations and Opportunities. Annies Ductan

Data Management in the Cloud: Limitations and Opportunities. Annies Ductan Data Management in the Cloud: Limitations and Opportunities Annies Ductan Discussion Outline: Introduc)on Overview Vision of Cloud Compu8ng Managing Data in The Cloud Cloud Characteris8cs Data Management

More information

Language Resources, Language Technology, Text Mining, the Seman8c Web: How interoperability of machines can help humans in the mul8lingual web

Language Resources, Language Technology, Text Mining, the Seman8c Web: How interoperability of machines can help humans in the mul8lingual web Language Resources, Language Technology, Text Mining, the Seman8c Web: How interoperability of machines can help humans in the mul8lingual web Felix Sasaki DFKI / University of Appl. Sciences Potsdam W3C

More information

The Library (Big) Data scien4st

The Library (Big) Data scien4st The Library (Big) Data scien4st IFLA/ALA webinar: Big Data: new roles and opportuni4es for new librarians June 15 th 2016 IFLA Big Data Special Interest Group (SIG) Wouter Klapwijk, Stellenbosch University,

More information

How To Understand The Big Data Paradigm

How To Understand The Big Data Paradigm Big Data and Its Empiricist Founda4ons Teresa Scantamburlo The evolu4on of Data Science The mechaniza4on of induc4on The business of data The Big Data paradigm (data + computa4on) Cri4cal analysis Tenta4ve

More information

METHODS AND TECHNIQUES OF PREDICTION OF KEY PERFORMANCE INDICATORS FOR IMPLEMENTATION OF CHANGES IN MAINTENANCE ORGANISATION

METHODS AND TECHNIQUES OF PREDICTION OF KEY PERFORMANCE INDICATORS FOR IMPLEMENTATION OF CHANGES IN MAINTENANCE ORGANISATION Management Systems in Production Engineering 0, No (5), pp 5 9 METHODS AND TECHNIQUES OF PREDICTION OF KEY PERFORMANCE INDICATORS FOR IMPLEMENTATION OF CHANGES IN MAINTENANCE ORGANISATION Andrzej WIECZOREK

More information

Web Services and Development of Semantic Applications

Web Services and Development of Semantic Applications Web Services and Development of Semantic Applications Trish Whetzel Outreach Coordinator THE NATIONAL CENTER FOR BIOMEDICAL ONTOLOGY Na#onal Center for Biomedical Ontology Mission To create software for

More information

Modeling and mining large scale biological seman0c networks using NEO4J

Modeling and mining large scale biological seman0c networks using NEO4J Modeling and mining large scale biological seman0c networks using NEO4J Junaid Gamieldien Principal Inves.gator Clinical Sequencing and Biomarker Discovery Neo4J Graph database Graph is composed of two

More information

CS 5150 So(ware Engineering Evalua4on and User Tes4ng

CS 5150 So(ware Engineering Evalua4on and User Tes4ng Cornell University Compu1ng and Informa1on Science CS 5150 So(ware Engineering Evalua4on and User Tes4ng William Y. Arms Usability: The Analyze/Design/Build/Evaluate Loop Analyze requirements Design User

More information

Cloudian The Storage Evolution to the Cloud.. Cloudian Inc. Pre Sales Engineering

Cloudian The Storage Evolution to the Cloud.. Cloudian Inc. Pre Sales Engineering Cloudian The Storage Evolution to the Cloud.. Cloudian Inc. Pre Sales Engineering Agenda Industry Trends Cloud Storage Evolu4on of Storage Architectures Storage Connec4vity redefined S3 Cloud Storage Use

More information

Opportuni)es and Challenges of Textual Big Data for the Humani)es

Opportuni)es and Challenges of Textual Big Data for the Humani)es Opportuni)es and Challenges of Textual Big Data for the Humani)es Dr. Adam Wyner, Department of Compu)ng Prof. Barbara Fennell, Department of Linguis)cs THiNK Network Knowledge Exchange in the Humani)es

More information

Social Media Analy.cs (SMA)

Social Media Analy.cs (SMA) Social Media Analy.cs (SMA) Emanuele Della Valle DEIB - Politecnico di Milano emanuele.dellavalle@polimi.it hap://emanueledellavalle.org What's social media? haps://www.youtube.com/watch?v=sgniiud_oqg

More information

Big Data in medical image processing

Big Data in medical image processing Big Data in medical image processing Konstan3n Bychenkov, CEO Aligned Research Group LLC bychenkov@alignedresearch.com Big data in medicine Genomic Research Popula3on Health Images M- Health hips://cloud.google.com/genomics/v1beta2/reference/

More information

Data Mining. Supervised Methods. Ciro Donalek donalek@astro.caltech.edu. Ay/Bi 199ab: Methods of Computa@onal Sciences hcp://esci101.blogspot.

Data Mining. Supervised Methods. Ciro Donalek donalek@astro.caltech.edu. Ay/Bi 199ab: Methods of Computa@onal Sciences hcp://esci101.blogspot. Data Mining Supervised Methods Ciro Donalek donalek@astro.caltech.edu Supervised Methods Summary Ar@ficial Neural Networks Mul@layer Perceptron Support Vector Machines SoLwares Supervised Models: Supervised

More information

Data Obesity: Ethics, Law or Regulation?

Data Obesity: Ethics, Law or Regulation? Data Obesity: Ethics, Law or Regulation? Mireille Hildebrandt Chair of Smart Environments, Data Protec:on and the Rule of Law, RU Nijmegen Professor of Technology Law and Law in Technology, Vrije Universiteit

More information

Urban Big Data Centre

Urban Big Data Centre Urban Big Data Centre Piyushimita Thakuriah (Vonu) Director, UBDC Professor and Ch2M Chair of Transport UNIVERSITY OF GLASGOW November 12, 2015 July 10, 2015 UBDC Partners Funded by ESRC Big Data Network

More information

Making Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.

Making Sense of Big Data. Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl. Making Sense of Big Data Dr. Thomas E. Potok Computa2onal Data Analy2cs Group Leader Oak Ridge Na2onal Laboratory potokte@ornl.gov 865-574- 0834 ORNL s Big Data Legacy Science National Security Energy

More information

Ins+tuto Superior Técnico Technical University of Lisbon. Big Data. Bruno Lopes Catarina Moreira João Pinho

Ins+tuto Superior Técnico Technical University of Lisbon. Big Data. Bruno Lopes Catarina Moreira João Pinho Ins+tuto Superior Técnico Technical University of Lisbon Big Data Bruno Lopes Catarina Moreira João Pinho Mo#va#on 2 220 PetaBytes Of data that people create every day! 2 Mo#va#on 90 % of Data UNSTRUCTURED

More information

Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za

Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za Theo JD Bothma Department of Informa1on Science theo.bothma@up.ac.za Reflec1ons on the role of corpora and big data in e- lexicography in rela1on to end user informa1on needs CILC 2015 7th Interna1onal

More information

Interac(ve Broker (UK) Limited Webinar: Proprietary Trading Groups

Interac(ve Broker (UK) Limited Webinar: Proprietary Trading Groups Interac(ve Broker (UK) Limited Webinar: Proprietary Trading Groups Presenter Gerald Perez Managing Director London, United Kingdom E- mail: gperez@interac=vebrokers.com Important Informa=on: The risk of

More information

Seman&c Web: Benefits For Clinical Decision Support At The Bedside. Emory Fry, MD SemTechBiz 2013

Seman&c Web: Benefits For Clinical Decision Support At The Bedside. Emory Fry, MD SemTechBiz 2013 Seman&c Web: Benefits For Clinical Decision Support At The Bedside Emory Fry, MD SemTechBiz 2013 Clinical Decision Support (CDS) A system providing knowledge and person specific or popula8on informa8on

More information

Ibis: Scaling Python Analy=cs on Hadoop and Impala

Ibis: Scaling Python Analy=cs on Hadoop and Impala Ibis: Scaling Python Analy=cs on Hadoop and Impala Wes McKinney, Budapest BI Forum 2015-10- 14 @wesmckinn 1 Me R&D at Cloudera Serial creator of structured data tools / user interfaces Mathema=cian MIT

More information

An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style

An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style An Integrated Approach to Manage IT Network Traffic - An Overview Click to edit Master /tle style Agenda A quick look at ManageEngine Tradi/onal Traffic Analysis Techniques & Tools Changing face of Network

More information

Introduc)on to the IoT- A methodology

Introduc)on to the IoT- A methodology 10/11/14 1 Introduc)on to the IoTA methodology Olivier SAVRY CEA LETI 10/11/14 2 IoTA Objec)ves Provide a reference model of architecture (ARM) based on Interoperability Scalability Security and Privacy

More information

Social Network Mining

Social Network Mining SSIIM - Seminários de Sistemas Inteligentes, Interacção e Mul8média, MIEIC Social Network Mining Eduarda Mendes Rodrigues Assistant Professor DEI- FEUP, Universidade do Porto hhp://www.fe.up.pt/~eduarda

More information

1 Actuate Corpora-on 2013. Big Data Business Analy/cs

1 Actuate Corpora-on 2013. Big Data Business Analy/cs 1 Big Data Business Analy/cs Introducing BIRT Analy3cs Provides analysts and business users with advanced visual data discovery and predictive analytics to make better, more timely decisions in the age

More information

Better Transnational Access and Data Sharing to Solve Common Questions

Better Transnational Access and Data Sharing to Solve Common Questions Better Transnational Access and Data Sharing to Solve Common Questions Julia Lane American Ins0tutes for Research University of Strasbourg University of Melbourne Overview Common Ques0ons New kinds of

More information

DTCC Data Quality Survey Industry Report

DTCC Data Quality Survey Industry Report DTCC Data Quality Survey Industry Report November 2013 element 22 unlocking the power of your data Contents 1. Introduction 3 2. Approach and participants 4 3. Summary findings 5 4. Findings by topic 6

More information

How to Measure Progress & Impact: Network Mapping

How to Measure Progress & Impact: Network Mapping How to Measure Progress & Impact: Network Mapping Professor Robyn Keast Chair Collaborative Research Network: Policy and Planning for Regional Sustainability, Southern Cross University Measuring Collec/ve

More information

Project Management Introduc1on

Project Management Introduc1on Project Management Introduc1on Session 1 Part I Introduc1on By Amal Le Collen, PMP Dr. Lauren1u Neamtu, PMP Session outline 1. PART I: Introduc1on 1. The Purpose of the PMBOK Guide 2. What is a project?

More information

XML, Seman9c Web and Content Analy9cs

XML, Seman9c Web and Content Analy9cs XML, Seman9c Web and Content Analy9cs XML Prague Pre- conference 2014 Felix Sasaki DFKI / W3C Fellow 1 What do you need to follow this session? Ideal: a computer with internet access, to be able to provide

More information

OVERVIEW OF DATA EXPLORATION TECHNIQUES. Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri SIGMOD 2015, Melbourne

OVERVIEW OF DATA EXPLORATION TECHNIQUES. Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri SIGMOD 2015, Melbourne OVERVIEW OF DATA EXPLORATION TECHNIQUES Stratos Idreos, Olga Papaemmanouil, Surajit Chaudhuri SIGMOD 2015, Melbourne USER INTERACTION express interests query/results recommendasons annotate collaborate

More information

Tim Blevins Execu;ve Director Labor and Revenue Solu;ons. FTA Technology Conference August 4th, 2015

Tim Blevins Execu;ve Director Labor and Revenue Solu;ons. FTA Technology Conference August 4th, 2015 Tim Blevins Execu;ve Director Labor and Revenue Solu;ons FTA Technology Conference August 4th, 2015 Governance and Organiza;onal Strategy PaIerns of Fraud and Abuse in Government What tools can we use

More information

Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves

Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves Expanding Assessment of Analy3cal Skills among Biology Majors: From Introductory labs to Upper Division Elec3ves Presented by Kathleen McAuley PI: Serena Moseman- Val3erra, Ph.D. Department of Biological

More information

Big Data and Clouds: Challenges and Opportuni5es

Big Data and Clouds: Challenges and Opportuni5es Big Data and Clouds: Challenges and Opportuni5es NIST January 15 2013 Geoffrey Fox gcf@indiana.edu h"p://www.infomall.org h"p://www.futuregrid.org School of Informa;cs and Compu;ng Digital Science Center

More information

MSc Data Science at the University of Sheffield. Started in September 2014

MSc Data Science at the University of Sheffield. Started in September 2014 MSc Data Science at the University of Sheffield Started in September 2014 Gianluca Demar?ni Lecturer in Data Science at the Informa?on School since 2014 Ph.D. in Computer Science at U. Hannover, Germany

More information

DNS Big Data Analy@cs

DNS Big Data Analy@cs Klik om de s+jl te bewerken Klik om de models+jlen te bewerken! Tweede niveau! Derde niveau! Vierde niveau DNS Big Data Analy@cs Vijfde niveau DNS- OARC Fall 2015 Workshop October 4th 2015 Maarten Wullink,

More information

Return on Experience on Cloud Compu2ng Issues a stairway to clouds. Experts Workshop Nov. 21st, 2013

Return on Experience on Cloud Compu2ng Issues a stairway to clouds. Experts Workshop Nov. 21st, 2013 Return on Experience on Cloud Compu2ng Issues a stairway to clouds Experts Workshop Agenda InGeoCloudS SoCware Stack InGeoCloudS Elas2city and Scalability Elas2c File Server Elas2c Database Server Elas2c

More information

Run$me Query Op$miza$on

Run$me Query Op$miza$on Run$me Query Op$miza$on Robust Op$miza$on for Graphs 2006-2014 All Rights Reserved 1 RDF Join Order Op$miza$on Typical approach Assign es$mated cardinality to each triple pabern. Bigdata uses the fast

More information

Extrac'ng People s Hobby and Interest Informa'on from Social Media Content

Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Extrac'ng People s Hobby and Interest Informa'on from Social Media Content Thomas Forss, Shuhua Liu and Kaj- Mikael Björk Dept of Business Administra?on and Analy?cs Arcada University of Applied Sciences

More information

Introduc)on to urika. Mul)threading. SPARQL Database. urika Appliance. XMT- 2 Programming. Use Cases

Introduc)on to urika. Mul)threading. SPARQL Database. urika Appliance. XMT- 2 Programming. Use Cases 1 Introduc)on to urika Mul)threading SPARQL Database urika Appliance XMT- 2 Programming Use Cases 2 MTA- 1 1998 Gallium arsenide: Proof of concept First produc,on implementa,on of latency- tolerant mul,threading

More information

.nl ENTRADA. CENTR-tech 33. November 2015 Marco Davids, SIDN Labs. Klik om de s+jl te bewerken

.nl ENTRADA. CENTR-tech 33. November 2015 Marco Davids, SIDN Labs. Klik om de s+jl te bewerken Klik om de s+jl te bewerken Klik om de models+jlen te bewerken Tweede niveau Derde niveau Vierde niveau.nl ENTRADA Vijfde niveau CENTR-tech 33 November 2015 Marco Davids, SIDN Labs Wie zijn wij? Mijlpalen

More information

Road Public Transport Informa5on Management Program BIG DATA. Market Consulta5on August 7, 2015

Road Public Transport Informa5on Management Program BIG DATA. Market Consulta5on August 7, 2015 Road Public Transport Informa5on Management Program BIG DATA Market Consulta5on August 7, 2015 How does DOTC collect transport data? 1. Data is collected either through household interviews (HIS) or on-

More information

Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme

Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme Honeycomb Crea/ve Works is financed by the European Union s European Regional Development Fund through the INTERREG IVA Cross- border Programme managed by the Special EU Programmes Body. Web Analy*cs In

More information

The Emerging Discipline of Data Science. Principles and Techniques For Data- Intensive Analysis

The Emerging Discipline of Data Science. Principles and Techniques For Data- Intensive Analysis The Emerging Discipline of Data Science Principles and Techniques For Data- Intensive Analysis What is Big Data Analy9cs? Is this a new paradigm? What is the role of data? What could possibly go wrong?

More information

FUTURE URBAN SYSTEMS: THE CONVERGENCE OF A SMART INTEGRATED INFRASTRUCTURE

FUTURE URBAN SYSTEMS: THE CONVERGENCE OF A SMART INTEGRATED INFRASTRUCTURE FUTURE URBAN SYSTEMS: THE CONVERGENCE OF A SMART INTEGRATED INFRASTRUCTURE RICK AZER DIRECTOR OF DEVELOPMENT SCOTT STALLARD VICE PRESIDENT SMART ANALYTICS SMART INTEGRATED INFRASTRUCTURE INTRODUCTIONS

More information

Big Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas

Big Data. The Big Picture. Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas Big Data The Big Picture Our flexible and efficient Big Data solu9ons open the door to new opportuni9es and new business areas What is Big Data? Big Data gets its name because that s what it is data that

More information

Welcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science

Welcome! Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research. Andrea Heckert, PhD, MPH Program Officer, Science Accelera'ng Pa'ent- Centered Outcomes Research and Methodological Research Emily Evans, PhD, MPH Program Officer, Science Andrea Heckert, PhD, MPH Program Officer, Science June 22, 2015 Welcome! Emily

More information

Introduc8on to Apache Spark

Introduc8on to Apache Spark Introduc8on to Apache Spark Jordan Volz, Systems Engineer @ Cloudera 1 Analyzing Data on Large Data Sets Python, R, etc. are popular tools among data scien8sts/analysts, sta8s8cians, etc. Why are these

More information

The Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons

The Data Reservoir. 10 th September 2014. Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Informa4on Solu4ons Mandy Chessell FREng CEng FBCS Dis4nguished Engineer, Master Inventor Chief Architect, Solu4ons The Reservoir 10 th September 2014 A growing demand Business Teams want Open access to more informa4on More

More information

TRANSLATING TECHNOLOGY INTO BUSINESS. Let s make money from Big Data!

TRANSLATING TECHNOLOGY INTO BUSINESS. Let s make money from Big Data! TRANSLATING TECHNOLOGY INTO BUSINESS Let s make money from Big Data! JUNE, 2014 About Transla.ng Technology into Business B Spot helps clients transform technology ideas into business concepts. As part

More information

How To Use Splunk For Android (Windows) With A Mobile App On A Microsoft Tablet (Windows 8) For Free (Windows 7) For A Limited Time (Windows 10) For $99.99) For Two Years (Windows 9

How To Use Splunk For Android (Windows) With A Mobile App On A Microsoft Tablet (Windows 8) For Free (Windows 7) For A Limited Time (Windows 10) For $99.99) For Two Years (Windows 9 Copyright 2014 Splunk Inc. Splunk for Mobile Intelligence Bill Emme< Director, Solu?ons Marke?ng Panos Papadopoulos Director, Product Management Disclaimer During the course of this presenta?on, we may

More information

Scalus A)ribute Workshop. Paris, April 14th 15th

Scalus A)ribute Workshop. Paris, April 14th 15th Scalus A)ribute Workshop Paris, April 14th 15th Content Mo=va=on, objec=ves, and constraints Scalus strategy Scenario and architectural views How the architecture works Mo=va=on for this MCITN Storage

More information

How To Improve Transporta0On Resilience

How To Improve Transporta0On Resilience Transportation Resilience, planning land use and mobility management for unpredictable events Wawira Njoka Briefing paper contribu0ng to Urban Resilience : what can Urban Governance contribute? superintended

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:

More information

Mission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology

Mission. To provide higher technological educa5on with quality, preparing. competent professionals, with sound founda5ons in science, technology Mission To provide higher technological educa5on with quality, preparing competent professionals, with sound founda5ons in science, technology and innova5on, commi

More information

Managed Services. An essen/al set of tools for today's businesses

Managed Services. An essen/al set of tools for today's businesses Managed Services An essen/al set of tools for today's businesses Manage your enterprise better with a holis/c solu/on to all your IT worries only at Infolob What are Managed Services? By far the most cu/ng

More information

Splunk and Big Data for Insider Threats

Splunk and Big Data for Insider Threats Copyright 2014 Splunk Inc. Splunk and Big Data for Insider Threats Mark Seward Sr. Director, Public Sector Company Company (NASDAQ: SPLK)! Founded 2004, first sohware release in 2006! HQ: San Francisco

More information

Big Data + Big Analytics Transforming the way you do business

Big Data + Big Analytics Transforming the way you do business Big Data + Big Analytics Transforming the way you do business Bryan Harris Chief Technology Officer VSTI A SAS Company 1 AGENDA Lets get Real Beyond the Buzzwords Who is SAS? Our PerspecDve of Big Data

More information

Founda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework

Founda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework Founda'onal IT Governance A Founda'onal Framework for Governing Enterprise IT Adapted from the ISACA COBIT 5 Framework Steven Hunt Enterprise IT Governance Strategist NASA Ames Research Center Michael

More information

Performance Management. Ch. 9 The Performance Measurement. Mechanism. Chiara Demar8ni UNIVERSITY OF PAVIA. mariachiara.demar8ni@unipv.

Performance Management. Ch. 9 The Performance Measurement. Mechanism. Chiara Demar8ni UNIVERSITY OF PAVIA. mariachiara.demar8ni@unipv. UNIVERSITY OF PAVIA Performance Management Ch. 9 The Performance Measurement Mechanism Chiara Demar8ni mariachiara.demar8ni@unipv.it Master in Interna+onal Business and Economics Defini8on Performance

More information

B2B Offerings. Helping businesses op2mize. Infolob s amazing b2b offerings helps your company achieve maximum produc2vity

B2B Offerings. Helping businesses op2mize. Infolob s amazing b2b offerings helps your company achieve maximum produc2vity B2B Offerings Helping businesses op2mize Infolob s amazing b2b offerings helps your company achieve maximum produc2vity What is B2B? B2B is shorthand for the sales prac4ce called business- to- business

More information

Community and Economic Development: Collaborative Leadership To Promote Regional Workforce Development

Community and Economic Development: Collaborative Leadership To Promote Regional Workforce Development Community and Economic Development: Collaborative Leadership To Promote Regional Workforce Development Presented By: Todd Greene Vice President ATLANTA SKYLINE Photo by Chuck Koehler, Creative Commons

More information

Exchange of experience from a SuccessFactors LMS Implementa9on

Exchange of experience from a SuccessFactors LMS Implementa9on Exchange of experience from a SuccessFactors LMS Implementa9on Seen from a user perspective Hanne Vasshus Ask Competency Management Cau9onary Statement The following presenta9on includes forward- looking

More information

Geo- social Network Analysis and Applica5ons

Geo- social Network Analysis and Applica5ons Geo- social Network Analysis and Applica5ons Cecilia Mascolo joint work with C. Brown, A. Noulas and S. Scellato Big Data and Social Media Workshop February 2013, Glasgow, UK. My Interests Geo- social

More information

Phone Systems Buyer s Guide

Phone Systems Buyer s Guide Phone Systems Buyer s Guide Contents How Cri(cal is Communica(on to Your Business? 3 Fundamental Issues 4 Phone Systems Basic Features 6 Features for Users with Advanced Needs 10 Key Ques(ons for All Buyers

More information

Data Stream Algorithms in Storm and R. Radek Maciaszek

Data Stream Algorithms in Storm and R. Radek Maciaszek Data Stream Algorithms in Storm and R Radek Maciaszek Who Am I? l Radek Maciaszek l l l l l l Consul9ng at DataMine Lab (www.dataminelab.com) - Data mining, business intelligence and data warehouse consultancy.

More information

Architec;ng Splunk for High Availability and Disaster Recovery

Architec;ng Splunk for High Availability and Disaster Recovery Copyright 2014 Splunk Inc. Architec;ng Splunk for High Availability and Disaster Recovery Dritan Bi;ncka BD Solu;on Architecture Disclaimer During the course of this presenta;on, we may make forward- looking

More information

The Big Integra-on Simula'on Pla,orms for Low Carbon Decision Making

The Big Integra-on Simula'on Pla,orms for Low Carbon Decision Making The Big Integra-on Simula'on Pla,orms for Low Carbon Decision Making Dr. Ma;hias Berger Role of Informa'on & BigData Interac've Tools for Decision Making Urban Planning @ FCL Beyond Smart Ci'es Background

More information

Nodes, Ties and Influence

Nodes, Ties and Influence Nodes, Ties and Influence Chapter 2 Chapter 2, Community Detec:on and Mining in Social Media. Lei Tang and Huan Liu, Morgan & Claypool, September, 2010. 1 IMPORTANCE OF NODES 2 Importance of Nodes Not

More information

Home Selling Marke/ng Proposal

Home Selling Marke/ng Proposal Home Selling Marke/ng Proposal Presented by: Nate Harimoto & Shane Haas Aviara Real Estate 2555 Townsgate Road, Suite 200 Westlake Village, CA 91361 www.aviararealestate.net Nate Office & Fax: (805) 418-2675

More information

Protec'ng Informa'on Assets - Week 8 - Business Continuity and Disaster Recovery Planning. MIS 5206 Protec/ng Informa/on Assets Greg Senko

Protec'ng Informa'on Assets - Week 8 - Business Continuity and Disaster Recovery Planning. MIS 5206 Protec/ng Informa/on Assets Greg Senko Protec'ng Informa'on Assets - Week 8 - Business Continuity and Disaster Recovery Planning MIS5206 Week 8 In the News Readings In Class Case Study BCP/DRP Test Taking Tip Quiz In the News Discuss items

More information

Program Model: Muskingum University offers a unique graduate program integra6ng BUSINESS and TECHNOLOGY to develop the 21 st century professional.

Program Model: Muskingum University offers a unique graduate program integra6ng BUSINESS and TECHNOLOGY to develop the 21 st century professional. Program Model: Muskingum University offers a unique graduate program integra6ng BUSINESS and TECHNOLOGY to develop the 21 st century professional. 163 Stormont Street New Concord, OH 43762 614-286-7895

More information

San Jacinto College Banner & Enterprise Applica5on Review Task Force Report. November 01, 2011 FINAL

San Jacinto College Banner & Enterprise Applica5on Review Task Force Report. November 01, 2011 FINAL San Jacinto College Banner & Enterprise Applica5on Review Task Force Report November 01, 2011 FINAL 1 Content Review goal and approach 3 Barriers to effec5ve use of Banner: Consultant observa5ons 10 Consultant

More information

UQ pipeline implementa,on and so0ware integra,on

UQ pipeline implementa,on and so0ware integra,on UQ pipeline implementa,on and so0ware integra,on michael aivazis psaap review 28 29 october 2009 Table of contents 1. Introduc,on people, computa,onal resources 2. Overview of the UQ pipeline problem scope

More information

A Tutorial Introduc/on to Big Data. Hands On Data Analy/cs over EMR. Robert Grossman University of Chicago Open Data Group

A Tutorial Introduc/on to Big Data. Hands On Data Analy/cs over EMR. Robert Grossman University of Chicago Open Data Group A Tutorial Introduc/on to Big Data Hands On Data Analy/cs over EMR Robert Grossman University of Chicago Open Data Group Collin BenneE Open Data Group November 12, 2012 1 Amazon AWS Elas/c MapReduce allows

More information

Strategy and Architecture to Establish 'Smart Plants'

Strategy and Architecture to Establish 'Smart Plants' Strategy and Architecture to Establish 'Smart Plants' About Intrigo We are a solu*on provider of Business Applica:ons focused on orchestra*ng Customer Value Networks in the changing SAP Enterprise technology

More information

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding

Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding Network Maps for End Users: Collect, Analyze, Visualize and Communicate Network Insights with Zero Coding A project from the Social Media Research Founda8on: h:p://www.smrfounda8on.org About Me Introduc8ons

More information

Contrail : Open Compu0ng Infrastructures For Elas0c Services Un approccio federa0vo alla creazione di pia=aforme Cloud affidabili

Contrail : Open Compu0ng Infrastructures For Elas0c Services Un approccio federa0vo alla creazione di pia=aforme Cloud affidabili CONSIGLIO NAZIONALE DELLE RICERCHE Massimo Coppola Contrail : Open Compu0ng Infrastructures For Elas0c Services Un approccio federa0vo alla creazione di pia=aforme Cloud affidabili 26 e 27 Maggio, 2014

More information

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10 1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom

More information

ARTIST Methodology and Tooling. Jesus Gorroñogoitia - Atos SOC Crete, 1 st July 2015

ARTIST Methodology and Tooling. Jesus Gorroñogoitia - Atos SOC Crete, 1 st July 2015 ARTIST Methodology and Tooling Jesus Gorroñogoitia - Atos SOC Crete, 1 st July 2015 Motivation: From SaaP to SaaS So#ware as a Product based Company So#ware as a Service based Company : Cloud Computing

More information

Cloud Data Management System (CDMS)

Cloud Data Management System (CDMS) Cloud Management System (CMS) Wiqar Chaudry Solu9ons Engineer Senior Advisor CMS Overview he OpenStack cloud data management system features a canonical data modeling framework designed to broker context

More information

Discovering Computers Fundamentals, 2010 Edition. Living in a Digital World

Discovering Computers Fundamentals, 2010 Edition. Living in a Digital World Discovering Computers Fundamentals, 2010 Edition Living in a Digital World Objec&ves Overview Discuss the importance of project management, feasibility assessment, documenta8on, and data and informa8on

More information

The Development of a Strategic Planning Framework for VCU s College of Humani?es and Sciences

The Development of a Strategic Planning Framework for VCU s College of Humani?es and Sciences The Development of a Strategic Planning Framework for VCU s College of Humani?es and Sciences Data Analysis and Representa?on Interpreta?on U?liza?on Why are we here? During the fall 0 CHS retreat, Dean

More information

RESTful or RESTless Current State of Today's Top Web APIs

RESTful or RESTless Current State of Today's Top Web APIs RESTful or RESTless Current State of Today's Top Web APIs Frederik Buelthoff, Maria Maleshkova AIFB, Karlsruhe Ins-tute of Technology (KIT), Germany [1] Growing Number of Web APIs Challenges Scalability

More information

SDN- based Mobile Networking for Cellular Operators. Seil Jeon, Carlos Guimaraes, Rui L. Aguiar

SDN- based Mobile Networking for Cellular Operators. Seil Jeon, Carlos Guimaraes, Rui L. Aguiar SDN- based Mobile Networking for Cellular Operators Seil Jeon, Carlos Guimaraes, Rui L. Aguiar Background The data explosion currently we re facing with has a serious impact on current cellular networks

More information

Compu4ng Privacy Requirements

Compu4ng Privacy Requirements Security Requirements Security in Compu4ng, Chapters 1 & 10. 1 Topics What are the key requirements to implement a secure system? Privacy Anonymity Authen4ca4on & Authorisa4on Integrity Audit 2 Privacy

More information

Systema(c Literature Review Challenges and Opportuni(es. Ivica Crnkovic ivica.crnkovic@mdh.se

Systema(c Literature Review Challenges and Opportuni(es. Ivica Crnkovic ivica.crnkovic@mdh.se Systema(c Literature Review Challenges and Opportuni(es Ivica Crnkovic ivica.crnkovic@mdh.se Empirical SE Ques(ons? The ques(ons similar to those an anthropologist might ask during first contact with a

More information

Power to the People: Analy0cs for All

Power to the People: Analy0cs for All Arijit Sengupta CEO, BeyondCore, Inc. Power to the People: Analy0cs for All " Ten patents related to Advanced Analytics, Privacy/Security and BPaaS. " Previously worked at Oracle, Microsoft, Yankee Group

More information

Kaseya Fundamentals Workshop DAY THREE. Developed by Kaseya University. Powered by IT Scholars

Kaseya Fundamentals Workshop DAY THREE. Developed by Kaseya University. Powered by IT Scholars Kaseya Fundamentals Workshop DAY THREE Developed by Kaseya University Powered by IT Scholars Kaseya Version 6.5 Last updated March, 2014 Day Two Overview Day Two Lab Review Patch Management Configura;on

More information

Research at the Department of Computer Science and Software Engineering. Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014

Research at the Department of Computer Science and Software Engineering. Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014 Research at the Department of Computer Science and Software Engineering Professor Yong Yue BEng, PhD, CEng, FIET, FIMechE 17 October 2014 Research Areas Ar%ficial intelligence Robo%cs Data mining Image

More information

So#ware quality assurance - introduc4on. Dr Ana Magazinius

So#ware quality assurance - introduc4on. Dr Ana Magazinius So#ware quality assurance - introduc4on Dr Ana Magazinius 1 What is quality? 2 What is a good quality car? 2 and 2 2 minutes 3 characteris4cs 3 What is quality? 4 What is quality? How good or bad something

More information

Project Por)olio Management

Project Por)olio Management Project Por)olio Management Important markers for IT intensive businesses Rest assured with Infolob s project management methodologies What is Project Por)olio Management? Project Por)olio Management (PPM)

More information

The Right BI Tool for the Job in a non- SAP Applica9on Environment

The Right BI Tool for the Job in a non- SAP Applica9on Environment September 9 11, 2013 Anaheim, California The Right BI Tool for the Job in a non- SAP Applica9on Environment Speaker Name(s): Ty Miller Full Spectrum Business Intelligence Self Service Dashboards and Apps

More information