BBM467 Data Intensive ApplicaAons
|
|
|
- Martina Pearson
- 10 years ago
- Views:
Transcription
1 Hace7epe Üniversitesi Bilgisayar Mühendisliği Bölümü BBM467 Data Intensive ApplicaAons Dr. Fuat Akal
2 Why Graphs? Why now? Big Data is the trend! NOSQL is the answer.
3 Everyone is Talking About Graphs Facebook Open Graph Google Knowledge Graph Twi7er Interest Graph How things are connected!
4 What is a Graph? Formally, a graph is just a collecaon of ver$ces and edges. Or, in less inamidaang language, a set of nodes and the rela$onships that connect them. Graphs represent enaaes as nodes and the ways in which those enaaes relate to the world as relaaonships.
5 Kinds of Graphs
6 Kinds of Graphs
7 Graphs are Everywhere Graphs are extremely useful in understanding a wide diversity of datasets in fields such as science, government, and business. For example, Twi7er s data is easily represented as a graph.
8 RelaAonal Databases Lack RelaAonships
9 RelaAonal Databases Lack RelaAonships Foreign key constraints add addiaonal development and maintenance overhead just to make the database work. Several expensive joins are needed just to discover what a customer bought. Reciprocal queries are even more costly. What products did a customer buy? is relaavely cheap compared to which customers bought this product?, which is the basis of recommendaaon systems.
10 Recording RelaAonships in RDBMs Bob s friends SELECT p1.person FROM Person p1 JOIN PersonFriend ON PersonFriend.FriendID = p1.id JOIN Person p2 ON PersonFriend.PersonID = p2.id WHERE p2.person = 'Bob' Who is friends with Bob? SELECT p1.person FROM Person p1 JOIN PersonFriend ON PersonFriend.PersonID = p1.id JOIN Person p2 ON PersonFriend.FriendID = p2.id WHERE p2.person = 'Bob'
11 Recording RelaAonships in RDBMs Things might get complicated! Alice s friends-of-friends SELECT p1.person AS PERSON, p2.person AS FRIEND_OF_FRIEND FROM PersonFriend pf1 JOIN Person p1 ON pf1.personid = p1.id JOIN PersonFriend pf2 ON pf2.personid = pf1.friendid JOIN Person p2 ON pf2.friendid = p2.id WHERE p1.person = 'Alice' AND pf2.friendid <> p1.id
12 Graphs Embrace RelaAonships SQL/NOSQL examples have dealt with implicitly connected data. Users infer semanac dependencies between enaaes, but the data models and the databases themselves are blind to these connecaons. We want a cohesive picture of the whole, including the connecaons between elements. In contrast to the SQL/NOSQL data stores we looked at before, in the graph world, connected data is stored as connected data.
13 What is a Graph Database? A graph database is NOT for charts & diagrams, or vector artwork J It is for storing data that is structured as a graph. A rela$onal database may tell you the average age of everyone in this place, but a graph database will tell you who is most likely to buy you a beer.
14 Neo4J Neo4J is a NOSQL Graph Database Reliable with ACID transacaons Scalable: 32 Billion nodes h7p://neo4j.org
15 A Graph Contains Nodes and RelaAonships A Graph records data in Nodes which have ProperAes
16 A Graph Contains Nodes and RelaAonships Nodes are organized by RelaAonships which also have ProperAes RelaAonships organize Nodes into arbitrary structures, allowing a Graph to resemble a List, a Tree, a Map, or a compound EnAty. Any of which can be combined into yet more complex, richly inter- connected structures.
17 A Graph Contains Nodes and RelaAonships Nodes are grouped by Labels into Sets Labels are a means of grouping the nodes in the graph. They can be used to restrict queries to subsets of the graph, as well as enabling opaonal model constraints and indexing rules.
18 Query a Graph with Traversal A Traversal navigates a Graph; it idenafies Paths which order Nodes A Traversal is how you query a Graph, navigaang from starang Nodes to related Nodes according to an algorithm. Finding answers to quesaons like what music do my friends like that I don t yet own, or if this power supply goes down, what web services are affected?
19 Indexes Look- up Nodes or RelaAonships An Index maps from ProperAes to either Nodes or RelaAonships Ooen, you want to find a specific Node or RelaAonship according to a Property it has. Rather than traversing the enare graph, use an Index to perform a look- up, for quesaons like find the Account for username master- of- graphs.
20 A Graph Database Stores Data in a Graph A Graph Database manages a Graph and also manages related Indexes OpAmized for graph structures instead of tables. Your applicaaon gets all the expressiveness of a graph, with all the dependability you expect out of a database.
21 Graph Databases Embrace RelaAonships
22 Graph Databases Embrace RelaAonships In this social network, the connecaons between enaaes don t exhibit uniformity across the domain. A social network is a popular example of a densely connected, semi- structured network. It resists being captured by a one- size- fits- all schema. The flexibility of the graph model has allowed us to add new nodes and new rela$onships without compromising the exisang network or migraang data
23 Graph Databases Embrace RelaAonships The graph offers a much richer picture of the network. We can see who LOVES whom (and whether that love is requited). We can see who is a COLLEAGUE_OF of whom. Who is BOSS_OF them all. We can see who s off the market, because they re MARRIED_TO someone else.
24 Graph vs. RelaAonal An RDBMS is opamized for aggregated data. Graph database is opamized for highly connected data.
25 Graph vs. Key- Value Store A Key- Value model is great for lookups of simple values or lists. When the values are themselves interconnected, you ve got a graph. K* represents a key, V* a value. Note that some keys point to other keys as well as plain values.
26 Graph vs. Document Store The container hierarchy of a document database accommodates nice, schema- free data that can easily be represented as a tree. Which is of course a graph. Refer to other documents (or document elements) within that tree and you have a more expressive representaaon of the same data. When in a graph database, those relaaonships are easily navigable. D=Document, S=Subdocument, V=Value, D2/S2 = reference to subdocument in (other) document.
27 How to Query the Graph Database? Cypher: A graph query language Pa7ern- matching query language DeclaraAve grammar with clauses (like SQL) AggregaAon, ordering, limits Create, read, update, delete
28 Example Query Who are John s friends? MATCH (john {name: 'John'})-[:friend]->()-[:friend]->(fof) RETURN john, fof Query Results :
29 EnAty- RelaAonship Model for TV Show Database
30 Create the Graph CREATE (himym:tvshow { name: "How I Met Your Mother" }) CREATE (himym_s1:season { name: "HIMYM Season 1" }) CREATE (himym_s1_e1:episode { name: "Pilot" }) CREATE (ted:character { name: "Ted Mosby" }) CREATE (joshradnor:actor { name: "Josh Radnor" }) CREATE UNIQUE (joshradnor)-[:played_character]->(ted) CREATE UNIQUE (himym)-[:has_season]->(himym_s1) CREATE UNIQUE (himym_s1)-[:has_episode]->(himym_s1_e1) CREATE UNIQUE (himym_s1_e1)-[:featured_character]->(ted) CREATE (himym_s1_e1_review1 { title: "Meet Me At The Bar In 15 Minutes & Suit Up, content: "It was awesome" }) CREATE (wakenpayne:user { name: "WakenPayne" }) CREATE (wakenpayne)-[:wrote_review]->(himym_s1_e1_review1)<- [:HAS_REVIEW]-(himym_s1_e1)
31 Inside the Graph Database
32 Performance of Dealing Connected Data RelaAonships in a graph naturally form paths. Querying or traversing the graph involves following paths. Because of the fundamentally path- oriented nature of the data model, the majority of path- based graph database operaaons are highly aligned with the way in which the data is laid out, making them extremely efficient.
33 Performance of Dealing with Connected Data Partner and VukoAc s experiment (Graph Databases from O Reilly) seeks to find friends- of- friends in a social network, to a maximum depth of five. Given any two persons chosen at random, is there a path that connects them that is at most five relaaonships long? For a social network containing 1,000,000 people, each with approximately 50 friends.
34 When to Use Graph Databases? There are two properaes of graph databases you should consider when invesagaang graph database technologies: The underlying storage: Some graph databases use na$ve graph storage that is opamized and designed for storing and managing graphs. Some serialize the graph data into a relaaonal database, an object- oriented database, or some other general- purpose data store. The processing engine: Some definiaons require that a graph database use index- free adjacency, meaning that connected nodes physically point to each other in the database.
35 Acknowledgement - 1 The course material used for this class is mostly taken and/or adopted* from the course materials of the Big Data class given by Nesime Tatbul and Donald Kossmann at ETH Zurich (h7p:// (*) Original course material is reduced somehow to fit the needs of BBM467. Therefore, original slides were not used as they are.
36 Acknowledgement - 2 Some material used for this lecture is taken and/or adopted from h7p://docs.neo4j.org/chunked/milestone/index.html Max De Marzi. Michael Hunger from Neo Technology.
BBM467 Data Intensive ApplicaAons
Hace7epe Üniversitesi Bilgisayar Mühendisliği Bölümü BBM467 Data Intensive ApplicaAons Dr. Fuat Akal [email protected] Problem How do you scale up applicaaons? Run jobs processing 100 s of terabytes
BBM467 Data Intensive ApplicaAons
Hace7epe Üniversitesi Bilgisayar Mühendisliği Bölümü BBM467 Data Intensive ApplicaAons Dr. Fuat Akal [email protected] Overview What is Cloud CompuAng? VirtualizaAon Service Oriented CompuAng What is
Graph Databases Mean Business
Graph Databases Mean Business Andreas Kollegger & Rik Van Bruggen September 2012 2012 Neo Technology http://neotechnology.com Table of Contents Graph Databases Mean Business! 2 The Big Data Business! 2
A Little Graph Theory for the Busy Developer. Dr. Jim Webber Chief Scientist, Neo Technology @jimwebber
A Little Graph Theory for the Busy Developer Dr. Jim Webber Chief Scientist, Neo Technology @jimwebber Roadmap Imprisoned data models Why most NoSQL stores and RDBMS are clumsy for connected data Labeled
Why NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1
Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots
BBM467 Data Intensive ApplicaAons
Hace7epe Üniversitesi Bilgisayar Mühendisliği Bölümü BBM467 Data Intensive ApplicaAons Dr. Fuat Akal [email protected] FoundaAons of Data[base] Clusters Database Clusters Hardware Architectures Data
Powering Recommendations with a Graph Database
Powering Recommendations with a Graph Database 2014 Neo Technology neo4j.com "YOU MAY ALSO LIKE " Product recommendations help businesses maximize their online revenue. It requires advanced technology,
Database Design Patterns. Winter 2006-2007 Lecture 24
Database Design Patterns Winter 2006-2007 Lecture 24 Trees and Hierarchies Many schemas need to represent trees or hierarchies of some sort Common way of representing trees: An adjacency list model Each
Visualizing a Neo4j Graph Database with KeyLines
Visualizing a Neo4j Graph Database with KeyLines Introduction 2! What is a graph database? 2! What is Neo4j? 2! Why visualize Neo4j? 3! Visualization Architecture 4! Benefits of the KeyLines/Neo4j architecture
Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world
Analytics March 2015 White paper Why NoSQL? Your database options in the new non-relational world 2 Why NoSQL? Contents 2 New types of apps are generating new types of data 2 A brief history of NoSQL 3
NoSQL systems: introduction and data models. Riccardo Torlone Università Roma Tre
NoSQL systems: introduction and data models Riccardo Torlone Università Roma Tre Why NoSQL? In the last thirty years relational databases have been the default choice for serious data storage. An architect
NoSQL and Graph Database
NoSQL and Graph Database Biswanath Dutta DRTC, Indian Statistical Institute 8th Mile Mysore Road R. V. College Post Bangalore 560059 International Conference on Big Data, Bangalore, 9-20 March 2015 Outlines
GRAPH DATABASE SYSTEMS. h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe 2016 1
GRAPH DATABASE SYSTEMS h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe 2016 1 Use Case: Route Finding Source: Neo Technology, Inc. h_da Prof. Dr. Uta Störl Big Data Technologies:
Graph Database Proof of Concept Report
Objectivity, Inc. Graph Database Proof of Concept Report Managing The Internet of Things Table of Contents Executive Summary 3 Background 3 Proof of Concept 4 Dataset 4 Process 4 Query Catalog 4 Environment
So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)
Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we
How graph databases started the multi-model revolution
How graph databases started the multi-model revolution Luca Garulli Author and CEO @OrientDB QCon Sao Paulo - March 26, 2015 Welcome to Big Data 90% of the data in the world today has been created in the
Lecture Data Warehouse Systems
Lecture Data Warehouse Systems Eva Zangerle SS 2013 PART C: Novel Approaches in DW NoSQL and MapReduce Stonebraker on Data Warehouses Star and snowflake schemas are a good idea in the DW world C-Stores
Big Data Management. Big Data Management. (BDM) Autumn 2013. Povl Koch September 2, 2013 01-09-2013 1
Big Data Management Big Data Management (BDM) Autumn 2013 Povl Koch September 2, 2013 01-09-2013 1 Overview Today s program 1. Little more practical details about this course 2. Chapter 2 & 3 in NoSQL
MEAP Edition Manning Early Access Program Neo4j in Action MEAP version 3
MEAP Edition Manning Early Access Program Neo4j in Action MEAP version 3 Copyright 2012 Manning Publications For more information on this and other Manning titles go to www.manning.com brief contents PART
Graph Databases: Neo4j
Course NDBI040: Big Data Management and NoSQL Databases Practice 05: Graph Databases: Neo4j Martin Svoboda 5. 1. 2016 Faculty of Mathematics and Physics, Charles University in Prague Outline Graph databases
Graph Databases. Ian Robinson, Jim Webber, and Emil Eifrem
Graph Databases Ian Robinson, Jim Webber, and Emil Eifrem Graph Databases by Ian Robinson, Jim Webber, and Emil Eifrem Copyright 2013 Neo Technology, Inc.. All rights reserved. Printed in the United States
CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #5: En-ty/Rela-onal Models- - - Part 1
CS 4604: Introduc0on to Database Management Systems B. Aditya Prakash Lecture #5: En-ty/Rela-onal Models- - - Part 1 Announcements- - - Project Goal: design a database system applica-on with a web front-
A1 and FARM scalable graph database on top of a transactional memory layer
A1 and FARM scalable graph database on top of a transactional memory layer Miguel Castro, Aleksandar Dragojević, Dushyanth Narayanan, Ed Nightingale, Alex Shamis Richie Khanna, Matt Renzelmann Chiranjeeb
Lecture 10: HBase! Claudia Hauff (Web Information Systems)! [email protected]
Big Data Processing, 2014/15 Lecture 10: HBase!! Claudia Hauff (Web Information Systems)! [email protected] 1 Course content Introduction Data streams 1 & 2 The MapReduce paradigm Looking behind the
Modeling and mining large scale biological seman0c networks using NEO4J
Modeling and mining large scale biological seman0c networks using NEO4J Junaid Gamieldien Principal Inves.gator Clinical Sequencing and Biomarker Discovery Neo4J Graph database Graph is composed of two
How To Improve Performance In A Database
Some issues on Conceptual Modeling and NoSQL/Big Data Tok Wang Ling National University of Singapore 1 Database Models File system - field, record, fixed length record Hierarchical Model (IMS) - fixed
1-Oct 2015, Bilbao, Spain. Towards Semantic Network Models via Graph Databases for SDN Applications
1-Oct 2015, Bilbao, Spain Towards Semantic Network Models via Graph Databases for SDN Applications Agenda Introduction Goals Related Work Proposal Experimental Evaluation and Results Conclusions and Future
Big Data Analytics. Rasoul Karimi
Big Data Analytics Rasoul Karimi Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany Big Data Analytics Big Data Analytics 1 / 1 Introduction
Copyright 2011 Pearson Education, Inc. Publishing as Prentice Hall. Objectives
Exploring Microsoft Access 2010 by Robert Grauer, Keith Mast, Mary Anne Poatsy Chapter 1 Introduction to Access Copyright 2011 Pearson Education, Inc. Publishing as Prentice Hall. 1 Objectives Navigate
Your Master Data Is a Graph: Are You Ready?
Karen Lopez, InfoAdvisors March 2015 www.datamodel.com [email protected] @datachick This white paper is sponsored by Neo Technology Inc. Neo4j is a registered trademark of Neo Technology Inc
INTRODUCTION TO CASSANDRA
INTRODUCTION TO CASSANDRA This ebook provides a high level overview of Cassandra and describes some of its key strengths and applications. WHAT IS CASSANDRA? Apache Cassandra is a high performance, open
Big Systems, Big Data
Big Systems, Big Data When considering Big Distributed Systems, it can be noted that a major concern is dealing with data, and in particular, Big Data Have general data issues (such as latency, availability,
these three NoSQL databases because I wanted to see a the two different sides of the CAP
Michael Sharp Big Data CS401r Lab 3 For this paper I decided to do research on MongoDB, Cassandra, and Dynamo. I chose these three NoSQL databases because I wanted to see a the two different sides of the
Graph Databases. Prad Nelluru, Bharat Naik, Evan Liu, Bon Koo
Graph Databases Prad Nelluru, Bharat Naik, Evan Liu, Bon Koo 1 Why are graphs important? Modeling chemical and biological data Social networks The web Hierarchical data 2 What is a graph database? A database
REAL-TIME BIG DATA ANALYTICS
www.leanxcale.com [email protected] REAL-TIME BIG DATA ANALYTICS Blending Transactional and Analytical Processing Delivers Real-Time Big Data Analytics 2 ULTRA-SCALABLE FULL ACID FULL SQL DATABASE LeanXcale
NoSQL. Thomas Neumann 1 / 22
NoSQL Thomas Neumann 1 / 22 What are NoSQL databases? hard to say more a theme than a well defined thing Usually some or all of the following: no SQL interface no relational model / no schema no joins,
Domain driven design, NoSQL and multi-model databases
Domain driven design, NoSQL and multi-model databases Java Meetup New York, 10 November 2014 Max Neunhöffer www.arangodb.com Max Neunhöffer I am a mathematician Earlier life : Research in Computer Algebra
Chapter 2. Data Model. Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel
Chapter 2 Data Model Database Systems: Design, Implementation, and Management, Sixth Edition, Rob and Coronel 1 In this chapter, you will learn: Why data models are important About the basic data-modeling
Cloud Computing and Advanced Relationship Analytics
Cloud Computing and Advanced Relationship Analytics Using Objectivity/DB to Discover the Relationships in your Data By Brian Clark Vice President, Product Management Objectivity, Inc. 408 992 7136 [email protected]
CSC/ECE 574 Computer and Network Security. What Is PKI. Certification Authorities (CA)
Computer Science CSC/ECE 574 Computer and Network Security Topic 7.2 Public Key Infrastructure (PKI) CSC/ECE 574 Dr. Peng Ning 1 What Is PKI Informally, the infrastructure supporting the use of public
10/6/2015 PKI. What Is PKI. Certificates. Certification Authorities (CA) PKI Models. Certificates
PKI IT Network Security Administration Instructor: Bo Sheng What Is PKI Informally, the infrastructure supporting the use of public key cryptography. A PKI consists of Certificate Authority () Certificates
NoSQL Databases. Polyglot Persistence
The future is: NoSQL Databases Polyglot Persistence a note on the future of data storage in the enterprise, written primarily for those involved in the management of application development. Martin Fowler
Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing. October 29th, 2015
E6893 Big Data Analytics Lecture 8: Spark Streams and Graph Computing (I) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science IBM Chief Scientist, Graph Computing
Big Data Management and Analytics
Big Data Management and Analytics Lecture Notes Winter semester 2015 / 2016 Ludwig-Maximilians-University Munich Prof. Dr. Matthias Renz 2015 Based on lectures by Donald Kossmann (ETH Zürich), as well
! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I)
! E6893 Big Data Analytics Lecture 9:! Linked Big Data Graph Computing (I) Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and
SYSM 6304: Risk and Decision Analysis Lecture 5: Methods of Risk Analysis
SYSM 6304: Risk and Decision Analysis Lecture 5: Methods of Risk Analysis M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: [email protected] October 17, 2015 Outline
Database Security. The Need for Database Security
Database Security Public domain NASA image L-1957-00989 of people working with an IBM type 704 electronic data processing machine. 1 The Need for Database Security Because databases play such an important
Big Data Database Revenue and Market Forecast, 2012-2017
Wikibon.com - http://wikibon.com Big Data Database Revenue and Market Forecast, 2012-2017 by David Floyer - 13 February 2013 http://wikibon.com/big-data-database-revenue-and-market-forecast-2012-2017/
Overview on Graph Datastores and Graph Computing Systems. -- Litao Deng (Cloud Computing Group) 06-08-2012
Overview on Graph Datastores and Graph Computing Systems -- Litao Deng (Cloud Computing Group) 06-08-2012 Graph - Everywhere 1: Friendship Graph 2: Food Graph 3: Internet Graph Most of the relationships
SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford
SQL VS. NO-SQL Adapted Slides from Dr. Jennifer Widom from Stanford 55 Traditional Databases SQL = Traditional relational DBMS Hugely popular among data analysts Widely adopted for transaction systems
Big Data. Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich
Big Data Donald Kossmann & Nesime Tatbul Systems Group ETH Zurich Goal of Today What is Big Data? introduce all major buzz words What is not Big Data? get a feeling for opportunities & limitations Answering
White Paper: Big Data and the hype around IoT
1 White Paper: Big Data and the hype around IoT Author: Alton Harewood 21 Aug 2014 (first published on LinkedIn) If I knew today what I will know tomorrow, how would my life change? For some time the idea
Conceptual Design Using the Entity-Relationship (ER) Model
Conceptual Design Using the Entity-Relationship (ER) Model Module 5, Lectures 1 and 2 Database Management Systems, R. Ramakrishnan 1 Overview of Database Design Conceptual design: (ER Model is used at
Modern Databases. Database Systems Lecture 18 Natasha Alechina
Modern Databases Database Systems Lecture 18 Natasha Alechina In This Lecture Distributed DBs Web-based DBs Object Oriented DBs Semistructured Data and XML Multimedia DBs For more information Connolly
Extending Data Processing Capabilities of Relational Database Management Systems.
Extending Data Processing Capabilities of Relational Database Management Systems. Igor Wojnicki University of Missouri St. Louis Department of Mathematics and Computer Science 8001 Natural Bridge Road
The Sierra Clustered Database Engine, the technology at the heart of
A New Approach: Clustrix Sierra Database Engine The Sierra Clustered Database Engine, the technology at the heart of the Clustrix solution, is a shared-nothing environment that includes the Sierra Parallel
IC05 Introduction on Networks &Visualization Nov. 2009. <[email protected]>
IC05 Introduction on Networks &Visualization Nov. 2009 Overview 1. Networks Introduction Networks across disciplines Properties Models 2. Visualization InfoVis Data exploration
NoSQL Databases. Nikos Parlavantzas
!!!! NoSQL Databases Nikos Parlavantzas Lecture overview 2 Objective! Present the main concepts necessary for understanding NoSQL databases! Provide an overview of current NoSQL technologies Outline 3!
MongoDB in the NoSQL and SQL world. Horst Rechner [email protected] Berlin, 2012-05-15
MongoDB in the NoSQL and SQL world. Horst Rechner [email protected] Berlin, 2012-05-15 1 MongoDB in the NoSQL and SQL world. NoSQL What? Why? - How? Say goodbye to ACID, hello BASE You
An Introduction to KeyLines and Network Visualization
An Introduction to KeyLines and Network Visualization 1. What is KeyLines?... 2 2. Benefits of network visualization... 2 3. Benefits of KeyLines... 3 4. KeyLines architecture... 3 5. Uses of network visualization...
Introduction to Inbound Marketing
Introduction to Inbound Marketing by Kevin Carney of Inbound Marketing University Page 1 of 20 InboundMarketingUniversity.biz InboundMarketingUniversity Published by Inbound Marketing University No part
www.objectivity.com Ibrahim Sallam Director of Development
www.objectivity.com Ibrahim Sallam Director of Development Graphs, what are they and why? Graph Data Management. Why do we need it? Problems in Distributed Graph How we solved the problems Simple Graph
Exploring Big Data in Social Networks
Exploring Big Data in Social Networks [email protected] ([email protected]) INWEB National Science and Technology Institute for Web Federal University of Minas Gerais - UFMG May 2013 Some thoughts about
Scaling Up HBase, Hive, Pegasus
CSE 6242 A / CS 4803 DVA Mar 7, 2013 Scaling Up HBase, Hive, Pegasus Duen Horng (Polo) Chau Georgia Tech Some lectures are partly based on materials by Professors Guy Lebanon, Jeffrey Heer, John Stasko,
Comparison of the Frontier Distributed Database Caching System with NoSQL Databases
Comparison of the Frontier Distributed Database Caching System with NoSQL Databases Dave Dykstra [email protected] Fermilab is operated by the Fermi Research Alliance, LLC under contract No. DE-AC02-07CH11359
Understanding Neo4j Scalability
Understanding Neo4j Scalability David Montag January 2013 Understanding Neo4j Scalability Scalability means different things to different people. Common traits associated include: 1. Redundancy in the
IT and CRM A basic CRM model Data source & gathering system Database system Data warehouse Information delivery system Information users
1 IT and CRM A basic CRM model Data source & gathering Database Data warehouse Information delivery Information users 2 IT and CRM Markets have always recognized the importance of gathering detailed data
How To Make A Route Map On Bpg More Efficient
NIRA: A New Inter-Domain Routing Architecture Xiaowei Yang, David Clark, Arthur W. Berger Rachit Agarwal (Results are by others, any errors are by me) ( Animated slides shamelessly stolen from Prasad s
2.1.5 Storing your application s structured data in a cloud database
30 CHAPTER 2 Understanding cloud computing classifications Table 2.3 Basic terms and operations of Amazon S3 Terms Description Object Fundamental entity stored in S3. Each object can range in size from
Business Case Development for Credit and Debit Card Fraud Re- Scoring Models
Business Case Development for Credit and Debit Card Fraud Re- Scoring Models Kurt Gutzmann Managing Director & Chief ScienAst GCX Advanced Analy.cs LLC www.gcxanalyacs.com October 20, 2011 www.gcxanalyacs.com
Quiz! Database Indexes. Index. Quiz! Disc and main memory. Quiz! How costly is this operation (naive solution)?
Database Indexes How costly is this operation (naive solution)? course per weekday hour room TDA356 2 VR Monday 13:15 TDA356 2 VR Thursday 08:00 TDA356 4 HB1 Tuesday 08:00 TDA356 4 HB1 Friday 13:15 TIN090
How To Make A Trustless Certificate Authority Secure
Network Security: Public Key Infrastructure Guevara Noubir Northeastern University [email protected] Network Security Slides adapted from Radia Perlman s slides Key Distribution - Secret Keys What if
Performance Management of SQL Server
Performance Management of SQL Server Padma Krishnan Senior Manager When we design applications, we give equal importance to the backend database as we do to the architecture and design of the application
Clustering through Decision Tree Construction in Geology
Nonlinear Analysis: Modelling and Control, 2001, v. 6, No. 2, 29-41 Clustering through Decision Tree Construction in Geology Received: 22.10.2001 Accepted: 31.10.2001 A. Juozapavičius, V. Rapševičius Faculty
PostgreSQL Concurrency Issues
PostgreSQL Concurrency Issues 1 PostgreSQL Concurrency Issues Tom Lane Red Hat Database Group Red Hat, Inc. PostgreSQL Concurrency Issues 2 Introduction What I want to tell you about today: How PostgreSQL
Simulating a File-Sharing P2P Network
Simulating a File-Sharing P2P Network Mario T. Schlosser, Tyson E. Condie, and Sepandar D. Kamvar Department of Computer Science Stanford University, Stanford, CA 94305, USA Abstract. Assessing the performance
Cassandra A Decentralized Structured Storage System
Cassandra A Decentralized Structured Storage System Avinash Lakshman, Prashant Malik LADIS 2009 Anand Iyer CS 294-110, Fall 2015 Historic Context Early & mid 2000: Web applicaoons grow at tremendous rates
Big Graph Processing: Some Background
Big Graph Processing: Some Background Bo Wu Colorado School of Mines Part of slides from: Paul Burkhardt (National Security Agency) and Carlos Guestrin (Washington University) Mines CSCI-580, Bo Wu Graphs
Data Modeling for Big Data
Data Modeling for Big Data by Jinbao Zhu, Principal Software Engineer, and Allen Wang, Manager, Software Engineering, CA Technologies In the Internet era, the volume of data we deal with has grown to terabytes
Infrastructures for big data
Infrastructures for big data Rasmus Pagh 1 Today s lecture Three technologies for handling big data: MapReduce (Hadoop) BigTable (and descendants) Data stream algorithms Alternatives to (some uses of)
Hadoop Ecosystem B Y R A H I M A.
Hadoop Ecosystem B Y R A H I M A. History of Hadoop Hadoop was created by Doug Cutting, the creator of Apache Lucene, the widely used text search library. Hadoop has its origins in Apache Nutch, an open
A COMPARATIVE STUDY OF NOSQL DATA STORAGE MODELS FOR BIG DATA
A COMPARATIVE STUDY OF NOSQL DATA STORAGE MODELS FOR BIG DATA Ompal Singh Assistant Professor, Computer Science & Engineering, Sharda University, (India) ABSTRACT In the new era of distributed system where
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
Project and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi
Project and Production Management Prof. Arun Kanda Department of Mechanical Engineering Indian Institute of Technology, Delhi Lecture - 9 Basic Scheduling with A-O-A Networks Today we are going to be talking
Outline. What is Big data and where they come from? How we deal with Big data?
What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,
Hierarchical Data Visualization
Hierarchical Data Visualization 1 Hierarchical Data Hierarchical data emphasize the subordinate or membership relations between data items. Organizational Chart Classifications / Taxonomies (Species and
