db4o Teching Software-Engineering Great stuff! 20 years with Relational Databases PostgreSQL, SQLite, Oracle...



Similar documents
Lecture Data Warehouse Systems

SQL VS. NO-SQL. Adapted Slides from Dr. Jennifer Widom from Stanford

The NoSQL Ecosystem, Relaxed Consistency, and Snoop Dogg. Adam Marcus MIT CSAIL

MongoDB in the NoSQL and SQL world. Horst Rechner Berlin,

Preparing Your Data For Cloud

Cloud Computing at Google. Architecture

Where We Are. References. Cloud Computing. Levels of Service. Cloud Computing History. Introduction to Data Management CSE 344

Introduction to NOSQL

Overview of Databases On MacOS. Karl Kuehn Automation Engineer RethinkDB

Advanced Data Management Technologies

Using Object Database db4o as Storage Provider in Voldemort

Challenges for Data Driven Systems

NoSQL Data Base Basics

Can the Elephants Handle the NoSQL Onslaught?

Integrating Big Data into the Computing Curricula

Big Data Technologies. Prof. Dr. Uta Störl Hochschule Darmstadt Fachbereich Informatik Sommersemester 2015

NoSQL. Thomas Neumann 1 / 22

NoSQL in der Cloud Why? Andreas Hartmann

In Memory Accelerator for MongoDB

Cloud & Big Data a perfect marriage? Patrick Valduriez

Not Relational Models For The Management of Large Amount of Astronomical Data. Bruno Martino (IASI/CNR), Memmo Federici (IAPS/INAF)

<Insert Picture Here> Oracle NoSQL Database A Distributed Key-Value Store

GRAPH DATABASE SYSTEMS. h_da Prof. Dr. Uta Störl Big Data Technologies: Graph Database Systems - SoSe

Cloud Scale Distributed Data Storage. Jürmo Mehine

Structured Data Storage

X4-2 Exadata announced (well actually around Jan 1) OEM/Grid control 12c R4 just released

High Throughput Computing on P2P Networks. Carlos Pérez Miguel

On- Prem MongoDB- as- a- Service Powered by the CumuLogic DBaaS Platform

CS 4604: Introduc0on to Database Management Systems. B. Aditya Prakash Lecture #13: NoSQL and MapReduce

An Approach to Implement Map Reduce with NoSQL Databases

The evolution of database technology (II) Huibert Aalbers Senior Certified Executive IT Architect

CISC 432/CMPE 432/CISC 832 Advanced Database Systems

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1


Slave. Master. Research Scholar, Bharathiar University

Composite Data Virtualization Composite Data Virtualization And NOSQL Data Stores

extensible record stores document stores key-value stores Rick Cattel s clustering from Scalable SQL and NoSQL Data Stores SIGMOD Record, 2010

Logistics. Database Management Systems. Chapter 1. Project. Goals for This Course. Any Questions So Far? What This Course Cannot Do.

Introduction to Big Data Training

Вовченко Алексей, к.т.н., с.н.с. ВМК МГУ ИПИ РАН

NoSQL replacement for SQLite (for Beatstream) Antti-Jussi Kovalainen Seminar OHJ-1860: NoSQL databases

A COMPARATIVE STUDY OF NOSQL DATA STORAGE MODELS FOR BIG DATA

Report Data Management in the Cloud: Limitations and Opportunities

Analytics March 2015 White paper. Why NoSQL? Your database options in the new non-relational world

Oracle s Big Data solutions. Roger Wullschleger. <Insert Picture Here>

A Survey of Distributed Database Management Systems

NewSQL: Towards Next-Generation Scalable RDBMS for Online Transaction Processing (OLTP) for Big Data Management

A Comparison of Current Graph Database Models

Data Management in the Cloud

Evaluating NoSQL for Enterprise Applications. Dirk Bartels VP Strategy & Marketing

A survey of big data architectures for handling massive data

How graph databases started the multi-model revolution

Cloud DBMS: An Overview. Shan-Hung Wu, NetDB CS, NTHU Spring, 2015

GridGain In- Memory Data Fabric: UlCmate Speed and Scale for TransacCons and AnalyCcs

bigdata Managing Scale in Ontological Systems

MongoDB Developer and Administrator Certification Course Agenda

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

CPS 516: Data-intensive Computing Systems. Instructor: Shivnath Babu TA: Zilong (Eric) Tan

LARGE-SCALE DATA STORAGE APPLICATIONS

Crazy NoSQL Data Integration with Pentaho

Practical Cassandra. Vitalii

The Synergy Between the Object Database, Graph Database, Cloud Computing and NoSQL Paradigms

Big Data Primer. 1 Why Big Data? Alex Sverdlov alex@theparticle.com

An Open Source NoSQL solution for Internet Access Logs Analysis

Benchmarking and Analysis of NoSQL Technologies

MySQL Cluster: HA als StorageEngine

Hypertable Architecture Overview

Introduction to NoSQL Databases. Tore Risch Information Technology Uppsala University

Cassandra A Decentralized Structured Storage System

Facebook: Cassandra. Smruti R. Sarangi. Department of Computer Science Indian Institute of Technology New Delhi, India. Overview Design Evaluation

NoSQL Database Systems and their Security Challenges

A Comparison of Approaches to Large-Scale Data Analysis

Oracle Architecture, Concepts & Facilities

Database Scalability and Oracle 12c

How To Use Big Data For Telco (For A Telco)

Big Data With Hadoop

Referential Integrity in Cloud NoSQL Databases

Introduction to Polyglot Persistence. Antonios Giannopoulos Database Administrator at ObjectRocket by Rackspace

Hadoop and Map-Reduce. Swati Gore

A Scalable Relational Database Model for Cloud Computing

Ground up Introduction to In-Memory Data (Grids)

A Review of Column-Oriented Datastores. By: Zach Pratt. Independent Study Dr. Maskarinec Spring 2011

nosql and Non Relational Databases

Making Sense ofnosql A GUIDE FOR MANAGERS AND THE REST OF US DAN MCCREARY MANNING ANN KELLY. Shelter Island

Hadoop: A Framework for Data- Intensive Distributed Computing. CS561-Spring 2012 WPI, Mohamed Y. Eltabakh

.NET User Group Bern

Chapter 11 Map-Reduce, Hadoop, HDFS, Hbase, MongoDB, Apache HIVE, and Related

Big Data Analytics: Hadoop-Map Reduce & NoSQL Databases

Apache Hadoop. Alexandru Costan

Big Data Database Revenue and Market Forecast,

Database Management System Choices. Introduction To Database Systems CSE 373 Spring 2013

BRAC. Investigating Cloud Data Storage UNIVERSITY SCHOOL OF ENGINEERING. SUPERVISOR: Dr. Mumit Khan DEPARTMENT OF COMPUTER SCIENCE AND ENGEENIRING

Databases 2 (VU) ( )

Introduction to Hadoop. New York Oracle User Group Vikas Sawhney

Transcription:

select fun, profit from real_world where rational = false Teching Software-Engineering 20 years with Relational Databases PostgreSQL, SQLite, Oracle... Great stuff! db4o 2006 2004

Reallygreat stuff! 2006 2008 Sorry. No books available! nosqlberlin.de nosqlfrankfurt.de nosql powerdays 2010 Oracle, IBM, etc. wollen auch non-relational Und nicht vergessen Einfache Installation / Handling + Fun!

Scale-out! Open-Source nur indirekte finanzielle Interessen Web-Scale = Historie Schema free Strange Loop Pattern: Design for Crash!

Replication fun? kommt noch nosqltapes.com! NoSQL is specialization!

NoSQL foundations: Paralellization Contracts Stonebraker A A giant step back! Imcompatible, missing features, not new, compile, analyze, optimize Starke Konkurrenz: Stratosphere (TUB), epic, SwissBox, etc. auf einer atmenden Cloud!

Eventually Consistent Consistency Models ACID Amazon Dynamo MySQL Replikation BASE Wilfried Springer NoSQL Rollercoaster CAP Theoreme Don t throw C away so easy! It s complex. Pick 2! ACID / Isolation Consistency Clients see equal data System is always on Availability Clients find replicas Klassiker NoSQL Partition Tolerance What you really have is: 1. Applicationerrors 2. Repetable DBMS errors 3. Unrepeatable DBMS errors 4. Operating System errors 5. Hardware failure in cluster 6. Network partition in local cluster 7. A disaster 8. WAN failure

6 = Network Partition is rare 3,4,5,6 is mostly a Single Node Algorithms can help! Consistent Hashing M:[0,5) R:[25,30) N:[5,10) HASH KNOTEN REPLIKAT 2 M N,O 8 N O,P 10 O P,Q 17 P Q,R 22 Q R,M 26 R M,N give up Prather than sacrificing C. Use VoltDB or NimbusDB Q:[20,25) W = 2*W R = 1*R P:[15,20) O:[10,15) ausfallsicher leicht erweiterbar gut verteilt / vnodes pessimistisches Locking?

Anna laufen A:1 L:1 Paul surfen P:1 L:1 surfen L:2 P:1 A:0 laufen A:1 L:1 P:0 => surfen P:2 A:1 L:2 Laura laufen L:1 surfen L:2 P:1 Google Protocol Buffers automatic RPC generation =>

Column Family DocumentDBs Key/ValueDBs Voldemort, Chordless, Scalaris, Dynamo / Dynomite GraphDBs andere db4o, Versant, Objectivity, Gemstone, Progress, Mark Logic, EMC Momentum, Tamino, GigaSpaces, Hazelcast, Terracotta, Wide Column Stores / Column Families + Skalierung = new node + Community + API - Replikation - Aufsetzen, Optimierung, Wartung + stressfreie SaaS Lösung + transparent scaling -UTF UTF-8 String - Daten liegen bei Amazon -kein tuning / config + Skalierung = new node + Replikation + Konfiguration (r, w) - Dokumentation -Abfragen -(storage (storage-conf.xml)

Document DBs Views + memory mapped, indexes, queries, marketing - durability, single instance design 1 URL Replikation 67 GB 2 Days off NoSQL Divergenz: CouchDB = NoNoSQL NoSQL Konvergenz: K/V-Stores Data Structure Server -> + sehr schnell > 100.000 /sek + konfigurierbarer Disc sync + API für eigene Anbindung + einfache Replikation + hash, list, set, sorted set, messages + Installation UNIX: 38 sek Windows: 18 sek - noch nicht skalierbar (2.*)

Property Graph Neo4j, Sones, HyperGraphDB, InfiniteGraph, InfoGrid, Dex, VertexDB, Filament, OrientDB X DataModel, Query Methods, Languaes, License, Protocol DB Key / ValueDB + DocumentDB + ObjectDB + GraphDB +schneller

> 220 DBs Entscheider 10 Seiten Analyse Bitten um Hilfe Bauchentscheidung nichtfunktionales Requirement! NoSQL Consulting Graph Start-Ups 1. Data 2. Transactions 3. Performance etablierte Unternehmen 4. Queries 5. Architecture 6. other Non-Functional Requirements

Analyse your Data Domain-Data, Log-Data, Event-Data, Message-Data, critical Data, Business-Data, Meta-Data, temp Data, Session-Data, Geo Data, etc. Data- / Storage-Model: relational, column-o, doc-alike, graphs, objects, etc. What Types / Type-System? Data-Navigation, Data Amount, Data Komplexity (Deep XML?) ACID vs. BASE vs. Mixture? CAP decisions Performance Dimension Analysis Latency, Request behaviour, Throughput Scale-Up vs Scale-Out Query Requirements Typical queries, Tools, Ad-Hoc Queries, SQL / LINQ needed, Map/Reduce? NoSQL Lessons learned Distribution Architecture local, parallel, distributed / grid, service, cloud, mobile, p2p, Data Access Patterns read / write distribution, random / sequential, Access Design Patterns Non Functional Requirements: Replication, Refactoring Frequency, DB-Support, Qualification / simplicity, Company restrictions, DB diversity (allowed?), Security, Safety / Backup & Restore, Crash Resistance, Licence Think outside the MyOracleSql Box RAM + SSD rocks RethinkDB VoltDB Vertica GenieDB MonetDB Hadoop++... Lot s of >1 PT RAM DBs in CA! Queries in FPGA!

Tonnen cleverer hybrid Lösungen da! The world is diverse! Act accordingly! Excel! Relational & SQL! Map& Reduce! Document! XML. Coffee? Tupel! OO-Model! Graphs! Key-Value! N SQL DaaS=> best Mix! Polyglot Persistence http://edlich.de This talk is supported by they want you for Cloud Computing!