NoSQL in der Cloud Why? Andreas Hartmann 17.04.2013
17.04.2013 2 NoSQL in der Cloud Why? Quelle: http://res.sys-con.com/story/mar12/2188748/cloudbigdata_0_0.jpg
Why Cloud??? 17.04.2013 3 NoSQL in der Cloud Why?
What does Big Data mean??? 17.04.2013 4 NoSQL in der Cloud Why?
What is the Problem with Big Data Quelle: http://www.codefutures.com/database-sharding/ 17.04.2013 5 NoSQL in der Cloud Why?
NoSQL Basics CAP Theorem Consistency Each client always has the same view of the data Availability All clients can always read and write Partition tolerance The System continues to operate despite arbitrary message loss Quelle: http://blog.nahurst.com/visual-guide-to-nosql-systems 17.04.2013 6 NoSQL in der Cloud Why?
ACID-BASE ACID Atomicity all or nothing rule Consistency any transaction the database performs will take it from one consistent state to another Isolation no transaction should be able to interfere with another transaction at all Durability once a transaction has been committed, it will remain so Quelle: http://en.wikipedia.org/wiki/acid BASE Basically Available the system does guarantee availability, in terms of the CAP theorem Soft-State the state of the system may change over time, even without input Eventual Consistency the system will become consistent over time, given that the system doesn't receive input during that time Quelle: http://stackoverflow.com/questions/3342497/explanation-ofbase-terminology 17.04.2013 7 NoSQL in der Cloud Why?
Things to think about!!! Scalability High Availability Caching Architecture Load Balancing Queuing Data Handling Federation Serving Files Storing Files Quelle: http://cloudscaling.com/blog/cloud-computing/up-out-centralized-and-decentralized 17.04.2013 8 NoSQL in der Cloud Why?
Open Your Mind;-) Quelle: http://images.tribe.net/tribe/upload/photo/deb/074/deb074db-81fc-4b8a-bfbd-b18b922885cb 17.04.2013 9 NoSQL in der Cloud Why?
NoSQL Tagcloud 17.04.2013 10 NoSQL in der Cloud Why?
Datastore Types Key/Value Store Document Store Extensible Record Stores (Widecolumn Stores) Graph Database 17.04.2013 11 NoSQL in der Cloud Why?
What it means Query Data is easily and quickly read/stored using primary key Denormalize data for commonly used queries Shema Design is optimized for the most common Use-Cases Developer More technologies to have fun with Broader choice of persistence stores Probably Polyglot Persistence Store name, firstname etc in RDBMS Store followers in Graph database Store Content in RDBMS Store User Generated Content in Document database Quelle: http://www.slideshare.net/adessoag/no-sql-9355109 17.04.2013 12 NoSQL in der Cloud Why?
Wich is the right one Quelle: http://www.slideshare.net/emileifrem/nosql-east-a-nosql-overview-and-the-benefits-of-graph-databases 17.04.2013 13 NoSQL in der Cloud Why?
Wich is the right one Quelle: http://martinfowler.com/articles/nosql-intro.pdf 17.04.2013 14 NoSQL in der Cloud Why?
Architecture Requirements Business Critical Data Read/Write Data should be consistent Rarley Changed Data / Various Reads / Slight inconsistencies are OK 17.04.2013 15 NoSQL in der Cloud Why?
mongodb as an Example Documentoriented & schemaless JSON like documents written in C++ Open Source with a AGPL- Licence v.3.0 17.04.2013 16 NoSQL in der Cloud Why?
mongodb in Detail MongoDB Basics Security and Authentication Indexes Replication Scaling Map/Reduce Binary Data Sets Monitoring Backup Schema Design Connectivity Ecosystem 17.04.2013 17 NoSQL in der Cloud Why?
mongodb Replication Replica Sets A replica set consists of two or more nodes that are copies of each other The replica set automatically selects a primary (master). Drivers can automatically detect when a replica set primary changes and will begin sending writes to the new primary Why Replica Sets Automated Failover Read Scaling (slaveokay Method) Maintenance Disaster Recovery Quelle: http://www.mongodb.org/display/docs/replica+sets 17.04.2013 18 NoSQL in der Cloud Why?
mongodb Scaling Sharding Horizontal scaling across multiple nodes Sharding Key Characteristics Inserts are balanced between shards Common queries are routed to a subset of the shards 17.04.2013 19 NoSQL in der Cloud Why?
mongodb Scaling Replica Sets comes together with Sharding Quelle: http://www.mongodb.org/display/docs/sharding+introduction 17.04.2013 20 NoSQL in der Cloud Why?
mongodb Map/Reduce Parallel processing huge datasets on distributed systems Data Data Data Data M A P R E D U C E 17.04.2013 21 NoSQL in der Cloud Why?
mongodb Map/Reduce Map XXX var map = function() { Map Reduce Execute }; emit( this.author, { pages: this.pages } ); 17.04.2013 22 NoSQL in der Cloud Why?
mongodb Map/Reduce Reduce Map Reduce Execute var reduce = function( key, values ) { var sum = 0; values.foreach( function( doc ) { sum += doc.pages; } ); }; return { "pages": sum }; 17.04.2013 23 NoSQL in der Cloud Why?
mongodb Map/Reduce Execute Map Reduce Execute db.bookstore.mapreduce( map, reduce, { out: "myresultcollection" } ); { "result" : "myresultcollection", "timemillis" : 156, "counts" : { "input" : 7, "emit" : 7, "reduce" : 3, "output" : 3 }, "ok" : 1, } 17.04.2013 24 NoSQL in der Cloud Why?
Case Studies 17.04.2013 25 NoSQL in der Cloud Why? Quelle: http://res.sys-con.com/story/mar12/2188748/cloudbigdata_0_0.jpg Quelle: http://www.techweekeurope.co.uk/wp-content/uploads/2011/03/big-data.jpg
Vielen Dank für Ihre Aufmerksamkeit. info@adesso.de www.adesso.de