Practical Data Integrity Protection in Network-Coded Cloud Storage



Similar documents
NCCloud: A Network-Coding-Based Storage System in a Cloud-of-Clouds

EMPOWER DATA PROTECTION AND DATA STORAGE IN CLOUD COMPUTING USING SECURE HASH ALGORITHM (SHA1)

Secure Distribution of File on Cloud

NCCloud: Applying Network Coding for the Storage Repair in a Cloud-of-Clouds

Secure Way of Storing Data in Cloud Using Third Party Auditor

Index Terms Cloud Storage Services, data integrity, dependable distributed storage, data dynamics, Cloud Computing.

Security of Data from Public - Using Cloud Storage

OVERVIEW OF SECURITY ISSUES IN CLOUD COMPUTING

Functional-Repair-by-Transfer Regenerating Codes

Data storage security in Cloud

Erasure correcting to enhance data security in cloud data storage

Ensuring Data Storage Security in Cloud Computing

How To Encrypt Data With A Power Of N On A K Disk

INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) PERCEIVING AND RECOVERING DEGRADED DATA ON SECURE CLOUD

Data Corruption In Storage Stack - Review

Verification of Data Reliability and Secure Service for Dynamic Data in Cloud Storage

HAIL: A High-Availability and Integrity Layer for Cloud Storage

Auditing for Distributed Storage Systems

Data Storage Security in Cloud Computing for Ensuring Effective and Flexible Distributed System

Verifying Correctness of Trusted data in Clouds

Remote Data Checking for Network Coding-based Distributed Storage Systems

Review On Regenerating Code Based Secure Cloud Storage Using Public Auditing Mr.Satish Shelar 1, Prof.S.Y.Raut 2

Data Storage Security in Cloud Computing

RIGOROUS PUBLIC AUDITING SUPPORT ON SHARED DATA STORED IN THE CLOUD BY PRIVACY-PRESERVING MECHANISM

Currency and Correctness of Content in Object Storage Networks

Near Sheltered and Loyal storage Space Navigating in Cloud

Ensuring Data Storage Security in Cloud Computing By IP Address Restriction & Key Authentication

Performance Evaluation Panda for Data Storage and Sharing Services in Cloud Computing

Reliability and Fault Tolerance in Storage

Improving data integrity on cloud storage services

Cloud storage with Apache jclouds

SECURE AND TRUSTY STORAGE SERVICES IN CLOUD COMPUTING

DESIGN AND IMPLEMENTATION OF A SECURE MULTI-CLOUD DATA STORAGE USING ENCRYPTION

Cloud Storage. Parallels. Performance Benchmark Results. White Paper.

Energy Efficiency in Secure and Dynamic Cloud Storage

Keywords: - Ring Signature, Homomorphic Authenticable Ring Signature (HARS), Privacy Preserving, Public Auditing, Cloud Computing.

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Ensuring Data Storage Security in Cloud Computing

AN EXPOSURE TO RELIABLE STORAGE SERVICES IN CLOUD COMPUTING

Index Terms : cloud computing, Distributed Storage, error detection, data recovery, SHA, dynamic block operations

PORs: Proofs of Retrievability for Large Files

A Secure and Dependable Cloud Storage Service in Cloud Computing

February. ISSN:

TECHNICAL WHITE PAPER: ELASTIC CLOUD STORAGE SOFTWARE ARCHITECTURE

Peer-to-peer Cooperative Backup System

Surveying Cloud Storage Correctness using TPA with BLS

Magnus: Peer to Peer Backup System

A Survey on Secure Storage Services in Cloud Computing

DepSky Dependable and Secure Storage in a Cloud-of-Clouds Alysson Bessani, Miguel Correia, Bruno Quaresma, Fernando André, Paulo Sousa

Distributing, Ensuring and Recovery of Data Stored in Cloud

SECURITY ENHANCEMENT OF GROUP SHARING AND PUBLIC AUDITING FOR DATA STORAGE IN CLOUD

Enhancing Data Security in Cloud Storage Auditing With Key Abstraction

Today s Papers. RAID Basics (Two optional papers) Array Reliability. EECS 262a Advanced Topics in Computer Systems Lecture 4

A Comprehensive Data Forwarding Technique under Cloud with Dynamic Notification

Secure Storage Services and Erasure Code Implementation in Cloud Servers

Coding Techniques for Efficient, Reliable Networked Distributed Storage in Data Centers

Data Protection Technologies: What comes after RAID? Vladimir Sapunenko, INFN-CNAF HEPiX Spring 2012 Workshop

A Secure & Efficient Data Integrity Model to establish trust in cloud computing using TPA

PRIVACY ASSURED IMAGE STACK MANAGEMENT SERVICE IN CLOUD

Enhance Data Security in Cloud Computing using Layered Interleaving Approach

Load Re-Balancing for Distributed File. System with Replication Strategies in Cloud

Service Description Cloud Storage Openstack Swift

Exercise 2 : checksums, RAID and erasure coding

Data Security And Integrity Of Cloud Storage In Cloud Computing

How To Check If A Data File Is Safe In A Cloud Archive

Technical Overview Simple, Scalable, Object Storage Software

CERN Cloud Storage Evaluation Geoffray Adde, Dirk Duellmann, Maitane Zotes CERN IT

ESPRESSO: An Encryption as a Service for Cloud Storage Systems

A Secure Cloud Backup System with Assured Deletion and Version Control

A Survey on Secure Auditing and Deduplicating Data in Cloud

How To Ensure Data Integrity In Cloud Computing

Chapter 13 File and Database Systems

Chapter 13 File and Database Systems

IMPLEMENTATION CONCEPT FOR ADVANCED CLIENT REPUDIATION DIVERGE AUDITOR IN PUBLIC CLOUD

AN INVESTIGATION OF SECURITY THEME FOR CLOUD COMPUTING

Distributed Storage Networks and Computer Forensics

Building Storage Clouds for Online Applications A Case for Optimized Object Storage

A Survey on Data Integrity of Cloud Storage in Cloud Computing

Secure Collaborative Privacy In Cloud Data With Advanced Symmetric Key Block Algorithm

Algorithms and Methods for Distributed Storage Networks 5 Raid-6 Encoding Christian Schindelhauer

Aspera Direct-to-Cloud Storage WHITE PAPER

Survey on Secrecy-Protective Communal Accounting for Restoring-Key-Based Cloud Storage

Transcription:

Practical Data Integrity Protection in Network-Coded Cloud Storage Henry C. H. Chen Department of Computer Science and Engineering The Chinese University of Hong Kong

Outline Introduction FMSR in NCCloud FMSR-DIP Publications Yuchong Hu, Henry C. H. Chen, Patrick P. C. Lee, and Yang Tang NCCloud: Applying Network Coding for the Storage Repair in a Cloud of-clouds Proceedings of the 10th USENIX Conference on File and Storage Technology (FAST 12) Henry C. H. Chen, and Patrick P. C. Lee Practical Data Integrity Protection in Regenerating-Coding-Based Storage To appear in the 31st International Symposium on Reliable Distributed Systems (SRDS 12)

Outline Introduction FMSR in NCCloud FMSR-DIP

Cloud Storage On-demand storage outsourcing Supports RESTful APIs: PUT, GET, DELETE, LIST

Problems in the Cloud

Data Integrity Protection Corruption detection Addressed in this work Fault-tolerance and repair Addressed in NCCloud Desirable properties Minimize cost Works on thin clouds (i.e., clouds with only basic file access semantics)

Data Integrity Protection Corruption detection Addressed in this work Fault-tolerance and repair Addressed in NCCloud Desirable properties Minimize cost Works on thin clouds (i.e., clouds with only basic file access semantics)

Data Integrity Protection Corruption detection Addressed in this work (FMSR-DIP) Fault-tolerance and repair Addressed in NCCloud Desirable properties Minimize cost Works on thin clouds (i.e., clouds with only basic file access semantics)

Related Work Single node, smart clouds PDP [Ateniese et al. 07] POR [Juels et al. 07] Multi-node, different storage schemes MR-PDP [Curtmola et al. 08] HAIL [Bowers et al. 09]

Our Work Build FMSR-DIP, a corruption detection scheme that allows byte-sampling Works on thin clouds Works on functional minimum storage regenerating (FMSR) code Targets on long-term archives

Outline Introduction FMSR in NCCloud FMSR-DIP

NCCloud Cloud 1 Users file upload file download Proxy Cloud 2 Cloud 3 Cloud 4

Contributions of NCCloud Propose an implementable design of functional minimum storage regenerating (FMSR) code Support basic read/write operations and the repair function on thin clouds Preserve storage requirements as in optimal erasure codes, while reducing repair traffic Implement and evaluate in real cloud storage

Repairing a Failed Cloud How to repair: Proxy Cloud 1 Cloud 2 Cloud 3 Cloud 4 Cloud 5 Repair traffic = + + Goal: minimize repair traffic

Reed-Solomon Codes Node 1 Node 2 A B A B File of size M Proxy Reed Solomon codes Repair traffic = M Node 3 Node 4 A+B A+2B B A+B A A Conventional repair: n = 4, k = 2 (n, k) MDS code: Any k out of n storage nodes (clouds) can rebuild original file. Reconstruct whole file and generate data in new node

FMSR in NCCloud Node 1 Node 2 P1 P2 P3 P4 A B C D File of size M Proxy F-MSR codes Repair traffic = 0.75M Node 3 Node 4 P5 P6 P7 P8 P3 P5 P7 P1 P2 P1 P2 n = 4, k = 2 Code chunk P i = linear combination of original native chunks Repair in FMSR: Download one code chunk from each surviving node Reconstruct new code chunks (via random linear combination) in new node

FMSR Property Proxy File partition k(n-k) chunks A B C D encode n(n-k) chunks P1 P2 P3 P4 P5 P6 P7 P8 distribute Storage nodes P1 P2 P3 P4 P5 P6 P7 P8 n=4, k=2

FMSR Property Proxy File partition k(n-k) chunks A B C D encode n(n-k) chunks P1 P2 P3 P4 P5 P6 P7 P8 distribute Storage nodes P1 P2 P3 P4 P5 P6 P7 P8 c 1,1 c 1,2 c 1,3 c 1,4... c 8,1 c 8,2 c 8,3 c 8,4 A B C D P1 P2 P3 P4 P5 P6 P7 P8 n=4, k=2 Encoding matrix rank = k(n-k) Native chunks Code chunks

NCCloud: Experiments Testbed environment Local cloud Openstack Swift 1.4.2 1 proxy node connected to 15 storage nodes (LAN) NCCloud deployed on proxy node Commercial cloud Microsoft Azure Storage schemes (4,2)-Reed-Solomon vs. (4,2)-FMSR

Response time (s) REPAIR Response time (s) DOWNLOAD Response time (s) UPLOAD 50 40 30 20 10 0 12 10 8 6 4 2 0 Response time: Local Cloud 1 10 50 100 200 300 400 500 1 10 50 100 200 300 400 500 RAID-6 RS F-MSR FMSR File size (MB) RAID-6 RS F-MSR FMSR File size (MB) FMSR has higher response time due to encoding/decoding overhead FMSR has slightly less response time in repair, due to less data download 35 30 25 20 15 10 5 0 1 10 50 100 200 300 400 500 RAID-6(native) RS chunk repair) RAID-6(parity) RS (code chunk repair) F-MSR FMSR File size (MB) 20

Response time (s) REPAIR Response time (s) DOWNLOAD Response time (s) UPLOAD Response time: Commercial Cloud 6 RAID-6 RS 4 F-MSR FMSR 2 0 1 2 5 10 File size (MB) 2.5 2 1.5 1 0.5 0 1 2 5 10 RAID-6 RS F-MSR FMSR File size (MB) No distinct response time difference, as network fluctuations play a bigger role in actual response time 6 5 4 3 RAID-6(native) RS chunk repair) RAID-6(parity) RS (code chunk repair) F-MSR FMSR 2 1 0 1 2 5 10 File size (MB) 21

Outline Introduction FMSR in NCCloud FMSR-DIP

FMSR-DIP: Design Goals Preserves advantage of FMSR Works on thin clouds Supports sampling to minimize cost Works against a Byzantine, mobile adversary Exhibits arbitrary behaviors Corrupts different subsets of servers over time

FMSR-DIP: Overview Cloud 1 Users file upload file download Proxy FMSR- DIP Cloud 2 Cloud 3 Cloud 4 Four operations: Upload, Check, Download and Repair

FMSR-DIP: Upload 8 FMSR code chunks, 3 bytes each

FMSR-DIP: Upload Apply error-correcting code (ECC) to each chunk individually

FMSR-DIP: Upload XOR each byte with a pseudorandom value

FMSR-DIP: Upload For each chunk, calculate the MAC of the first 3 bytes

FMSR-DIP: Upload Upload the chunks to clouds Encrypt the metadata from NCCloud (which contains the encoding matrix) Append all MACs to metadata Replicate metadata on all nodes

FMSR-DIP: Check Pick a row to check

FMSR-DIP: Check XOR with the previous pseudorandom values, and check their consistency

Recall: FMSR Encoding c 1,1 c 1,2 c 1,3 c 1,4 P 1 c 2,1 c 2,2 c 2,3 c 2,4 c 3,1 c 3,2 c 3,3 c 3,4 A P 2 P 3 c 4,1 c 4,2 c 4,3 c 4,4 B P 4 c 5,1 c 5,2 c 5,3 c 5,4 C P 5 c 6,1 c 6,2 c 6,3 c 6,4 D P 6 c 7,1 c 7,2 c 7,3 c 7,4 c 8,1 c 8,2 c 8,3 c 8,4 Encoding matrix rank = k(n-k) Native chunks P 7 P 8 Code chunks

FMSR-DIP: Download Download chunks from any 2 nodes and verify with their MACs

FMSR-DIP: Download Remove pseudorandom values and pass to NCCloud for decoding

FMSR-DIP: Repair

FMSR-DIP: Repair Download 1 chunk from all other nodes and verify with their MACs

FMSR-DIP: Repair Remove pseudorandom values and pass to NCCloud

FMSR-DIP: Repair NCCloud generates new chunks

FMSR-DIP: Repair Process the newly generated chunks as before

FMSR-DIP: Repair Upload chunks and update metadata on all nodes

FMSR-DIP: Experiments Testbed environment Openstack Swift 1.4.2 1 proxy node connected to 15 storage nodes (LAN) NCCloud and FMSR-DIP deployed on proxy node NCCloud uses RAMDisk as storage Storage scheme (4,2)-FMSR

REPAIR Time taken(s) DOWNLOAD Time taken(s) UPLOAD Time taken (s) 25 20 15 10 5 0 8 6 4 2 Running Time vs. File Size 100MB 50MB 20MB 10MB 5MB 1MB Transfer-Up DIP-Encode FMSR File size Transfer-Down DIP-Decode FMSR FMSR-DIP overhead comparable to network transfer time in a LAN environment 0 20 15 10 5 0 100MB 50MB 20MB 10MB 5MB 1MB 100MB 50MB 20MB 10MB 5MB 1MB File size Transfer-Up Transfer-Down DIP-Encode DIP-Decode FMSR File size

Time taken (s) Time taken (s) The Check Operation 80 70 60 50 40 30 20 10 0 1% check 256B 1KB 4KB 7KB 25KB 256KB Misc. Transfer-Down Rank Checking PRF Download block size Bottleneck in network transfer 30 25 20 15 10 5 256KB download block size Misc. Transfer-Down Rank Checking PRF 0 100% 75% 50% 25% 10% 5% 1% Checking percentage

Conclusions Propose a design for efficient data integrity protection using FMSR on thin clouds Implement and evaluate the efficiency of the design Source code: NCCloud http://ansrlab.cse.cuhk.edu.hk/software/nccloud/ FMSRDIP http://ansrlab.cse.cuhk.edu.hk/software/fmsrdip/

Thank You!

Error Localization Assume each byte is correct in turn

Error Localization Form a system with bytes from k other nodes

Error Localization Mark all involved bytes as correct if system is consistent

Error Localization Try all subsets

Error Localization Try all subsets

Error Localization And so on

Cloud Storage Pricing S3 Rackspace Azure Storage (per GB) $0.125 $0.15 $0.125 Data transfer in (per GB) free free Free Data transfer out (per GB) $0.12 $0.18 $0.12 PUT (per 10,000 requests) $0.10 free $0.01 GET (per 10,000 requests) $0.01 free $0.01 Pricing in US dollars, as of May, 2012