Using Data Classification to Manage File Servers



Similar documents
Using Classification to manage File Servers. Nir Ben-Zvi, Microsoft Corporation

ILM et Archivage Les solutions IBM

Caching SMB Data for Offline Access and an Improved Online Experience

How to Backup and Restore a VM using Veeam

Enterprise IT is complex. Today, IT infrastructure spans the physical, the virtual and applications, and crosses public, private and hybrid clouds.

AvePoint DocAve Online vs Office SharePoint Online Management

Assignment # 1 (Cloud Computing Security)

Enterprise and Standard Feature Compare

Online Transaction Processing in SQL Server 2008

Designing Database Solutions for Microsoft SQL Server 2012 MOC 20465

Windows Server 2012 R2 The Essentials Experience

Level 1: Asigra Cloud Backup Foundation Training

How To Install Powerpoint 6 On A Windows Server With A Powerpoint 2.5 (Powerpoint) And Powerpoint On A Microsoft Powerpoint 4.5 Powerpoint (Powerpoints) And A Powerpoints 2

AvePoint DocAve Online vs Office SharePoint Online Management

CLOUD SERVICES FOR EMS

Top. Reasons Legal Firms Select kiteworks by Accellion

System Services. Engagent System Services 2.06

Vistara Lifecycle Management

McAfee Web Gateway Administration Intel Security Education Services Administration Course Training

Cloud Computing and the Future of Internet Services. Wei-Ying Ma Principal Researcher, Research Area Manager Microsoft Research Asia

c360 Advanced Quote and Order Processing for Microsoft Dynamics CRM 4.0 Installing Guide

A Brief Overview. Delivering Windows Azure Services on Windows Server. Enabling Service Providers

Fight fire with fire when protecting sensitive data

Designing and Deploying Messaging Solutions with Microsoft Exchange Server 2010 Service Pack B; 5 days, Instructor-led

HP Virtualization Performance Viewer

"Charting the Course to Your Success!" MOC B Configuring and Administering Microsoft SharePoint Course Summary

MS 20465: Designing Database Solutions for Microsoft SQL Server 2012

Service Overview CloudCare Online Backup

Security challenges for internet technologies on mobile devices

Securing enterprise collaboration through and file sharing on a unified platform

SharePoint 2010 Performance and Capacity Planning Best Practices

ELO for SharePoint. More functionality for greater effectiveness. ELO ECM for Microsoft SharePoint 2013

Gladinet Cloud Backup V3.0 User Guide

Configuring Celerra for Security Information Management with Network Intelligence s envision

Designing Database Solutions for Microsoft SQL Server 2012

<Insert Picture Here> Oracle Database Security Overview

Microsoft SQL Server 2008 R2 Enterprise Edition and Microsoft SharePoint Server 2010

Embedded System Deployment and Management

Smart Client Deployment with ClickOnce. Brian Noyes IDesign, Inc. (

Windows Server 2012 R2 VDI - Virtual Desktop Infrastructure. Ori Husyt Agile IT Consulting Team Manager orih@agileit.co.il

BackupAssist v6 quickstart guide

Click Studios. Passwordstate. Upgrade Instructions to V7 from V5.xx

Release Notes Scribe Adapter for Microsoft Dynamics

Optimizing the Performance of Your Longview Application

Data Management in an International Data Grid Project. Timur Chabuk 04/09/2007

Monitor and Manage Your MicroStrategy BI Environment Using Enterprise Manager and Health Center

HP Records Manager. Release Notes. Software Version: 8.1. Document Release Date: June 2014

SharePoint 2010

Features of AnyShare

CA ARCserve and CA XOsoft r12.5 Best Practices for protecting Microsoft Exchange

Zmanda Cloud Backup Frequently Asked Questions

This course is intended for IT professionals who are responsible for the Exchange Server messaging environment in an enterprise.

WHITE PAPER: TECHNICAL OVERVIEW. NetBackup Desktop Laptop Option Technical Product Overview

Securing enterprise collaboration through and file sharing on a unified platform

How do you test to determine which backup and restore technology best suits your business needs?

RightsWATCH. Data-centric Security.

Data Classification Technical Assessment

Replication Workshop International Conference. Peter Schroer

Layer2 Knowledge Management Suite for SharePoint V3

RSA Authentication Manager 7.1 to 8.1 Migration Guide: Upgrading RSA SecurID Appliance 3.0 On Existing Hardware

Web Application Deployment in the Cloud Using Amazon Web Services From Infancy to Maturity

Hitachi Content Platform (HCP)

solution brief solution brief storserver.com STORServer, Inc. U.S. (800) : STORServer, Europe 0031 (0)

Virtual Volumes Technical Deep Dive

Tutto quello che c è da sapere su Azure App Service

EMC PERSPECTIVE EMC SourceOne Management

Partner Camp Leistungsstarkes Log-Management für physische, virtuelle und cloud-basierte Umgebungen. Tomas Baublys

KonyOne Server Installer - Linux Release Notes

Tableau Server 7.0 scalability

SharePoint Administrator

PRODUCT DESCRIPTIONS AND METRICS

Kaseya 2. User Guide. Version 7.0. English

How To Use Adobe Software For A Business

Multi-Datacenter Replication

PTC System Monitor Solution Training

AvePoint DocAve 4.0 (Enterprise Solutions for SharePoint) User Guide August 2006

with Managing RSA the Lifecycle of Key Manager RSA Streamlining Security Operations Data Loss Prevention Solutions RSA Solution Brief

How to avoid building a data swamp

Diagram 1: Islands of storage across a digital broadcast workflow

Microsoft Training and Certification Guide. Current as of December 31, 2013

LT Auditor Windows Assessment SP1 Installation & Configuration Guide

Overview of Active Directory Rights Management Services with Windows Server 2008 R2

Office 365 Migration Performance & Server Requirements

Exchange Mailbox Protection Whitepaper

Technical Document Release Version 3.0. Product Sheet. MediaStore Manager. Archive manager Application Module

Securing SharePoint 101. Rob Rachwald Imperva

MySQL Security: Best Practices

ERNW Newsletter 29 / November 2009

The biggest challenges of Life Sciences companies today. Comply or Perish: Maintaining 21 CFR Part 11 Compliance

Server Installation ZENworks Mobile Management 2.7.x August 2013

Sharepoint vs. inforouter

How Exclaimer Mail Archiver Works

The Windows Web Platform. Michael Epprecht Microsoft Switzerland twitter: fastflame

MS-55052: SharePoint 2013 End User Level II

ITPS AG. Aplication overview. DIGITAL RESEARCH & DEVELOPMENT SQL Informational Management System. SQL Informational Management System 1

Server Consolidation with SQL Server 2008

Business 360 Online - Product concepts and features

IT Peace of Mind. Powered by: Secure Backup and Collaboration for Enterprises

Enforce AD RMS Policies for PDF documents in SharePoint Environments Enforce AD RMS Policies for PDF documents in Exchange Environments...

A Websense Research Brief Prevent Data Loss and Comply with Payment Card Industry Data Security Standards

Transcription:

Using Data Classification to Manage File Servers Adi Oltean Senior SDE, Microsoft Corporation Ran Kalach Principal Dev Manager, Microsoft Corporation

Agenda Customer challenges Solution: File Classification Manage data based on business value Grow the ecosystem in classification solutions File Classification Infrastructure The classification pipeline Aggregation, conflict resolution Incremental classification Challenges, Mitigations & Best Practices Conclusions

Customer challenges file servers Storage growth Storage cost Data sharing and search Compliance Security and Information leakage Increasing data management needs / many data management tools HSM Security Archive Backup Encryption Replication Expiration

File shares and business requirements Business IT Need per project share Make sure high business impact files do not leak out Backup files with personal information to encrypted store Expire low business impact files created three years ago and not touched for a year 4

Some time later 5

Classify and apply policy Classification methods Step 1: Classify data IT Scripts Manual Line Of Business application Automatic classification Location Content Owner Step 2: Apply policy based on classification Actions based on classification Backup Archive HSM Reports Expiration Replication Security Encryption Search Leakage prevention

File shares and business requirements Business IT Personal Business Information Impact Need per project share Make sure high business impact files do not leak out Backup files with personal information to encrypted store Expire low business impact files created three years ago and not touched for a year 7

Customer benefits - Summary Apply Policies Based on Classification = Manage data based on business value! Reduce Cost Expire files to reduce storage purchasing needs Move files to less expensive storage Optimize backup SLAs Replicate only business related files Manage risk Find sensitive files on public servers Watermark documents Keep files containing personal information encrypted in backup Apply rights management to high secrecy files Comply with retention policies

Agenda Customer challenges Solution: File Classification Manage data based on business value Grow the ecosystem in classification solutions File Classification Infrastructure The classification pipeline Aggregation, conflict resolution Incremental classification Challenges, Mitigations & Best Practices Conclusions

File Classification Infrastructure Get classification properties API for external applications Set classification properties API for external applications Discover Data Extract classification properties Classify Data Store classification properties Apply Policy based on classification File Classification Extensibility points

Classification pipeline an example This is an example of a pipeline setup with one storage module and two classifiers Property bag object Each component passes property bags to the next one Classification Runtime Process Scanner Gets basic file properties Office Storage [Load] Folder Classifier Content Classifier Office Storage [Save] Reporting Engine Hosting Process Hosting Process Hosting Process discovery load properties classification save properties run policies Property bags can cross processes Security checks are performed on cross-process data transfers Most modules are hosted within a separate process

Aggregation and Conflict Resolution Problem: A classification rule may provide conflicting value with the value already stored in the file Two classification rules may provide conflicting values for the same property Example: Solution: Admin creates a Business Impact property with possible values (LBI, MBI, HBI) A file previously classified as MBI is copied to a folder x:\foo The Folder rule for x:\foo classifies all files as LBI The Content classifier scans the file and classifies it as HBI What is the correct value? Provide several types of classification rules: Default: rule runs only if the property not present in the file. Otherwise: rules can either explicitly aggregate or overwrite previously-stored properties. Value aggregation depends on the property type

Incremental Classification Goal: Minimize re-classification of already classified files Crucial for scalability (large amount of files) Automatic classification (scheduled) Cache classification results in ADS (alternate data stream) ADS contains a hash of certain file properties (last-modify-time, file-path, file-id, etc) ADS contains the last classification time Allows determining whether the cached classification is up-to-date Re-classify the file only if: The file changed or was added since previous classification (hash is different), or A rule has changed since previous classification, or The configuration of a classifier has been updated since previous classification. Get Property API (on-demand) If cache is present and up to date, return cached properties Otherwise (out-of-date classification), application can choose: Accuracy: classify the file on the fly Performance: return stored properties

Challenges, Mitigations & Best Practices 1 - Performance Content classification is expensive (I/O, CPU) Must optimize to scan & classify only when needed Must be able to cache results Minimize performance impact on host of data being classified Classify on another machine When classifying locally, throttle machine resource usage and back out when the machines becomes non-idle Be smart with how you schedule classification, support pause/resume

Challenges, Mitigations & Best Practices 2 - Accuracy Automatic Classification can almost never be 100% accurate Tune your rules for false-positive / false-negative according to the scenario Example: secure files false positive, expire files false negative Policy execution: revert in case of classification error Example: backup files one last time just before you expire them Examine classification results periodically Modify your rules or classifiers till they re optimized for your data-set Enable manual classification Clear and consistent policy for aggregating and resolving conflicts Support flexible rules that allow tuning by administrator or application One answer doesn t fit all!

Challenges, Mitigations & Best Practices 3 - Real-time Classification and Policies Some policies require real-time or near real-time execution Example: removing confidential file from unsecured share Solution: event-based classification File-system activity can be a trigger Need a hook to file-system operations, (many implementation options exist) Consider Classifying only when the file content is stable Avoid overloading the server performance with too aggressive classification

Examples of FCI-enabled solutions Solution Classification solutions Custom classifiers that extract metadata from files Custom storage modules that load/store custom metadata in files Add classification awareness to existing data management solutions. Build intelligent policy-based data management solutions Example An LOB app that maintains special classification rules for PII data it generates. A medical imaging classifier extracts embedded metadata from scanned images Load/store metadata in your custom file formats (example: videos) A backup app can have special backup policies for HBI data Define a policy to automatically apply encrypt HBI data

Opportunities for you Why participate in the File Classification Infrastructure ecosystem? Use FCI for existing software Enhance existing data-producing apps to also attach classification to generated files (ex: LOB applications) Enhance existing data management apps to consume classification Use FCI for new software solutions Develop solutions on top of FCI Develop components for the FCI ecosystem Classifiers Storage modules How I can develop against it? File Classification Infrastructure can be consumed through a rich, scriptable COM API FCI can be extended using C++/C# code, or Powershell scripts When can I start? Now: FCI is part of the latest Server releases (starting with Windows Server 2008 R2)

More information about FCI General information Home page: http://www.microsoft.com/windowsserver2008/en/us/fci.aspx Team blog: http://blogs.technet.com/filecab API documentation on MSDN: http://msdn.microsoft.com/enus/library/bb972746(vs.85).aspx Sample code Windows SDK http://msdn.microsoft.com/enus/windows/bb980924.aspx Sample FCI clients (C++, C#) Sample classifiers (C++, C#) Code Gallery: http://code.msdn.microsoft.com/fci