#MMTM15 #INFOARCHIVE #EMCWORLD 1

Size: px
Start display at page:

Download "#MMTM15 #INFOARCHIVE #EMCWORLD 1"

Transcription

1 #MMTM15 #INFOARCHIVE #EMCWORLD 1 1

2 WHAT'S NEW & WHAT'S NEXT: EMC INFOARCHIVE HARSH HATEKAR PRODUCT MANAGER, 5-MAY #MMTM15 2

3 TWEET LIVE DURING THE SESSION! Connect with us: Sign up for a Hands On Lab 6 th May, 1.30 PM, Galileo 906 Attend Big Data Velocity & Information Governance 6 th May, 3 PM Session Share your thoughts Take the survey In the App or via Set Your Data Free 6 th May, 3 PM, Galileo 1004 #MMTM15 #INFOARCHIVE #EMCWORLD 3

4 REGISTRATIONS ARE NOW OPEN! Online at bit.ly/emchoriz on OR At the Momentum Booth Solutions Expo Booth #803

5 5

6 SAFE HARBOR DISCLAIMER This document contains forward-looking statements as defined under the US Federal Securities Laws. Please note that EMC makes no representations or commitments, and undertakes no obligations with regard to product planning information, anticipated product characteristics, performance specifications and functionality, or anticipated release dates (collectively, Roadmap Information ) as provided. Roadmap Information is provided by EMC as a courtesy to the recipient solely for purposes of discussion and without intending to be bound, and is subject to change without notice. Roadmap information is EMC Confidential Information, and is provided under the terms, conditions and restrictions defined in the EMC Non- Disclosure Agreement in place with your organization. #MMTM15 #INFOARCHIVE #EMCWORLD 6

7 AGENDA InfoArchive Accelerating IT Transformation Use Cases & Customers What s New: InfoArchive 3.1 What s New: InfoArchive 3.2 What s Next: Vision & Investment Themes Resources Questions #MMTM15 #INFOARCHIVE #EMCWORLD 7

8 Reduce IT Complexity Optimize Infrastructure Ensure Regulatory Compliance Extract Value Application Decommissioning Active Archiving EMC s solution to empower organizations unlock the data trapped in applications to lower IT costs, preserve compliance and put application data to work #MMTM15 #INFOARCHIVE #EMCWORLD 8

9 CHANGING THE DISCUSSION IT TRANSFORMATION JOURNEY 9

10 APPLICATION DECOMMISSIONING Move legacy data to InfoArchive Shutdown the legacy applications Meet compliance needs Provide access to legacy data, enable BI and Analytics Content with all associated data Context ACTION All relevant information available Immediate action possible Not application specific Future proofing access #MMTM15 #INFOARCHIVE #EMCWORLD 10

11 ACTIVE ARCHIVING Optimize applications running in production ACTIVE APPLICATIONS READ & WRITE INFRASTRUCTURE & STORAGE Meet backup SLAs Address scalability cost concerns Backup costs App performance Server costs Provide access to data, enable BI and Analytics EMC INFOARCHIVE #MMTM15 #INFOARCHIVE #EMCWORLD 11

12 EXTRACTING VALUE OUT OF DATA #MMTM15 #INFOARCHIVE #EMCWORLD 12

13 ALL ANGLES COVERED ARCHITECTED FOR CHOICE All application data: structured and unstructured Create and manage one central pool of application data One size does not fit all: multiple archive strategies - optimized to source application Tables Structured Data Unstructured Content Compound Objects APPLICATION DECOMMISSIONING DATA RECORDS FILES AND PRINT STREAMS COLLABORATIVE APP ARCHIVING #MMTM15 #INFOARCHIVE #EMCWORLD 13

14 12 Week Time To Value $3.8M Available Annually By Eliminating 1 Main App 80+ More Apps To Be Consolidated Or Decommissioned #MMTM15 #INFOARCHIVE #EMCWORLD 14

15 12 Siloed Applications Decommission ed EPIC Regulated Patient Information Resurfaced 360 View of the Patient to the Clinician X-Rays Treatment History Test Results Progress Notes Immunizations InfoArchive Enterprise Medical Record Application Complete & Patient Centric 15

16 INFOARCHIVE : RELEASE TRACK MAR DEC MAY 2015 #MMTM15 #INFOARCHIVE #EMCWORLD 16

17 INFOARCHIVE DEC, 2014 Extended Compliance and Security Ease of Solution Deployment Extended Search Capabilities Archive Enhancements #MMTM15 #INFOARCHIVE #EMCWORLD 17

18 INFOARCHIVE 3.1 COMPLIANCE Retention Sets Records Hold Sets Retention Policies Management Retention Policies Legal Holds Stronger compliance capabilities Centralized Retention management #MMTM15 #INFOARCHIVE #EMCWORLD Extended and refined retention policies granularity enabling increasing flexibility and reusability on the same holdings 18

19 INFOARCHIVE 3.1 COMPLIANCE Storage SAN Isilon Centera Storage Interface File System NFS CAS IA Date Based Retention features at the applicatio n level IA + RPS Date Based Event Based Legal Hold #MMTM15 #INFOARCHIVE #EMCWORLD 19

20 INFOARCHIVE 3.1 DATA ENCRYPTION Encrypt sensitive data and content at rest Protect and control sensitive information Protect sensitive data from privileged access e.g. Admins Integrate with existing encryption technologies #MMTM15 #INFOARCHIVE #EMCWORLD Partnership with encryption providers is planned 20

21 INFOARCHIVE 3.1 WIZARD Expedite archival setup for a datatype using simple and flexible steps offered in the Wizard Auto generation of configuration files for a archival setup with validation Support for Asynchronous Ingestion save and edit predefined configurations #MMTM15 #INFOARCHIVE #EMCWORLD 21

22 INFOARCHIVE 3.1 SEARCH End user productivity Efficient navigation through search results Locate information quickly Logical Operators filtering, Sort, Column level Filters, Multi- Value Search Criteria Query Criteria in the URL #MMTM15 #INFOARCHIVE #EMCWORLD 22

23 INFOARCHIVE 3.1 EXPORT, REPORTS Ability to export search results Format: xml, csv, txt With or without content Archive Reports Performed Actions Export Action Audit Package Disposition Archive Volume History Current Archive Volume Package Retention #MMTM15 #INFOARCHIVE #EMCWORLD 23

24 INFOARCHIVE MAY, 2015 Compliance, Analytics EMC Storage PrintStream, Wizard, Connectors Table Archiving Enhancements #MMTM15 #INFOARCHIVE #EMCWORLD 24

25 INFOARCHIVE 3.2 COMPLIANCE EMC Isilon OneFS Simple to manage & Massively scalable Single file system, single volume, global namespace Multi protocol support NFS, SMB, HTTP, FTP and HDFS EMC Isilon SmartLock (WORM) Protects critical data against accidental, premature, or malicious alteration or deletion. Enterprise or Compliance mode #MMTM15 #INFOARCHIVE #EMCWORLD 25

26 INFOARCHIVE COMPLIANCE Storage SAN Isilon Centera Storage Interface Retention features at the application level Retention features at the storage level File System NFS CAS IA Date Based IA + RPS Date Based Event Based Legal Hold IA Date Based * IA + RPS Date Based * Legal Hold ** Privileged Delete #MMTM15 #INFOARCHIVE #EMCWORLD 26 With Isilon SmartLock License / ** With Centera ARM License

27 INFOARCHIVE 3.2 ANALYTICS eas_sip.xml eas_pdi.xml account1.pdf account2.pdf account3.pdf eas_sip.xml eas_pdi.xml employee1.pdf employee2.pdf employee3.pdf Compliant Data Lake InfoArchive eas_pdi.xml I N S I G Retention and disposition rules applied on the archived data InfoArchive controls the data exposed in the data lake Isilon HDFS H T Isilon provides access to information being analyzed. e.g. Finance pool, HR pool #MMTM15 #INFOARCHIVE #EMCWORLD 27

28 INFOARCHIVE 3.2 STORAGE ViPR InfoArchive certified with ViPR 2.1 Functional Testing, performance testing on Isilon setup using S3 access Utilized to store unstructured content ECS Hardware appliance with petabyte and global scale managed by ViPR Controller, archived information can be stored on ECS ( unstructured content) #MMTM15 #INFOARCHIVE #EMCWORLD 28

29 INFOARCHIVE 3.2 PRINT STREAM 1 Reception 2 3 Ingestion PDF segments eas_sip.xml eas_pdi.xml Content Retrieval PDF SIP AFP New capabilities Before Third-party connectors like ProArchiver created the SIPs. It is used as ingestion input Limited chain of custody More components in the deployment After AFP file is a input to ingestion process Complete chain of custody Simplified deployment Ability to plugin third-party processing e.g. ProAFP and others Efficient Print Stream processing 29

30 INFOARCHIVE 3.2 WIZARD, ACTIONS Expedite archival setup for a datatype using simple and flexible steps offered in the Wizard Synchronous Ingestion support User actions on the search UI can be controlled by a configuration context Export, Background Search 30

31 INFOARCHIVE CONNECTORS No Additional Cost Documentum SharePoint #MMTM15 #INFOARCHIVE #EMCWORLD 31

32 INFOARCHIVE 3.2 SHAREPOINT CONNECTOR SharePoint Connector for InfoArchive Active Archiving or Decommissioning of items from SharePoint. Configurable rules and filter criteria. Works with SharePoint online and server versions. Based on filter criteria SharePoint objects and attachments are extracted to a SIP SIP Search and access SharePoint archived information Purge Confirmation Delete record what has been archived InfoArchive #MMTM15 #INFOARCHIVE #EMCWORLD 32

33 INFOARCHIVE 3.2 SHAREPOINT CONNECTOR # Features Details 1 SharePoint Server Version Support 2 Accessibility to archive information 3 Type of information archived from SharePoint 4 Selection Criteria for archiving SharePoint 2013 SharePoint 2010 SharePoint Online as a part of Office 365 suite The archived information is searchable via InfoArchive search and retrieve screen. Archive information will not be accessible via SharePoint UI SharePoint Documents (and custom types) SharePoint Lists and Tasks Binary Content (files, documents, attachements) Meta-data driven selection criteria (filters and constraints) Entire site or a subarea of the site Run at some designated interval - polling the SharePoint site or sites for information to be archived All document versions are archived Ability to store query and parameter in the external configuration file 5 Extract Process Extract data from a REST query, results are returned in XML (SharePoint format) 6 Tracking list Tracking list updated to list archived SharePoint items. It can be leveraged by the customers to purge it from the SharePoint site. 7 Output Creates the SIP package #MMTM15 #INFOARCHIVE #EMCWORLD 33

34 INFOARCHIVE 3.2 DOCUMENTUM CONNECTOR Documentum Connector for InfoArchive Archiving of content from Documentum. Configurable rules and filter criteria. ACL Registered Tables Virtual Documents XSL & DQL SIP Based on filter criteria Documentum objects and attachments are extracted to a SIP SIP Search and access Documentum archived information Purge Confirmation Delete record what has been archived InfoArchive #MMTM15 #INFOARCHIVE #EMCWORLD 34

35 INFOARCHIVE 3.2 TABLE ARCHIVING Application decommissioning with Table archiving DATE TIME TITLE LOCATION Self-Paced Hands On Lab: Every day Wed 1:30 PM 2:30 PM IT Transformation By Application Decommissioning InfoArchive Hands On Lab EMC InfoArchive: An Applied Technology Review EMC vlabs in the Village Galileo 906 JDBC Driver Chain of Custody validation ETL templates & Bulk Import Configurable UI Data Masking Authentication - LDAP and SSO Audit Logging Retention and Hold Analytics Samples #MMTM15 #INFOARCHIVE #EMCWORLD 35

36 INFOARCHIVE 3.2 COMPONENTS Table archiving: Only supported on Tomcat. CS and DA are optional components #MMTM15 #INFOARCHIVE #EMCWORLD 36

37 Mobility Centralize d console e- discovery tools INFOARCHIVE VISION Big Data Analytics Compliance Data Lakes Assert compliance over valuable information that is exposed to the Lake Extract Value Enable seamless analytics on the archived information. Compliance Expand EMC s records leadership to a compliant archive EMC InfoArchive Cloud All Data Optimize for the various source applications To and From The Cloud Expose archived data to the cloud for accessing anywhere; Archive cloud applications. #MMTM15 #INFOARCHIVE #EMCWORLD 37

38 NEXT INVESTMENT THEMES Rapid release cadence User Experience Enhanced Analytics Any Data, Any Content Any Application Archive as a Service #MMTM15 #INFOARCHIVE #EMCWORLD 38

39 INFOARCHIVE ROADMAP Q2, 2015 Q3, 2015 Q1, 2016 Q3, 2016 InfoArchive 3.2 InfoArchive Apollo Role based UI Connectors InfoArchive Hercules Analytics 2.0 ViPR HDFS ediscovery enhancements InfoArchive Perseus All data TBD

40 LEARN MORE ABOUT INFOARCHIVE DATE TIME TITLE LOCATION Self-Paced Hands On Lab: EMC vlabs in the Everyday IT Transformation By Application Decommissioning InfoArchive Village 1:30 PM 2:30 Hands On Lab Wednesday Galileo 906 PM EMC InfoArchive: An Applied Technology Review 3:00 PM 4:00 Big Data Governance: Balancing Big Data Velocity & Information Governance Venetian Ballroom A PM 3:00 PM 4:00 Real Stories Galileo 1004 PM EMC InfoArchive - Set Your Data Free! Thursday 9:00 AM 1:00 PM Hackathon: From the Ground Up - Developing an EMC InfoArchive Archiving Solution Galileo 1006 InfoArchive Product Community: //community.emc.com/community/products/infoarchive EMC InfoArchive Support & Enhancement Requests:

41 Reduce IT Complexity Optimize Infrastructure Ensure Regulatory Compliance Extract Value Application Decommissioning Active Archiving EMC s solution to empower organizations unlock the data trapped in applications to lower IT costs, preserve compliance and put application data to work #MMTM15 #INFOARCHIVE #EMCWORLD 41

42 INFOARCHIVE QUESTIONS QUESTIONS #MMTM15 #INFOARCHIVE #EMCWORLD 42

43

44

45 INFOARCHIVE OAIS Open archives information system Developed by the Consultative Committee for Space Data Systems in 2002 and became an ISO standard in 2003 Provides a framework for the understanding and increased awareness of archival concepts needed for long term digital information preservation and access #MMTM15 #INFOARCHIVE #EMCWORLD 45

46 ARCHITECTURAL OVERVIEW Legacy Systems Live Applications Connectors Unstructured Content Structured Data InfoArchive GUI Ingestion Management Data Services Archive Services Archive Access Content Services Storage Platform EMC Isilon, Atmos, Centera + others 46

47 Technical Overview Ingestion Module Archive Services Module Archive Packages Scheduling Validation Indexing Content Classification Source Systems Receipt Confirmation Reject Ingestion tracking Encryption Management Retention Management Legal Holds Traceability & Audit Archive Storage Archive data User focused Archive access portals Archive Access Synchronous access Background search Analytics Access Control Administration Billing/Charge back Schema less database Audit logs Massive scaling through stateless, modular, multi-threaded processing Schema less Architecture 47

48 GROUPING PACKAGED APP VALUE TRANSACTION APPLICATIONS Rich, highly validated transaction data. PRINT STREAMS CONTENT AND IMAGES INTERACTION APPLICATIONS COLLABORATIVE APPLICATIONS CMOD Valuable customer communications and financial reporting history. Images- comprehensive archive of legal agreements. Content- vast quantities of work products. Enriched and semi-structured contentparticularly important source of communications history. #MMTM15 #INFOARCHIVE #EMCWORLD 48

49 PROVIDING DATA FOR THE LAKE Format Considerations Applications Images Print streams Unstructured documents Typical format Structured data- highly normalized table structures Multi-page tiff, with little metadata- and minimal text Large files with proprietary formats (ex. AFP, PCL, Postscript, multiple PDFs) Too many to count Why problematic Very difficult to construct business object Little to no textual information Massive sizes, not easily parsed No structure, little to no way to understand what is documented #MMTM15 #INFOARCHIVE #EMCWORLD 49

50 INFOARCHIVE TECHNICAL VIEW Ingestion Module Archive Services Archive Storage Archive Packages Scheduling Validation Indexing Content Classification Archive data Receipt Confirmation Reject Encryption Management Retention Management Schema less database Ingestion tracking Archive Access Legal Holds Traceability & Audit Audit logs Synchronous access Access Control Background search Analytics Administration Billing/Charge back Hadoop Access Layer Source Systems Archive Portal #MMTM15 #INFOARCHIVE #EMCWORLD 50

51 INFOARCHIVE 3.2 TABLE ARCHIVING Table Archiving Enhancements Description Application decommissioning with Table archiving JDBC Driver Chain of Custody validation ETL templates & Bulk Import Configurable UI Data Masking Authentication - LDAP and SSO Audit Logging Retention and Hold Benefits Fastest way to decommission structured data applications Enables flexible ad hoc queries & reporting on data Minimal deployment footprint with proven xdb technology Configurable UI that provides flexible search and retrieve capabilities Ensures data is not tampered with and there is an unbroken Chain of Custody from data extraction to future usage Provides JDBC driver to leverage existing BI reporting tools #MMTM15 #INFOARCHIVE #EMCWORLD 51