DELETE DUPLICATE EMAILS IN THE EMC EMAILXTENDER ARCHIVE SYSTEM USING THE MSGIDCRACKER UTILITY



Similar documents
EMC Documentum Content Management Interoperability Services

EMC Replication Manager and Kroll Ontrack PowerControls for Granular Recovery of SharePoint Items

Audit Management for EMC Documentum Web Development Kit 6.7-based Applications

EMC Documentum Repository Services for Microsoft SharePoint

EMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER

EMC Documentum Connector for Microsoft SharePoint

Minimum Hardware Configurations for EMC Documentum Archive Services for SAP Practical Sizing Guide

SETTING UP ACTIVE DIRECTORY (AD) ON WINDOWS 2008 FOR EROOM

Dell InTrust Preparing for Auditing Microsoft SQL Server

Deploying EMC Documentum WDK Applications with IBM WebSEAL as a Reverse Proxy

How To Manage Em Sourceone In Windows Exchange

Welcome to InFixi Exchange Mailbox Recovery Software. Help Guide. Exchange Mailbox recovery software

Technical Notes. EMC NetWorker Performing Backup and Recovery of SharePoint Server by using NetWorker Module for Microsoft SQL VDI Solution

Using Windows Administrative Tools on VNX

Technical Notes TECHNICAL NOTES. Release number 8.2 Service Pack REV 01. January, 2015

Working with the Cognos BI Server Using the Greenplum Database

EMC Documentum Enterprise Content Integration Services

DEPLOYING WEBTOP 6.8 ON JBOSS 6.X APPLICATION SERVER

APPLE PUSH NOTIFICATION IN EMC DOCUMENTUM MOBILE APPLICATION

EMC NetWorker Module for Microsoft Exchange Server Release 5.1

Symantec Enterprise Vault

EMC Documentum Content Services for SAP Document Controllers

Symantec Enterprise Vault

Release Notes P/N Rev A01

EMC Celerra Network Server

ARCHIVING FOR EXCHANGE 2013

Virtualized Exchange 2007 Archiving with EMC Xtender/DiskXtender to EMC Centera

DEPLOYING EMC DOCUMENTUM BUSINESS ACTIVITY MONITOR SERVER ON IBM WEBSPHERE APPLICATION SERVER CLUSTER

IBM WEBSPHERE LOAD BALANCING SUPPORT FOR EMC DOCUMENTUM WDK/WEBTOP IN A CLUSTERED ENVIRONMENT

Greenplum Database (software-only environments): Greenplum Database (4.0 and higher supported, or higher recommended)

EMC Documentum Interactive Delivery Services Accelerated Overview

Symantec Enterprise Vault

SolarWinds Migrating SolarWinds NPM Technical Reference

TECHNICAL NOTES. Celerra Physical to Virtual IP Address Migration Utility Technical Notes P/N REV A03. EMC Ionix ControlCenter 6.

REMOTE KEY MANAGEMENT (RKM) ENABLEMENT FOR EXISTING DOCUMENTUM CONTENT SERVER DEPLOYMENTS

Using SQL Server Management Studio

Technical Note. Performing Exchange Server Granular Level Recovery by using the EMC Avamar 7.1 Plug-in for Exchange VSS with Ontrack PowerControls

EMC Documentum Interactive Delivery Services Accelerated: Step-by-Step Setup Guide

White Paper DEPLOYING WDK APPLICATIONS ON WEBLOGIC AND APACHE WEBSERVER CLUSTER CONFIGURED FOR HIGH AVAILABILITY AND LOAD BALANCE

EMC SourceOne for Microsoft SharePoint Storage Management Version 7.1

Symantec Enterprise Vault

EMC NetWorker Module for Microsoft Applications Release 2.3. Application Guide P/N REV A02

Enterprise Vault Installing and Configuring

Copyright 2013 EMC Corporation. All Rights Reserved.

Symantec Enterprise Vault

Symantec Enterprise Vault

Symantec Enterprise Vault

Configuring Celerra for Security Information Management with Network Intelligence s envision

EMC Documentum Webtop

Symantec Enterprise Vault

Setting Up a Unisphere Management Station for the VNX Series P/N Revision A01 January 5, 2010

EMC ApplicationXtender Server

Process Integrator Deployment on IBM Webspher Application Server Cluster

Security Explorer 9.5. User Guide

EMC RepliStor for Microsoft Windows ERROR MESSAGE AND CODE GUIDE P/N REV A02

EMC APPSYNC AND MICROSOFT SQL SERVER A DETAILED REVIEW

EMC SourceOne SEARCH USER GUIDE. Version 6.8 P/N A01. EMC Corporation Corporate Headquarters: Hopkinton, MA

Enterprise Vault.cloud. Microsoft Exchange Managed Folder Archiving Guide

eg Enterprise v5.2 Clariion SAN storage system eg Enterprise v5.6

Using EMC SourceOne Management in IBM Lotus Notes/Domino Environments

ICE for Eclipse. Release 9.0.1

Enterprise Deployment of the EMC Documentum WDK Application

TIBCO Slingshot User Guide

EMC Celerra Version 5.6 Technical Primer: Control Station Password Complexity Policy Technology Concepts and Business Considerations

Symantec Enterprise Vault Technical Note. Administering the Monitoring database. Windows

Extending Microsoft SharePoint Environments with EMC Documentum ApplicationXtender Document Management

XCP APP FAILOVER CONFIGURATION FOR WEBLOGIC CLUSTER AND APACHE WEBSERVER

EMC Disk Library with EMC Data Domain Deployment Scenario

Project management integrated into Outlook

NearPoint for Microsoft Exchange Server

Installing a Plug-in

EMC ApplicationXtender Server

CommVault Simpana Archive 8.0 Integration Guide

PROXY SETUP WITH IIS USING URL REWRITE, APPLICATION REQUEST ROUTING AND WEB FARM FRAMEWORK OR APACHE HTTP SERVER FOR EMC DOCUMENTUM EROOM

Using Group Policy to Manage and Enforce ACL on VNX for File P/N REV A01 February 2011

EMC Documentum Business Process Suite

EMC NETWORKER SNAPSHOT MANAGEMENT

Technical Note P/N REV A02 May 07, 2010

EMC DOCUMENTUM XPLORE 1.2 AND XPLORE 1.3 HIGH AVAILABILITY IN AN ACTIVE-ACTIVE SETUP WITH LOAD BALANCER

EMC Celerra Version 5.6 Technical Primer: Public Key Infrastructure Support

CA Nimsoft Monitor Snap

Honeywell Process Solutions Experion HS R400 Server Patch220 for PAR1-MVKSRD Software Change Notice

EMC Backup and Recovery for Microsoft SQL Server 2008 Enabled by EMC Celerra Unified Storage

Acronis Backup & Recovery 11.5 Quick Start Guide

SMTP POP3 SETUP FOR EMC DOCUMENTUM eroom

WINDOWS SERVER 2008 OFFLINE SYSTEM RECOVERY USING WINDOWS SERVER BACKUP WITH NETWORKER

Querying Microsoft SQL Server (20461) H8N61S

Archiving Service Finding Your Own Messages Guide

Lepide Event Log Manager: Installation Guide. Installation Guide. Lepide Event Log Manager. Lepide Software Private Limited

The End User Experience. Introduction to Archiving for End Users

Migrating Cirrus. Revised 7/19/2007

EMC NetWorker. Licensing Guide. Release 8.0 P/N REV A01

Course Syllabus. 2553A: Administering Microsoft SharePoint Portal Server Key Data. Audience. At Course Completion.

Release Notes P/N Rev 01

GFI Product Manual. Outlook Connector User Manual

EMC SourceOne Auditing and Reporting Version 7.0

Transcription:

White Paper DELETE DUPLICATE EMAILS IN THE EMC EMAILXTENDER ARCHIVE SYSTEM USING THE MSGIDCRACKER UTILITY Abstract This white paper describes the process of using the EmailXtender Customized MsgIdCracker Utility to delete duplicate emails from the EMC EmailXtender archiving system. July 2011

Copyright 2011 EMC Corporation. All Rights Reserved. EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. The information in this publication is provided as is. EMC Corporation makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose. Use, copying, and distribution of any EMC software described in this publication requires an applicable software license. For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com. All trademarks used herein are the property of their respective owners. Part Number H8837 2

Table of Contents Executive summary... 4 Audience...4 Pre-requisites for using the MsgIDCracker utility... 4 Risk Factors... 5 Using the MsgIDCracker utility...5 Conclusion... 6 References... 6 3

Executive summary The information provided in this white paper will help you use the MsgIDCracker utility to delete duplicate email messages in the EMC EmailXtender archiving system. This paper also includes information about the consequences of deleting duplicates in a production environment. If you have enabled the Cached Exchange Mode feature in Microsoft Office Outlook, and if you archive email messages in the Sent Items folder using the EmailXtract tool, a duplicate copy of each message is archived. Use the customized MsgIDCracker utility to clean up duplicates after performing the archive operation. For additional information about the EmailXtender product and its utilities, see the relevant product documentation. Audience This white paper is intended for customers who want to delete duplicate email messages in their email archiving system. It also provides adequate information about risk factors involved in deleting duplicates in the production environment. Pre-requisites for using the MsgIDCracker utility Perform the following steps before you use the MsgIDCracker utility: 1. Take a backup of container files associated with duplicate messages. This will help rollback the files to their original state, if inconsistencies occur. 2. Identify the date range of duplicate messages. Use the following SQL query to retrieve duplicate messages within a specific date range: SELECT MD5HashKey,TrackingID,TimeStamp,MsgDate WHERE (TrackingId) IN (SELECT TrackingID GROUP BY TrackingID HAVING COUNT(TrackingId) > 1) GROUP BY MD5HashKey,TrackingID, TimeStamp, MsgDate ORDER BY TimeStamp 3. Save the result of the query you executed in Step 2, to a.csv file. Use Microsoft Office Excel to sort messages by MsgDate, to obtain the beginning and the end date (date range) of the duplicate messages. 4. Restore all shortcuts associated with duplicate messages (Journal and Sent Items) using the EmailXtract tool in the date range specified in Step 3. 4

Risk Factors Running the MsgIDCracker utility has the following risks associated with it: Archived Sent Items will have all users set as owners of the messages including Bcc users. If you delete duplicate messages archived from the Sent Items folder, you may lose Bcc information. When users who are blind carbon copied perform a search operation for their messages, no message is returned in the search results. If the user performs the Extract operation on the Sent Items folder after messages in the Sent Items folder are deleted from the archive, duplicate messages are created again. Using the MsgIDCracker utility Perform the following steps to use the MsgIDCracker utility: 1. Execute the following query to obtain a list of duplicate messages: CREATE TABLE DuplicateRecords (TrackId bigint NOT NULL PRIMARY KEY, MD5Hash bigint, TimeStm int, MsgDt datetime); Declare mycur CURSOR GLOBAL for SELECT MD5HashKey,TrackingID,TimeStamp,MsgDate WHERE (TrackingId) IN (SELECT TrackingID GROUP BY TrackingID HAVING COUNT(TrackingId) > 1) GROUP BY MD5HashKey,TrackingID, TimeStamp, MsgDate ORDER BY TimeStamp DESC Declare @MD5Key bigint, @Trackid bigint, @Tstamp int, @PreTrackid bigint, @MsgDate datetime Set @PreTrackid =-1 OPEN mycur FETCH NEXT FROM mycur INTO @MD5Key,@Trackid,@Tstamp,@MsgDate WHILE(@@FETCH_STATUS =0) BEGIN if (@PreTrackid!= @Trackid) Begin INSERT INTO DuplicateRecords(TrackId,MD5Hash,TimeStm,MsgDt) VALUES (@Trackid,@MD5Key,@Tstamp,@MsgDate); Set @PreTrackid = @Trackid END 5

FETCH NEXT FROM mycur INTO @MD5Key,@Trackid,@Tstamp,@MsgDate END SELECT MD5Hash,Timestm FROM DuplicateRecords Close mycur DEALLOCATE mycur 2. Save the result in the duplicates.txt file. 3. Create the C:\CleanUpActivity folder. 4. Execute the delivered utility as follows at the command prompt: CmdPrompt:\> <Utility path\ MsgIDCracker.exe> /c <File Path\duplicates.txt> After executing this command, 0 KB files are created with the.delreq extension in the C:\CleanUpActivity folder. 5. Take a backup of this folder to use as a reference for deleted messages. 6. Move all.delreq files from the C:\CleanUpActivity folder to the EX Installed Dir\Archive Deletion folder. All duplicate messages are deleted. Conclusion The paper describes the MsgIDCracker solution that customers can use to clean up their email archiving production environment. This paper also helps customers understand the consequences of deleting duplicate email messages. References The Powerlink website (http://powerlink.emc.com) contains the downloadable packages for EmailXtender Archiving Solution product versions along with the release notes and other relevant documentation associated with the product version. To locate product documentation, navigate to Support > Technical Documentation and Advisories > Software ~ D ~ Documentation, then select the product name and version number. Note: Most of the Content Management products are listed under Software D > Documentum?, where? = a letter, or letters, in the alphabet. Product documentation that is available online from the application (such as online help) does not appear as a separate item. It is automatically downloaded and installed with the software. 6