Highlight File Format ADOBE SYSTEMS INCORPORATED



Similar documents
Deploying Adobe Acrobat Using Microsoft Windows Terminal Service ADOBE SYSTEMS INCORPORATED

Deploying Adobe Acrobat Using Microsoft Windows Terminal Service ADOBE SYSTEMS INCORPORATED

Adobe Acrobat 9 Deployment on Microsoft Systems Management

bbc Adobe PDF/XML Architecture - Working Samples

bbc Parameters for Opening PDF Files Adobe Acrobat SDK June 2008 Version 9.0

Adobe Acrobat 9 Deployment on Microsoft Windows Group Policy and the Active Directory service

User Document. Adobe Acrobat 7.0 for Microsoft Windows Group Policy Objects and Active Directory

AvePoint Record Rollback for Microsoft Dynamics CRM

PDFSealer User s Guide. ITEKSOFT Corporation Copyright All rights reserved

A Adobe RGB Color Space

Adobe Systems Implementation of Black Point Compensation

AvePoint CRM Migration Manager for Microsoft Dynamics CRM. Release Notes

How to Configure Access Control for Exchange using PowerShell Cmdlets A Step-by-Step guide

Ver USERS MANUAL

AvePoint SearchAll for Microsoft Dynamics CRM

Software Support Registration workflow

Download and Install Crystal Reports for Eclipse via the Eclipse Software Update Manager

bbc Developer FAQ Adobe Acrobat SDK November 2006 Version 8.0

CA Workload Automation Agent for Microsoft SQL Server

Getting started with Adobe Reader

EMC SourceOne Auditing and Reporting Version 7.0

Microsoft Small Business Financials. Small Business Center Integration

Aras Innovator Internet Explorer Client Configuration

ADOBE ACROBAT CONNECT ADD-IN FOR MICROSOFT OUTLOOK USER GUIDE

formerly Help Desk Authority Quest Free Network Tools User Manual

Computer Information Concepts. CIC Data Health Check 3.0

AvePoint Record Rollback for Microsoft Dynamics CRM

AvePoint SearchAll for Microsoft Dynamics CRM

Client Administrator Quick Reference Guide

InSite Prepress Portal

Deploying the Workspace Application for Microsoft SharePoint Online

RedBlack CyBake Online Customer Service Desk

Abila Nonprofit Online. Connection Guide

Product Development. Using Critical Path EVM for the Microsoft Project Desktop Application Readme

Quest SQL Optimizer 6.5. for SQL Server. Installation Guide

Employee Quick Reference Guide

formerly Help Desk Authority HDAccess User Manual

Unified Infrastructure Management Compatibility Matrix April 4, 2016

CA Nimsoft Monitor. Probe Guide for NT Event Log Monitor. ntevl v3.8 series

NCD ThinPATH Load Balancing Startup Guide

Pipeliner CRM Phaenomena Guide Add-In for MS Outlook Pipelinersales Inc.

TIBCO MFT BusinessWorks MFT Palette

Wimba Create. Version 2.5. Installation Guide

Pipeliner CRM Phaenomena Guide Opportunity Management Pipelinersales Inc.

8.7. Resource Kit User Guide

Utilities for downloading and uploading OO ABAP classes in XML format

StreamLink 5.0. StreamLink Configuration XML Reference. November 2009 C O N F I D E N T I A L

Citrix Access Gateway Enterprise Edition Citrix Access Gateway Plugin for Java User Guide. Citrix Access Gateway 8.1, Enterprise Edition

Nimsoft Monitor Compatibility Matrix October 17, 2013

Pipeliner CRM Phaenomena Guide Getting Started with Pipeliner Pipelinersales Inc.

bbc Developing Plug-ins and Applications Adobe Acrobat SDK April 2007 Version 8.1

System Requirements and Platform Support Guide

How To Configure MDM to Work with Oracle ASM-Based Products

Acrobat JavaScript Scripting Guide ADOBE SYSTEMS INCORPORATED

Microsoft Dynamics GP. Engineering Data Management Integration Administrator s Guide

Business Portal for Microsoft Dynamics GP. Requisition Management User s Guide Release 10.0

bbc Understanding the Differences Between Static and Dynamic PDF Forms Adobe LiveCycle Designer ES July 2008 Version 8.2

Technical Brief for Windows Home Server Remote Access

Ricoh HotSpot. Printing App. User s Guide. HotSpot Printing App supports:

Debugging Portal Applications

Security Analytics Engine 1.0. Help Desk User Guide

System Requirements for Microsoft Dynamics NAV 2013 R2

Data Archiving in CRM: a Brief Overview

Alpha Motherboards Software Developer s Kit Read Me First

Labour Market Programs Support System. LaMPSS Computer Compatibility Guide

IBM SmartCloud Analytics - Log Analysis. Anomaly App. Version 1.2

SQL Server 2005 Reporting Services (SSRS)

Microsoft Dynamics NAV

Adobe Acrobat X Pro Creating & Working with PDF Documents

Pipeliner CRM Phaenomena Guide Sales Pipeline Management Pipelinersales Inc.

DeviceAnywhere Automation for Smartphones Setup Guide Windows Mobile

Telephony System Integrator s Guide for ShoreTel. Citrix EasyCall Gateway 3.0

Operating Instructions Software (Device Monitor)

Foglight Experience Monitor and Foglight Experience Viewer

Installation and User Guide. Leaf Capture Remote Version 2.1.1

Policy Based Encryption E. Administrator Guide

Policy Based Encryption E. Administrator Guide

Sage CRM 2016 R1 Patch Release Notes. Revision: SYS-REA-ENG Updated: June 2016

AvePoint Tags 1.1 for Microsoft Dynamics CRM. Installation and Configuration Guide

WHAT S NEW EXPENSE MANAGEMENT SERVICE. US Administrator s Guide IN YOUR. Release 3.1

Microsoft Dynamics GP. Payroll Connect

Quest Privilege Manager Console Installation and Configuration Guide

etrust Audit Using the Recorder for Check Point FireWall-1 1.5

Dell NetVault Backup Plug-in for Advanced Encryption 2.2. User s Guide

Patch Assessment Content Update Release Notes for CCS Version: Update

An Oracle White Paper May Creating Custom PDF Reports with Oracle Application Express and the APEX Listener

Business Portal for Microsoft Dynamics GP. Electronic Document Delivery Release 10.0

Lab Answer Key for Module 6: Configuring and Managing Windows SharePoint Services 3.0. Table of Contents Lab 1: Configuring and Managing WSS 3.

Adobe Acrobat 9 Digital Signatures, Changes and Improvements

bbc Installing Your Development Environment Adobe LiveCycle ES July 2007 Version 8.0

Tivoli Endpoint Manager for Security and Compliance Analytics. Setup Guide

Defender Delegated Administration. User Guide

Transcription:

bbc Highlight File Format Technical Note #5172 Version : Acrobat 6.0 ADOBE SYSTEMS INCORPORATED Corporate Headquarters 345 Park Avenue San Jose, CA 95110-2704 (408) 536-6000 http://partners.adobe.com July 2003

Copyright 2003 Adobe Systems Incorporated. All rights reserved. NOTICE: All information contained herein is the property of Adobe Systems Incorporated. No part of this publication (whether in hardcopy or electronic form) may be reproduced or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written consent of the Adobe Systems Incorporated. PostScript is a registered trademark of Adobe Systems Incorporated. All instances of the name PostScript in the text are references to the PostScript language as defined by Adobe Systems Incorporated unless otherwise stated. The name PostScript also is used as a product trademark for Adobe Systems implementation of the PostScript language interpreter. Except as otherwise stated, any reference to a PostScript printing device, PostScript display device, or similar item refers to a printing device, display device or item (respectively) that contains PostScript technology created or licensed by Adobe Systems Incorporated and not to devices or items that purport to be merely compatible with the PostScript language. Adobe, the Adobe logo, Acrobat, the Acrobat logo, Acrobat Capture, Distiller, PostScript, the PostScript logo and Reader are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States and/or other countries. Apple, Macintosh, and Power Macintosh are trademarks of Apple Computer, Inc., registered in the United States and other countries. PowerPC is a registered trademark of IBM Corporation in the United States. ActiveX, Microsoft, Windows, and Windows NT are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries. UNIX is a registered trademark of The Open Group. All other trademarks are the property of their respective owners. This publication and the information herein is furnished AS IS, is subject to change without notice, and should not be construed as a commitment by Adobe Systems Incorporated. Adobe Systems Incorporated assumes no responsibility or liability for any errors or inaccuracies, makes no warranty of any kind (express, implied, or statutory) with respect to this publication, and expressly disclaims any and all warranties of merchantability, fitness for particular purposes, and noninfringement of third party rights.

Contents Highlight File Format.................................. 5 Introduction........................................................ 5 File Format......................................................... 5 URL Format......................................................... 7 Debugging Highlight Files............................................... 7 Highlight File Format 3

Contents 4 Highlight File Format

Highlight File Format Introduction This technical note describes the file format and URL specification that allows a Web server to highlight text in a PDF file being displayed by version 3.0 and above of the Adobe Acrobat viewers (Acrobat and Reader). To use the highlight file, the Acrobat viewer must include the highlight plug-in and the External Window Handler (EWH) plug-in. In addition, the Web browser must use the Adobe Acrobat plug-in for Netscape Navigator any Web browser that accepts this Netscape Navigator plug-in can be used. Highlight files can also be used with the ActiveX control for Internet Explorer 4.0 and later. The highlight file is an ASCII file that is downloaded separately by the Web browser and used by Acrobat to highlight words in the PDF file. If you received this technical note without obtaining the entire Acrobat Software Development Kit (SDK), you can get the complete SDK by visiting: http://partners.adobe.com/asn/developer/acrosdk/main.html File Format Although the file begins with <XML> and is specified in the URL with #xml, the file format is not true XML and only supports the syntax and keywords as described in this document. The highlight file must begin with the line: <XML> followed by the line: <Body units=[characters words ] version=vernum> In this line, a vertical bar ( ) within brackets separates choices from which one and only one must be used. The brackets should not be included in the highlight file. The value of the units key indicates whether highlights are given as character offsets on the page (characters) or as word offsets on the page (words). N OTE: Because of a bug in the word finding algorithm of the 3.01 Viewer (Reader and Exchange), it is strongly suggested that character offsets be used in the highlight file. If word offsets are used, incorrect words may be highlighted. This occurs when the word offsets are calculated using other (non 3.01) versions of the viewer, the text extraction toolkits or the PDF Library and then used to highlight text in PDF Highlight File Format 5

Highlight File Format File Format files displayed in the 3.01 viewers. For a CJK text, it is necessary to use a character offset. The value of the version key is used when units is set to words. It indicates the version of the Acrobat viewer s word finding algorithm for which word offsets are specified. Because the characters that make up a word may not be stored consecutively in the PDF file, an algorithm must be used to find words; different versions of the algorithm (corresponding to different versions of Acrobat) may give different word offsets. The version for Acrobat 3.0, 3.01, and 4.0 is 2. N OTE: Previous versions of the Highlight File Format documented two other keys in this section of the file. These were color and mode. Color was documented as specifying the color of the highlight and mode was documented as determining whether the viewer would go to the first page containing highlighted text when opening the PDF file. This functionality was never implemented and therefore documentation of these keys has been removed. The highlights will always use the highlight color as set by the operating system and the viewer will always display the page containing the first highlighted text. The next line must be: <Highlight> followed by one or more highlight ranges. Each highlight range is a line of the form: <loc pg=p pos=o len=l> The value of pg is an integer that specifies the page on which the highlight is located. Pages are numbered sequentially, with the first page in a file having a page number of zero. The value of pos is an integer that specifies the offset of the highlight on the page. Offsets are specified either in words or characters, depending on the value of units in the Body line described previously. The offset of the first character/word on a page is zero. The value of len is an integer that specifies the number of words or characters to highlight. Note that version 3.0, 3.01 and 4.0 of the Acrobat viewer only support highlighting of entire words; the entire word is highlighted if at least one of its characters are highlighted. Following the list of highlight ranges are the following lines: </Highlight> </Body> </XML> The example below shows a sample a highlight file. This file contains two highlights, specified as character offsets (units=characters). The Acrobat viewer will automatically display the first page that contains a highlight. No word finding algorithm is specified because the file uses character offsets. The first highlight range contains 10 characters, beginning with the first character on the document s first page. The second 6 Highlight File Format

Highlight File Format URL Format highlight range contains two characters, beginning with the fifth character on the document s second page. <XML> <Body units=characters> <Highlight> <loc pg=0 pos=0 len=10> <loc pg=1 pos=4 len=2> </Highlight> </Body> </XML> URL Format The URL that specifies the PDF file must also specify the highlight file, as illustrated in the following example: http://www.adobe.com/a.pdf#xml=http://www.adobe.com/a.txt The URL consists of two pieces, separated by a pound sign (#). The first part of the URL specifies the PDF file (a.pdf in this example). The second part specifies the highlight file (a.txt) and the auxiliary data handler to which its contents should be sent (xml, the name of the highlight plug-in s auxiliary data handler). Debugging Highlight Files If you have a Web server, the easiest and most thorough way to debug a highlight file is to place the files on your server and specify an extended URL. You can, however, write a simple plug-in for Acrobat which will pass a highlight file to the highlight plug-in. Because this method bypasses the EWH plug-in, it does not enable the Next Highlight and Previous Highlight toolbar buttons. The following code reads in data from a highlight file and uses it to highlight text in the currently displayed document. { ASBool status; AVDoc avdoc = AVAppGetActiveDoc(); FILE *stream; char auxdata[255]; ASInt32 numchar; } char * filename = "C:\\test.txt"; stream = fopen (filename, "r"); numchar = fread (auxdata, sizeof(char),255,stream); status = AVDocSendAuxData (avdoc, ASAtomFromString ("xml"), auxdata, numchar); Highlight File Format 7

Highlight File Format Debugging Highlight Files 8 Highlight File Format