Committee on WIPO Standards (CWS)



Similar documents
Module 1. Sequence Formats and Retrieval. Charles Steward

EGYPTIAN PATENT OFFICE

Biological Sequence Data Formats

Annex I Common Requirements for All Types of Documents

Bioinformatics Resources at a Glance

Regulations under the Patent Cooperation Treaty. (as in force from July 1, 2015)

Agreement. between the Indian Patent Office and the International Bureau of the World Intellectual Property Organization

XML. CIS-3152, Spring 2013 Peter C. Chapin

Sharing Data from Large-scale Biological Research Projects: A System of Tripartite Responsibility

PATENT COOPERATION TREATY (PCT)

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

M The Nucleus M The Cytoskeleton M Cell Structure and Dynamics

A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Standard Registry Development and Publication Process


STATE INTELLECTUAL PROPERTY OFFICE OF THE PEOPLE S REPUBLIC OF CHINA

Annex 6: Nucleotide Sequence Information System BEETLE. Biological and Ecological Evaluation towards Long-Term Effects

VIRTUAL LABORATORY: MULTI-STYLE CODE EDITOR

BIO 3350: ELEMENTS OF BIOINFORMATICS PARTIALLY ONLINE SYLLABUS

Self Testing and Product Qualification Processes

Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness

Table 1: General overview of the PCT procedure Legend:

Chapter 1800 Patent Cooperation Treaty

STATUTE OF THE POLISH ACCREDITATION COMMITTEE

Theme 3: Filing a PCT Application Electronically

SRA File Formats Guide

INDIAN PATENT OFFICE

The Role of Intellectual Property Offices (IPOs) in Promoting Innovation, Business Competitiveness and Economic Growth

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications

Patentability of inventions in the field of food industry: search and examination practice at the EPO

Joint Position on the Disclosure of Clinical Trial Information via Clinical Trial Registries and Databases 1 Updated November 10, 2009

Module 10: Bioinformatics

Biological Databases and Protein Sequence Analysis

Training for Patent Professionals EQE BASIC LEGAL DISTANCE LEARNING

INTELLECTUAL PROPERTY OFFICE (SERBIA)

U.S. FDA Title 21 CFR Part 11 Compliance Assessment of SAP Records Management

EMBL-EBI Web Services

BIOLOGY 3IR3 / MOLECULAR BIOLOGY 3I03 Independent Research Project

Agreement. (as in force from July 1, 2014)*

EUROPASS DIPLOMA SUPPLEMENT

Debian Med. Integrated software environment for all medical purposes based on Debian GNU/Linux. Andreas Tille. OSWC, Malaga Debian.

NEMA Standards Publication PS 3 Supplement 41. Digital Imaging and Communications in Medicine (DICOM) Digital Signatures

Simplify the HR systems integration and provide a single version of employee data. C

Definition of Minimum Performance Requirements for Analytical Methods of GMO Testing European Network of GMO Laboratories (ENGL)

SEARCHING BIOTECHNOLOGY INFORMATION IN THE 2010s: Section II (Databases & Search Strategies)

SRCSB General Web Development Policy Guidelines Jun. 2010

ASF Commission Affacturage 02/08/2011 BUSINESS JUSTIFICATION

KEY BIOTECHNOLOGY INDICATORS (December 2011)

Hundred and seventy-fourth session DIRECTIVES CONCERNING THE USE OF THE NAME, ACRONYM, LOGO AND INTERNET DOMAIN NAMES OF UNESCO SUMMARY

TABLE OF CONTENTS. CHAPTER 1: THIS GUIDE AND ITS ANNEXES Introduction CHAPTER 2: WHAT IS THE PCT?

SUBJECT: New Features in Version 5.3

COMMISSION STAFF WORKING DOCUMENT. Elements of the Union greenhouse gas inventory system and the Quality Assurance and Control (QA/QC) programme

ETSI TS V1.1.1 ( ) Technical Specification

REGULATION (EEC) No 2309/93

Course Specification

ectd Digital Handbook Table of Contents

Invenio: A Modern Digital Library for Grey Literature

Credible IP Solutions

Global Networking of Collections WFCC and GBRCN perspectives. EMbaRC Seminar David Smith Cantacuzino Institute, Bucharest, Romania 8-9 March 2010

EUROPEAN COMMISSION Directorate-General for Research & Innovation. Guidelines on Data Management in Horizon 2020

[ENGLISH TRANSLATION BY WIPO] Spanish Patent and Trademark Office, Ministry of Industry, Energy and Tourism

INTELLECTUAL PROPERTY CORPORATION OF MALAYSIA

ETSO Modelling Methodology for the Automation of Data Interchange of Business Processes (EMM)

Standards Designation and Organization Manual

Compliance Response Edition 07/2009. SIMATIC WinCC V7.0 Compliance Response Electronic Records / Electronic Signatures. simatic wincc DOKUMENTATION

INVENTION OFFICE OF THE DEMOCRATIC PEOPLE S REPUBLIC OF KOREA

STN Frequently Asked Questions

Managing XML Documents Versions and Upgrades with XSLT

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM

The Intelligent Content Framework

TENDER INSTRUCTIONS Consulting services and auction software in connection with the 1800 MHz auction. 23 March 2015 Case no.

How to Read a Biotech Patent

PCT-SAFE User Reference Manual - PCT- EASY Mode

DTD Tutorial. About the tutorial. Tutorial

EURASIAN PATENT OFFICE (EAPO)

Cataloguing of Articles in Periodicals : Database Approach Model in A Biomedical Library

APIS IT d.o.o. Fiscalization technical aspects

Configuring Firewalls An XML-based Approach to Modelling and Implementing Firewall Configurations

Global Dossier Active Components JPO s Prospects on Active Component. 2 nd GDTF Meeting January 2015 JAPAN PATENT OFFICE

Transcription:

E CWS/1/5 ORIGINAL: ENGLISH DATE: OCTOBER 13, 2010 Committee on WIPO Standards (CWS) First Session Geneva, October 25 to 29, 2010 PROPOSAL FOR THE PREPARATION OF A NEW WIPO STANDARD ON THE PRESENTATION OF NUCLEOTIDE AND AMINO ACID SEQUENCE LISTINGS USING EXTENSIBLE MARKUP LANGUAGE (XML) Document prepared by the Secretariat 1. On September 17, 2010, the European Patent Office (EPO) sent a document to the Secretariat requesting the initiation by the Committee on WIPO Standards (CWS) at its first session in October 2010 of discussions on a proposal for a new standard on the presentation of nucleotide and amino acid sequence listings based on extensible Markup Language (XML). The request by the EPO is reproduced in the Annex to this document. 2. In accordance with the request by the EPO, the Secretariat proposes the following for consideration and approval by the CWS: (a) the creation of a new task whose description would read as follows: Prepare a recommendation on the presentation of nucleotide and amino acid sequence listings based on extensible Markup Language (XML) for adoption as a WIPO standard. The proposal of the new WIPO standard should be presented along with a report on the impact of the said standard on the current WIPO Standard ST.25, including the proposed necessary changes to Standard ST.25 ; (b) (c) the establishment of a new Task Force, with its corresponding Task Force Leader, to handle the new task; and a request that the Task Force present the proposal of the new WIPO standard and necessary changes to Standard ST.25 for consideration and approval by the CWS at its second session to be held in 2011.

page 2 3. The CWS is invited to: (a) note the request by the European Patent Office for a new WIPO standard as reproduced in the Annex to this document; (b) consider and approve the proposal concerning the creation of the task and corresponding time frame referred to in paragraphs 2(a) and (c), above; and (c) consider and approve the establishment of the new Task Force, with its corresponding Task Force Leader, referred to in paragraph 2(b), above; [Annex follows]

ANNEX REQUEST BY THE EPO TO ESTABLISH A WIPO TASK FORCE EPO proposal for a new standard for the presentation of sequence listings based on XML format BACKGROUND AND REQUEST 1. Biological sequences in patent applications are disclosed as sequence listings. Sequence listings are currently presented in accordance with WIPO Standard ST.25 both within the framework of the PCT (Annex C of the Administrative Instructions) and under most national or regional patent procedures. 2. The EPO is of the view that, for a number of technical and practical reasons, WIPO Standard ST.25 should be replaced, or at least supplemented, by a new standard based on XML format. Such new standard would mitigate the shortcomings of WIPO Standard ST.25 and provide additional advantages for both applicants and Offices since the drafting and submission of high quality sequence listings will enable more efficient downstream processes. 3. The EPO intends to submit its proposal for a new WIPO standard by the end of 2010 for adoption in 2011. The EPO therefore hereby requests that a specific Task Force be established by the Committee on WIPO Standards (CWS) with the mandate to draft the new standard based on the EPO s proposal. In support to the present request, the EPO wishes to provide the preliminary information, below. PROBLEMS ENCOUNTERED WITH THE CURRENT WIPO STANDARD ST.25 4. It has been years since WIPO Standard ST.25 was approved whilst life sciences is a fast changing technical domain. WIPO Standard ST.25 is a format unique to the patent world. It is unfamiliar for inventors working directly with biological sequences, who are generally more familiar with the presentation of sequences according to the formats of public repositories. Hence it is unfeasible to leverage open source tools with WIPO Standard ST.25. 5. WIPO Standard ST.25 does not express the latest scientific requirements, e.g.: There are more location types supported by public database providers (European Molecular Biology Laboratory (EMBL), DNA Databank of Japan (DDBJ), National Center for Biotechnology Information (NCBI)) than by WIPO Standard ST.25. Applicants must find out unconventional and frequently inaccurate ways to annotate sequences. Features keys and qualifiers (controlled vocabularies used to depict characteristics of sequences) used in WIPO Standard ST.25 are not in use by the world sequence repositories. Some of the International Union of Pure and Applied Chemistry (IUPAC) standard abbreviations are also not supported.

Annex, page 2 6. WIPO Standard ST.25 is error-prone. As it is designed to be both human and computer readable, it is easy to make mistakes, and those are difficult to detect. The relative ease of creating a sequence listing by non-dedicated software, such as text editors, leads to many errors, e.g.: invalid sequences, wrong number of sequences, invalid organism names, unauthorized characters, inadequate features, syntax and scientific information incorrectness due to the complexity of the so called mixed mode presentation of biological sequences. 7. It follows that public database providers drop the information provided by applicants and regenerate it. Offices which provide sequence listings to public repositories spend therefore much time and efforts to correct the data. THE PROPOSED NEW STANDARD 8. The new standard proposed by the EPO would solve the inconveniences faced with WIPO Standard ST.25 and help to foster the quality of sequences in patent and public sequence databases. It is also noted that WIPO ST.36 recommends the usage of industry standards 1. 9. The new standard will therefore have the following features: Universality universality: a common format for patent and non patent communities, robustness and flexibility: an XML format based on an agreed DTD. 10. A sequence listing consists in a general information part depicting information related to the patent application and a sequence part consisting in a variable number of biological sequences. 11. The public data providers, which form the International Nucleotide Sequence Database Collaboration (INSDC) (consisting of the European Bioinformatics Institute (EBI), DDBJ, NCBI), have designed an XML based format for the exchange of sequences and related information called INSDSeq. This format was designed to facilitate the addition of the relevant patent related information. 1 See 93 of WIPO Standard ST.36: Where appropriate to the content of a document, that is, where the content is not unique to the industrial property domain, use industry-standard DTDs.

Annex, page 3 12. The new standard for the presentation of sequence listings should therefore be using INSDSeq to disclose the sequence information and amend it with patent related information (general information of sequence listing) in accordance with WIPO ST.36. The aim is to ensure the synchronization of the sequence part of WIPO s sequence listing format with INSDSeq. 13. In effect, the new standard based on XML format would be more user-friendly as scientists would be using almost the same format for both patent applications and submission to public databases with no need to make conversions. The mere familiarity with the syntax of the standard could in itself lead to fewer errors. Robustness and flexibility 14. The syntax of the new standard provided by the DTD will be both more precise and easier to verify by means of automatic tools. It could be verified by a vast set of existing freeware. 15. The contents of an XML file, while more complicated for humans to read without a style sheet, is actually easier to manipulate by computers, and there are extensive libraries for this purpose. STATUS 16. A draft DTD is under discussion between the Trilateral Offices. The agreed DTD will form the basis of the proposal. 17. The EPO has developed, in cooperation with the EBI, a client software for the submission of biological sequences (Biological Sequence Submission Application for Patents (BiSSAP)) which includes a verification module and which contains an option to create XML files. This software is therefore both WIPO Standard ST.25 and INSDSeq / WIPO ST.36 XML compliant. The software has been tested by European users in August with positive results. The time schedule for implementation and the making available of this new tool to the public is currently under discussion at the EPO. [End of Annex and of document]