Basic Course on Bioinformatics tools for Next Generation Sequencing data mining 11-12 June, 2015 Istituto Superiore di Sanità, SIDBAE Training Room



Similar documents
EU Reference Laboratory for E. coli Department of Veterinary Public Health and Food Safety Unit of Foodborne Zoonoses Istituto Superiore di Sanità

Typing in the NGS era: The way forward!

Use of Whole Genome Sequencing (WGS) of food-borne pathogens for public health protection

Molecular typing of VTEC: from PFGE to NGS-based phylogeny

VTEC Network preparedness for molecular typing data collection

FLURISK Development of a risk assessment methodological framework for potentially pandemic influenza strains. Ilaria Capua

2 Short biographies and contact information of the workshop organizers

Use of Whole Genome. of food-borne pathogens for public health protection. Efsa Scientific Colloquium Summary Report June 2014, Parma, Italy

Whole genome sequencing of foodborne pathogens: experiences from the Reference Laboratory. Kathie Grant Gastrointestinal Bacteria Reference Unit

Experimental Regulatory Audit and National Seminar on standards and criteria for the inspection of blood establishments Ancona, 12 th 13 th May 2010

4th Workshop Rapid NGS for Clinical, Public Health, and Food Microbiology

Demographic indicators

Fiscal federalism in Italy at a glance

REAL TIME SURVEILLANCE STATION OF DRINKING WATER QUALITY IN THE PROVINCE OF TERAMO (ABRUZZO REGION - ITALY)

Europass curriculum vitae. Personal information

Identification and Characterization of Foodborne Pathogens by Whole Genome Sequencing: A Shift in Paradigm

Workshop Rapid NGS for Public Health Microbiology

Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT

CALL FOR DATA ON VEROTOXIGENIC ESCHERICHIA COLI (VTEC) / SHIGATOXIGENIC E. COLI (STEC)

LIQUID BIOPSY: TRACKING CANCER

Avian Influenza: the moving target

Kickoff Meeting of the Italian Medical Council

Next Generation Sequencing; Technologies, applications and data analysis

Università Politecnica delle Marche

Scientific Programme

Rifiuti tra crescita e decrescita

Delegated to CNR on December 23rd, New synchronous registration system from September 28 th, 2009

workshop The challenge of Bio-districts during the programming period

MENU 2013 SECOND CIRCULAR

Europass curriculum vitae

NATI PER LEGGERE. Nati per leggere: A national programme to enhance literacy and health in small children through reading aloud

The Moroccan American Pharmaceutical Sciences & Education Network Group (PharMaSeng), University Hassan II MohammediaCasablanca & FST Mohammedia

Open Source Intelligence Dissemination Conference, Rome, Wednesday 8 th July 2015

General Services Administration Federal Supply Service Authorized Federal Supply Schedule Price List

5 February Bari, Italy

Next Generation Sequencing; Technologies, applications and data analysis

Next Generation Sequencing; Technologies, applications and data analysis

SCIENTIFIC REPORT OF EFSA

Updated ECDC Public Health Microbiology Strategy and Work Plan

Local Organizing Committee Geremia Gios, Chair Roberta Raffaelli Sandra Notaro

Call for proposals to host. the. ECSITE ANNUAL CONFERENCE 2018 or ECSITE ANNUAL CONFERENCE 2019

Master in Health Services Management

Delivering the power of the world s most successful genomics platform

International Symposium on. Enterococci in Foods. Functional and Safety Aspects. 2nd Announcement and Call for Papers

Best regards President of Italian Dance Sport Federation Christian Zamblera

PRESENTATION ITALIAN PRISON SYSTEM. Italia, LPPS November 2010

Soil Biological Communities and Aboveground Resilience

29 January Rome, Italy

3 rd National Conference on Science and Technology

Traineeships Regulation in Italy after the Fornero Labour Market Reform

Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

Establishing a Whole Genome Sequence-Based national network for the detection and traceback of Foodborne Pathogens

BioFluMeN. Biological Fluid Mechanics Network. BioFluMeN - IV. Aula Bovet Viale Regina Elena 299 Istituto Superiore di Sanità

Genomics in Hematology

Horizon2020 La partecipazione italiana allo Strumento PMI

IT infrastructure and user interface: The Galaxy architecture and ARIES cluster

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik

Data Envelopment Analysis (DEA) assessment of composite indicators. of infrastructure endowment.

Scientific Committee Hilary Calvert UCL Cancer Institute, London, United Kingdom

INTERNATIONAL RETREAT OF PhD STUDENTS IN IMMUNOLOGY June 2015

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

No. prev. doc.: 9392/08 SAN 77 DENLEG 48 VETER 5 Subject: EMPLOYMENT, SOCIAL POLICY, HEALTH AND CONSUMER AFFAIRS COUNCIL MEETING ON 9 AND 10 JUNE 2008

Regional Young Investigator SIC meeting. 5-6 March 2015

e INTESA: L'uso di sistemi italiani di telemedicina e loro Integrazione nel Sistema Sanitario Nazionale" L. Guerriero e R. Bedini

I Module Crime Scene & Forensic Genetics

Integration of Protein-protein Interaction Data in a Genomic and proteomic Data Warehouse

Next Generation Sequencing; Technologies, applications and data analysis

Workshop Rapid NGS for Clinical & Public Health Microbiology

@ Biblioteca Salaborsa, Auditorium Enzo Biagi, Piazza del Nettuno, 3, Bologna. tbc Ministero dei Beni e delle Attività Culturali e del Turismo

DIRECT FUNDS STRUCTURAL FUNDS EU BUDGET NO FRAUD. Rome, October 2015

Introduction to NGS data analysis

Pharmacogenomics and theranostics in practice

Bacterial Next Generation Sequencing - nur mehr Daten oder auch mehr Wissen? Dag Harmsen Univ. Münster, Germany dharmsen@uni-muenster.

5,000,000,000 Covered Bond Programme

How To Organize An Efpp Conference

AN OVERVIEW OF THE UK OUTBOUND MARKET

FIRST INTERNATIONAL CONFERENCE ON ANIMAL WELFARE EDUCATION

Analisi in silicoe relazione tra enterotossine stafilococciche e tossine ipotetiche

February Monitor of Bankruptcies, Insolvency Proceedings and Business Closures FourthQuarter 2012

Gruppo Intesa Network

The University is comprised of seven colleges and offers 19. including more than 5000 graduate students.

pulmonary hypertension bologna 2012 Royal Hotel Carlton - Bologna - Italy Under the aegis of alma mater studiorum università di bologna

INTENSIVE COURSE ON GEODIVERSITY AND GEOHERITAGE: EVALUATION AND INTERPRETATION

Practical Solutions for Big Data Analytics

Biogenic Amines 2011

The National Antimicrobial Resistance Monitoring System (NARMS)

EURAPS Annual Meeting

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

Transcription:

EU Reference Laboratory for E. coli Department of Veterinary Public Health and Food Safety Unit of Foodborne Zoonoses Basic Course on Bioinformatics tools for Next Generation Sequencing data mining 11-12 June, 2015, SIDBAE Training Room Content Scientific Report Course agenda and list of participants (Annex 1) Scientific Report The rapid development of next generation sequencing (NGS) platforms and the parallel development of bioinformatics tools for NGS data management and analyses make the genome sequence-based investigation a realistic alternative to conventional molecular typing of bacterial isolates. In particular, the approach promise to become a credible alternative to PFGE and its designated successor for molecular surveillance of VTEC infections. Discussion groups have been recently organized by the European Food Safety Authority (EFSA) and the European Centre for Disease Control (ECDC) to define the most appropriate context for the introduction of NGS as the standard technology for molecular surveillance of foodborne infections. In this respect, a recent inventory of the molecular typing methods and IT applications available within the network of the National Reference Laboratories (NRL) for E. coli showed that some NRLs have access to NGS facilities. In this framework, the European Reference Laboratory for E. coli (EU-RL VTEC) organized a course on the use of bioinformatics for assembling, mining, and analyzing NGS data, 1

with the aim of consolidating the knowledge on the NGS technology and to increase the level of skill and awareness within the NRLs E. coli network. The event consisted in a two-days course and was held in Rome on 11-12 June 2015, at the IT-training room of the (ISS). Nine representatives of eight NRLs and five from as many Italian Official Laboratories participated in the event. Two other scientists from NRL Italy, three scientists from other ISS structures, and two EFSA representatives also took part in the course as observers. The travel and accommodation costs of the NRLs participants were supported with a dedicated budget by DG-SANTE. The course was organized in sessions, each composed of interactive oral presentations, and by hands-on training regarding the subjects discussed. Each participant was placed in front of a workstation for carrying out the practical sessions. The presentations and the exercises were managed by the EU-RL and ISS staff. Dr. Massimiliano Orsini, a Bioinformatician from the Istituto Zooprofilattico Sperimentale dell'abruzzo e del Molise, Teramo, Italy, who is collaborating with EU-RL in the development of bioinformatics tools, also participated in the course management, either giving specific presentations or providing help in the hands-on sessions. An EFSA representative participated as invited speaker. The Course was opened by Dr. Stefano Morabito, Deputy Director of the EU-RL VTEC, who welcomed the participants and briefly introduced the workshop s aims and program. Dr. Morabito also gave an overview of the tools for genomics analysis available as software suites or webservers and introduced the ARIES webserver, a shared workspace for intensive data analyses developed within a collaboration among the EU-RL VTEC, the ISS IT service (Dr. Arnold Knijn), and the Istituto Zooprofilattico Sperimentale dell'abruzzo e del Molise (Dr. M. Orsini). Dr. M. Teresa da Silva Felicio, from EFSA, Parma, presented the current Whole Genome Sequencing (WGS) procurement activities in place at EFSA and the conclusions of the discussion groups from EFSA s Scientific Colloquium on the use of WGS of food-borne pathogens for public health protection. The Colloquium was held in Parma on June 2014 and the report is available for consultation on the EFSA website. Dr. introduced the NGS technology, considering the most widely used platforms Illumina and Ion Torrent, with particular attention to the methodology underlying the two different systems, their throughput, and the possible problems arising with the two technologies. Moreover, she gave an overview of the first steps in the management of NGS data, introducing the concepts of checking the quality of the raw sequencing reads and assembling them in contigs. 2

Dr Massimiliano Orsini presented the advantages deriving from the use of servers for NGS data analyses, and underlined the possible problems deriving from the use of local computing capacities, for limits either in the computational power or in the capacity of storage of the resulting data. Other talks provided a deeper explanation of the assembling processes, and an overview of the assembly pipelines available, with particular attention to those recommended for the different NGS systems, and introducing the steps of the assembly project. Dr. Arnold Knijn, from the ISS IT-service, introduced the informatics infra-structure of ISS, the Galaxy webserver architecture, and the advantages deriving from the use of a Galaxy cluster, increasing the computational activity by portioning the workload among several computing nodes. Dr. Knijn showed the user interface of the Aries Galaxy Cluster at ISS, which will be soon made available for external users. The rest of the course consisted in hands-on sessions, lead by Dr. Michelacci, with the support of Dr. Antonella Maugliani and Dr. Rosangela Tozzoli, both from EU-RL VTEC, and Dr. Orsini. The practical sessions regarded: The quality check of the sequence files. The assembly of the reads in contigs by different webservers (i.e. Centre for Genomic Epidemiology and Aries Galaxy Server). The possibilities of BLAST searches to identify virulence genes and define the serotype of an E. coli strain based on the whole genome sequences. The use of the Prokka tool for annotating contigs. The progressive alignments among sequences from multiple strains with MAUVE software. The possibilities of typing with NGS data. On the second training day, Dr Michelacci gave a short presentation on the current approaches used for typing bacterial strains starting from NGS data, introduced the problems arising in reproducibility and nomenclature assignment, and proposed some novel strategies for designing cross-platform and cross-generational typing strategies. The course was concluded by Dr. Morabito, who presented a wrap up of the topics discussed during the two days. Some of the course presentations are available in the EU-RL VTEC web site (www.iss.it/vtec), section: E. coli Genomics. 3

Annex 1 EU Reference Laboratory for E. coli Department of Veterinary Public Health and Food Safety Unit of Foodborne Zoonoses Basic Course on Bioinformatics tools for Next Generation Sequencing data mining 11-12 June, 2015 SIDBAE Training Room (Building 1, Floor B) Viale Regina Elena, 299 Rome, Italy Organized by: The EU Reference Laboratory for E. coli The ISS IT Service (SIDBAE) Funded by the European Commission DG SANTE 4

DIRECTOR OF THE COURSE Stefano MORABITO EU Reference Laboratory for E. coli Dipartimento di Sanità Pubblica Veterinaria e Sicurezza Alimentare SPEAKERS EU Reference Laboratory for E. coli, ISS, Rome, Italy Alfredo CAPRIOLI Antonella MAUGLIANI Valeria MICHELACCI Stefano MORABITO Rosangela TOZZOLI IT Service (SIDBAE), ISS, Rome, Italy Arnold KNIJN Istituto Zooprofilattico Sperimentale dell'abruzzo e del Molise, Teramo, Italy Massimiliano ORSINI European Food Safety Agency, Parma, Italy Maria Teresa DA SILVA FELICIO Susan BABSA, Clarissa FERRERI TECHNICAL SECRETARIAT EU Reference Laboratory for E. coli Dipartimento di Sanità Pubblica Veterinaria e Sicurezza Alimentare Rome, Italy Fabio GALATI Servizio Informatico, documentazione, biblioteca ed attività editoriali (SIDBAE) Rome, Italy GENERAL INFORMATION Venue:, SIDBAE Training Room (Building 1, Floor B) Viale Regina Elena 299, 00161 Rome This event is part of the scientific and tutorial activities of the EU-RL VTEC, funded by the European Commission DG SANTE For any information regarding the event, please send an email to crl.vtec@iss.it 5

Thursday 11 June 9.00 Registration 9.15 Welcome, housekeeping, and general overview on the training course Alfredo Caprioli Stefano Morabito Fabio Galati Session 1 9.45 Recent and on-going WGS initiatives and activities of EFSA 10.00 Next generation sequencers: from the bacterial culture to raw data M. Teresa da Silva Felicio 10.30 Coffee break 11.00 Analysis of NGS data: local and remote options, closed and custom platforms (Galaxy) 11.30 IT infrastructure and user interface: The Galaxy architecture and ARIES cluster Massimiliano Orsini Arnold Knijn 12.00 Assembly and alignment of NGS data Massimiliano Orsini 13.00 Lunch 14.00 Hands-on exercises: DTU Center for Genomic Epidemiology remote closed approach 15.00 Hands-on exercises: remote and custom analysis on ARIES (Assembly of Illumina and IonTorrent reads, QC, Bowtie2 mapping) Rosangela Tozzoli Antonella Maugliani Massimiliano Orsini Arnold Knijn 16.00 Blast searches, annotations, visualization of genome comparisons 6

Friday 12 June 9.30 Session 2 Hands-on exercises: E. coli Virulotyper and Serotyper, Prokka annotation and Mauve alignments. 11.00 Coffee break 11.30 Typing in the NGS era: The way forward! 12.15 12.45 Hands-on exercises: MLST, different tools for phylogenetic trees (SNPs, NDtree) Concluding remarks Rosangela Tozzoli Stefano Morabito Stefano Morabito Alfredo Caprioli 13.00 Closure of the Course and Lunch Participants Roslen BONDÌ, NRL Italy Maria Grazia DE FALCO, IZS del Mezzogiorno Portici (NA) Sarah DENAYER, NRL Belgium (IPH) Federica GIGLIUCCI, NRL Italy Sirpa HEINIKAINEN, NRL Finland Ivana KOLÁČKOVÁ, NRL Czech Republic Guerrino MACORI, IZS Piemonte, Liguria e Valle d'aosta - Torino Eleonora MASTRORILLI, IZS delle Venezie Legnaro (PD) Antonio PARISI, IZS Puglia e Basilicata - Putignano (BA) Michael POLEMIS, NRL Greece Stefano REALE, IZS della Sicilia Sabine SCHLAGER, NRL Austria Gitte SØRENSEN, NRL Denmark Michael WRIGHT, NRL UK 7

Observers Paola CHIANI, NRL Italy Paola DE SANTIS, IZS Lazio e Toscana - Roma Ilaria DI BARTOLO, ISS Federica FLAMINI, NRL Italy Beatriz GUERRA, EFSA Francesca LEONI, IZS Umbria e Marche - Ancona Angelo ROMANO, IZS Piemonte, Liguria e Valle d'aosta - Torino Gabriele VACCARI, ISS 8