Primetime for KNIME:



Similar documents
#jenkinsconf. Jenkins as a Scientific Data and Image Processing Platform. Jenkins User Conference Boston #jenkinsconf

BIOINFORMATICS Supporting competencies for the pharma industry

Cell Discovery 360: Explore more possibilities.

MultiQuant Software 2.0 for Targeted Protein / Peptide Quantification

Thermo Scientific ArrayScan XTI High Content Analysis Reader. revolutionizing cell biology with the power of high content

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

2012 LABVANTAGE Solutions, Inc. All Rights Reserved.

Dicer Substrate RNAi Design

Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee

Delivering the power of the world s most successful genomics platform

Table of Contents INTRODUCTION Prerequisites... 3 Audience... 3 Report Metrics... 3

Document Management. Document Management for the Agile Enterprise. AuraTech Pte Ltd

PreciseTM Whitepaper

MRMPilot Software: Accelerating MRM Assay Development for Targeted Quantitative Proteomics

DMBI: Data Management for Bio-Imaging.

Biorepository and Biobanking

RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial

Fundamentals of LoadRunner 9.0 (2 Days)

Integration of DiscoveryQuant Software into Automated In-Vitro ADME Assay Workflows

Validating Methods using Waters Empower TM 2 Method. Validation. Manager

KNIME Enterprise server usage and global deployment at NIBR

Service-Oriented Architecture and Software Engineering

SOFTWARE TESTING TRAINING COURSES CONTENTS

Dr Alexander Henzing

a measurable difference

Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices

Call 2014: High throughput screening of therapeutic molecules and rare diseases

SOA REFERENCE ARCHITECTURE: WEB TIER

G E N OM I C S S E RV I C ES

GC3 Use cases for the Cloud

岑 祥 股 份 有 限 公 司 技 術 專 員 費 軫 尹

TIBCO Spotfire Helps Organon Bridge the Data Gap Between Basic Research and Clinical Trials

What s New in Pathway Studio Web 11.1

GE Healthcare. Centricity * PACS with Universal Viewer. Universal Viewer. Where it all comes together.

High Availability Essentials

Informatics and Knowledge Management at the Novartis Institutes for BioMedical Research (NIBR)

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

User Manual/Hand book. qpcr mirna Arrays ABM catalog # MA003 (human) and MA004 (mouse)

Excel at anything Expertise without limits

Schools Remote Access Server

Implementing and Managing Microsoft Desktop Virtualization

Outline. interfering RNA - What is dat? Brief history of RNA interference. What does it do? How does it work?

Hybrid Development and Test USE CASE

What s New in Analytics: Fall 2015

2311A: Advanced Web Application Development using Microsoft ASP.NET Course 2311A Three days Instructor-led

PANDORA FMS NETWORK DEVICE MONITORING

Leading Genomics. Diagnostic. Discove. Collab. harma. Shanghai Cambridge, MA Reykjavik

Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes

NaviCell Data Visualization Python API

How to Run the PacBio RS II Instrument. 1. Run Instrument. 1.1 How to Run the RS II Instrument

Introduction To Real Time Quantitative PCR (qpcr)

Automation in genomics High-throughput management of biological samples and nucleic acids. Valentina Gualdi Operational Scientist PGP

GE Healthcare. Centricity PACS and PACS-IW with Universal Viewer* Where it all comes together

What s New in Analytics: Fall 2015

Ultimus Adaptive BPM Suite V8

DAISY PRODUCER: AN INTEGRATED PRODUCTION MANAGEMENT SYSTEM FOR ACCESSIBLE MEDIA

GXP WebView GEOSPATIAL EXPLOITATION PRODUCTS (GXP )

GE Healthcare. Centricity* PACS and PACS-IW with Universal Viewer. Universal Viewer. Where it all comes together.

W H I T E P A P E R. Flexible Automation for Application Workflows. Flexible Automation for Application Workfl ows Date:

RNAi Shooting the Messenger!

Advanced Web Application Development using Microsoft ASP.NET

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

PANDORA FMS NETWORK DEVICES MONITORING

Euro-BioImaging European Research Infrastructure for Imaging Technologies in Biological and Biomedical Sciences

TOOLS sirna and mirna. User guide

Six Trends in Robotics in the Life Sciences

Microsoft Visual Basic Scripting Edition and Microsoft Windows Script Host Essentials

Software Requirements Specification. Schlumberger Scheduling Assistant. for. Version 0.2. Prepared by Design Team A. Rice University COMP410/539

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

Make the Most of Big Data to Drive Innovation Through Reseach

The full setup includes the server itself, the server control panel, Firebird Database Server, and three sample applications with source code.

Cellular Imaging Solutions Imaging with a vision

Using NI Vision & Motion for Automated Inspection of Medical Devices and Pharmaceutical Processes. Morten Jensen 2004

NNMi120 Network Node Manager i Software 9.x Essentials

Automated Library Preparation for Next-Generation Sequencing

1 What Are Web Services?

Globus Research Data Management: Introduction and Service Overview

Chapter 2 TOPOLOGY SELECTION. SYS-ED/ Computer Education Techniques, Inc.

Implementing and Managing Microsoft Desktop Virtualization en

Reprogramming, Screening and Validation of ipscs and Terminally Differentiated Cells using the qbiomarker PCR Array System

JOURNAL OF OBJECT TECHNOLOGY

Virtualization s Evolution

Aaron Ponti. Single Cell Unit, D-BSSE ETHZ (Basel) h

GeneProf and the new GeneProf Web Services

PRODUCT INFORMATION...

Optimally Manage the Data Center Using Systems Management Tools from Cisco and Microsoft

Advanced Web Application Development using Microsoft ASP.NET

CLC Sequence Viewer USER MANUAL

Integrating Automated Systems for Regulated Bioanalysis Farmen, R.H., Struwe, P. and Groeschl, M.

Scientific and Technical Applications as a Service in the Cloud

Transcription:

Primetime for KNIME: Towards an Integrated Analysis and Visualization Environment for RNAi Screening Data F. Oliver Gathmann, Ph. D. Director IT, Cenix BioScience Presentation for: KNIME User Group Meeting 2011 Zürich, March 3rd 2011

Overview Explain RNAi Screening IT infrastructure for HT-HCS (High-Throughput, High-Content Screening) at Cenix: past, present, and future

Explain RNAi Screening How RNAi works sirna RISC Unwinding of sirna Target mrna Target mrna recognition Degradation of mrna First Take Home Message: RNAi allows you to investigate the function of genes by knocking them down selectively

Explain RNAi Screening The Drug Discovery Pipeline Target Discovery (in vitro) Direct Direct LoF LoF Screens Screens Modifier Modifier Screens Screens Target Validation (in vitro) Phenotypic Phenotypic Profiling Profiling Target Discovery Phenotypic Titration Target Target Lead Lead Validation Validation Identification Optimization in vivo in vitro ADME/ Tox Clinical Phase I Clinical Phase II Clinical Phase III Registration Second Take Home Message: Early In The Drug Discovery Pipeline means highthroughput and lots of data

Explain RNAi Screening Information Layers Metabolic Pathway Gene network, disease conditions Gene Sequence, species, pathway annotations, transcripts Silencing Reagent Structure, targeted Gene(s), stock and order information Experiment Meta data (sample and control positions), production data Phenotype Cell images, morphology data Hit Phenotype annotations, knock down, reproducibility, significance Last Take Home Message: High-Content means complex data structures

IT Infrastructure for HT-HCS Cenix LDAP Database Scientist Workstations LIMS Tube Handler Pipetting Robot Farm File Automated Microscope

Terminology: Workflows Process-centric Workflows vs. Data-centric Workflows Process-centric: mapping a work process in the physical world; focused on data acquisition Data-centric: mapping an algorithm; focused on data processing Not always clear-cut, but still useful distinction

Primordial Process Workflows: Design

Primordial Process Workflows: Implementation

Data Analysis Workflows: Excel In the beginning, there was Excel. + Advantages: Ubiquitous and easy to use Full flexibility for the end user (in theory, anyways) Disadvantages: Hard to debug Nightmarish version control Slow and cumbersome

Data Analysis Workflows: Excel Load phenotype data files; run analysis; generate graphs Engines Submit image processing job Job Store image data Run image Image Processing processing job; store phenotype data Excel Img. Analysis Client Data Analysis Store experiment data; track experiment; wait for image data LIMS qpcr Design experiment Excel Plate reader LIMS Client Autoscope Data Acquisition Post image data File Storage Database Submit experiment Experiment Design

Data Analysis Workflows: Web Tools Next: Web tools with tabular data as input and output. + Advantages: Encapsulation of complex functionality Centralized administration Executed on server Disadvantages: Low flexibility Frugal web interface

Data Analysis Workflows: Web Tools Load result data files; generate graphs Run analysis Download result data files Engines Web Tools Spotfire Browser Upload phenotype and design data files Img. Analysis Job Client Image Processing Data Analysis LIMS qpcr Excel Plate reader LIMS Client Autoscope Data Acquisition File Storage Database Experiment Design

Data Analysis Workflows: KNIME! KNIME: A giant leap forward Flexible and easy to use and yet robust, scalable, performant and extensible! Current KNIME infrastructure: Centrally administered Windows and Mac installations, configured to point to a user-specific workspace on the file server Workflow curation policy: Versioned reference workflows for each project, owned by power users Experiment meta data provided through database nodes, raw data through files Complex statistics implemented with (remote) R scripting nodes

Data Analysis Workflows: KNIME! Load result data files; generate graphs Engines Spotfire KNIME Job Run workflow on Img. Analysis phenotype data and Client experiment design Image Processing Data Analysis LIMS qpcr Excel Plate reader LIMS Client Autoscope Data Acquisition File Storage Database Experiment Design

Primetime: Requirements Streamlining the Screening Pipeline Analysis has become the bottleneck: Potential for 10-20 % increase in overall throughput Even Higher Content: More parameters using advanced analysis methods Single object rather than population data Integrate gene annotations and pathway data Enable customers to explore and (re-)analyze delivered data sets Selecting/weighing parameters Tight integration with Spotfire, including raw data

Primetime: IRIS Integrated computational environment for high throughput RNA Interference Screening Engines Post phenotype data; run workflow on phenotype data and experiment design; post result data Submit image analysis job; wait for phenotype data Spotfire Post phenotype data KNIME Job Store image data; KNIME launch image processing workflow Image Processing Retrieve result data; run Spotfire Data Analysis LIMS qpcr Excel Plate reader LIMS Client Autoscope Data Acquisition File Storage Database Experiment Design

Primetime: Beyond IRIS Use KNIME for process-centric workflows as well This would require Standard interface to the LIMS server to drive the business logic (REST) Easily configurable User Interfaces to parameterize processing steps (something like RGG?)

Primetime: Beyond IRIS KNIME solutions : Hide complexity of workflows by exposing only a few knobs to the end user Features: Again, a User Interface generator to make it easy for non-it power users to create new solutions Ideally, a way to publish the solution to a server and run it remotely

Conclusions KNIME has quickly become an integral part of the HT-HCS screening pipeline at Cenix Current work on the data analysis infrastructure around KNIME is focused on tight integration with the LIMS server, with Definiens for image processing, and with Spotfire for data visualization Further down the road, we plan to use KNIME for all workflows at Cenix and to build pre-packaged solutions

Thank you! Any questions?