Web-based Gene Expression Handling with the Genetic Data Warehouse



Similar documents
Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM

Basic Analysis of Microarray Data

Software and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University

The GeWare data warehouse platform for the analysis of molecular-biological and clinical data

Exiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays

Step by Step Guide to Importing Genetic Data into JMP Genomics

MultiExperiment Viewer Quickstart Guide

Cluster software and Java TreeView

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Tutorial for proteome data analysis using the Perseus software platform

Hierarchical Clustering Analysis

Analysis of gene expression data. Ulf Leser and Philippe Thomas

PhonEX ONE Microsoft Sample Reports November 2010

ClicktoFax Service Usage Manual

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation

JustClust User Manual

Analysis of Illumina Gene Expression Microarray Data

User Manual. Transcriptome Analysis Console (TAC) Software. For Research Use Only. Not for use in diagnostic procedures. P/N Rev.

Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6

Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study

OECD.Stat Web Browser User Guide

Simplifying Data Interpretation with Nexus Copy Number

Importance of Statistics in creating high dimensional data

QQConnect Overview Guide

ANALYSIS OF ENTITY-ATTRIBUTE-VALUE MODEL APPLICATIONS IN FREELY AVAILABLE DATABASE MANAGEMENT SYSTEMS FOR DNA MICROARRAY DATA PROCESSING 1.

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

GAIA: Genomic Analysis of Important Aberrations

Course on Functional Analysis. ::: Gene Set Enrichment Analysis - GSEA -

COGNOS 8 Business Intelligence

HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - aniketb1@umbc.edu. CMSC Presentation

Visual Aids. Release 2.4 Version 1.1

1. Digital Asset Management User Guide Digital Asset Management Concepts Working with digital assets Importing assets in

RNA Express. Introduction 3 Run RNA Express 4 RNA Express App Output 6 RNA Express Workflow 12 Technical Assistance

DeCyder Extended Data Analysis module Version 1.0

Genome Viewing. Module 2. Using Genome Browsers to View Annotation of the Human Genome

Agilent CytoGenomics Software A Complete Solution for Cytogenetic Research Data Analysis

UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS

How To Learn To Write A Report In A Database On A Microsoft Powerbook

Microarray Data Analysis. A step by step analysis using BRB-Array Tools

PROJECTS. onepoint PROJECTS 13. Group Server and. Enterprise Cloud/Server. Tutorial

How many of you have checked out the web site on protein-dna interactions?

TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:

Data Integration in Bioinformatics and Life Sciences

Functional Requirements for Digital Asset Management Project version /30/2006

Microarray Technology

Session Administration System (SAS) Manager s Guide

Introductory to Advanced Training Course Five Day Course Information and Agenda October, 2015

A truly robust Expression analyzer

mframe Software Development Platform KEY FEATURES

How To Choose A Business Intelligence Toolkit

OpenIMS 4.2. Document Management Server. User manual

Online Packaging Management Solution

Ofgem Carbon Savings Community Obligation (CSCO) Eligibility System

Intellect Platform - Tables and Templates Basic Document Management System - A101

Intellect Platform - The Workflow Engine Basic HelpDesk Troubleticket System - A102

Scatter Plots with Error Bars

Power Monitoring Expert 7.2

InfiniteInsight 6.5 sp4

EMC Documentum Content Services for SAP iviews for Related Content

- Solvent extraction Database -

Materials and Methods. Blocking of Globin Reverse Transcription to Enhance Human Whole Blood Gene Expression Profiling

Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER

Guide for Data Visualization and Analysis using ACSN

Methods for network visualization and gene enrichment analysis July 17, Jeremy Miller Scientist I jeremym@alleninstitute.org

Magento module Documentation

1. Digital Asset Management User Guide Digital Asset Management Concepts Working with digital assets Importing assets in

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE

SAS BI Dashboard 4.4. User's Guide Second Edition. SAS Documentation

Intellicus Enterprise Reporting and BI Platform

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications

Exercises for the UCSC Genome Browser Introduction

A Primer of Genome Science THIRD

DataPA OpenAnalytics End User Training

Help Desk Templates User s Manual

SWAN 15.1 Advance user information What s new in SWAN? Introduction of the new user interface. Last update: 28th April 2015

MUNIS Instructions for Logging into SaaS (ASP) Dashboard

Fast. Integrated Genome Browser & DAS. Easy. Flexible. Free. bioviz.org/igb

Archiving Full Resolution Images

NaviCell Data Visualization Python API

Secure Website and Reader Application User Guide

Sophos Mobile Control Administrator guide. Product version: 3

Nuclear Science and Technology Division (94) Multigroup Cross Section and Cross Section Covariance Data Visualization with Javapeño

Minimum information about a microarray experiment (MIAME) toward standards for microarray data

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

ORACLE USER PRODUCTIVITY KIT USAGE TRACKING ADMINISTRATION & REPORTING RELEASE 3.6 PART NO. E

Query 4. Lesson Objectives 4. Review 5. Smart Query 5. Create a Smart Query 6. Create a Smart Query Definition from an Ad-hoc Query 9

Row Quantile Normalisation of Microarrays

EMBL Identity & Access Management

Using the New InfoAssist Tool for Ad Hoc Query and Reporting. John Osborn Information Builders

ELOQUA INSIGHT Reporter User Guide

Tutorial Overview Quick Tips on Using the ONRR Statistical Information Website

How to build Dashboard - Step by Step tutorial/recipe

ACCELRYS CISPRO CLOUD. User Guide

Netmail Search for Outlook 2010

GeneChip Sequence Analysis Software (GSEQ) is used to analyze data from the Resequencing Arrays

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis

User Manual Online Clinical Trial Application & Monitoring System

FileMaker Pro and Microsoft Office Integration

Transcription:

Web-based Gene Expression Handling with the Genetic Data Warehouse Jörg Lange, Toralf Kirsten Microarray-Workshop, June 2006

Outline Requirements for Gene Expression Analyses Intensity values MIAME Genetic Data Warehouse Usage Chip Handling Chip and Gene Annotations Analyses and Report

Intensity values Huge amounts of data with every new chip Data type: Numeric Must be interpreted and analyzed with statistical and empirical method Need annotations for interpretation and verification Gene annotation: Publicly available in databases: GenBank, UNIPROT, KEGG Chip/experiment annotation: Manually prompted by the experimenter; include sample data, array data and laboratory data

MIAME Minimum Information About a Microarray Experiment needed to enable the interpretation of the results of the experiment unambiguously and potentially to reproduce the experiment Checklist for chip annotation which an experimenter has to capture Standard developed by Microarray Gene Expression Data Society (MGED) Necessary for publications e.g. in Nature Genetics, Bioinformatics http://www.mged.org/workgroups/miame/ miame.html

Components of MIAME

MIAME Example Entries Experiment design: Goal of the experiment, experimental factors and design Sample: Origin and characteristics (name, provider, gender, age) and manipulations (growth condition, treatments) Hybridization: Used protocols and conditions (temperature, duration) Measurement data: Raw and normalized data, used image scanning hardware and software and processing procedures Array: Platform, location of each spot [not necessary for standard array]

MIAMExpress Exact implementation of MIAME for annotation of microarray data Developed at the European Bioinformatics Institute (EBI), Hinxton Utilization of controlled vocabularies Annotation export to the public GE-Repository ArrayExpress Disadvantages Many input fields => error-prone, due to describe the same entities in different manner No query function No import function Not extendable for further annotations, e.g. such as captured in studies

Genetic Data Warehouse Developed at IZBI Leipzig Handling, analysis and storage of large chip-based genetic data Microarray-based gene expression data (Affymetrix) Matrix-CGH (Array-CGH) data Web-based interfaces for data im- & export and to perform analysis methods Load cel-files of chips and preprocessed data Chip annotation using predefined and extendable templates Visualize intensity values Generate statistical reports Integration of public annotation data Data export in tab-delimited form

www.izbi.de/geware Login of an user into an user group in which she has access

Current Applications GeWare is used in two collaborative cancer research studies Molecular Mechanism in Malignant Lymphoma http://www.lymphome.de/projekte/mmml German Glioma Network http://www.gliomnetzwerk.de/ Collaboration: Germany-wide clinical, pathological and molecular-genetics centers Heterogeneous data for hundreds of patients Harmonized study design managed by clinical trial software => integrated as chip annotation and GeWare is open for all researchers to share its functions

Apply Chips Apply the prompted number of chips Create or Append Experiments as Collection of Chips

Available Chip Types in GeWare Gene Expression: Affymetrix GeneChips Human: HG-U95A, HG-U95Av2, HG-U133A, HuGeneFl, Hu35KsubA, HG-U133_Plus_2 Mouse: MG_U74Av2, MOE430A, Mouse430_2 Rat: RAE230A C. elegans Further on demand (we load cdf then) Matrix-CGH Laboratory chips produced in Ulm within the MMML project Upload in form of of tab-delimited files by the user

Chip Annotation Goal: Utilization of a uniform and comprehensive annotation for later analysis Focus-dependent annotation data in different clinical studies, e.g. Lymphoma vs. Glioma Annotation templates Collections of annotation categories (parameters) for which the annotation values has to be captured Generic management of metadata and values Hierarchical arrangement of categories Definition of MIAME compliant templates Controlled vocabularies (predefined terms)

Chip Annotation (2) List of Chips with filters to decrease the number of entries Navigation bar Select values from specified controlled vocabulary

Browse Chip Annotation Search for relevant mol.-biol. data in GeWare using clinical data Group data for later reuse in other analysis

Browse Gene Annotation Fetch Probe Set Names and Chip Type by querying gene annotation like Gene Symbol, Map Location, OMIM, Annotation attributes are appended

Preprocessing List of possible preprocessings Selection of the chips with cel-files

Group Management GE Data CGH Data Chip Annotation (e.g. clinical data) Group Management Chip groups Gene- and clone groups Parameter groups Analyses Visualization Export

Visualization Display signal values of a gene and chip group in a line plot Display a M/A Plot of 2 selected chips Signal difference (Chip1 - Chip2) on the y axis and Signal sum (Chip1 + Chip2) on the x axis Draw a Heatmap of a chip and gene group Computed by statistical software R Selected chip annotation as class label Output: *.png or *.pdf Also available for Matrix-CGH

Visualization in Heatmaps (1) List of gene groups according to the selected chips and chip type e.g. HG-U133A Additional: Chip annotation as class label

Visualization in Heatmaps (2) Additional annotation class label: Stage (# infected Lymph nodes) Chips / Patients Heatmap including hierarchical cluster analysis Genes

Statistical Reports Standard error Chip and gene groups to filter Generation of new gene groups Flexible report extension with annotation attributes Available NetAffx annotation attributes Further annotation attributes Downloadable results Chip group filter Lymphome Controls View further gene annotations Gene group filter Selection to store genes Selected of interest annotation in a attributes group

Statistical Reports - Correlation Specify one probe set which correlation with a set of probe sets should be Additional Annotation computed Probe sets of the group sorted by correlation

Data Export All experimental files imported or generated by the own user group CEL-Files, Affymetrix Reports (RPT) if available Files that are generated by analyses, e.g. TIF Other data Intensity values (expression matrix) determined by chip and gene group Default: tab-delimited, other separators can specified Flexible extension by chip annotation data

Outlook Integration of further genetic data into GeWare Single nucleotide polymorphisms (SNP) Tiling Arrays More analysis methods based on projects members needs Differential gene expression analysis Chip quality control by M. Rosolowski

Thank you for your attention!