Integrative Analysis of Genomic Copy Number. Cancer.

Size: px
Start display at page:

Download "Integrative Analysis of Genomic Copy Number. Cancer."

Transcription

1 Integrative Analysis of Genomic Copy Number and Gene Expression Data in Metastatic Prostate Cancer. Elise Chang Agilent Technologies

2 Agenda Introduction Features of Copy Number Workflow SNPs.. SNPs.. Case study- Integrative Analysis CNVs.. of Genomic copy number CNVs.. and Gene Expression Data in Metastatic Prostate Cancer CNPs CNPs CNVRs.. CNVRs..

3 Copy Number Variation- Understanding the Relevance to Human Diseases Copy number variation (CNV): DNA segments in which copy-number varies between two or more genomes Ranges from 1 Kb to millions of DNA bases in size CNVs have been associated with susceptibility to disease, complex behavioral traits, and other phenotypic variability Identifying significant CNVs is important in understanding the underlying mechanism of disease and disease susceptibility

4 Supported Array Platforms Affymetrix: 100K (50K Xba, 50K Hind) 500K (250K Nsp, 250K Sty) SNP 5.0 SNP 6.0 Illumina: GenomeStudio outputs for all SNP/CNV arrays GeneSpring GX plugin for GenomeStudio used to export data in format GeneSpring GX will support (plug-in located in: INSTALLDIR\app\Illumina\GX.Genotyping.Export.dll to Genomestudio\modules \ BSGT \ ReportPlugins\) -Instructions for installation are in section of the manual.

5 Supported Arrays Affymetrix Technology available on Agilent server. Experiment creation involves importing the CEL files, summarization and normalization GX11 computes log ratio, CN and LOH GX11 uses the CN values to get ASCN, PSCN and to run GISTIC Illumina Technology created on the fly. Experiment creation involves import from GenomeStudio Log ratios, CN values and LOH are imported from GenomeStudio GX11 uses the CN values to get ASCN, PSCN and to run GISTIC

6 Experimental Designs Identification of variation requires comparison to either a reference DNA source, a reference dataset or a reference genome sequence. This is important for Affymetrix experiment creation 1. Analysis against a reference: The control is generated from a pool of individuals. All the test samples are then compared against a common, pooled control, also known as reference. HapMap samples are packaged as Standard Reference Custom Reference can be created 2. Paired Analysis: Control and the test DNA are from the same individual Pairing is defined during experiment grouping

7 Custom Reference Creation Menu: Tools> Create Custom Reference Typically need reference samples for accurate genotype calls on non-reference Once Custom Reference is created, it will be saved for future experiment creation

8 Reference Creation References contain: Averaged summarised intensities for probe sets from PLIER For Affymetrix 50/100K Set Statistics from BRLMM For 250/500K Set and SNP5.0 Affymetrix arrays Statistics from BirdSeed Algorithm Clusters from BirdSeed Algorithm (and median and s.d. of clusters) For SNP6.0 Affymetrix arrays Statistics from BirdSeed Algorithm Clusters from BirdSeed Algorithm (and median and s.d. of clusters) Clusters from CANARY (and median and s.d. of clusters)

9 Experimental Set-up for Paired Normal Design For paired-normal experimental designs, two parameters must be specified Group indicates a set of paired samples Condition indicates which sample(s) to use as reference (Normal) for test sample(s) (Tumor) Parameters must be Group and Condition for GeneSpring GX to recognize it as a paired design Interpretation using Group and Condition must be used for Copy Number Computation

10 Copy Number Analysis Workflow in GeneSpring GX 11 QC / Batch Correction Copy NumberAnalysis: (CN, LOH, ASCN, Log ratio) GISTIC for Identification of Statistically Common CN variation within a set of samples Filter for Regions of Interest Biological Contextualization of Genes in Regions of interest * QC/Batch correction step is not available for Illumina workflow

11 Quality Control on Samples This window should look familiar to current GeneSpringGX users.

12 Quality Control Tools - PCA and Batch Effect Quality Control PCA- -identifies potential sample outliers Batch Effect -identifies and corrects for systematic error when different samples are processed on different days or different conditions.

13 Batch Correction Select interpretation that groups samples into their respective batches Minimum samples per batch Minimum m number of samples per batch to be considered for correction P-value T-test p-value cutoff for each probe Percentage of bad batches allowed If percent bad batches below userspecified value, do not perform correction for probe Each batch is T-tested against a pool of all remaining batches. Correction for each flagged entity is Correction for each flagged entity is performed using a reference batch.

14 Copy Number Computation Copy NumberAnalysis: (CN, LOH, ASCN, Log ratio, LOD score)

15 Copy Number Analysis for Affymetrix Data Computation actually computing: (1) Log ratio values Against Reference design: Normalized intensity of sample/ Normalized intensity of reference Paired design: Normalized intensity of Case/ Normalized intensity of Control (2) Genomic Copy Number Circular Binary Segmentation to identify segments Log ratio values to estimate genomic copy number Confidence value give as log10 of p-value (3) Allele-specific copy number (ascn) information Fawkes algorithm used to assign allele-specific copy number using SNP probes (4) Parent-specific copy number (pscn) information (5) Loss of Heterozygosity (LOH) Hidden Markov Model (HMM) used to calculate LOH score

16 Log Ratio and Copy Number Computation Copy Number computation (paired or against reference) is determined by the interpretation selected: First Log 2 ratios are calculated for every probe: Against Reference design: Normalized intensity of sample/ Normalized intensity of reference Paired design: Normalized intensity of Case/ Normalized intensity of Control

17 Copy Number Computation Circular Binary Segmentation Smooths outliers Finds change points in each sample using a statistic to identify a segment break Validation of change point using t-test test with p value cut off < Outputs are segment break points and mean log ratio for segment Segment Break Points

18 Copy Number Computation Once segments are identified by CBS then copy numbers and confidence scores need to be assigned to them Copy Number: HapMap dataset is used to generate a median map Using the birdseed and CANARY outputs for each possible copy number (0,1,2,3,4) the median and s.d log ratios across all probes is calculated Log ratios for segments from CBS are compared to the median map and copy numbers are assigned Homozygous and Hemizygous deletions are given values of 0 and1 Amplifications are given CN values of 3 and 4. Copy Number Confidence: Copy Numbers between 1.5 and 2.5 are assigned a p value of '1' For any other copy number a T test t against zero of log ratios is performed with multiples l testing ti correction Negative logarithm to the base 10 of the final p value reported as confidence.

19 Copy Number Computation Median Map Copy Number Assigned Genome- Wide Human SNP Array 6.0 Genome-Wide Human SNP Array 5.0 Mean Log Ratio that is mapped Human Mapping 500K Array Set - NSP Human Mapping 500K Array Set - STY Mapping 100k array set Same as Genome Wide Human SNP Array

20 Copy Number Analysis Log ratios are smoothed to give CN values. CN segments are created using Circular Binary Segmentation (CBS) algorithm. CN values log ratios F ti l ll di t CN l i d i Fractional as well as discrete CN values are assigned, in the range of 0-4

21 1. Paired Analysis CN computation Condition-Type Interpretation 2. Each tumor is paired against the Normal of its group 3. All Normals are compared against the reference All samples against reference comparison Only one set of CN Analysis results can be stored.

22 Allele-specific Copy Number Given segment with copy number = 3, which allele was duplicated? Example output: AAB = A2: B1

23 Parent-specific Copy Number Consider a section of a Chromosome with haplotypes: ChrCopy1: A 1B 2A 3B 4B 5 B (after duplication): A 1B 2A 3B 4B 5 B A 1B 2A 3B 4B 5 B ChrCopy2: A 1 A 2 B 3 A 4 B 5 Suppose Copy1 gets duplicated 2 additional times (CN of region =4), the ascn become: A 1 :4 B 1 :0 and pscn = 4-0 A 2 :1 B 2 :3 and pscn = 3-1 A 3 :3 B 3 :1 and pscn = 3-1 A 4 :1 B 4 :3 and pscn = 3-1 A 5 :0 B 5 :4 and pscn = 4-0 PSCN is a measure of allelic imbalance

24 Copy Number Computation for Illumina Arrays Copy Number, Log ratio, and LOH scores calculated in GenomeStudio and imported into GeneSpring GX The following are computed in GeneSpring GX: ASCN information PSCN information

25 Analysis and Filtering Once you have identified regions of genomic alteration in individual sample how can you find meaningful events in groups of samples? Find Common Genomic Variant Regions Filter By Regions Identify Copy Neutral LOH Filter By PSCN

26 Finding Common Genomic Variant Regions Across asetofsamples Samples Genomic Identification of Significant ifi Targets in Cancer (GISTIC)

27 Find Common Genomic Variant Regions Many tumour samples have large numbers of chromosomal abberations. GISTIC was developed to try and distinguish meaningful or driver mutation events from random background somatic or passenger events Driver mutations are functionally important events which confer advantageous biological properties to the tumour allowing it to initiate grow or persist and are more likely to drive cancer pathogenesis GISTIC can also be applied to non cancer datasets where you want to find common genomic variant regions

28 Common Genomic Variant Regions Choose Fine or Coarse Mode Amplified Regions Deleted Regions

29 Common Variation Results Once GISTIC has identified aberrant regions it uses the biological genome to find overlapping genes for amplified and deleted segments For each probeset within the region, the upstream and downstream 1000 bases are scanned and the genes are identified G l i th Genes overlapping the significant regions identified and stored in the Project Navigator

30 Use of Filters to identify genomic landscape prevalent in metastatic prostate cancer

31 Results Analysis 31 Confidentialit March

32 Biological Contextualization of Copy Number Data 32 Confidentialit March

33 Case Study

34 Integrative Analysis of Metastatic Prostate Cancer Prostate Cancer is the most common cancer in men. Primary tumors are thought to be composed of multiple genetically distinct cancer cell clones. Both the primary and the metastatic prostate cancers are p y p heterogenous in nature, posing therapeutic challenges.

35 Datasets Used Expression: GSE metastatic samples from 4 patients and 18 normal samples Genomic Copy Number: GSE metastatic locations from 14 patients and 16 subject paired non-cancerous samples Liu et al, Nat Med May;15(5):559-65

36 Copy Number Analysis in Prostate Cancer Samples 36 Confidentialit March

37 Expression Analysis in Prostate Cancer Samples 37 Confidentialit March

38 PCA- Genotyping Data Shape by Condition: Tumor Normal Color by Patient Color by Patient Group

39 PCA- Expression Data Normal Metastatic QC using PCA shows separation of the Normal and the Metastatic samples of GSE6919

40 Histogram view of data tracks in Genome Browser showing deletions as green blocks and amplifications as red dblocks Published data Chr. 6 Deletion- Pateint #17 Chromosome 6 Validated d in GX11

41 Joint Analysis of Gene Expression and Genomic Copy Number Data in Metastatic Prostate Cancer Copy Number Gene Expression Prostate Cancer Studies Controlled for regions and metastatic tissues 41 Confidentialit March

42 Deletions present in chr.6 of patient 17: An Integrative Analysis

43 Analysis workflow Expression: Genotyping: T-test Standard Reference FC 2.0 p-value: 0.05 Differentially expressed 441 entities Copy Number computation Filters Genome Browser

44 Deletion of PLAGL Fold Downregulation of PLAGL1 in Metastasis Data xpression Ex Genomic Data

45 PLAGL1 Candidate Tumor suppressor gene, with anti-proliferative activities Zinc finger protein with transactivation and DNA binding activity Presence of splice variants which allow differential regulation of apoptosis induction and cell cycle arrest Frequently deleted in many solid tumors-breast, ovarian and renal cell carcinomas Also known as LOT or Lost On Transformation

46 PLAG1-network analysis

47 First order expansion of PLAG1 network and overlay with FC data

48 TCF21 Genomic Data Expression Data TCF21 TCF21 CN=2 No genomic aberration of TCF21 Down regulation of Down-regulation of expression levels of TCF21

49 TCF21 First Order Expansion of the PLAGL1 network identified TCF21, a ts gene, to be down regulated in the expression analysis. The CN of TCF21 remains at 2, unlike that of PLAGL1. TCF21 is known to be frequently silenced epigenetically in head and neck cancer. Consistent with this, TCF21 did not show any deletion in the samples examined, raising the possibility that TFC21 could be epigenetically pg regulated in prostate cancer.

50 Conclusions 1. Using GX11, we could validate the presence of ERG- TMPRSS2 in several of metastatic prostate cancer samples 2. Significant Aberration found in PTEN, FGF18, TRIB3 by GISTIC indicates that these could be driver mutations of prostate cancer. 3. Additional candidates were identified by combined use of filters to identify amplified regions and regions of allelic imbalance. 4. Integrative ti analysis using expression and genotyping data has identified PLAGL1, a candidate ts gene, and TCF21, a ts gene, to be having a possible role in prostate cancer. 5. PLAGL1 deletion, though present in a small percentage of population, is an early event, occurring at a pre-metastatic stage

Simplifying Data Interpretation with Nexus Copy Number

Simplifying Data Interpretation with Nexus Copy Number Simplifying Data Interpretation with Nexus Copy Number A WHITE PAPER FROM BIODISCOVERY, INC. Rapid technological advancements, such as high-density acgh and SNP arrays as well as next-generation sequencing

More information

Core Facility Genomics

Core Facility Genomics Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray

More information

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Comparative genomic hybridization Because arrays are more than just a tool for expression analysis Carsten Friis ( with several slides from

More information

CNV Univariate Analysis Tutorial

CNV Univariate Analysis Tutorial CNV Univariate Analysis Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Overview 2 2. CNAM Optimal Segmenting 4 A. Performing CNAM Optimal Segmenting..................................

More information

DNA Copy Number and Loss of Heterozygosity Analysis Algorithms

DNA Copy Number and Loss of Heterozygosity Analysis Algorithms DNA Copy Number and Loss of Heterozygosity Analysis Algorithms Detection of copy-number variants and chromosomal aberrations in GenomeStudio software. Introduction Illumina has developed several algorithms

More information

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

Focusing on results not data comprehensive data analysis for targeted next generation sequencing Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

Single-Cell Whole Genome Sequencing on the C1 System: a Performance Evaluation

Single-Cell Whole Genome Sequencing on the C1 System: a Performance Evaluation PN 100-9879 A1 TECHNICAL NOTE Single-Cell Whole Genome Sequencing on the C1 System: a Performance Evaluation Introduction Cancer is a dynamic evolutionary process of which intratumor genetic and phenotypic

More information

PREDA S4-classes. Francesco Ferrari October 13, 2015

PREDA S4-classes. Francesco Ferrari October 13, 2015 PREDA S4-classes Francesco Ferrari October 13, 2015 Abstract This document provides a description of custom S4 classes used to manage data structures for PREDA: an R package for Position RElated Data Analysis.

More information

Agilent CytoGenomics Software A Complete Solution for Cytogenetic Research Data Analysis

Agilent CytoGenomics Software A Complete Solution for Cytogenetic Research Data Analysis Agilent CytoGenomics Software A Complete Solution for Cytogenetic Research Data Analysis Technical Overview Streamlines the cytogenetic research workflow for finding CNCs, LOH, and UPD Enables manual sample

More information

Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the

Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the Chapter 5 Analysis of Prostate Cancer Association Study Data 5.1 Risk factors for Prostate Cancer Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the disease has

More information

Step by Step Guide to Importing Genetic Data into JMP Genomics

Step by Step Guide to Importing Genetic Data into JMP Genomics Step by Step Guide to Importing Genetic Data into JMP Genomics Page 1 Introduction Data for genetic analyses can exist in a variety of formats. Before this data can be analyzed it must imported into one

More information

GenomeStudio Data Analysis Software

GenomeStudio Data Analysis Software GenomeStudio Data Analysis Software Illumina has created a comprehensive suite of data analysis tools to support a wide range of genetic analysis assays. This single software package provides data visualization

More information

GenomeStudio Data Analysis Software

GenomeStudio Data Analysis Software GenomeStudio Analysis Software Illumina has created a comprehensive suite of data analysis tools to support a wide range of genetic analysis assays. This single software package provides data visualization

More information

Data Analysis for Ion Torrent Sequencing

Data Analysis for Ion Torrent Sequencing IFU022 v140202 Research Use Only Instructions For Use Part III Data Analysis for Ion Torrent Sequencing MANUFACTURER: Multiplicom N.V. Galileilaan 18 2845 Niel Belgium Revision date: August 21, 2014 Page

More information

UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015

UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015 UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory April, 2015 1 Contents Overview... 3 Rare Variants... 3 Observation... 3 Approach... 3 ApoE

More information

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data The Illumina TopHat Alignment and Cufflinks Assembly and Differential Expression apps make RNA data analysis accessible to any user, regardless

More information

Targeted. sequencing solutions. Accurate, scalable, fast TARGETED

Targeted. sequencing solutions. Accurate, scalable, fast TARGETED Targeted TARGETED Sequencing sequencing solutions Accurate, scalable, fast Sequencing for every lab, every budget, every application Ion Torrent semiconductor sequencing Ion Torrent technology has pioneered

More information

Tutorial for proteome data analysis using the Perseus software platform

Tutorial for proteome data analysis using the Perseus software platform Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information

More information

Overview of Genetic Testing and Screening

Overview of Genetic Testing and Screening Integrating Genetics into Your Practice Webinar Series Overview of Genetic Testing and Screening Genetic testing is an important tool in the screening and diagnosis of many conditions. New technology is

More information

SNPbrowser Software v3.5

SNPbrowser Software v3.5 Product Bulletin SNP Genotyping SNPbrowser Software v3.5 A Free Software Tool for the Knowledge-Driven Selection of SNP Genotyping Assays Easily visualize SNPs integrated with a physical map, linkage disequilibrium

More information

Combining Data from Different Genotyping Platforms. Gonçalo Abecasis Center for Statistical Genetics University of Michigan

Combining Data from Different Genotyping Platforms. Gonçalo Abecasis Center for Statistical Genetics University of Michigan Combining Data from Different Genotyping Platforms Gonçalo Abecasis Center for Statistical Genetics University of Michigan The Challenge Detecting small effects requires very large sample sizes Combined

More information

Microarray Data Analysis. A step by step analysis using BRB-Array Tools

Microarray Data Analysis. A step by step analysis using BRB-Array Tools Microarray Data Analysis A step by step analysis using BRB-Array Tools 1 EXAMINATION OF DIFFERENTIAL GENE EXPRESSION (1) Objective: to find genes whose expression is changed before and after chemotherapy.

More information

SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis

SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis Goal: This tutorial introduces several websites and tools useful for determining linkage disequilibrium

More information

Next Generation Sequencing: Technology, Mapping, and Analysis

Next Generation Sequencing: Technology, Mapping, and Analysis Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University gbenson@bu.edu http://tandem.bu.edu/ The Human Genome Project took

More information

Breast cancer and the role of low penetrance alleles: a focus on ATM gene

Breast cancer and the role of low penetrance alleles: a focus on ATM gene Modena 18-19 novembre 2010 Breast cancer and the role of low penetrance alleles: a focus on ATM gene Dr. Laura La Paglia Breast Cancer genetic Other BC susceptibility genes TP53 PTEN STK11 CHEK2 BRCA1

More information

Contents. molecular biology techniques. - Mutations in Factor II. - Mutations in MTHFR gene. - Breast cencer genes. - p53 and breast cancer

Contents. molecular biology techniques. - Mutations in Factor II. - Mutations in MTHFR gene. - Breast cencer genes. - p53 and breast cancer Contents Introduction: biology and medicine, two separated compartments What we need to know: - boring basics in DNA/RNA structure and overview of particular aspects of molecular biology techniques - How

More information

Frequently Asked Questions Next Generation Sequencing

Frequently Asked Questions Next Generation Sequencing Frequently Asked Questions Next Generation Sequencing Import These Frequently Asked Questions for Next Generation Sequencing are some of the more common questions our customers ask. Questions are divided

More information

Replacing TaqMan SNP Genotyping Assays that Fail Applied Biosystems Manufacturing Quality Control. Begin

Replacing TaqMan SNP Genotyping Assays that Fail Applied Biosystems Manufacturing Quality Control. Begin User Bulletin TaqMan SNP Genotyping Assays May 2008 SUBJECT: Replacing TaqMan SNP Genotyping Assays that Fail Applied Biosystems Manufacturing Quality Control In This Bulletin Overview This user bulletin

More information

Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs)

Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Single nucleotide polymorphisms or SNPs (pronounced "snips") are DNA sequence variations that occur

More information

Interpret software. User guide. version 11

Interpret software. User guide. version 11 Interpret software User guide version 11 This protocol booklet and its contents are Oxford Gene Technology (Operations) Limited 2008. All rights reserved. Reproduction of all or any substantial part of

More information

MUTATION, DNA REPAIR AND CANCER

MUTATION, DNA REPAIR AND CANCER MUTATION, DNA REPAIR AND CANCER 1 Mutation A heritable change in the genetic material Essential to the continuity of life Source of variation for natural selection New mutations are more likely to be harmful

More information

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE AP Biology Date SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE LEARNING OBJECTIVES Students will gain an appreciation of the physical effects of sickle cell anemia, its prevalence in the population,

More information

Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study

Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study The data for this study is taken from experiment GSE848 from the Gene Expression

More information

Analysis of FFPE DNA Data in CNAG 2.0 A Manual

Analysis of FFPE DNA Data in CNAG 2.0 A Manual Analysis of FFPE DNA Data in CNAG 2.0 A Manual Table of Contents: I. Background P.2 II. Installation and Setup a. Download/Install CNAG 2.0 P.3 b. Setup P.4 III. Extract Mapping 500K FFPE Data P.7 IV.

More information

GAIA: Genomic Analysis of Important Aberrations

GAIA: Genomic Analysis of Important Aberrations GAIA: Genomic Analysis of Important Aberrations Sandro Morganella Stefano Maria Pagnotta Michele Ceccarelli Contents 1 Overview 1 2 Installation 2 3 Package Dependencies 2 4 Vega Data Description 2 4.1

More information

How many of you have checked out the web site on protein-dna interactions?

How many of you have checked out the web site on protein-dna interactions? How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss

More information

An example of bioinformatics application on plant breeding projects in Rijk Zwaan

An example of bioinformatics application on plant breeding projects in Rijk Zwaan An example of bioinformatics application on plant breeding projects in Rijk Zwaan Xiangyu Rao 17-08-2012 Introduction of RZ Rijk Zwaan is active worldwide as a vegetable breeding company that focuses on

More information

Name: Class: Date: ID: A

Name: Class: Date: ID: A Name: Class: _ Date: _ Meiosis Quiz 1. (1 point) A kidney cell is an example of which type of cell? a. sex cell b. germ cell c. somatic cell d. haploid cell 2. (1 point) How many chromosomes are in a human

More information

Overview of Next Generation Sequencing platform technologies

Overview of Next Generation Sequencing platform technologies Overview of Next Generation Sequencing platform technologies Dr. Bernd Timmermann Next Generation Sequencing Core Facility Max Planck Institute for Molecular Genetics Berlin, Germany Outline 1. Technologies

More information

Advances in RainDance Sequence Enrichment Technology and Applications in Cancer Research. March 17, 2011 Rendez-Vous Séquençage

Advances in RainDance Sequence Enrichment Technology and Applications in Cancer Research. March 17, 2011 Rendez-Vous Séquençage Advances in RainDance Sequence Enrichment Technology and Applications in Cancer Research March 17, 2011 Rendez-Vous Séquençage Presentation Overview Core Technology Review Sequence Enrichment Application

More information

Basic Analysis of Microarray Data

Basic Analysis of Microarray Data Basic Analysis of Microarray Data A User Guide and Tutorial Scott A. Ness, Ph.D. Co-Director, Keck-UNM Genomics Resource and Dept. of Molecular Genetics and Microbiology University of New Mexico HSC Tel.

More information

Analysis of ChIP-seq data in Galaxy

Analysis of ChIP-seq data in Galaxy Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers

More information

Step-by-Step Guide to Basic Expression Analysis and Normalization

Step-by-Step Guide to Basic Expression Analysis and Normalization Step-by-Step Guide to Basic Expression Analysis and Normalization Page 1 Introduction This document shows you how to perform a basic analysis and normalization of your data. A full review of this document

More information

Quality Assessment of Exon and Gene Arrays

Quality Assessment of Exon and Gene Arrays Quality Assessment of Exon and Gene Arrays I. Introduction In this white paper we describe some quality assessment procedures that are computed from CEL files from Whole Transcript (WT) based arrays such

More information

CHAPTER 2: UNDERSTANDING CANCER

CHAPTER 2: UNDERSTANDING CANCER CHAPTER 2: UNDERSTANDING CANCER INTRODUCTION We are witnessing an era of great discovery in the field of cancer research. New insights into the causes and development of cancer are emerging. These discoveries

More information

LESSON 3.5 WORKBOOK. How do cancer cells evolve? Workbook Lesson 3.5

LESSON 3.5 WORKBOOK. How do cancer cells evolve? Workbook Lesson 3.5 LESSON 3.5 WORKBOOK How do cancer cells evolve? In this unit we have learned how normal cells can be transformed so that they stop behaving as part of a tissue community and become unresponsive to regulation.

More information

Genomes and SNPs in Malaria and Sickle Cell Anemia

Genomes and SNPs in Malaria and Sickle Cell Anemia Genomes and SNPs in Malaria and Sickle Cell Anemia Introduction to Genome Browsing with Ensembl Ensembl The vast amount of information in biological databases today demands a way of organising and accessing

More information

Partek Methylation User Guide

Partek Methylation User Guide Partek Methylation User Guide Introduction This user guide will explain the different types of workflow that can be used to analyze methylation datasets. Under the Partek Methylation workflow there are

More information

Human Genome Organization: An Update. Genome Organization: An Update

Human Genome Organization: An Update. Genome Organization: An Update Human Genome Organization: An Update Genome Organization: An Update Highlights of Human Genome Project Timetable Proposed in 1990 as 3 billion dollar joint venture between DOE and NIH with 15 year completion

More information

Roberto Ciccone, Orsetta Zuffardi Università di Pavia

Roberto Ciccone, Orsetta Zuffardi Università di Pavia Roberto Ciccone, Orsetta Zuffardi Università di Pavia XIII Corso di Formazione Malformazioni Congenite dalla Diagnosi Prenatale alla Terapia Postnatale unipv.eu Carrara, 24 ottobre 2014 Legend:Bluebars

More information

8/7/2012. Experimental Design & Intro to NGS Data Analysis. Examples. Agenda. Shoe Example. Breast Cancer Example. Rat Example (Experimental Design)

8/7/2012. Experimental Design & Intro to NGS Data Analysis. Examples. Agenda. Shoe Example. Breast Cancer Example. Rat Example (Experimental Design) Experimental Design & Intro to NGS Data Analysis Ryan Peters Field Application Specialist Partek, Incorporated Agenda Experimental Design Examples ANOVA What assays are possible? NGS Analytical Process

More information

micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved

micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved microrna 2 micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved Regulate gene expression by binding complementary regions at 3 regions of target mrnas Act as negative

More information

Differential privacy in health care analytics and medical research An interactive tutorial

Differential privacy in health care analytics and medical research An interactive tutorial Differential privacy in health care analytics and medical research An interactive tutorial Speaker: Moritz Hardt Theory Group, IBM Almaden February 21, 2012 Overview 1. Releasing medical data: What could

More information

TruSeq Custom Amplicon v1.5

TruSeq Custom Amplicon v1.5 Data Sheet: Targeted Resequencing TruSeq Custom Amplicon v1.5 A new and improved amplicon sequencing solution for interrogating custom regions of interest. Highlights Figure 1: TruSeq Custom Amplicon Workflow

More information

Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6

Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Overview This tutorial outlines how microrna data can be analyzed within Partek Genomics Suite. Additionally,

More information

Organization and analysis of NGS variations. Alireza Hadj Khodabakhshi Research Investigator

Organization and analysis of NGS variations. Alireza Hadj Khodabakhshi Research Investigator Organization and analysis of NGS variations. Alireza Hadj Khodabakhshi Research Investigator Why is the NGS data processing a big challenge? Computation cannot keep up with the Biology. Source: illumina

More information

Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company

Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Genetic engineering: humans Gene replacement therapy or gene therapy Many technical and ethical issues implications for gene pool for germ-line gene therapy what traits constitute disease rather than just

More information

School of Nursing. Presented by Yvette Conley, PhD

School of Nursing. Presented by Yvette Conley, PhD Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression

More information

Supervised and unsupervised learning - 1

Supervised and unsupervised learning - 1 Chapter 3 Supervised and unsupervised learning - 1 3.1 Introduction The science of learning plays a key role in the field of statistics, data mining, artificial intelligence, intersecting with areas in

More information

Introduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director

Introduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director Introduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director Gene expression depends upon multiple factors Gene Transcription

More information

Factors for success in big data science

Factors for success in big data science Factors for success in big data science Damjan Vukcevic Data Science Murdoch Childrens Research Institute 16 October 2014 Big Data Reading Group (Department of Mathematics & Statistics, University of Melbourne)

More information

What is Cancer? Cancer is a genetic disease: Cancer typically involves a change in gene expression/function:

What is Cancer? Cancer is a genetic disease: Cancer typically involves a change in gene expression/function: Cancer is a genetic disease: Inherited cancer Sporadic cancer What is Cancer? Cancer typically involves a change in gene expression/function: Qualitative change Quantitative change Any cancer causing genetic

More information

Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes

Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes 2.1 Introduction Large-scale insertional mutagenesis screening in

More information

Lecture 3: Mutations

Lecture 3: Mutations Lecture 3: Mutations Recall that the flow of information within a cell involves the transcription of DNA to mrna and the translation of mrna to protein. Recall also, that the flow of information between

More information

BioBoot Camp Genetics

BioBoot Camp Genetics BioBoot Camp Genetics BIO.B.1.2.1 Describe how the process of DNA replication results in the transmission and/or conservation of genetic information DNA Replication is the process of DNA being copied before

More information

Genotyping and quality control of UK Biobank, a large- scale, extensively phenotyped prospective resource

Genotyping and quality control of UK Biobank, a large- scale, extensively phenotyped prospective resource Genotyping and quality control of UK Biobank, a large- scale, extensively phenotyped prospective resource Information for researchers Interim Data Release, 2015 1 Introduction... 3 1.1 UK Biobank... 3

More information

CCR Biology - Chapter 9 Practice Test - Summer 2012

CCR Biology - Chapter 9 Practice Test - Summer 2012 Name: Class: Date: CCR Biology - Chapter 9 Practice Test - Summer 2012 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Genetic engineering is possible

More information

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation Identification of rheumatoid arthritis and osterthritis patients by transcriptome-based rule set generation Bering Limited Report generated on September 19, 2014 Contents 1 Dataset summary 2 1.1 Project

More information

Collaborative Association Study of Psoriasis. Gonçalo Abecasis, Anne Bowcock, James Elder, Jerry Krueger

Collaborative Association Study of Psoriasis. Gonçalo Abecasis, Anne Bowcock, James Elder, Jerry Krueger Collaborative Association Study of Psoriasis Gonçalo Abecasis, Anne Bowcock, James Elder, Jerry Krueger Psoriasis Chronic, inflammatory skin condition Characteristic lesions, can affect substantial proportion

More information

Autoimmunity and immunemediated. FOCiS. Lecture outline

Autoimmunity and immunemediated. FOCiS. Lecture outline 1 Autoimmunity and immunemediated inflammatory diseases Abul K. Abbas, MD UCSF FOCiS 2 Lecture outline Pathogenesis of autoimmunity: why selftolerance fails Genetics of autoimmune diseases Therapeutic

More information

Wissenschaftliche Highlights der GSF 2007

Wissenschaftliche Highlights der GSF 2007 H Forschungszentrum für Umwelt und Gesundheit GmbH in der Helmholtzgemeinschaft Wissenschaftlich-Technische Abteilung Wissenschaftliche Highlights der GSF 2007 Abfrage Oktober 2007 Institut / Selbst. Abteilung

More information

Online Supplement to Polygenic Influence on Educational Attainment. Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using

Online Supplement to Polygenic Influence on Educational Attainment. Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using Online Supplement to Polygenic Influence on Educational Attainment Construction of Polygenic Score for Educational Attainment Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using

More information

Bio EOC Topics for Cell Reproduction: Bio EOC Questions for Cell Reproduction:

Bio EOC Topics for Cell Reproduction: Bio EOC Questions for Cell Reproduction: Bio EOC Topics for Cell Reproduction: Asexual vs. sexual reproduction Mitosis steps, diagrams, purpose o Interphase, Prophase, Metaphase, Anaphase, Telophase, Cytokinesis Meiosis steps, diagrams, purpose

More information

Package cgdsr. August 27, 2015

Package cgdsr. August 27, 2015 Type Package Package cgdsr August 27, 2015 Title R-Based API for Accessing the MSKCC Cancer Genomics Data Server (CGDS) Version 1.2.5 Date 2015-08-25 Author Anders Jacobsen Maintainer Augustin Luna

More information

European Medicines Agency

European Medicines Agency European Medicines Agency July 1996 CPMP/ICH/139/95 ICH Topic Q 5 B Quality of Biotechnological Products: Analysis of the Expression Construct in Cell Lines Used for Production of r-dna Derived Protein

More information

NATIONAL GENETICS REFERENCE LABORATORY (Manchester)

NATIONAL GENETICS REFERENCE LABORATORY (Manchester) NATIONAL GENETICS REFERENCE LABORATORY (Manchester) MLPA analysis spreadsheets User Guide (updated October 2006) INTRODUCTION These spreadsheets are designed to assist with MLPA analysis using the kits

More information

GWAS Data Cleaning. GENEVA Coordinating Center Department of Biostatistics University of Washington. January 13, 2016.

GWAS Data Cleaning. GENEVA Coordinating Center Department of Biostatistics University of Washington. January 13, 2016. GWAS Data Cleaning GENEVA Coordinating Center Department of Biostatistics University of Washington January 13, 2016 Contents 1 Overview 2 2 Preparing Data 3 2.1 Data formats used in GWASTools............................

More information

Information leaflet. Centrum voor Medische Genetica. Version 1/20150504 Design by Ben Caljon, UZ Brussel. Universitair Ziekenhuis Brussel

Information leaflet. Centrum voor Medische Genetica. Version 1/20150504 Design by Ben Caljon, UZ Brussel. Universitair Ziekenhuis Brussel Information on genome-wide genetic testing Array Comparative Genomic Hybridization (array CGH) Single Nucleotide Polymorphism array (SNP array) Massive Parallel Sequencing (MPS) Version 120150504 Design

More information

DeCyder Extended Data Analysis module Version 1.0

DeCyder Extended Data Analysis module Version 1.0 GE Healthcare DeCyder Extended Data Analysis module Version 1.0 Module for DeCyder 2D version 6.5 User Manual Contents 1 Introduction 1.1 Introduction... 7 1.2 The DeCyder EDA User Manual... 9 1.3 Getting

More information

GSR Microarrays Project Management System

GSR Microarrays Project Management System GSR Microarrays Project Management System A User s Guide GSR Microarrays Vanderbilt University MRBIII, Room 9274 465 21 st Avenue South Nashville, TN 37232 microarray@vanderbilt.edu (615) 936-3003 www.gsr.vanderbilt.edu

More information

Single-Cell DNA Sequencing with the C 1. Single-Cell Auto Prep System. Reveal hidden populations and genetic diversity within complex samples

Single-Cell DNA Sequencing with the C 1. Single-Cell Auto Prep System. Reveal hidden populations and genetic diversity within complex samples DATA Sheet Single-Cell DNA Sequencing with the C 1 Single-Cell Auto Prep System Reveal hidden populations and genetic diversity within complex samples Single-cell sensitivity Discover and detect SNPs,

More information

GeneChip Sequence Analysis Software (GSEQ) is used to analyze data from the Resequencing Arrays

GeneChip Sequence Analysis Software (GSEQ) is used to analyze data from the Resequencing Arrays GeneChip Sequence Analysis Software 4.1 Note For more information, Please refer to the Affymetrix GeneChip Sequence Analysis Software User s Guide Version 4.1 guidebook & Quick Reference Card I. GSEQ Introduction

More information

Consistent Assay Performance Across Universal Arrays and Scanners

Consistent Assay Performance Across Universal Arrays and Scanners Technical Note: Illumina Systems and Software Consistent Assay Performance Across Universal Arrays and Scanners There are multiple Universal Array and scanner options for running Illumina DASL and GoldenGate

More information

Current Motif Discovery Tools and their Limitations

Current Motif Discovery Tools and their Limitations Current Motif Discovery Tools and their Limitations Philipp Bucher SIB / CIG Workshop 3 October 2006 Trendy Concepts and Hypotheses Transcription regulatory elements act in a context-dependent manner.

More information

Genetics Lecture Notes 7.03 2005. Lectures 1 2

Genetics Lecture Notes 7.03 2005. Lectures 1 2 Genetics Lecture Notes 7.03 2005 Lectures 1 2 Lecture 1 We will begin this course with the question: What is a gene? This question will take us four lectures to answer because there are actually several

More information

Tutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment

Tutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment Tutorial for Windows and Macintosh Preparing Your Data for NGS Alignment 2015 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) 1.734.769.7249

More information

A Primer of Genome Science THIRD

A Primer of Genome Science THIRD A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:

More information

Guide for Data Visualization and Analysis using ACSN

Guide for Data Visualization and Analysis using ACSN Guide for Data Visualization and Analysis using ACSN ACSN contains the NaviCell tool box, the intuitive and user- friendly environment for data visualization and analysis. The tool is accessible from the

More information

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Xiaohui Xie 1, Jun Lu 1, E. J. Kulbokas 1, Todd R. Golub 1, Vamsi Mootha 1, Kerstin Lindblad-Toh

More information

Psychoonkology, Sept. 2010 lifestyle factors and epigenetics

Psychoonkology, Sept. 2010 lifestyle factors and epigenetics Psychoonkology, Sept. 2010 lifestyle factors and epigenetics Alexander G. Haslberger Dep. für Ernährungswissenschaften Univ. of Vienna Working group: Food, GI-Microbiology, Epigenetics Content Health:

More information

Release Notes. Agilent CytoGenomics v4.0.2. For Research Use Only. Not for use in diagnostic procedures. Product Number

Release Notes. Agilent CytoGenomics v4.0.2. For Research Use Only. Not for use in diagnostic procedures. Product Number Release Notes Agilent CytoGenomics v4.0.2 Product Number G1662AA CytoGenomics Client 1 year named license (including Feature Extraction). This license supports installation of one client and server (to

More information

Tutorial on gplink. http://pngu.mgh.harvard.edu/~purcell/plink/gplink.shtml. PLINK tutorial, December 2006; Shaun Purcell, shaun@pngu.mgh.harvard.

Tutorial on gplink. http://pngu.mgh.harvard.edu/~purcell/plink/gplink.shtml. PLINK tutorial, December 2006; Shaun Purcell, shaun@pngu.mgh.harvard. Tutorial on gplink http://pngu.mgh.harvard.edu/~purcell/plink/gplink.shtml Basic gplink analyses Data management Summary statistics Association analysis Population stratification IBD-based analysis gplink

More information

Course on Functional Analysis. ::: Gene Set Enrichment Analysis - GSEA -

Course on Functional Analysis. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional Analysis ::: Madrid, June 31st, 2007. Gonzalo Gómez, PhD. ggomez@cnio.es Bioinformatics Unit CNIO ::: Contents. 1. Introduction. 2. GSEA Software 3. Data Formats 4. Using GSEA 5. GSEA

More information

Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms

Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms Introduction Mate pair sequencing enables the generation of libraries with insert sizes in the range of several kilobases (Kb).

More information

Single Nucleotide Polymorphisms (SNPs)

Single Nucleotide Polymorphisms (SNPs) Single Nucleotide Polymorphisms (SNPs) Additional Markers 13 core STR loci Obtain further information from additional markers: Y STRs Separating male samples Mitochondrial DNA Working with extremely degraded

More information

SAP HANA Enabling Genome Analysis

SAP HANA Enabling Genome Analysis SAP HANA Enabling Genome Analysis Joanna L. Kelley, PhD Postdoctoral Scholar, Stanford University Enakshi Singh, MSc HANA Product Management, SAP Labs LLC Outline Use cases Genomics review Challenges in

More information

Cluster software and Java TreeView

Cluster software and Java TreeView Cluster software and Java TreeView To download the software: http://bonsai.hgc.jp/~mdehoon/software/cluster/software.htm http://bonsai.hgc.jp/~mdehoon/software/cluster/manual/treeview.html Cluster 3.0

More information

1 Mutation and Genetic Change

1 Mutation and Genetic Change CHAPTER 14 1 Mutation and Genetic Change SECTION Genes in Action KEY IDEAS As you read this section, keep these questions in mind: What is the origin of genetic differences among organisms? What kinds

More information

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each

More information