Integrative Analysis of Genomic Copy Number. Cancer.
|
|
- Reynard Simpson
- 7 years ago
- Views:
Transcription
1 Integrative Analysis of Genomic Copy Number and Gene Expression Data in Metastatic Prostate Cancer. Elise Chang Agilent Technologies
2 Agenda Introduction Features of Copy Number Workflow SNPs.. SNPs.. Case study- Integrative Analysis CNVs.. of Genomic copy number CNVs.. and Gene Expression Data in Metastatic Prostate Cancer CNPs CNPs CNVRs.. CNVRs..
3 Copy Number Variation- Understanding the Relevance to Human Diseases Copy number variation (CNV): DNA segments in which copy-number varies between two or more genomes Ranges from 1 Kb to millions of DNA bases in size CNVs have been associated with susceptibility to disease, complex behavioral traits, and other phenotypic variability Identifying significant CNVs is important in understanding the underlying mechanism of disease and disease susceptibility
4 Supported Array Platforms Affymetrix: 100K (50K Xba, 50K Hind) 500K (250K Nsp, 250K Sty) SNP 5.0 SNP 6.0 Illumina: GenomeStudio outputs for all SNP/CNV arrays GeneSpring GX plugin for GenomeStudio used to export data in format GeneSpring GX will support (plug-in located in: INSTALLDIR\app\Illumina\GX.Genotyping.Export.dll to Genomestudio\modules \ BSGT \ ReportPlugins\) -Instructions for installation are in section of the manual.
5 Supported Arrays Affymetrix Technology available on Agilent server. Experiment creation involves importing the CEL files, summarization and normalization GX11 computes log ratio, CN and LOH GX11 uses the CN values to get ASCN, PSCN and to run GISTIC Illumina Technology created on the fly. Experiment creation involves import from GenomeStudio Log ratios, CN values and LOH are imported from GenomeStudio GX11 uses the CN values to get ASCN, PSCN and to run GISTIC
6 Experimental Designs Identification of variation requires comparison to either a reference DNA source, a reference dataset or a reference genome sequence. This is important for Affymetrix experiment creation 1. Analysis against a reference: The control is generated from a pool of individuals. All the test samples are then compared against a common, pooled control, also known as reference. HapMap samples are packaged as Standard Reference Custom Reference can be created 2. Paired Analysis: Control and the test DNA are from the same individual Pairing is defined during experiment grouping
7 Custom Reference Creation Menu: Tools> Create Custom Reference Typically need reference samples for accurate genotype calls on non-reference Once Custom Reference is created, it will be saved for future experiment creation
8 Reference Creation References contain: Averaged summarised intensities for probe sets from PLIER For Affymetrix 50/100K Set Statistics from BRLMM For 250/500K Set and SNP5.0 Affymetrix arrays Statistics from BirdSeed Algorithm Clusters from BirdSeed Algorithm (and median and s.d. of clusters) For SNP6.0 Affymetrix arrays Statistics from BirdSeed Algorithm Clusters from BirdSeed Algorithm (and median and s.d. of clusters) Clusters from CANARY (and median and s.d. of clusters)
9 Experimental Set-up for Paired Normal Design For paired-normal experimental designs, two parameters must be specified Group indicates a set of paired samples Condition indicates which sample(s) to use as reference (Normal) for test sample(s) (Tumor) Parameters must be Group and Condition for GeneSpring GX to recognize it as a paired design Interpretation using Group and Condition must be used for Copy Number Computation
10 Copy Number Analysis Workflow in GeneSpring GX 11 QC / Batch Correction Copy NumberAnalysis: (CN, LOH, ASCN, Log ratio) GISTIC for Identification of Statistically Common CN variation within a set of samples Filter for Regions of Interest Biological Contextualization of Genes in Regions of interest * QC/Batch correction step is not available for Illumina workflow
11 Quality Control on Samples This window should look familiar to current GeneSpringGX users.
12 Quality Control Tools - PCA and Batch Effect Quality Control PCA- -identifies potential sample outliers Batch Effect -identifies and corrects for systematic error when different samples are processed on different days or different conditions.
13 Batch Correction Select interpretation that groups samples into their respective batches Minimum samples per batch Minimum m number of samples per batch to be considered for correction P-value T-test p-value cutoff for each probe Percentage of bad batches allowed If percent bad batches below userspecified value, do not perform correction for probe Each batch is T-tested against a pool of all remaining batches. Correction for each flagged entity is Correction for each flagged entity is performed using a reference batch.
14 Copy Number Computation Copy NumberAnalysis: (CN, LOH, ASCN, Log ratio, LOD score)
15 Copy Number Analysis for Affymetrix Data Computation actually computing: (1) Log ratio values Against Reference design: Normalized intensity of sample/ Normalized intensity of reference Paired design: Normalized intensity of Case/ Normalized intensity of Control (2) Genomic Copy Number Circular Binary Segmentation to identify segments Log ratio values to estimate genomic copy number Confidence value give as log10 of p-value (3) Allele-specific copy number (ascn) information Fawkes algorithm used to assign allele-specific copy number using SNP probes (4) Parent-specific copy number (pscn) information (5) Loss of Heterozygosity (LOH) Hidden Markov Model (HMM) used to calculate LOH score
16 Log Ratio and Copy Number Computation Copy Number computation (paired or against reference) is determined by the interpretation selected: First Log 2 ratios are calculated for every probe: Against Reference design: Normalized intensity of sample/ Normalized intensity of reference Paired design: Normalized intensity of Case/ Normalized intensity of Control
17 Copy Number Computation Circular Binary Segmentation Smooths outliers Finds change points in each sample using a statistic to identify a segment break Validation of change point using t-test test with p value cut off < Outputs are segment break points and mean log ratio for segment Segment Break Points
18 Copy Number Computation Once segments are identified by CBS then copy numbers and confidence scores need to be assigned to them Copy Number: HapMap dataset is used to generate a median map Using the birdseed and CANARY outputs for each possible copy number (0,1,2,3,4) the median and s.d log ratios across all probes is calculated Log ratios for segments from CBS are compared to the median map and copy numbers are assigned Homozygous and Hemizygous deletions are given values of 0 and1 Amplifications are given CN values of 3 and 4. Copy Number Confidence: Copy Numbers between 1.5 and 2.5 are assigned a p value of '1' For any other copy number a T test t against zero of log ratios is performed with multiples l testing ti correction Negative logarithm to the base 10 of the final p value reported as confidence.
19 Copy Number Computation Median Map Copy Number Assigned Genome- Wide Human SNP Array 6.0 Genome-Wide Human SNP Array 5.0 Mean Log Ratio that is mapped Human Mapping 500K Array Set - NSP Human Mapping 500K Array Set - STY Mapping 100k array set Same as Genome Wide Human SNP Array
20 Copy Number Analysis Log ratios are smoothed to give CN values. CN segments are created using Circular Binary Segmentation (CBS) algorithm. CN values log ratios F ti l ll di t CN l i d i Fractional as well as discrete CN values are assigned, in the range of 0-4
21 1. Paired Analysis CN computation Condition-Type Interpretation 2. Each tumor is paired against the Normal of its group 3. All Normals are compared against the reference All samples against reference comparison Only one set of CN Analysis results can be stored.
22 Allele-specific Copy Number Given segment with copy number = 3, which allele was duplicated? Example output: AAB = A2: B1
23 Parent-specific Copy Number Consider a section of a Chromosome with haplotypes: ChrCopy1: A 1B 2A 3B 4B 5 B (after duplication): A 1B 2A 3B 4B 5 B A 1B 2A 3B 4B 5 B ChrCopy2: A 1 A 2 B 3 A 4 B 5 Suppose Copy1 gets duplicated 2 additional times (CN of region =4), the ascn become: A 1 :4 B 1 :0 and pscn = 4-0 A 2 :1 B 2 :3 and pscn = 3-1 A 3 :3 B 3 :1 and pscn = 3-1 A 4 :1 B 4 :3 and pscn = 3-1 A 5 :0 B 5 :4 and pscn = 4-0 PSCN is a measure of allelic imbalance
24 Copy Number Computation for Illumina Arrays Copy Number, Log ratio, and LOH scores calculated in GenomeStudio and imported into GeneSpring GX The following are computed in GeneSpring GX: ASCN information PSCN information
25 Analysis and Filtering Once you have identified regions of genomic alteration in individual sample how can you find meaningful events in groups of samples? Find Common Genomic Variant Regions Filter By Regions Identify Copy Neutral LOH Filter By PSCN
26 Finding Common Genomic Variant Regions Across asetofsamples Samples Genomic Identification of Significant ifi Targets in Cancer (GISTIC)
27 Find Common Genomic Variant Regions Many tumour samples have large numbers of chromosomal abberations. GISTIC was developed to try and distinguish meaningful or driver mutation events from random background somatic or passenger events Driver mutations are functionally important events which confer advantageous biological properties to the tumour allowing it to initiate grow or persist and are more likely to drive cancer pathogenesis GISTIC can also be applied to non cancer datasets where you want to find common genomic variant regions
28 Common Genomic Variant Regions Choose Fine or Coarse Mode Amplified Regions Deleted Regions
29 Common Variation Results Once GISTIC has identified aberrant regions it uses the biological genome to find overlapping genes for amplified and deleted segments For each probeset within the region, the upstream and downstream 1000 bases are scanned and the genes are identified G l i th Genes overlapping the significant regions identified and stored in the Project Navigator
30 Use of Filters to identify genomic landscape prevalent in metastatic prostate cancer
31 Results Analysis 31 Confidentialit March
32 Biological Contextualization of Copy Number Data 32 Confidentialit March
33 Case Study
34 Integrative Analysis of Metastatic Prostate Cancer Prostate Cancer is the most common cancer in men. Primary tumors are thought to be composed of multiple genetically distinct cancer cell clones. Both the primary and the metastatic prostate cancers are p y p heterogenous in nature, posing therapeutic challenges.
35 Datasets Used Expression: GSE metastatic samples from 4 patients and 18 normal samples Genomic Copy Number: GSE metastatic locations from 14 patients and 16 subject paired non-cancerous samples Liu et al, Nat Med May;15(5):559-65
36 Copy Number Analysis in Prostate Cancer Samples 36 Confidentialit March
37 Expression Analysis in Prostate Cancer Samples 37 Confidentialit March
38 PCA- Genotyping Data Shape by Condition: Tumor Normal Color by Patient Color by Patient Group
39 PCA- Expression Data Normal Metastatic QC using PCA shows separation of the Normal and the Metastatic samples of GSE6919
40 Histogram view of data tracks in Genome Browser showing deletions as green blocks and amplifications as red dblocks Published data Chr. 6 Deletion- Pateint #17 Chromosome 6 Validated d in GX11
41 Joint Analysis of Gene Expression and Genomic Copy Number Data in Metastatic Prostate Cancer Copy Number Gene Expression Prostate Cancer Studies Controlled for regions and metastatic tissues 41 Confidentialit March
42 Deletions present in chr.6 of patient 17: An Integrative Analysis
43 Analysis workflow Expression: Genotyping: T-test Standard Reference FC 2.0 p-value: 0.05 Differentially expressed 441 entities Copy Number computation Filters Genome Browser
44 Deletion of PLAGL Fold Downregulation of PLAGL1 in Metastasis Data xpression Ex Genomic Data
45 PLAGL1 Candidate Tumor suppressor gene, with anti-proliferative activities Zinc finger protein with transactivation and DNA binding activity Presence of splice variants which allow differential regulation of apoptosis induction and cell cycle arrest Frequently deleted in many solid tumors-breast, ovarian and renal cell carcinomas Also known as LOT or Lost On Transformation
46 PLAG1-network analysis
47 First order expansion of PLAG1 network and overlay with FC data
48 TCF21 Genomic Data Expression Data TCF21 TCF21 CN=2 No genomic aberration of TCF21 Down regulation of Down-regulation of expression levels of TCF21
49 TCF21 First Order Expansion of the PLAGL1 network identified TCF21, a ts gene, to be down regulated in the expression analysis. The CN of TCF21 remains at 2, unlike that of PLAGL1. TCF21 is known to be frequently silenced epigenetically in head and neck cancer. Consistent with this, TCF21 did not show any deletion in the samples examined, raising the possibility that TFC21 could be epigenetically pg regulated in prostate cancer.
50 Conclusions 1. Using GX11, we could validate the presence of ERG- TMPRSS2 in several of metastatic prostate cancer samples 2. Significant Aberration found in PTEN, FGF18, TRIB3 by GISTIC indicates that these could be driver mutations of prostate cancer. 3. Additional candidates were identified by combined use of filters to identify amplified regions and regions of allelic imbalance. 4. Integrative ti analysis using expression and genotyping data has identified PLAGL1, a candidate ts gene, and TCF21, a ts gene, to be having a possible role in prostate cancer. 5. PLAGL1 deletion, though present in a small percentage of population, is an early event, occurring at a pre-metastatic stage
Simplifying Data Interpretation with Nexus Copy Number
Simplifying Data Interpretation with Nexus Copy Number A WHITE PAPER FROM BIODISCOVERY, INC. Rapid technological advancements, such as high-density acgh and SNP arrays as well as next-generation sequencing
More informationCore Facility Genomics
Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray
More informationComparative genomic hybridization Because arrays are more than just a tool for expression analysis
Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Comparative genomic hybridization Because arrays are more than just a tool for expression analysis Carsten Friis ( with several slides from
More informationCNV Univariate Analysis Tutorial
CNV Univariate Analysis Tutorial Release 8.1 Golden Helix, Inc. March 18, 2014 Contents 1. Overview 2 2. CNAM Optimal Segmenting 4 A. Performing CNAM Optimal Segmenting..................................
More informationDNA Copy Number and Loss of Heterozygosity Analysis Algorithms
DNA Copy Number and Loss of Heterozygosity Analysis Algorithms Detection of copy-number variants and chromosomal aberrations in GenomeStudio software. Introduction Illumina has developed several algorithms
More informationFocusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
More informationAGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
More informationSingle-Cell Whole Genome Sequencing on the C1 System: a Performance Evaluation
PN 100-9879 A1 TECHNICAL NOTE Single-Cell Whole Genome Sequencing on the C1 System: a Performance Evaluation Introduction Cancer is a dynamic evolutionary process of which intratumor genetic and phenotypic
More informationPREDA S4-classes. Francesco Ferrari October 13, 2015
PREDA S4-classes Francesco Ferrari October 13, 2015 Abstract This document provides a description of custom S4 classes used to manage data structures for PREDA: an R package for Position RElated Data Analysis.
More informationAgilent CytoGenomics Software A Complete Solution for Cytogenetic Research Data Analysis
Agilent CytoGenomics Software A Complete Solution for Cytogenetic Research Data Analysis Technical Overview Streamlines the cytogenetic research workflow for finding CNCs, LOH, and UPD Enables manual sample
More informationGlobally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the
Chapter 5 Analysis of Prostate Cancer Association Study Data 5.1 Risk factors for Prostate Cancer Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the disease has
More informationStep by Step Guide to Importing Genetic Data into JMP Genomics
Step by Step Guide to Importing Genetic Data into JMP Genomics Page 1 Introduction Data for genetic analyses can exist in a variety of formats. Before this data can be analyzed it must imported into one
More informationGenomeStudio Data Analysis Software
GenomeStudio Data Analysis Software Illumina has created a comprehensive suite of data analysis tools to support a wide range of genetic analysis assays. This single software package provides data visualization
More informationGenomeStudio Data Analysis Software
GenomeStudio Analysis Software Illumina has created a comprehensive suite of data analysis tools to support a wide range of genetic analysis assays. This single software package provides data visualization
More informationData Analysis for Ion Torrent Sequencing
IFU022 v140202 Research Use Only Instructions For Use Part III Data Analysis for Ion Torrent Sequencing MANUFACTURER: Multiplicom N.V. Galileilaan 18 2845 Niel Belgium Revision date: August 21, 2014 Page
More informationUKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015
UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory April, 2015 1 Contents Overview... 3 Rare Variants... 3 Observation... 3 Approach... 3 ApoE
More informationUsing Illumina BaseSpace Apps to Analyze RNA Sequencing Data
Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data The Illumina TopHat Alignment and Cufflinks Assembly and Differential Expression apps make RNA data analysis accessible to any user, regardless
More informationTargeted. sequencing solutions. Accurate, scalable, fast TARGETED
Targeted TARGETED Sequencing sequencing solutions Accurate, scalable, fast Sequencing for every lab, every budget, every application Ion Torrent semiconductor sequencing Ion Torrent technology has pioneered
More informationTutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
More informationOverview of Genetic Testing and Screening
Integrating Genetics into Your Practice Webinar Series Overview of Genetic Testing and Screening Genetic testing is an important tool in the screening and diagnosis of many conditions. New technology is
More informationSNPbrowser Software v3.5
Product Bulletin SNP Genotyping SNPbrowser Software v3.5 A Free Software Tool for the Knowledge-Driven Selection of SNP Genotyping Assays Easily visualize SNPs integrated with a physical map, linkage disequilibrium
More informationCombining Data from Different Genotyping Platforms. Gonçalo Abecasis Center for Statistical Genetics University of Michigan
Combining Data from Different Genotyping Platforms Gonçalo Abecasis Center for Statistical Genetics University of Michigan The Challenge Detecting small effects requires very large sample sizes Combined
More informationMicroarray Data Analysis. A step by step analysis using BRB-Array Tools
Microarray Data Analysis A step by step analysis using BRB-Array Tools 1 EXAMINATION OF DIFFERENTIAL GENE EXPRESSION (1) Objective: to find genes whose expression is changed before and after chemotherapy.
More informationSeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis
SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis Goal: This tutorial introduces several websites and tools useful for determining linkage disequilibrium
More informationNext Generation Sequencing: Technology, Mapping, and Analysis
Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University gbenson@bu.edu http://tandem.bu.edu/ The Human Genome Project took
More informationBreast cancer and the role of low penetrance alleles: a focus on ATM gene
Modena 18-19 novembre 2010 Breast cancer and the role of low penetrance alleles: a focus on ATM gene Dr. Laura La Paglia Breast Cancer genetic Other BC susceptibility genes TP53 PTEN STK11 CHEK2 BRCA1
More informationContents. molecular biology techniques. - Mutations in Factor II. - Mutations in MTHFR gene. - Breast cencer genes. - p53 and breast cancer
Contents Introduction: biology and medicine, two separated compartments What we need to know: - boring basics in DNA/RNA structure and overview of particular aspects of molecular biology techniques - How
More informationFrequently Asked Questions Next Generation Sequencing
Frequently Asked Questions Next Generation Sequencing Import These Frequently Asked Questions for Next Generation Sequencing are some of the more common questions our customers ask. Questions are divided
More informationReplacing TaqMan SNP Genotyping Assays that Fail Applied Biosystems Manufacturing Quality Control. Begin
User Bulletin TaqMan SNP Genotyping Assays May 2008 SUBJECT: Replacing TaqMan SNP Genotyping Assays that Fail Applied Biosystems Manufacturing Quality Control In This Bulletin Overview This user bulletin
More informationLecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs)
Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Single nucleotide polymorphisms or SNPs (pronounced "snips") are DNA sequence variations that occur
More informationInterpret software. User guide. version 11
Interpret software User guide version 11 This protocol booklet and its contents are Oxford Gene Technology (Operations) Limited 2008. All rights reserved. Reproduction of all or any substantial part of
More informationMUTATION, DNA REPAIR AND CANCER
MUTATION, DNA REPAIR AND CANCER 1 Mutation A heritable change in the genetic material Essential to the continuity of life Source of variation for natural selection New mutations are more likely to be harmful
More informationSICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE
AP Biology Date SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE LEARNING OBJECTIVES Students will gain an appreciation of the physical effects of sickle cell anemia, its prevalence in the population,
More informationAnalyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study
Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study The data for this study is taken from experiment GSE848 from the Gene Expression
More informationAnalysis of FFPE DNA Data in CNAG 2.0 A Manual
Analysis of FFPE DNA Data in CNAG 2.0 A Manual Table of Contents: I. Background P.2 II. Installation and Setup a. Download/Install CNAG 2.0 P.3 b. Setup P.4 III. Extract Mapping 500K FFPE Data P.7 IV.
More informationGAIA: Genomic Analysis of Important Aberrations
GAIA: Genomic Analysis of Important Aberrations Sandro Morganella Stefano Maria Pagnotta Michele Ceccarelli Contents 1 Overview 1 2 Installation 2 3 Package Dependencies 2 4 Vega Data Description 2 4.1
More informationHow many of you have checked out the web site on protein-dna interactions?
How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss
More informationAn example of bioinformatics application on plant breeding projects in Rijk Zwaan
An example of bioinformatics application on plant breeding projects in Rijk Zwaan Xiangyu Rao 17-08-2012 Introduction of RZ Rijk Zwaan is active worldwide as a vegetable breeding company that focuses on
More informationName: Class: Date: ID: A
Name: Class: _ Date: _ Meiosis Quiz 1. (1 point) A kidney cell is an example of which type of cell? a. sex cell b. germ cell c. somatic cell d. haploid cell 2. (1 point) How many chromosomes are in a human
More informationOverview of Next Generation Sequencing platform technologies
Overview of Next Generation Sequencing platform technologies Dr. Bernd Timmermann Next Generation Sequencing Core Facility Max Planck Institute for Molecular Genetics Berlin, Germany Outline 1. Technologies
More informationAdvances in RainDance Sequence Enrichment Technology and Applications in Cancer Research. March 17, 2011 Rendez-Vous Séquençage
Advances in RainDance Sequence Enrichment Technology and Applications in Cancer Research March 17, 2011 Rendez-Vous Séquençage Presentation Overview Core Technology Review Sequence Enrichment Application
More informationBasic Analysis of Microarray Data
Basic Analysis of Microarray Data A User Guide and Tutorial Scott A. Ness, Ph.D. Co-Director, Keck-UNM Genomics Resource and Dept. of Molecular Genetics and Microbiology University of New Mexico HSC Tel.
More informationAnalysis of ChIP-seq data in Galaxy
Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers
More informationStep-by-Step Guide to Basic Expression Analysis and Normalization
Step-by-Step Guide to Basic Expression Analysis and Normalization Page 1 Introduction This document shows you how to perform a basic analysis and normalization of your data. A full review of this document
More informationQuality Assessment of Exon and Gene Arrays
Quality Assessment of Exon and Gene Arrays I. Introduction In this white paper we describe some quality assessment procedures that are computed from CEL files from Whole Transcript (WT) based arrays such
More informationCHAPTER 2: UNDERSTANDING CANCER
CHAPTER 2: UNDERSTANDING CANCER INTRODUCTION We are witnessing an era of great discovery in the field of cancer research. New insights into the causes and development of cancer are emerging. These discoveries
More informationLESSON 3.5 WORKBOOK. How do cancer cells evolve? Workbook Lesson 3.5
LESSON 3.5 WORKBOOK How do cancer cells evolve? In this unit we have learned how normal cells can be transformed so that they stop behaving as part of a tissue community and become unresponsive to regulation.
More informationGenomes and SNPs in Malaria and Sickle Cell Anemia
Genomes and SNPs in Malaria and Sickle Cell Anemia Introduction to Genome Browsing with Ensembl Ensembl The vast amount of information in biological databases today demands a way of organising and accessing
More informationPartek Methylation User Guide
Partek Methylation User Guide Introduction This user guide will explain the different types of workflow that can be used to analyze methylation datasets. Under the Partek Methylation workflow there are
More informationHuman Genome Organization: An Update. Genome Organization: An Update
Human Genome Organization: An Update Genome Organization: An Update Highlights of Human Genome Project Timetable Proposed in 1990 as 3 billion dollar joint venture between DOE and NIH with 15 year completion
More informationRoberto Ciccone, Orsetta Zuffardi Università di Pavia
Roberto Ciccone, Orsetta Zuffardi Università di Pavia XIII Corso di Formazione Malformazioni Congenite dalla Diagnosi Prenatale alla Terapia Postnatale unipv.eu Carrara, 24 ottobre 2014 Legend:Bluebars
More information8/7/2012. Experimental Design & Intro to NGS Data Analysis. Examples. Agenda. Shoe Example. Breast Cancer Example. Rat Example (Experimental Design)
Experimental Design & Intro to NGS Data Analysis Ryan Peters Field Application Specialist Partek, Incorporated Agenda Experimental Design Examples ANOVA What assays are possible? NGS Analytical Process
More informationmicrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved
microrna 2 micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved Regulate gene expression by binding complementary regions at 3 regions of target mrnas Act as negative
More informationDifferential privacy in health care analytics and medical research An interactive tutorial
Differential privacy in health care analytics and medical research An interactive tutorial Speaker: Moritz Hardt Theory Group, IBM Almaden February 21, 2012 Overview 1. Releasing medical data: What could
More informationTruSeq Custom Amplicon v1.5
Data Sheet: Targeted Resequencing TruSeq Custom Amplicon v1.5 A new and improved amplicon sequencing solution for interrogating custom regions of interest. Highlights Figure 1: TruSeq Custom Amplicon Workflow
More informationAnalyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6
Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Overview This tutorial outlines how microrna data can be analyzed within Partek Genomics Suite. Additionally,
More informationOrganization and analysis of NGS variations. Alireza Hadj Khodabakhshi Research Investigator
Organization and analysis of NGS variations. Alireza Hadj Khodabakhshi Research Investigator Why is the NGS data processing a big challenge? Computation cannot keep up with the Biology. Source: illumina
More informationChapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company
Genetic engineering: humans Gene replacement therapy or gene therapy Many technical and ethical issues implications for gene pool for germ-line gene therapy what traits constitute disease rather than just
More informationSchool of Nursing. Presented by Yvette Conley, PhD
Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression
More informationSupervised and unsupervised learning - 1
Chapter 3 Supervised and unsupervised learning - 1 3.1 Introduction The science of learning plays a key role in the field of statistics, data mining, artificial intelligence, intersecting with areas in
More informationIntroduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director
Introduction To Epigenetic Regulation: How Can The Epigenomics Core Services Help Your Research? Maria (Ken) Figueroa, M.D. Core Scientific Director Gene expression depends upon multiple factors Gene Transcription
More informationFactors for success in big data science
Factors for success in big data science Damjan Vukcevic Data Science Murdoch Childrens Research Institute 16 October 2014 Big Data Reading Group (Department of Mathematics & Statistics, University of Melbourne)
More informationWhat is Cancer? Cancer is a genetic disease: Cancer typically involves a change in gene expression/function:
Cancer is a genetic disease: Inherited cancer Sporadic cancer What is Cancer? Cancer typically involves a change in gene expression/function: Qualitative change Quantitative change Any cancer causing genetic
More informationChapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes
Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes 2.1 Introduction Large-scale insertional mutagenesis screening in
More informationLecture 3: Mutations
Lecture 3: Mutations Recall that the flow of information within a cell involves the transcription of DNA to mrna and the translation of mrna to protein. Recall also, that the flow of information between
More informationBioBoot Camp Genetics
BioBoot Camp Genetics BIO.B.1.2.1 Describe how the process of DNA replication results in the transmission and/or conservation of genetic information DNA Replication is the process of DNA being copied before
More informationGenotyping and quality control of UK Biobank, a large- scale, extensively phenotyped prospective resource
Genotyping and quality control of UK Biobank, a large- scale, extensively phenotyped prospective resource Information for researchers Interim Data Release, 2015 1 Introduction... 3 1.1 UK Biobank... 3
More informationCCR Biology - Chapter 9 Practice Test - Summer 2012
Name: Class: Date: CCR Biology - Chapter 9 Practice Test - Summer 2012 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Genetic engineering is possible
More informationIdentification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation
Identification of rheumatoid arthritis and osterthritis patients by transcriptome-based rule set generation Bering Limited Report generated on September 19, 2014 Contents 1 Dataset summary 2 1.1 Project
More informationCollaborative Association Study of Psoriasis. Gonçalo Abecasis, Anne Bowcock, James Elder, Jerry Krueger
Collaborative Association Study of Psoriasis Gonçalo Abecasis, Anne Bowcock, James Elder, Jerry Krueger Psoriasis Chronic, inflammatory skin condition Characteristic lesions, can affect substantial proportion
More informationAutoimmunity and immunemediated. FOCiS. Lecture outline
1 Autoimmunity and immunemediated inflammatory diseases Abul K. Abbas, MD UCSF FOCiS 2 Lecture outline Pathogenesis of autoimmunity: why selftolerance fails Genetics of autoimmune diseases Therapeutic
More informationWissenschaftliche Highlights der GSF 2007
H Forschungszentrum für Umwelt und Gesundheit GmbH in der Helmholtzgemeinschaft Wissenschaftlich-Technische Abteilung Wissenschaftliche Highlights der GSF 2007 Abfrage Oktober 2007 Institut / Selbst. Abteilung
More informationOnline Supplement to Polygenic Influence on Educational Attainment. Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using
Online Supplement to Polygenic Influence on Educational Attainment Construction of Polygenic Score for Educational Attainment Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using
More informationBio EOC Topics for Cell Reproduction: Bio EOC Questions for Cell Reproduction:
Bio EOC Topics for Cell Reproduction: Asexual vs. sexual reproduction Mitosis steps, diagrams, purpose o Interphase, Prophase, Metaphase, Anaphase, Telophase, Cytokinesis Meiosis steps, diagrams, purpose
More informationPackage cgdsr. August 27, 2015
Type Package Package cgdsr August 27, 2015 Title R-Based API for Accessing the MSKCC Cancer Genomics Data Server (CGDS) Version 1.2.5 Date 2015-08-25 Author Anders Jacobsen Maintainer Augustin Luna
More informationEuropean Medicines Agency
European Medicines Agency July 1996 CPMP/ICH/139/95 ICH Topic Q 5 B Quality of Biotechnological Products: Analysis of the Expression Construct in Cell Lines Used for Production of r-dna Derived Protein
More informationNATIONAL GENETICS REFERENCE LABORATORY (Manchester)
NATIONAL GENETICS REFERENCE LABORATORY (Manchester) MLPA analysis spreadsheets User Guide (updated October 2006) INTRODUCTION These spreadsheets are designed to assist with MLPA analysis using the kits
More informationGWAS Data Cleaning. GENEVA Coordinating Center Department of Biostatistics University of Washington. January 13, 2016.
GWAS Data Cleaning GENEVA Coordinating Center Department of Biostatistics University of Washington January 13, 2016 Contents 1 Overview 2 2 Preparing Data 3 2.1 Data formats used in GWASTools............................
More informationInformation leaflet. Centrum voor Medische Genetica. Version 1/20150504 Design by Ben Caljon, UZ Brussel. Universitair Ziekenhuis Brussel
Information on genome-wide genetic testing Array Comparative Genomic Hybridization (array CGH) Single Nucleotide Polymorphism array (SNP array) Massive Parallel Sequencing (MPS) Version 120150504 Design
More informationDeCyder Extended Data Analysis module Version 1.0
GE Healthcare DeCyder Extended Data Analysis module Version 1.0 Module for DeCyder 2D version 6.5 User Manual Contents 1 Introduction 1.1 Introduction... 7 1.2 The DeCyder EDA User Manual... 9 1.3 Getting
More informationGSR Microarrays Project Management System
GSR Microarrays Project Management System A User s Guide GSR Microarrays Vanderbilt University MRBIII, Room 9274 465 21 st Avenue South Nashville, TN 37232 microarray@vanderbilt.edu (615) 936-3003 www.gsr.vanderbilt.edu
More informationSingle-Cell DNA Sequencing with the C 1. Single-Cell Auto Prep System. Reveal hidden populations and genetic diversity within complex samples
DATA Sheet Single-Cell DNA Sequencing with the C 1 Single-Cell Auto Prep System Reveal hidden populations and genetic diversity within complex samples Single-cell sensitivity Discover and detect SNPs,
More informationGeneChip Sequence Analysis Software (GSEQ) is used to analyze data from the Resequencing Arrays
GeneChip Sequence Analysis Software 4.1 Note For more information, Please refer to the Affymetrix GeneChip Sequence Analysis Software User s Guide Version 4.1 guidebook & Quick Reference Card I. GSEQ Introduction
More informationConsistent Assay Performance Across Universal Arrays and Scanners
Technical Note: Illumina Systems and Software Consistent Assay Performance Across Universal Arrays and Scanners There are multiple Universal Array and scanner options for running Illumina DASL and GoldenGate
More informationCurrent Motif Discovery Tools and their Limitations
Current Motif Discovery Tools and their Limitations Philipp Bucher SIB / CIG Workshop 3 October 2006 Trendy Concepts and Hypotheses Transcription regulatory elements act in a context-dependent manner.
More informationGenetics Lecture Notes 7.03 2005. Lectures 1 2
Genetics Lecture Notes 7.03 2005 Lectures 1 2 Lecture 1 We will begin this course with the question: What is a gene? This question will take us four lectures to answer because there are actually several
More informationTutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment
Tutorial for Windows and Macintosh Preparing Your Data for NGS Alignment 2015 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) 1.734.769.7249
More informationA Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
More informationGuide for Data Visualization and Analysis using ACSN
Guide for Data Visualization and Analysis using ACSN ACSN contains the NaviCell tool box, the intuitive and user- friendly environment for data visualization and analysis. The tool is accessible from the
More informationSystematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals
Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Xiaohui Xie 1, Jun Lu 1, E. J. Kulbokas 1, Todd R. Golub 1, Vamsi Mootha 1, Kerstin Lindblad-Toh
More informationPsychoonkology, Sept. 2010 lifestyle factors and epigenetics
Psychoonkology, Sept. 2010 lifestyle factors and epigenetics Alexander G. Haslberger Dep. für Ernährungswissenschaften Univ. of Vienna Working group: Food, GI-Microbiology, Epigenetics Content Health:
More informationRelease Notes. Agilent CytoGenomics v4.0.2. For Research Use Only. Not for use in diagnostic procedures. Product Number
Release Notes Agilent CytoGenomics v4.0.2 Product Number G1662AA CytoGenomics Client 1 year named license (including Feature Extraction). This license supports installation of one client and server (to
More informationTutorial on gplink. http://pngu.mgh.harvard.edu/~purcell/plink/gplink.shtml. PLINK tutorial, December 2006; Shaun Purcell, shaun@pngu.mgh.harvard.
Tutorial on gplink http://pngu.mgh.harvard.edu/~purcell/plink/gplink.shtml Basic gplink analyses Data management Summary statistics Association analysis Population stratification IBD-based analysis gplink
More informationCourse on Functional Analysis. ::: Gene Set Enrichment Analysis - GSEA -
Course on Functional Analysis ::: Madrid, June 31st, 2007. Gonzalo Gómez, PhD. ggomez@cnio.es Bioinformatics Unit CNIO ::: Contents. 1. Introduction. 2. GSEA Software 3. Data Formats 4. Using GSEA 5. GSEA
More informationData Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms
Data Processing of Nextera Mate Pair Reads on Illumina Sequencing Platforms Introduction Mate pair sequencing enables the generation of libraries with insert sizes in the range of several kilobases (Kb).
More informationSingle Nucleotide Polymorphisms (SNPs)
Single Nucleotide Polymorphisms (SNPs) Additional Markers 13 core STR loci Obtain further information from additional markers: Y STRs Separating male samples Mitochondrial DNA Working with extremely degraded
More informationSAP HANA Enabling Genome Analysis
SAP HANA Enabling Genome Analysis Joanna L. Kelley, PhD Postdoctoral Scholar, Stanford University Enakshi Singh, MSc HANA Product Management, SAP Labs LLC Outline Use cases Genomics review Challenges in
More informationCluster software and Java TreeView
Cluster software and Java TreeView To download the software: http://bonsai.hgc.jp/~mdehoon/software/cluster/software.htm http://bonsai.hgc.jp/~mdehoon/software/cluster/manual/treeview.html Cluster 3.0
More information1 Mutation and Genetic Change
CHAPTER 14 1 Mutation and Genetic Change SECTION Genes in Action KEY IDEAS As you read this section, keep these questions in mind: What is the origin of genetic differences among organisms? What kinds
More informationSeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications
Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each
More information