Open array data analysis: mirna profiling in blood samples from patient suffering heart diseases

Size: px
Start display at page:

Download "Open array data analysis: mirna profiling in blood samples from patient suffering heart diseases"

Transcription

1 CRG BIOINFORMATICS CORE FACILITIES Open array data analysis: mirna profiling in blood samples from patient suffering heart diseases May 2015 Users: Begona Benito and Marta Tajes Users center: IMIM Analyst: Sarah Bonnin Group leader: Julia Ponomarenko

2 1. Project Scientific background, limitations: Blood samples from patients with either preserved or reduced ejection fraction, with or without atrial fibrillation. Some studies suggest that some mirnas could be directly involved in the development of the disease. Limitations: * Because of the high dilution of target molecules in blood samples, mirnas are present in low concentration in plasma and are therefore difficult to detect. * One of the challenges of mirna profiling from serum or plasma is the lack of established housekeeping genes for data normalization. Technology: Platform: OpenArray, Life Technology Array: TaqMan OpenArray Human MicroRNA Panel, QuantStudio 12K Flex Catalog number: mirnas + 4 controls replicated 16 times 3 samples are loaded on each array, for a total of 15 arrays ran in 4 different batches at the following dates: - batch A, 3 arrays: 13/02/ batch B, 4 arrays: 05/03/ batch C, 4 arrays: 10/03/ batch D, 4 arrays: 11/03/2015 Data and goal: The experiment consists of 45 samples, divided into 5 experimental groups: * 9 technical replicates samples: 9 samples pooled together and then ran as technical replicates, to be able to study the technical variation (TechCtrl) * 9 preserved ejection fraction (PEF = 4) * 9 preserved ejection fraction with atrial fibrillation (PEF+AF = 3) * 9 reduced ejection fraction (REF = 2)

3 * 9 reduced ejection fraction with atrial fibrillation (REF+AF = 1) The users are mostly interested in the change of mirna expression in case of atrial fibrillation in each ejection fraction situation. Hence, we will first focus on comparing PEF+AF vs PEF and REF+AF vs REF. Table 1 shows, for each sample, which experimental group it belongs to, on which array it was run, and in which batch. Sample ID Experimental group Array Batch = REF+AF ROK49 B = REF+AF ROK58 B = REF+AF ROK62 B = REF+AF ROK50 C = REF+AF RON52 C = REF+AF RON60 C = REF+AF RON50 D = REF+AF ROK51 D = REF+AF ROK67 D = REF OMZ01 A = REF OMZ18 A = REF OMZ54 A = REF ROL05 B = REF ROK49 B = REF ROK62 B = REF ROL1 C = REF RON60 C = REF ROK51 D = PEF+AF OMZ18 A = PEF+AF OMZ01 A = PEF+AF OMZ54 A = PEF+AF ROL05 B = PEF+AF ROK58 B = PEF+AF ROL1 C = PEF+AF RON52 C = PEF+AF RON42 D = PEF+AF RON50 D = PEF ROK62 B = PEF ROK50 C = PEF RON52 C = PEF RON60 C = PEF RON42 D = PEF RON50 D = PEF ROK51 D

4 = PEF ROK67 D = PEF ROK67 D 12Q TechCtrl RON42 D 13Q TechCtrl ROL05 B 14Q TechCtrl OMZ01 A 15Q TechCtrl ROK49 B 16Q TechCtrl OMZ18 A 17Q TechCtrl ROK50 C 18Q TechCtrl ROL1 C 19Q TechCtrl OMZ54 A 20Q TechCtrl ROK58 B Table 1

5 2. Preprocessing Extraction of Ct data: Ct data for all mirnas and all samples was extracted from the analysis_result.txt (part of the raw data handed by the users) file for each mirna and each sample. All analysis was performed in the R/Bioconductor environment. In particular, Bioconductor package HTqPCR was used as it is designed for the analysis of high-throughput qpcr data. Quality control: Figure 1 shows the raw Ct distribution for each sample. Figure 1 We observe two clear Ct density peaks : one which summit is located around Ct=25, the other one around Ct=40.

6 mirna transcripts for which Ct is around 40 are too lowly expressed to be considered as actually expressed: we will try to filter out some features in order not to lose too much detection power. Figure 2 shows a hierarchical clustering of samples using all mirnas. Colors represent the experimental groups the samples belong to (a.), or the batches (b.) in which samples were run. a. Clustering colored per experimental group. b. Clustering colored per batch. Figure 2. Dendrograms using raw data. Figure 2.b shows us a slight batch effect: indeed samples that were run in batch A are all grouping together. This is a bias often found when arrays are not processed all in the same batch and/or on the same day.

7 It is to remember that such technical biases are more visible when features are lowly expressed or when few differences are expected between experimental groups. We will try to correct for that bias. Features filtering: Features are tagged as Undetermined if their Ct is beyond 38, and Unreliable if their Ct is below 10 or if their standard variation is above 0.9 across all samples of a same experimental group. We are then filtering out features that are Undetermined/Unreliable in 36 samples or more (we consider that features can potentially be expressed in only one experimental group, i.e. 9 samples here, and not expressed in the 36 remaining samples). Using that filtering, 411 features were removed and we will be working with the 407 remaining ones. Figure 3 shows the density plot (same as Figure 1) of the remaining filtered data: we can see that the second peak of lowly expressed features is well reduced. Figure 3

8 Figure 4 shows the dendrogram (as Figure 2) using the remaining features after filtering. Colors represent the experimental groups the samples belong to (a.), or the batches (b.) in which samples were run. a. Clustering colored per experimental group. b. Clustering colored per batch. Figure 4. Dendrograms using filtered data. Batch effect correction: The ComBat method (Bioconductor package sva ) was applied to try and correct for the batch effect we observe. ComBat allows adjusting for batch effects in a dataset where the batch covariate is known, which is the case here.

9 Figure 5 and 6 show, as in previous steps, the Ct density per sample and dendrograms based on filtered and corrected data, respectively. Figure 5 a. Clustering colored by experimental group.

10 b. Clustering colored by batches. Figure 6. Dendrograms using filtered and batch corrected data. Figure 6b shows us that samples from batch A do not clustered all together as previously observed, so the batch effect seems to have been corrected. Figure 6a does not show a very improved clustering of samples per experimental group, apart maybe slightly for the group of replicated controls (TechCtrl). Normalization: A commonly used and validated method for qpcr normalization is the deltact intra-sample normalization: one or more features within the array are chosen (sufficiently expressed and stable in expression across the whole experiment), and are used as reference feature(s) for raw Ct correction. The Ct data from this (or these) reference feature (s) is (are) then subtracted from all other features, to adjust for intra-sample variability and make samples better comparable. Selection of reference features 4 control features are provided within this array, and are repeated each 16 times in each array: _ath-miR159a_B _RNU48_B _RNU44_B _U6. We will first check their levels of expression and variability within and across samples (on raw data before filtering and ComBat correction).

11 Figure 7 shows boxplots dispaying the Ct distribution of each control feature per sample. Results are displayed only for 4 samples but show the main trends _Ct.txt U6 athmir159 RNU44 RNU48 Figure 7. Figure 8 shows the expression profiles of these control features across samples.

12 control genes Ct _ath mir159a_b _RNU48_B _RNU44_B _U6 rrna_b Figure Q Q Q 15Q 16Q Q 18Q 19Q Q samples Of the 4 control features, _ath-miR159a_B and _RNU44_B have very high Ct values, i.e. very low transcript expression (hence unreliable) _RNU48_B is generally more highly expressed, but seems to be varying in expression across samples quite much _U6 is the most stable in expression across samples, and is sufficiently expressed. Next we tried to find some mirnas within the array which would be suitable (and better than the default controls) as references for deltact normalization: mirnas for which maximum Ct is below or equals 35, and coefficient of variation less than 0.1 across all samples, are selected. This method results in the selection of 61 mirnas. From these 61 mirnas, we decide to select the top 10 mirnas, i.e. the ones that show lowest levels of variation across samples (smallest coefficient of variation): _hsa-miR-10b#_B _hsa-miR-144#_B _HSA-MIR-1291_B

13 000512_hsa-miR-210_A _hsa-miR-10a_A _hsa-miR-193a-5p_A _hsa-miR-660_A _hsa-miR-30a-3p_B _hsa-miR-423-5p_A _hsa-miR-590-5p_A Figure 9 shows the Ct profiles of these 10 mirnas across samples (a.) and their intra-experimental group variation (b.). tested mirna for use as controls Ct _hsa mir 10b#_B _hsa mir 144#_B _HSA MIR 1291_B _hsa mir 210_A _hsa mir 10a_A _hsa mir 193a 5p_A _hsa mir 660_A _hsa mir 30a 3p_B _hsa mir 423 5p_A _hsa mir 590 5p_A Q Q Q 15Q 16Q Q 18Q 19Q Q samples a TechCtrl 30 Ct values for samples b _hsa mir 10a_A _hsa mir 30a 3p_B _hsa mir 210_A _hsa mir 660_A _hsa mir 590 5p_A _hsa mir 144#_B _hsa mir 193a 5p_A _hsa mir 10b#_B _hsa mir 423 5p_A _HSA MIR 1291_B

14 Figure most stable mirnas that will be used for normalization. These 10 samples are used for normalization of our data (filtered and ComBat corrected) using the deltact method. 3. Analysis Differential expression analysis: Remaining control probes (000338_ath-miR159a_B, _RNU48_B, _RNU44_B, _U6) are removed from the dataset before performing differential expression analysis: it will hence be performed on 375 mirnas. A method from HTqPCR based on limma (linear models for microarray data) was used, which uses a moderated t-test to assess differential expression of mirnas between experimental groups. Results: Results (Excel file) can be found in: Using the following credentials: Login: mtajes Password: marta15 Brief description of the columns found in the results file: t.test : The result of the t-test. p.value : The corresponding p.values. adj.p.value : P-values after correcting for multiple testing using the Benjamini- Holm method. ddct : The deltadeltact values = deltadeltact = deltact(target) deltact(calibrator) FC: The fold change; 2^(-ddCt). Target/Calibrator: the first/last experimental group in a pairwise comparison, respectively; for G1 vs G2, G1 is the target, G2 the calibrator.

15 Mean columns: The average Ct across the target/calibrator samples for the given Category columns: all results are assigned to a category, either "OK" or "Unreliable" depending on the input Ct values: the result will be "OK unless at least half of the Ct values for a given gene are unreliable/undetermined. Filtering the data using the adjusted p-value (<0.05) does not yield any result. Table 2 lists the mirnas found when filtering the data using the (unadjusted) p-value (< 0.05). G1 vs G2 G3 vs G4 22 mirnas 12 mirnas _hsa-miR-27b_A _hsa-miR-107_A _hsa-miR-539_A _hsa-miR-645_B _hsa-miR-411_A _hsa-miR-598_A _hsa-miR-505#_B _hsa-miR-636_A _hsa-miR-1_A _hsa-miR-9#_B _hsa-miR-331-5p_A _hsa-miR-142-5p_A _hsa-miR-22#_B _hsa-miR-425#_B _hsa-miR-30d#_B _hsa-miR-483-3p_B _hsa-miR-652_A _hsa-miR-489_A _hsa-miR-628-3p_B _hsa-miR-20a#_B _HSA-MIR-151-5P_B _HSA-MIR-1255B_B Table _hsa-let-7a_A _hsa-miR-107_A _hsa-miR-181c_A _hsa-miR-199b_A _hsa-miR-302c_A _hsa-miR-624_B _hsa-miR-939_B _hsa-miR-889_A _hsa-miR-654-3p_A _hsa-miR-146b-3p_A _hsa-miR-520d-5p_A _hsa-miR-589_A

16 4. References OpenArray: R project: R Development Core Team (2008). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN Bioconductor: Huber, W., Carey, J. V, Gentleman, R., Anders, S., Carlson, M., Carvalho, S. B, Bravo, C. H, Davis, S., Gatto, L., Girke, T., Gottardo, R., Hahne, F., Hansen, D. K, Irizarry, A. R, Lawrence, M., Love, I. M, MacDonald, J., Obenchain, V., Ole's, K. A, Pag'es, H., Reyes, A., Shannon, P., Smyth, K. G, Tenenbaum, D., Waldron, L., Morgan and M. (2015). Orchestrating highthroughput genomic analysis with Bioconductor. Nature Methods, 12(2), pp HTqPCR: Dvinge H and Bertone P (2009). HTqPCR: High - throughput analysis and visualization of quantitative real - time PCR data in R. Bioinformatics, 25(24), pp ComBat: Johnson WE, Rabinovic A, and Li C (2007). Adjusting batch effects in microarray expression data using Empirical Bayes methods. Biostatistics 8(1): limma: Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W and Smyth GK (2015). limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research, 43(7), pp. e47. Shaffer J, Schlumpberger M and Lader E. mirna profiling from blood challenges and recommendations. From Qiagen:

Gene Expression Analysis

Gene Expression Analysis Gene Expression Analysis Jie Peng Department of Statistics University of California, Davis May 2012 RNA expression technologies High-throughput technologies to measure the expression levels of thousands

More information

Analysis of Illumina Gene Expression Microarray Data

Analysis of Illumina Gene Expression Microarray Data Analysis of Illumina Gene Expression Microarray Data Asta Laiho, Msc. Tech. Bioinformatics research engineer The Finnish DNA Microarray Centre Turku Centre for Biotechnology, Finland The Finnish DNA Microarray

More information

Statistical issues in the analysis of microarray data

Statistical issues in the analysis of microarray data Statistical issues in the analysis of microarray data Daniel Gerhard Institute of Biostatistics Leibniz University of Hannover ESNATS Summerschool, Zermatt D. Gerhard (LUH) Analysis of microarray data

More information

Consistent Assay Performance Across Universal Arrays and Scanners

Consistent Assay Performance Across Universal Arrays and Scanners Technical Note: Illumina Systems and Software Consistent Assay Performance Across Universal Arrays and Scanners There are multiple Universal Array and scanner options for running Illumina DASL and GoldenGate

More information

Tutorial for proteome data analysis using the Perseus software platform

Tutorial for proteome data analysis using the Perseus software platform Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information

More information

Exiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays

Exiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays Exiqon Array Software Manual Quick guide to data extraction from mircury LNA microrna Arrays March 2010 Table of contents Introduction Overview...................................................... 3 ImaGene

More information

Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6

Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Overview This tutorial outlines how microrna data can be analyzed within Partek Genomics Suite. Additionally,

More information

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation

Identification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation Identification of rheumatoid arthritis and osterthritis patients by transcriptome-based rule set generation Bering Limited Report generated on September 19, 2014 Contents 1 Dataset summary 2 1.1 Project

More information

Introduction To Real Time Quantitative PCR (qpcr)

Introduction To Real Time Quantitative PCR (qpcr) Introduction To Real Time Quantitative PCR (qpcr) SABiosciences, A QIAGEN Company www.sabiosciences.com The Seminar Topics The advantages of qpcr versus conventional PCR Work flow & applications Factors

More information

RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial

RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial Samuel J. Rulli, Jr., Ph.D. qpcr-applications Scientist Samuel.Rulli@QIAGEN.com Pathway Focused Research from Sample Prep to Data Analysis! -2-

More information

Software and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University

Software and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University Software and Methods for the Analysis of Affymetrix GeneChip Data Rafael A Irizarry Department of Biostatistics Johns Hopkins University Outline Overview Bioconductor Project Examples 1: Gene Annotation

More information

Real-time PCR: Understanding C t

Real-time PCR: Understanding C t APPLICATION NOTE Real-Time PCR Real-time PCR: Understanding C t Real-time PCR, also called quantitative PCR or qpcr, can provide a simple and elegant method for determining the amount of a target sequence

More information

ALLEN Mouse Brain Atlas

ALLEN Mouse Brain Atlas TECHNICAL WHITE PAPER: QUALITY CONTROL STANDARDS FOR HIGH-THROUGHPUT RNA IN SITU HYBRIDIZATION DATA GENERATION Consistent data quality and internal reproducibility are critical concerns for high-throughput

More information

REAL TIME PCR USING SYBR GREEN

REAL TIME PCR USING SYBR GREEN REAL TIME PCR USING SYBR GREEN 1 THE PROBLEM NEED TO QUANTITATE DIFFERENCES IN mrna EXPRESSION SMALL AMOUNTS OF mrna LASER CAPTURE SMALL AMOUNTS OF TISSUE PRIMARY CELLS PRECIOUS REAGENTS 2 THE PROBLEM

More information

From Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data

From Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data From Reads to Differentially Expressed Genes The statistics of differential gene expression analysis using RNA-seq data experimental design data collection modeling statistical testing biological heterogeneity

More information

Predictive Gene Signature Selection for Adjuvant Chemotherapy in Non-Small Cell Lung Cancer Patients

Predictive Gene Signature Selection for Adjuvant Chemotherapy in Non-Small Cell Lung Cancer Patients Predictive Gene Signature Selection for Adjuvant Chemotherapy in Non-Small Cell Lung Cancer Patients by Li Liu A practicum report submitted to the Department of Public Health Sciences in conformity with

More information

2.500 Threshold. 2.000 1000e - 001. Threshold. Exponential phase. Cycle Number

2.500 Threshold. 2.000 1000e - 001. Threshold. Exponential phase. Cycle Number application note Real-Time PCR: Understanding C T Real-Time PCR: Understanding C T 4.500 3.500 1000e + 001 4.000 3.000 1000e + 000 3.500 2.500 Threshold 3.000 2.000 1000e - 001 Rn 2500 Rn 1500 Rn 2000

More information

Correlation of microarray and quantitative real-time PCR results. Elisa Wurmbach Mount Sinai School of Medicine New York

Correlation of microarray and quantitative real-time PCR results. Elisa Wurmbach Mount Sinai School of Medicine New York Correlation of microarray and quantitative real-time PCR results Elisa Wurmbach Mount Sinai School of Medicine New York Microarray techniques Oligo-array: Affymetrix, Codelink, spotted oligo-arrays (60-70mers)

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

REAL TIME PCR SYBR GREEN

REAL TIME PCR SYBR GREEN REAL TIME PCR SYBR GREEN 1 THE PROBLEM NEED TO QUANTITATE DIFFERENCES IN mrna EXPRESSION SMALL AMOUNTS OF mrna LASER CAPTURE SMALL AMOUNTS OF TISSUE PRIMARY CELLS PRECIOUS REAGENTS 2 THE PROBLEM QUANTITATION

More information

200631 - ADO - Omics Data Analysis

200631 - ADO - Omics Data Analysis Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2016 200 - FME - School of Mathematics and Statistics 1004 - UB - (ENG)Universitat de Barcelona MASTER'S DEGREE IN STATISTICS AND

More information

ExploRase: Multivariate exploratory analysis and visualization for systems biology. 1 Introduction. Abstract

ExploRase: Multivariate exploratory analysis and visualization for systems biology. 1 Introduction. Abstract ExploRase: Multivariate exploratory analysis and visualization for systems biology Michael Lawrence, Dianne Cook, Eun-Kyung Lee Abstract The large datasets being produced by high-throughput biological

More information

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI

STATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members

More information

DeCyder Extended Data Analysis module Version 1.0

DeCyder Extended Data Analysis module Version 1.0 GE Healthcare DeCyder Extended Data Analysis module Version 1.0 Module for DeCyder 2D version 6.5 User Manual Contents 1 Introduction 1.1 Introduction... 7 1.2 The DeCyder EDA User Manual... 9 1.3 Getting

More information

SELDI-TOF Mass Spectrometry Protein Data By Huong Thi Dieu La

SELDI-TOF Mass Spectrometry Protein Data By Huong Thi Dieu La SELDI-TOF Mass Spectrometry Protein Data By Huong Thi Dieu La References Alejandro Cruz-Marcelo, Rudy Guerra, Marina Vannucci, Yiting Li, Ching C. Lau, and Tsz-Kwong Man. Comparison of algorithms for pre-processing

More information

Improving SAS Global Forum Papers

Improving SAS Global Forum Papers Paper 3343-2015 Improving SAS Global Forum Papers Vijay Singh, Pankush Kalgotra, Goutam Chakraborty, Oklahoma State University, OK, US ABSTRACT Just as research is built on existing research, the references

More information

QuantStudio 3D AnalysisSuite Software

QuantStudio 3D AnalysisSuite Software USER GUIDE QuantStudio 3D AnalysisSuite Software for use with QuantStudio 3D Digital PCR System Publication Number MAN0008161 Revision 1.0 For Research Use Only. Not for use in diagnostic procedures. For

More information

Step-by-Step Guide to Basic Expression Analysis and Normalization

Step-by-Step Guide to Basic Expression Analysis and Normalization Step-by-Step Guide to Basic Expression Analysis and Normalization Page 1 Introduction This document shows you how to perform a basic analysis and normalization of your data. A full review of this document

More information

Factors for success in big data science

Factors for success in big data science Factors for success in big data science Damjan Vukcevic Data Science Murdoch Childrens Research Institute 16 October 2014 Big Data Reading Group (Department of Mathematics & Statistics, University of Melbourne)

More information

Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study

Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study The data for this study is taken from experiment GSE848 from the Gene Expression

More information

Validation and Calibration. Definitions and Terminology

Validation and Calibration. Definitions and Terminology Validation and Calibration Definitions and Terminology ACCEPTANCE CRITERIA: The specifications and acceptance/rejection criteria, such as acceptable quality level and unacceptable quality level, with an

More information

Gene expression analysis. Ulf Leser and Karin Zimmermann

Gene expression analysis. Ulf Leser and Karin Zimmermann Gene expression analysis Ulf Leser and Karin Zimmermann Ulf Leser: Bioinformatics, Wintersemester 2010/2011 1 Last lecture What are microarrays? - Biomolecular devices measuring the transcriptome of a

More information

Chapter 2: Supplementary Exercises

Chapter 2: Supplementary Exercises Chapter 2: Supplementary Exercises 2.S.1. A Public Analyst s laboratory routinely measures potable water samples by flame atomic absorption spectrometry to ensure compliance with the EU drinking water

More information

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests

Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management

More information

Cluster software and Java TreeView

Cluster software and Java TreeView Cluster software and Java TreeView To download the software: http://bonsai.hgc.jp/~mdehoon/software/cluster/software.htm http://bonsai.hgc.jp/~mdehoon/software/cluster/manual/treeview.html Cluster 3.0

More information

Supplemental Material. Methods

Supplemental Material. Methods Supplemental Material Methods Measurement of lncrnas expression Total RNA was extracted from PAXgene TM tubes using the PAXgene blood RNA kit (Qiagen, Venlo, Netherlands) as described by the manufacturer.

More information

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Analysis of gene expression data. Ulf Leser and Philippe Thomas Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:

More information

Quality Assessment of Exon and Gene Arrays

Quality Assessment of Exon and Gene Arrays Quality Assessment of Exon and Gene Arrays I. Introduction In this white paper we describe some quality assessment procedures that are computed from CEL files from Whole Transcript (WT) based arrays such

More information

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( ) Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates

More information

Profiling of microrna in Blood Serum/Plasma. Guidelines for the mircury LNA TM Universal RT microrna PCR System

Profiling of microrna in Blood Serum/Plasma. Guidelines for the mircury LNA TM Universal RT microrna PCR System Profiling of microrna in Blood Serum/Plasma Guidelines for the mircury LNA TM Universal RT microrna PCR System Table of Contents 2 Introduction.....................................................................................

More information

Frequently Asked Questions Next Generation Sequencing

Frequently Asked Questions Next Generation Sequencing Frequently Asked Questions Next Generation Sequencing Import These Frequently Asked Questions for Next Generation Sequencing are some of the more common questions our customers ask. Questions are divided

More information

edger: differential expression analysis of digital gene expression data User s Guide Yunshun Chen, Davis McCarthy, Mark Robinson, Gordon K.

edger: differential expression analysis of digital gene expression data User s Guide Yunshun Chen, Davis McCarthy, Mark Robinson, Gordon K. edger: differential expression analysis of digital gene expression data User s Guide Yunshun Chen, Davis McCarthy, Mark Robinson, Gordon K. Smyth First edition 17 September 2008 Last revised 8 October

More information

A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias

A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias B. M. Bolstad, R. A. Irizarry 2, M. Astrand 3 and T. P. Speed 4, 5 Group in Biostatistics, University

More information

Two-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?...

Two-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?... Two-Way ANOVA tests Contents at a glance I. Definition and Applications...2 II. Two-Way ANOVA prerequisites...2 III. How to use the Two-Way ANOVA tool?...3 A. Parametric test, assume variances equal....4

More information

Gene Expression Assay Performance Guaranteed With the TaqMan Assays QPCR Guarantee Program

Gene Expression Assay Performance Guaranteed With the TaqMan Assays QPCR Guarantee Program WHITE PAPER TaqMan Assays QPCR Guarantee Program Gene Expression Assay Performance Guaranteed With the TaqMan Assays QPCR Guarantee Program Real-Time PCR for the Quantification of Gene Expression Real-time

More information

PREDA S4-classes. Francesco Ferrari October 13, 2015

PREDA S4-classes. Francesco Ferrari October 13, 2015 PREDA S4-classes Francesco Ferrari October 13, 2015 Abstract This document provides a description of custom S4 classes used to manage data structures for PREDA: an R package for Position RElated Data Analysis.

More information

Hierarchical Clustering Analysis

Hierarchical Clustering Analysis Hierarchical Clustering Analysis What is Hierarchical Clustering? Hierarchical clustering is used to group similar objects into clusters. In the beginning, each row and/or column is considered a cluster.

More information

Exercise with Gene Ontology - Cytoscape - BiNGO

Exercise with Gene Ontology - Cytoscape - BiNGO Exercise with Gene Ontology - Cytoscape - BiNGO This practical has material extracted from http://www.cbs.dtu.dk/chipcourse/exercises/ex_go/goexercise11.php In this exercise we will analyze microarray

More information

Formalin fixation at low temperature better preserves nucleic acid integrity. Gianni Bussolati. University of Turin

Formalin fixation at low temperature better preserves nucleic acid integrity. Gianni Bussolati. University of Turin Formalin fixation at low temperature better preserves nucleic acid integrity Gianni Bussolati University of Turin Disclosure of interests: G.B. was originally responsible for the invention of the Cold

More information

Real time and Quantitative (RTAQ) PCR. so I have an outlier and I want to see if it really is changed

Real time and Quantitative (RTAQ) PCR. so I have an outlier and I want to see if it really is changed Real time and Quantitative (RTAQ) PCR or.. for this audience so I have an outlier and I want to see if it really is changed Nigel Walker, Ph.D. Laboratory of Computational Biology and Risk Analysis, Environmental

More information

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics

Descriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),

More information

ncounter Leukemia Fusion Gene Expression Assay Molecules That Count Product Highlights ncounter Leukemia Fusion Gene Expression Assay Overview

ncounter Leukemia Fusion Gene Expression Assay Molecules That Count Product Highlights ncounter Leukemia Fusion Gene Expression Assay Overview ncounter Leukemia Fusion Gene Expression Assay Product Highlights Simultaneous detection and quantification of 25 fusion gene isoforms and 23 additional mrnas related to leukemia Compatible with a variety

More information

Quando si parla di PCR quantitativa si intende:

Quando si parla di PCR quantitativa si intende: Quando si parla di PCR quantitativa si intende: A. Una PCR che produce grandi quantità di DNA B. Una PCR che emette quanti di luce C. Una PCR che quantifica il numero di molecole stampo presenti all inizio

More information

manual last update on July 8, 2008

manual last update on July 8, 2008 manual last update on July 8, 2008 1. Introduction The genorm VBA applet for Microsoft Excel determines the most stable reference genes from a set of tested genes in a given cdna sample panel, and calculates

More information

Frozen Robust Multi-Array Analysis and the Gene Expression Barcode

Frozen Robust Multi-Array Analysis and the Gene Expression Barcode Frozen Robust Multi-Array Analysis and the Gene Expression Barcode Matthew N. McCall October 13, 2015 Contents 1 Frozen Robust Multiarray Analysis (frma) 2 1.1 From CEL files to expression estimates...................

More information

Gene Expression Macro Version 1.1

Gene Expression Macro Version 1.1 Gene Expression Macro Version 1.1 Instructions Rev B 1 Bio-Rad Gene Expression Macro Users Guide 2004 Bio-Rad Laboratories Table of Contents: Introduction..................................... 3 Opening

More information

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk

Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:

More information

Microarray Data Analysis. A step by step analysis using BRB-Array Tools

Microarray Data Analysis. A step by step analysis using BRB-Array Tools Microarray Data Analysis A step by step analysis using BRB-Array Tools 1 EXAMINATION OF DIFFERENTIAL GENE EXPRESSION (1) Objective: to find genes whose expression is changed before and after chemotherapy.

More information

The timecourse Package

The timecourse Package The timecourse Package Yu huan Tai October 13, 2015 ontents Institute for Human Genetics, University of alifornia, San Francisco taiy@humgen.ucsf.edu 1 Overview 1 2 Longitudinal one-sample problem 2 2.1

More information

OpenArray Sample Tracker Software

OpenArray Sample Tracker Software QUICK REFERENCE OpenArray Sample Tracker Software For QuantStudio 12K Flex OpenArray Sample Block and the OpenArray Real-Time PCR System Publication Part Number 4460657 Rev. C Revision Date May 2012 Contents

More information

Analytical Test Method Validation Report Template

Analytical Test Method Validation Report Template Analytical Test Method Validation Report Template 1. Purpose The purpose of this Validation Summary Report is to summarize the finding of the validation of test method Determination of, following Validation

More information

QuantStudio 12K Flex Real-Time PCR System. The all-in-one qpcr instrument

QuantStudio 12K Flex Real-Time PCR System. The all-in-one qpcr instrument QuantStudio 12K Flex Real-Time PCR System The all-in-one qpcr instrument Expand the boundaries of your research Life Technologies is taking qpcr to the next level. Designed for maximum throughput, flexibility,

More information

Regression Clustering

Regression Clustering Chapter 449 Introduction This algorithm provides for clustering in the multiple regression setting in which you have a dependent variable Y and one or more independent variables, the X s. The algorithm

More information

Gene Expression Assays

Gene Expression Assays APPLICATION NOTE TaqMan Gene Expression Assays A mpl i fic ationef ficienc yof TaqMan Gene Expression Assays Assays tested extensively for qpcr efficiency Key factors that affect efficiency Efficiency

More information

UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS

UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS Dwijesh C. Mishra I.A.S.R.I., Library Avenue, New Delhi-110 012 dcmishra@iasri.res.in What is Learning? "Learning denotes changes in a system that enable

More information

Quantitative Real Time PCR Protocol. Stack Lab

Quantitative Real Time PCR Protocol. Stack Lab Quantitative Real Time PCR Protocol Stack Lab Overview Real-time quantitative polymerase chain reaction (qpcr) differs from regular PCR by including in the reaction fluorescent reporter molecules that

More information

Export Pricing and Credit Constraints: Theory and Evidence from Greek Firms. Online Data Appendix (not intended for publication) Elias Dinopoulos

Export Pricing and Credit Constraints: Theory and Evidence from Greek Firms. Online Data Appendix (not intended for publication) Elias Dinopoulos Export Pricing and Credit Constraints: Theory and Evidence from Greek Firms Online Data Appendix (not intended for publication) Elias Dinopoulos University of Florida Sarantis Kalyvitis Athens University

More information

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.

Course Objective This course is designed to give you a basic understanding of how to run regressions in SPSS. SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed

More information

Obesity in America: A Growing Trend

Obesity in America: A Growing Trend Obesity in America: A Growing Trend David Todd P e n n s y l v a n i a S t a t e U n i v e r s i t y Utilizing Geographic Information Systems (GIS) to explore obesity in America, this study aims to determine

More information

MS Data Analysis I: Importing Data into Genespring and Initial Quality Control

MS Data Analysis I: Importing Data into Genespring and Initial Quality Control Homework: Session 2 GENESPRING MS ONLINE TRAINING SESSION 2 MS Data Analysis I: Importing Data into Genespring and Initial Quality Control Introduction and Lab Overview: If you need help during completion

More information

DHL Data Mining Project. Customer Segmentation with Clustering

DHL Data Mining Project. Customer Segmentation with Clustering DHL Data Mining Project Customer Segmentation with Clustering Timothy TAN Chee Yong Aditya Hridaya MISRA Jeffery JI Jun Yao 3/30/2010 DHL Data Mining Project Table of Contents Introduction to DHL and the

More information

A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling

A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling Background Bryan Orme and Rich Johnson, Sawtooth Software March, 2009 Market segmentation is pervasive

More information

Microarray Analysis. The Basics. Thomas Girke. December 9, 2011. Microarray Analysis Slide 1/42

Microarray Analysis. The Basics. Thomas Girke. December 9, 2011. Microarray Analysis Slide 1/42 Microarray Analysis The Basics Thomas Girke December 9, 2011 Microarray Analysis Slide 1/42 Technology Challenges Data Analysis Data Depositories R and BioConductor Homework Assignment Microarray Analysis

More information

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set

EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin

More information

Using Excel for inferential statistics

Using Excel for inferential statistics FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied

More information

Scott Reierstad. Field Applications Scientist

Scott Reierstad. Field Applications Scientist Real-time PCR Training Scott Reierstad Field Applications Scientist Topics we ll cover Quick introduction to real-time PCR ABI-supported real-time chemistries Instrumentation Quantitation via Standard

More information

Quantitative proteomics background

Quantitative proteomics background Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran

More information

Nucleic Acid Purity Assessment using A 260 /A 280 Ratios

Nucleic Acid Purity Assessment using A 260 /A 280 Ratios Nucleic Acid Purity Assessment using A 260 /A 280 Ratios A common practice in molecular biology is to perform a quick assessment of the purity of nucleic acid samples by determining the ratio of spectrophotometric

More information

Summary of important mathematical operations and formulas (from first tutorial):

Summary of important mathematical operations and formulas (from first tutorial): EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a

More information

CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS

CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS TABLES, CHARTS, AND GRAPHS / 75 CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS Tables, charts, and graphs are frequently used in statistics to visually communicate data. Such illustrations are also a frequent

More information

Web-based Tools for the Analysis of DNA Microarrays. End of Project Report. Authors: P. Geeleher 1,2, A. Golden 3, J. Hinde 2 and D. G.

Web-based Tools for the Analysis of DNA Microarrays. End of Project Report. Authors: P. Geeleher 1,2, A. Golden 3, J. Hinde 2 and D. G. Web-based Tools for the Analysis of DNA Microarrays End of Project Report Project 5236 Authors: P. Geeleher 1,2, A. Golden 3, J. Hinde 2 and D. G. Morris 1 1 Teagasc, Animal Reproduction Department, Mellows

More information

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray

More information

A Streamlined Workflow for Untargeted Metabolomics

A Streamlined Workflow for Untargeted Metabolomics A Streamlined Workflow for Untargeted Metabolomics Employing XCMS plus, a Simultaneous Data Processing and Metabolite Identification Software Package for Rapid Untargeted Metabolite Screening Baljit K.

More information

PreciseTM Whitepaper

PreciseTM Whitepaper Precise TM Whitepaper Introduction LIMITATIONS OF EXISTING RNA-SEQ METHODS Correctly designed gene expression studies require large numbers of samples, accurate results and low analysis costs. Analysis

More information

Validating Microarray Data Using RT 2 Real-Time PCR Products

Validating Microarray Data Using RT 2 Real-Time PCR Products Validating Microarray Data Using RT 2 Real-Time PCR Products Introduction: Real-time PCR monitors the amount of amplicon as the reaction occurs. Usually, the amount of product is directly related to the

More information

GenEx User Guide. www.multid.se. Version 1.0. Copyrights 2001-2012 Multid Analyses AB. GenEx - Data Analysis Software

GenEx User Guide. www.multid.se. Version 1.0. Copyrights 2001-2012 Multid Analyses AB. GenEx - Data Analysis Software GenEx User Guide Version 1.0 Copyrights 2001-2012 Multid Analyses AB GenEx - Data Analysis Software www.multid.se 1 Introduction As the qpcr field advances, the design of experiments and the analysis of

More information

Final Project Report

Final Project Report CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes

More information

ABSORBENCY OF PAPER TOWELS

ABSORBENCY OF PAPER TOWELS ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?

More information

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening Micro RNAs: potentielle Biomarker für das Blutspenderscreening micrornas - Background Types of RNA -Coding: messenger RNA (mrna) -Non-coding (examples): Ribosomal RNA (rrna) Transfer RNA (trna) Small nuclear

More information

Visualization of Complex Survey Data: Regression Diagnostics

Visualization of Complex Survey Data: Regression Diagnostics Visualization of Complex Survey Data: Regression Diagnostics Susan Hinkins 1, Edward Mulrow, Fritz Scheuren 3 1 NORC at the University of Chicago, 11 South 5th Ave, Bozeman MT 59715 NORC at the University

More information

Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88)

Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88) Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88) Introduction The National Educational Longitudinal Survey (NELS:88) followed students from 8 th grade in 1988 to 10 th grade in

More information

NCSS Statistical Software

NCSS Statistical Software Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the

More information

Molecular Assessment of Dried Blood Spot Quality during Development of a Novel Automated. Screening

Molecular Assessment of Dried Blood Spot Quality during Development of a Novel Automated. Screening Molecular Assessment of Dried Blood Spot Quality during Development of a Novel Automated in situ TREC qpcr Assay for SCID Screening J Bai, T Henry, J Benfer, S Berberich, T Kreman, and L DesJardin State

More information

Building risk prediction models - with a focus on Genome-Wide Association Studies. Charles Kooperberg

Building risk prediction models - with a focus on Genome-Wide Association Studies. Charles Kooperberg Building risk prediction models - with a focus on Genome-Wide Association Studies Risk prediction models Based on data: (D i, X i1,..., X ip ) i = 1,..., n we like to fit a model P(D = 1 X 1,..., X p )

More information

The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA

The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Paper 156-2010 The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Abstract JMP has a rich set of visual displays that can help you see the information

More information

Exploratory data analysis (Chapter 2) Fall 2011

Exploratory data analysis (Chapter 2) Fall 2011 Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,

More information

Practical Differential Gene Expression. Introduction

Practical Differential Gene Expression. Introduction Practical Differential Gene Expression Introduction In this tutorial you will learn how to use R packages for analysis of differential expression. The dataset we use are the gene-summarized count data

More information

Factor Analysis. Sample StatFolio: factor analysis.sgp

Factor Analysis. Sample StatFolio: factor analysis.sgp STATGRAPHICS Rev. 1/10/005 Factor Analysis Summary The Factor Analysis procedure is designed to extract m common factors from a set of p quantitative variables X. In many situations, a small number of

More information

bitter is de pil Linos Vandekerckhove, MD, PhD

bitter is de pil Linos Vandekerckhove, MD, PhD 4//24 Current HIV care HIV copies/ ml plasma Viral load Welcome to the Digital droplet PCR age! bitter is de pil Linos Vandekerckhove, MD, PhD Latent HIV reservoir Time at Ghent University Hospital 2 HIV

More information

Comparing Methods for Identifying Transcription Factor Target Genes

Comparing Methods for Identifying Transcription Factor Target Genes Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF

More information