Open array data analysis: mirna profiling in blood samples from patient suffering heart diseases
|
|
- Sabina Parks
- 7 years ago
- Views:
Transcription
1 CRG BIOINFORMATICS CORE FACILITIES Open array data analysis: mirna profiling in blood samples from patient suffering heart diseases May 2015 Users: Begona Benito and Marta Tajes Users center: IMIM Analyst: Sarah Bonnin Group leader: Julia Ponomarenko
2 1. Project Scientific background, limitations: Blood samples from patients with either preserved or reduced ejection fraction, with or without atrial fibrillation. Some studies suggest that some mirnas could be directly involved in the development of the disease. Limitations: * Because of the high dilution of target molecules in blood samples, mirnas are present in low concentration in plasma and are therefore difficult to detect. * One of the challenges of mirna profiling from serum or plasma is the lack of established housekeeping genes for data normalization. Technology: Platform: OpenArray, Life Technology Array: TaqMan OpenArray Human MicroRNA Panel, QuantStudio 12K Flex Catalog number: mirnas + 4 controls replicated 16 times 3 samples are loaded on each array, for a total of 15 arrays ran in 4 different batches at the following dates: - batch A, 3 arrays: 13/02/ batch B, 4 arrays: 05/03/ batch C, 4 arrays: 10/03/ batch D, 4 arrays: 11/03/2015 Data and goal: The experiment consists of 45 samples, divided into 5 experimental groups: * 9 technical replicates samples: 9 samples pooled together and then ran as technical replicates, to be able to study the technical variation (TechCtrl) * 9 preserved ejection fraction (PEF = 4) * 9 preserved ejection fraction with atrial fibrillation (PEF+AF = 3) * 9 reduced ejection fraction (REF = 2)
3 * 9 reduced ejection fraction with atrial fibrillation (REF+AF = 1) The users are mostly interested in the change of mirna expression in case of atrial fibrillation in each ejection fraction situation. Hence, we will first focus on comparing PEF+AF vs PEF and REF+AF vs REF. Table 1 shows, for each sample, which experimental group it belongs to, on which array it was run, and in which batch. Sample ID Experimental group Array Batch = REF+AF ROK49 B = REF+AF ROK58 B = REF+AF ROK62 B = REF+AF ROK50 C = REF+AF RON52 C = REF+AF RON60 C = REF+AF RON50 D = REF+AF ROK51 D = REF+AF ROK67 D = REF OMZ01 A = REF OMZ18 A = REF OMZ54 A = REF ROL05 B = REF ROK49 B = REF ROK62 B = REF ROL1 C = REF RON60 C = REF ROK51 D = PEF+AF OMZ18 A = PEF+AF OMZ01 A = PEF+AF OMZ54 A = PEF+AF ROL05 B = PEF+AF ROK58 B = PEF+AF ROL1 C = PEF+AF RON52 C = PEF+AF RON42 D = PEF+AF RON50 D = PEF ROK62 B = PEF ROK50 C = PEF RON52 C = PEF RON60 C = PEF RON42 D = PEF RON50 D = PEF ROK51 D
4 = PEF ROK67 D = PEF ROK67 D 12Q TechCtrl RON42 D 13Q TechCtrl ROL05 B 14Q TechCtrl OMZ01 A 15Q TechCtrl ROK49 B 16Q TechCtrl OMZ18 A 17Q TechCtrl ROK50 C 18Q TechCtrl ROL1 C 19Q TechCtrl OMZ54 A 20Q TechCtrl ROK58 B Table 1
5 2. Preprocessing Extraction of Ct data: Ct data for all mirnas and all samples was extracted from the analysis_result.txt (part of the raw data handed by the users) file for each mirna and each sample. All analysis was performed in the R/Bioconductor environment. In particular, Bioconductor package HTqPCR was used as it is designed for the analysis of high-throughput qpcr data. Quality control: Figure 1 shows the raw Ct distribution for each sample. Figure 1 We observe two clear Ct density peaks : one which summit is located around Ct=25, the other one around Ct=40.
6 mirna transcripts for which Ct is around 40 are too lowly expressed to be considered as actually expressed: we will try to filter out some features in order not to lose too much detection power. Figure 2 shows a hierarchical clustering of samples using all mirnas. Colors represent the experimental groups the samples belong to (a.), or the batches (b.) in which samples were run. a. Clustering colored per experimental group. b. Clustering colored per batch. Figure 2. Dendrograms using raw data. Figure 2.b shows us a slight batch effect: indeed samples that were run in batch A are all grouping together. This is a bias often found when arrays are not processed all in the same batch and/or on the same day.
7 It is to remember that such technical biases are more visible when features are lowly expressed or when few differences are expected between experimental groups. We will try to correct for that bias. Features filtering: Features are tagged as Undetermined if their Ct is beyond 38, and Unreliable if their Ct is below 10 or if their standard variation is above 0.9 across all samples of a same experimental group. We are then filtering out features that are Undetermined/Unreliable in 36 samples or more (we consider that features can potentially be expressed in only one experimental group, i.e. 9 samples here, and not expressed in the 36 remaining samples). Using that filtering, 411 features were removed and we will be working with the 407 remaining ones. Figure 3 shows the density plot (same as Figure 1) of the remaining filtered data: we can see that the second peak of lowly expressed features is well reduced. Figure 3
8 Figure 4 shows the dendrogram (as Figure 2) using the remaining features after filtering. Colors represent the experimental groups the samples belong to (a.), or the batches (b.) in which samples were run. a. Clustering colored per experimental group. b. Clustering colored per batch. Figure 4. Dendrograms using filtered data. Batch effect correction: The ComBat method (Bioconductor package sva ) was applied to try and correct for the batch effect we observe. ComBat allows adjusting for batch effects in a dataset where the batch covariate is known, which is the case here.
9 Figure 5 and 6 show, as in previous steps, the Ct density per sample and dendrograms based on filtered and corrected data, respectively. Figure 5 a. Clustering colored by experimental group.
10 b. Clustering colored by batches. Figure 6. Dendrograms using filtered and batch corrected data. Figure 6b shows us that samples from batch A do not clustered all together as previously observed, so the batch effect seems to have been corrected. Figure 6a does not show a very improved clustering of samples per experimental group, apart maybe slightly for the group of replicated controls (TechCtrl). Normalization: A commonly used and validated method for qpcr normalization is the deltact intra-sample normalization: one or more features within the array are chosen (sufficiently expressed and stable in expression across the whole experiment), and are used as reference feature(s) for raw Ct correction. The Ct data from this (or these) reference feature (s) is (are) then subtracted from all other features, to adjust for intra-sample variability and make samples better comparable. Selection of reference features 4 control features are provided within this array, and are repeated each 16 times in each array: _ath-miR159a_B _RNU48_B _RNU44_B _U6. We will first check their levels of expression and variability within and across samples (on raw data before filtering and ComBat correction).
11 Figure 7 shows boxplots dispaying the Ct distribution of each control feature per sample. Results are displayed only for 4 samples but show the main trends _Ct.txt U6 athmir159 RNU44 RNU48 Figure 7. Figure 8 shows the expression profiles of these control features across samples.
12 control genes Ct _ath mir159a_b _RNU48_B _RNU44_B _U6 rrna_b Figure Q Q Q 15Q 16Q Q 18Q 19Q Q samples Of the 4 control features, _ath-miR159a_B and _RNU44_B have very high Ct values, i.e. very low transcript expression (hence unreliable) _RNU48_B is generally more highly expressed, but seems to be varying in expression across samples quite much _U6 is the most stable in expression across samples, and is sufficiently expressed. Next we tried to find some mirnas within the array which would be suitable (and better than the default controls) as references for deltact normalization: mirnas for which maximum Ct is below or equals 35, and coefficient of variation less than 0.1 across all samples, are selected. This method results in the selection of 61 mirnas. From these 61 mirnas, we decide to select the top 10 mirnas, i.e. the ones that show lowest levels of variation across samples (smallest coefficient of variation): _hsa-miR-10b#_B _hsa-miR-144#_B _HSA-MIR-1291_B
13 000512_hsa-miR-210_A _hsa-miR-10a_A _hsa-miR-193a-5p_A _hsa-miR-660_A _hsa-miR-30a-3p_B _hsa-miR-423-5p_A _hsa-miR-590-5p_A Figure 9 shows the Ct profiles of these 10 mirnas across samples (a.) and their intra-experimental group variation (b.). tested mirna for use as controls Ct _hsa mir 10b#_B _hsa mir 144#_B _HSA MIR 1291_B _hsa mir 210_A _hsa mir 10a_A _hsa mir 193a 5p_A _hsa mir 660_A _hsa mir 30a 3p_B _hsa mir 423 5p_A _hsa mir 590 5p_A Q Q Q 15Q 16Q Q 18Q 19Q Q samples a TechCtrl 30 Ct values for samples b _hsa mir 10a_A _hsa mir 30a 3p_B _hsa mir 210_A _hsa mir 660_A _hsa mir 590 5p_A _hsa mir 144#_B _hsa mir 193a 5p_A _hsa mir 10b#_B _hsa mir 423 5p_A _HSA MIR 1291_B
14 Figure most stable mirnas that will be used for normalization. These 10 samples are used for normalization of our data (filtered and ComBat corrected) using the deltact method. 3. Analysis Differential expression analysis: Remaining control probes (000338_ath-miR159a_B, _RNU48_B, _RNU44_B, _U6) are removed from the dataset before performing differential expression analysis: it will hence be performed on 375 mirnas. A method from HTqPCR based on limma (linear models for microarray data) was used, which uses a moderated t-test to assess differential expression of mirnas between experimental groups. Results: Results (Excel file) can be found in: Using the following credentials: Login: mtajes Password: marta15 Brief description of the columns found in the results file: t.test : The result of the t-test. p.value : The corresponding p.values. adj.p.value : P-values after correcting for multiple testing using the Benjamini- Holm method. ddct : The deltadeltact values = deltadeltact = deltact(target) deltact(calibrator) FC: The fold change; 2^(-ddCt). Target/Calibrator: the first/last experimental group in a pairwise comparison, respectively; for G1 vs G2, G1 is the target, G2 the calibrator.
15 Mean columns: The average Ct across the target/calibrator samples for the given Category columns: all results are assigned to a category, either "OK" or "Unreliable" depending on the input Ct values: the result will be "OK unless at least half of the Ct values for a given gene are unreliable/undetermined. Filtering the data using the adjusted p-value (<0.05) does not yield any result. Table 2 lists the mirnas found when filtering the data using the (unadjusted) p-value (< 0.05). G1 vs G2 G3 vs G4 22 mirnas 12 mirnas _hsa-miR-27b_A _hsa-miR-107_A _hsa-miR-539_A _hsa-miR-645_B _hsa-miR-411_A _hsa-miR-598_A _hsa-miR-505#_B _hsa-miR-636_A _hsa-miR-1_A _hsa-miR-9#_B _hsa-miR-331-5p_A _hsa-miR-142-5p_A _hsa-miR-22#_B _hsa-miR-425#_B _hsa-miR-30d#_B _hsa-miR-483-3p_B _hsa-miR-652_A _hsa-miR-489_A _hsa-miR-628-3p_B _hsa-miR-20a#_B _HSA-MIR-151-5P_B _HSA-MIR-1255B_B Table _hsa-let-7a_A _hsa-miR-107_A _hsa-miR-181c_A _hsa-miR-199b_A _hsa-miR-302c_A _hsa-miR-624_B _hsa-miR-939_B _hsa-miR-889_A _hsa-miR-654-3p_A _hsa-miR-146b-3p_A _hsa-miR-520d-5p_A _hsa-miR-589_A
16 4. References OpenArray: R project: R Development Core Team (2008). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN Bioconductor: Huber, W., Carey, J. V, Gentleman, R., Anders, S., Carlson, M., Carvalho, S. B, Bravo, C. H, Davis, S., Gatto, L., Girke, T., Gottardo, R., Hahne, F., Hansen, D. K, Irizarry, A. R, Lawrence, M., Love, I. M, MacDonald, J., Obenchain, V., Ole's, K. A, Pag'es, H., Reyes, A., Shannon, P., Smyth, K. G, Tenenbaum, D., Waldron, L., Morgan and M. (2015). Orchestrating highthroughput genomic analysis with Bioconductor. Nature Methods, 12(2), pp HTqPCR: Dvinge H and Bertone P (2009). HTqPCR: High - throughput analysis and visualization of quantitative real - time PCR data in R. Bioinformatics, 25(24), pp ComBat: Johnson WE, Rabinovic A, and Li C (2007). Adjusting batch effects in microarray expression data using Empirical Bayes methods. Biostatistics 8(1): limma: Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W and Smyth GK (2015). limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Research, 43(7), pp. e47. Shaffer J, Schlumpberger M and Lader E. mirna profiling from blood challenges and recommendations. From Qiagen:
Gene Expression Analysis
Gene Expression Analysis Jie Peng Department of Statistics University of California, Davis May 2012 RNA expression technologies High-throughput technologies to measure the expression levels of thousands
More informationAnalysis of Illumina Gene Expression Microarray Data
Analysis of Illumina Gene Expression Microarray Data Asta Laiho, Msc. Tech. Bioinformatics research engineer The Finnish DNA Microarray Centre Turku Centre for Biotechnology, Finland The Finnish DNA Microarray
More informationStatistical issues in the analysis of microarray data
Statistical issues in the analysis of microarray data Daniel Gerhard Institute of Biostatistics Leibniz University of Hannover ESNATS Summerschool, Zermatt D. Gerhard (LUH) Analysis of microarray data
More informationConsistent Assay Performance Across Universal Arrays and Scanners
Technical Note: Illumina Systems and Software Consistent Assay Performance Across Universal Arrays and Scanners There are multiple Universal Array and scanner options for running Illumina DASL and GoldenGate
More informationTutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
More informationExiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays
Exiqon Array Software Manual Quick guide to data extraction from mircury LNA microrna Arrays March 2010 Table of contents Introduction Overview...................................................... 3 ImaGene
More informationAnalyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6
Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Overview This tutorial outlines how microrna data can be analyzed within Partek Genomics Suite. Additionally,
More informationIdentification of rheumatoid arthritis and osteoarthritis patients by transcriptome-based rule set generation
Identification of rheumatoid arthritis and osterthritis patients by transcriptome-based rule set generation Bering Limited Report generated on September 19, 2014 Contents 1 Dataset summary 2 1.1 Project
More informationIntroduction To Real Time Quantitative PCR (qpcr)
Introduction To Real Time Quantitative PCR (qpcr) SABiosciences, A QIAGEN Company www.sabiosciences.com The Seminar Topics The advantages of qpcr versus conventional PCR Work flow & applications Factors
More informationRT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial
RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial Samuel J. Rulli, Jr., Ph.D. qpcr-applications Scientist Samuel.Rulli@QIAGEN.com Pathway Focused Research from Sample Prep to Data Analysis! -2-
More informationSoftware and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University
Software and Methods for the Analysis of Affymetrix GeneChip Data Rafael A Irizarry Department of Biostatistics Johns Hopkins University Outline Overview Bioconductor Project Examples 1: Gene Annotation
More informationReal-time PCR: Understanding C t
APPLICATION NOTE Real-Time PCR Real-time PCR: Understanding C t Real-time PCR, also called quantitative PCR or qpcr, can provide a simple and elegant method for determining the amount of a target sequence
More informationALLEN Mouse Brain Atlas
TECHNICAL WHITE PAPER: QUALITY CONTROL STANDARDS FOR HIGH-THROUGHPUT RNA IN SITU HYBRIDIZATION DATA GENERATION Consistent data quality and internal reproducibility are critical concerns for high-throughput
More informationREAL TIME PCR USING SYBR GREEN
REAL TIME PCR USING SYBR GREEN 1 THE PROBLEM NEED TO QUANTITATE DIFFERENCES IN mrna EXPRESSION SMALL AMOUNTS OF mrna LASER CAPTURE SMALL AMOUNTS OF TISSUE PRIMARY CELLS PRECIOUS REAGENTS 2 THE PROBLEM
More informationFrom Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data
From Reads to Differentially Expressed Genes The statistics of differential gene expression analysis using RNA-seq data experimental design data collection modeling statistical testing biological heterogeneity
More informationPredictive Gene Signature Selection for Adjuvant Chemotherapy in Non-Small Cell Lung Cancer Patients
Predictive Gene Signature Selection for Adjuvant Chemotherapy in Non-Small Cell Lung Cancer Patients by Li Liu A practicum report submitted to the Department of Public Health Sciences in conformity with
More information2.500 Threshold. 2.000 1000e - 001. Threshold. Exponential phase. Cycle Number
application note Real-Time PCR: Understanding C T Real-Time PCR: Understanding C T 4.500 3.500 1000e + 001 4.000 3.000 1000e + 000 3.500 2.500 Threshold 3.000 2.000 1000e - 001 Rn 2500 Rn 1500 Rn 2000
More informationCorrelation of microarray and quantitative real-time PCR results. Elisa Wurmbach Mount Sinai School of Medicine New York
Correlation of microarray and quantitative real-time PCR results Elisa Wurmbach Mount Sinai School of Medicine New York Microarray techniques Oligo-array: Affymetrix, Codelink, spotted oligo-arrays (60-70mers)
More informationAGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
More informationREAL TIME PCR SYBR GREEN
REAL TIME PCR SYBR GREEN 1 THE PROBLEM NEED TO QUANTITATE DIFFERENCES IN mrna EXPRESSION SMALL AMOUNTS OF mrna LASER CAPTURE SMALL AMOUNTS OF TISSUE PRIMARY CELLS PRECIOUS REAGENTS 2 THE PROBLEM QUANTITATION
More information200631 - ADO - Omics Data Analysis
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2016 200 - FME - School of Mathematics and Statistics 1004 - UB - (ENG)Universitat de Barcelona MASTER'S DEGREE IN STATISTICS AND
More informationExploRase: Multivariate exploratory analysis and visualization for systems biology. 1 Introduction. Abstract
ExploRase: Multivariate exploratory analysis and visualization for systems biology Michael Lawrence, Dianne Cook, Eun-Kyung Lee Abstract The large datasets being produced by high-throughput biological
More informationSTATS8: Introduction to Biostatistics. Data Exploration. Babak Shahbaba Department of Statistics, UCI
STATS8: Introduction to Biostatistics Data Exploration Babak Shahbaba Department of Statistics, UCI Introduction After clearly defining the scientific problem, selecting a set of representative members
More informationDeCyder Extended Data Analysis module Version 1.0
GE Healthcare DeCyder Extended Data Analysis module Version 1.0 Module for DeCyder 2D version 6.5 User Manual Contents 1 Introduction 1.1 Introduction... 7 1.2 The DeCyder EDA User Manual... 9 1.3 Getting
More informationSELDI-TOF Mass Spectrometry Protein Data By Huong Thi Dieu La
SELDI-TOF Mass Spectrometry Protein Data By Huong Thi Dieu La References Alejandro Cruz-Marcelo, Rudy Guerra, Marina Vannucci, Yiting Li, Ching C. Lau, and Tsz-Kwong Man. Comparison of algorithms for pre-processing
More informationImproving SAS Global Forum Papers
Paper 3343-2015 Improving SAS Global Forum Papers Vijay Singh, Pankush Kalgotra, Goutam Chakraborty, Oklahoma State University, OK, US ABSTRACT Just as research is built on existing research, the references
More informationQuantStudio 3D AnalysisSuite Software
USER GUIDE QuantStudio 3D AnalysisSuite Software for use with QuantStudio 3D Digital PCR System Publication Number MAN0008161 Revision 1.0 For Research Use Only. Not for use in diagnostic procedures. For
More informationStep-by-Step Guide to Basic Expression Analysis and Normalization
Step-by-Step Guide to Basic Expression Analysis and Normalization Page 1 Introduction This document shows you how to perform a basic analysis and normalization of your data. A full review of this document
More informationFactors for success in big data science
Factors for success in big data science Damjan Vukcevic Data Science Murdoch Childrens Research Institute 16 October 2014 Big Data Reading Group (Department of Mathematics & Statistics, University of Melbourne)
More informationAnalyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study
Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study The data for this study is taken from experiment GSE848 from the Gene Expression
More informationValidation and Calibration. Definitions and Terminology
Validation and Calibration Definitions and Terminology ACCEPTANCE CRITERIA: The specifications and acceptance/rejection criteria, such as acceptable quality level and unacceptable quality level, with an
More informationGene expression analysis. Ulf Leser and Karin Zimmermann
Gene expression analysis Ulf Leser and Karin Zimmermann Ulf Leser: Bioinformatics, Wintersemester 2010/2011 1 Last lecture What are microarrays? - Biomolecular devices measuring the transcriptome of a
More informationChapter 2: Supplementary Exercises
Chapter 2: Supplementary Exercises 2.S.1. A Public Analyst s laboratory routinely measures potable water samples by flame atomic absorption spectrometry to ensure compliance with the EU drinking water
More informationContent Sheet 7-1: Overview of Quality Control for Quantitative Tests
Content Sheet 7-1: Overview of Quality Control for Quantitative Tests Role in quality management system Quality Control (QC) is a component of process control, and is a major element of the quality management
More informationCluster software and Java TreeView
Cluster software and Java TreeView To download the software: http://bonsai.hgc.jp/~mdehoon/software/cluster/software.htm http://bonsai.hgc.jp/~mdehoon/software/cluster/manual/treeview.html Cluster 3.0
More informationSupplemental Material. Methods
Supplemental Material Methods Measurement of lncrnas expression Total RNA was extracted from PAXgene TM tubes using the PAXgene blood RNA kit (Qiagen, Venlo, Netherlands) as described by the manufacturer.
More informationAnalysis of gene expression data. Ulf Leser and Philippe Thomas
Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:
More informationQuality Assessment of Exon and Gene Arrays
Quality Assessment of Exon and Gene Arrays I. Introduction In this white paper we describe some quality assessment procedures that are computed from CEL files from Whole Transcript (WT) based arrays such
More informationNCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
More informationProfiling of microrna in Blood Serum/Plasma. Guidelines for the mircury LNA TM Universal RT microrna PCR System
Profiling of microrna in Blood Serum/Plasma Guidelines for the mircury LNA TM Universal RT microrna PCR System Table of Contents 2 Introduction.....................................................................................
More informationFrequently Asked Questions Next Generation Sequencing
Frequently Asked Questions Next Generation Sequencing Import These Frequently Asked Questions for Next Generation Sequencing are some of the more common questions our customers ask. Questions are divided
More informationedger: differential expression analysis of digital gene expression data User s Guide Yunshun Chen, Davis McCarthy, Mark Robinson, Gordon K.
edger: differential expression analysis of digital gene expression data User s Guide Yunshun Chen, Davis McCarthy, Mark Robinson, Gordon K. Smyth First edition 17 September 2008 Last revised 8 October
More informationA Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias
A Comparison of Normalization Methods for High Density Oligonucleotide Array Data Based on Variance and Bias B. M. Bolstad, R. A. Irizarry 2, M. Astrand 3 and T. P. Speed 4, 5 Group in Biostatistics, University
More informationTwo-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?...
Two-Way ANOVA tests Contents at a glance I. Definition and Applications...2 II. Two-Way ANOVA prerequisites...2 III. How to use the Two-Way ANOVA tool?...3 A. Parametric test, assume variances equal....4
More informationGene Expression Assay Performance Guaranteed With the TaqMan Assays QPCR Guarantee Program
WHITE PAPER TaqMan Assays QPCR Guarantee Program Gene Expression Assay Performance Guaranteed With the TaqMan Assays QPCR Guarantee Program Real-Time PCR for the Quantification of Gene Expression Real-time
More informationPREDA S4-classes. Francesco Ferrari October 13, 2015
PREDA S4-classes Francesco Ferrari October 13, 2015 Abstract This document provides a description of custom S4 classes used to manage data structures for PREDA: an R package for Position RElated Data Analysis.
More informationHierarchical Clustering Analysis
Hierarchical Clustering Analysis What is Hierarchical Clustering? Hierarchical clustering is used to group similar objects into clusters. In the beginning, each row and/or column is considered a cluster.
More informationExercise with Gene Ontology - Cytoscape - BiNGO
Exercise with Gene Ontology - Cytoscape - BiNGO This practical has material extracted from http://www.cbs.dtu.dk/chipcourse/exercises/ex_go/goexercise11.php In this exercise we will analyze microarray
More informationFormalin fixation at low temperature better preserves nucleic acid integrity. Gianni Bussolati. University of Turin
Formalin fixation at low temperature better preserves nucleic acid integrity Gianni Bussolati University of Turin Disclosure of interests: G.B. was originally responsible for the invention of the Cold
More informationReal time and Quantitative (RTAQ) PCR. so I have an outlier and I want to see if it really is changed
Real time and Quantitative (RTAQ) PCR or.. for this audience so I have an outlier and I want to see if it really is changed Nigel Walker, Ph.D. Laboratory of Computational Biology and Risk Analysis, Environmental
More informationDescriptive statistics Statistical inference statistical inference, statistical induction and inferential statistics
Descriptive statistics is the discipline of quantitatively describing the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics (or inductive statistics),
More informationncounter Leukemia Fusion Gene Expression Assay Molecules That Count Product Highlights ncounter Leukemia Fusion Gene Expression Assay Overview
ncounter Leukemia Fusion Gene Expression Assay Product Highlights Simultaneous detection and quantification of 25 fusion gene isoforms and 23 additional mrnas related to leukemia Compatible with a variety
More informationQuando si parla di PCR quantitativa si intende:
Quando si parla di PCR quantitativa si intende: A. Una PCR che produce grandi quantità di DNA B. Una PCR che emette quanti di luce C. Una PCR che quantifica il numero di molecole stampo presenti all inizio
More informationmanual last update on July 8, 2008
manual last update on July 8, 2008 1. Introduction The genorm VBA applet for Microsoft Excel determines the most stable reference genes from a set of tested genes in a given cdna sample panel, and calculates
More informationFrozen Robust Multi-Array Analysis and the Gene Expression Barcode
Frozen Robust Multi-Array Analysis and the Gene Expression Barcode Matthew N. McCall October 13, 2015 Contents 1 Frozen Robust Multiarray Analysis (frma) 2 1.1 From CEL files to expression estimates...................
More informationGene Expression Macro Version 1.1
Gene Expression Macro Version 1.1 Instructions Rev B 1 Bio-Rad Gene Expression Macro Users Guide 2004 Bio-Rad Laboratories Table of Contents: Introduction..................................... 3 Opening
More informationAnalysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk
Analysing Questionnaires using Minitab (for SPSS queries contact -) Graham.Currell@uwe.ac.uk Structure As a starting point it is useful to consider a basic questionnaire as containing three main sections:
More informationMicroarray Data Analysis. A step by step analysis using BRB-Array Tools
Microarray Data Analysis A step by step analysis using BRB-Array Tools 1 EXAMINATION OF DIFFERENTIAL GENE EXPRESSION (1) Objective: to find genes whose expression is changed before and after chemotherapy.
More informationThe timecourse Package
The timecourse Package Yu huan Tai October 13, 2015 ontents Institute for Human Genetics, University of alifornia, San Francisco taiy@humgen.ucsf.edu 1 Overview 1 2 Longitudinal one-sample problem 2 2.1
More informationOpenArray Sample Tracker Software
QUICK REFERENCE OpenArray Sample Tracker Software For QuantStudio 12K Flex OpenArray Sample Block and the OpenArray Real-Time PCR System Publication Part Number 4460657 Rev. C Revision Date May 2012 Contents
More informationAnalytical Test Method Validation Report Template
Analytical Test Method Validation Report Template 1. Purpose The purpose of this Validation Summary Report is to summarize the finding of the validation of test method Determination of, following Validation
More informationQuantStudio 12K Flex Real-Time PCR System. The all-in-one qpcr instrument
QuantStudio 12K Flex Real-Time PCR System The all-in-one qpcr instrument Expand the boundaries of your research Life Technologies is taking qpcr to the next level. Designed for maximum throughput, flexibility,
More informationRegression Clustering
Chapter 449 Introduction This algorithm provides for clustering in the multiple regression setting in which you have a dependent variable Y and one or more independent variables, the X s. The algorithm
More informationGene Expression Assays
APPLICATION NOTE TaqMan Gene Expression Assays A mpl i fic ationef ficienc yof TaqMan Gene Expression Assays Assays tested extensively for qpcr efficiency Key factors that affect efficiency Efficiency
More informationUNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS
UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS Dwijesh C. Mishra I.A.S.R.I., Library Avenue, New Delhi-110 012 dcmishra@iasri.res.in What is Learning? "Learning denotes changes in a system that enable
More informationQuantitative Real Time PCR Protocol. Stack Lab
Quantitative Real Time PCR Protocol Stack Lab Overview Real-time quantitative polymerase chain reaction (qpcr) differs from regular PCR by including in the reaction fluorescent reporter molecules that
More informationExport Pricing and Credit Constraints: Theory and Evidence from Greek Firms. Online Data Appendix (not intended for publication) Elias Dinopoulos
Export Pricing and Credit Constraints: Theory and Evidence from Greek Firms Online Data Appendix (not intended for publication) Elias Dinopoulos University of Florida Sarantis Kalyvitis Athens University
More informationCourse Objective This course is designed to give you a basic understanding of how to run regressions in SPSS.
SPSS Regressions Social Science Research Lab American University, Washington, D.C. Web. www.american.edu/provost/ctrl/pclabs.cfm Tel. x3862 Email. SSRL@American.edu Course Objective This course is designed
More informationObesity in America: A Growing Trend
Obesity in America: A Growing Trend David Todd P e n n s y l v a n i a S t a t e U n i v e r s i t y Utilizing Geographic Information Systems (GIS) to explore obesity in America, this study aims to determine
More informationMS Data Analysis I: Importing Data into Genespring and Initial Quality Control
Homework: Session 2 GENESPRING MS ONLINE TRAINING SESSION 2 MS Data Analysis I: Importing Data into Genespring and Initial Quality Control Introduction and Lab Overview: If you need help during completion
More informationDHL Data Mining Project. Customer Segmentation with Clustering
DHL Data Mining Project Customer Segmentation with Clustering Timothy TAN Chee Yong Aditya Hridaya MISRA Jeffery JI Jun Yao 3/30/2010 DHL Data Mining Project Table of Contents Introduction to DHL and the
More informationA Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling
A Procedure for Classifying New Respondents into Existing Segments Using Maximum Difference Scaling Background Bryan Orme and Rich Johnson, Sawtooth Software March, 2009 Market segmentation is pervasive
More informationMicroarray Analysis. The Basics. Thomas Girke. December 9, 2011. Microarray Analysis Slide 1/42
Microarray Analysis The Basics Thomas Girke December 9, 2011 Microarray Analysis Slide 1/42 Technology Challenges Data Analysis Data Depositories R and BioConductor Homework Assignment Microarray Analysis
More informationEM Clustering Approach for Multi-Dimensional Analysis of Big Data Set
EM Clustering Approach for Multi-Dimensional Analysis of Big Data Set Amhmed A. Bhih School of Electrical and Electronic Engineering Princy Johnson School of Electrical and Electronic Engineering Martin
More informationUsing Excel for inferential statistics
FACT SHEET Using Excel for inferential statistics Introduction When you collect data, you expect a certain amount of variation, just caused by chance. A wide variety of statistical tests can be applied
More informationScott Reierstad. Field Applications Scientist
Real-time PCR Training Scott Reierstad Field Applications Scientist Topics we ll cover Quick introduction to real-time PCR ABI-supported real-time chemistries Instrumentation Quantitation via Standard
More informationQuantitative proteomics background
Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran
More informationNucleic Acid Purity Assessment using A 260 /A 280 Ratios
Nucleic Acid Purity Assessment using A 260 /A 280 Ratios A common practice in molecular biology is to perform a quick assessment of the purity of nucleic acid samples by determining the ratio of spectrophotometric
More informationSummary of important mathematical operations and formulas (from first tutorial):
EXCEL Intermediate Tutorial Summary of important mathematical operations and formulas (from first tutorial): Operation Key Addition + Subtraction - Multiplication * Division / Exponential ^ To enter a
More informationCHAPTER TWELVE TABLES, CHARTS, AND GRAPHS
TABLES, CHARTS, AND GRAPHS / 75 CHAPTER TWELVE TABLES, CHARTS, AND GRAPHS Tables, charts, and graphs are frequently used in statistics to visually communicate data. Such illustrations are also a frequent
More informationWeb-based Tools for the Analysis of DNA Microarrays. End of Project Report. Authors: P. Geeleher 1,2, A. Golden 3, J. Hinde 2 and D. G.
Web-based Tools for the Analysis of DNA Microarrays End of Project Report Project 5236 Authors: P. Geeleher 1,2, A. Golden 3, J. Hinde 2 and D. G. Morris 1 1 Teagasc, Animal Reproduction Department, Mellows
More informationMolecular Genetics: Challenges for Statistical Practice. J.K. Lindsey
Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray
More informationA Streamlined Workflow for Untargeted Metabolomics
A Streamlined Workflow for Untargeted Metabolomics Employing XCMS plus, a Simultaneous Data Processing and Metabolite Identification Software Package for Rapid Untargeted Metabolite Screening Baljit K.
More informationPreciseTM Whitepaper
Precise TM Whitepaper Introduction LIMITATIONS OF EXISTING RNA-SEQ METHODS Correctly designed gene expression studies require large numbers of samples, accurate results and low analysis costs. Analysis
More informationValidating Microarray Data Using RT 2 Real-Time PCR Products
Validating Microarray Data Using RT 2 Real-Time PCR Products Introduction: Real-time PCR monitors the amount of amplicon as the reaction occurs. Usually, the amount of product is directly related to the
More informationGenEx User Guide. www.multid.se. Version 1.0. Copyrights 2001-2012 Multid Analyses AB. GenEx - Data Analysis Software
GenEx User Guide Version 1.0 Copyrights 2001-2012 Multid Analyses AB GenEx - Data Analysis Software www.multid.se 1 Introduction As the qpcr field advances, the design of experiments and the analysis of
More informationFinal Project Report
CPSC545 by Introduction to Data Mining Prof. Martin Schultz & Prof. Mark Gerstein Student Name: Yu Kor Hugo Lam Student ID : 904907866 Due Date : May 7, 2007 Introduction Final Project Report Pseudogenes
More informationABSORBENCY OF PAPER TOWELS
ABSORBENCY OF PAPER TOWELS 15. Brief Version of the Case Study 15.1 Problem Formulation 15.2 Selection of Factors 15.3 Obtaining Random Samples of Paper Towels 15.4 How will the Absorbency be measured?
More informationMicro RNAs: potentielle Biomarker für das. Blutspenderscreening
Micro RNAs: potentielle Biomarker für das Blutspenderscreening micrornas - Background Types of RNA -Coding: messenger RNA (mrna) -Non-coding (examples): Ribosomal RNA (rrna) Transfer RNA (trna) Small nuclear
More informationVisualization of Complex Survey Data: Regression Diagnostics
Visualization of Complex Survey Data: Regression Diagnostics Susan Hinkins 1, Edward Mulrow, Fritz Scheuren 3 1 NORC at the University of Chicago, 11 South 5th Ave, Bozeman MT 59715 NORC at the University
More informationChapter 5: Analysis of The National Education Longitudinal Study (NELS:88)
Chapter 5: Analysis of The National Education Longitudinal Study (NELS:88) Introduction The National Educational Longitudinal Survey (NELS:88) followed students from 8 th grade in 1988 to 10 th grade in
More informationNCSS Statistical Software
Chapter 06 Introduction This procedure provides several reports for the comparison of two distributions, including confidence intervals for the difference in means, two-sample t-tests, the z-test, the
More informationMolecular Assessment of Dried Blood Spot Quality during Development of a Novel Automated. Screening
Molecular Assessment of Dried Blood Spot Quality during Development of a Novel Automated in situ TREC qpcr Assay for SCID Screening J Bai, T Henry, J Benfer, S Berberich, T Kreman, and L DesJardin State
More informationBuilding risk prediction models - with a focus on Genome-Wide Association Studies. Charles Kooperberg
Building risk prediction models - with a focus on Genome-Wide Association Studies Risk prediction models Based on data: (D i, X i1,..., X ip ) i = 1,..., n we like to fit a model P(D = 1 X 1,..., X p )
More informationThe Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA
Paper 156-2010 The Forgotten JMP Visualizations (Plus Some New Views in JMP 9) Sam Gardner, SAS Institute, Lafayette, IN, USA Abstract JMP has a rich set of visual displays that can help you see the information
More informationExploratory data analysis (Chapter 2) Fall 2011
Exploratory data analysis (Chapter 2) Fall 2011 Data Examples Example 1: Survey Data 1 Data collected from a Stat 371 class in Fall 2005 2 They answered questions about their: gender, major, year in school,
More informationPractical Differential Gene Expression. Introduction
Practical Differential Gene Expression Introduction In this tutorial you will learn how to use R packages for analysis of differential expression. The dataset we use are the gene-summarized count data
More informationFactor Analysis. Sample StatFolio: factor analysis.sgp
STATGRAPHICS Rev. 1/10/005 Factor Analysis Summary The Factor Analysis procedure is designed to extract m common factors from a set of p quantitative variables X. In many situations, a small number of
More informationbitter is de pil Linos Vandekerckhove, MD, PhD
4//24 Current HIV care HIV copies/ ml plasma Viral load Welcome to the Digital droplet PCR age! bitter is de pil Linos Vandekerckhove, MD, PhD Latent HIV reservoir Time at Ghent University Hospital 2 HIV
More informationComparing Methods for Identifying Transcription Factor Target Genes
Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF
More information