Package empiricalfdr.deseq2



Similar documents
Package dunn.test. January 6, 2016

Package tagcloud. R topics documented: July 3, 2015

Statistics in Retail Finance. Chapter 2: Statistical models of default

Package GSA. R topics documented: February 19, 2015

Package survpresmooth

Package missforest. February 20, 2015

Package bigdata. R topics documented: February 19, 2015

Package retrosheet. April 13, 2015

Package cpm. July 28, 2015

Package HHG. July 14, 2015

Practical Differential Gene Expression. Introduction

Package MDM. February 19, 2015

Package gazepath. April 1, 2015

From Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data

Package TSfame. February 15, 2013

Package ChannelAttribution

Package MBA. February 19, Index 7. Canopy LIDAR data

Package sendmailr. February 20, 2015

Package ERP. December 14, 2015

Tutorial for proteome data analysis using the Perseus software platform

Package metafuse. November 7, 2015

p-values and significance levels (false positive or false alarm rates)

Package hazus. February 20, 2015

Package uptimerobot. October 22, 2015

STATISTICA Formula Guide: Logistic Regression. Table of Contents

Package httprequest. R topics documented: February 20, 2015

Package treemap. February 15, 2013

Optimization of sampling strata with the SamplingStrata package

Gene Expression Analysis

Package EstCRM. July 13, 2015

Comparing Functional Data Analysis Approach and Nonparametric Mixed-Effects Modeling Approach for Longitudinal Data Analysis

Package png. February 20, 2015

Package hier.part. February 20, Index 11. Goodness of Fit Measures for a Regression Hierarchy

Package cgdsr. August 27, 2015

Package pesticides. February 20, 2015

Package nortest. R topics documented: July 30, Title Tests for Normality Version Date

Course on Functional Analysis. ::: Gene Set Enrichment Analysis - GSEA -

Generalized Linear Models

Package CIFsmry. July 10, Index 6

RNA-seq. Quantification and Differential Expression. Genomics: Lecture #12

Statistical issues in the analysis of microarray data

Package changepoint. R topics documented: November 9, Type Package Title Methods for Changepoint Detection Version 2.

Package neuralnet. February 20, 2015

Package benford.analysis

General Method: Difference of Means. 3. Calculate df: either Welch-Satterthwaite formula or simpler df = min(n 1, n 2 ) 1.

Package EasyHTMLReport

NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )

Package translater. R topics documented: February 20, Type Package

Package hive. January 10, 2011

Package trimtrees. February 20, 2015

Package copa. R topics documented: August 9, 2016

Pearson's Correlation Tests

Package erp.easy. September 26, 2015

Package COSINE. February 19, 2015

Package smoothhr. November 9, 2015

Survey, Statistics and Psychometrics Core Research Facility University of Nebraska-Lincoln. Log-Rank Test for More Than Two Groups

Package fimport. February 19, 2015

Prediction of Business Process Model Quality based on Structural Metrics

Package plan. R topics documented: February 20, 2015

Package ATE. R topics documented: February 19, Type Package Title Inference for Average Treatment Effects using Covariate. balancing.

Big Data Analytics. Tools and Techniques

Package lmertest. July 16, 2015

> plot(exp.btgpllm, main = "treed GP LLM,", proj = c(1)) > plot(exp.btgpllm, main = "treed GP LLM,", proj = c(2)) quantile diff (error)

Regression Analysis: A Complete Example

Package TRADER. February 10, 2016

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis

2013 MBA Jump Start Program. Statistics Module Part 3

SAS Software to Fit the Generalized Linear Model

Two-Group Hypothesis Tests: Excel 2013 T-TEST Command

The Variability of P-Values. Summary

Tutorial 5: Hypothesis Testing

ITSM-R Reference Manual

Minería de Datos ANALISIS DE UN SET DE DATOS.! Visualization Techniques! Combined Graph! Charts and Pies! Search for specific functions

Package AMORE. February 19, 2015

Package obs.agree. February 20, 2015

Package SHELF. February 5, 2016

Business Statistics. Successful completion of Introductory and/or Intermediate Algebra courses is recommended before taking Business Statistics.

Package depend.truncation

LOGNORMAL MODEL FOR STOCK PRICES

The Wondrous World of fmri statistics

Factors affecting online sales

Cross Validation techniques in R: A brief overview of some methods, packages, and functions for assessing prediction models.

Step-by-Step Guide to Basic Expression Analysis and Normalization

Package CoImp. February 19, 2015

Package pdfetch. R topics documented: July 19, 2015

Package sjdbc. R topics documented: February 20, 2015

Package CRM. R topics documented: February 19, 2015

Simple Linear Regression Inference

Statistics I for QBIC. Contents and Objectives. Chapters 1 7. Revised: August 2013

Two-Sample T-Tests Assuming Equal Variance (Enter Means)

Package RDAVIDWebService

Package bigrf. February 19, 2015

Package hoarder. June 30, 2015

September Population analysis of the Retriever (Flat Coated) breed

Maximally Selected Rank Statistics in R

The F distribution and the basic principle behind ANOVAs. Situating ANOVAs in the world of statistical tests

Package brewdata. R topics documented: February 19, Type Package

Two-Sample T-Tests Allowing Unequal Variance (Enter Difference)

Package hybridensemble

The power of a test is the of. by using a particular and a. value of the that is an to the value

Transcription:

Type Package Package empiricalfdr.deseq2 May 27, 2015 Title Simulation-Based False Discovery Rate in RNA-Seq Version 1.0.3 Date 2015-05-26 Author Mikhail V. Matz Maintainer Mikhail V. Matz <matz@utexas.edu> Auxiliary functions for the DESeq2 package to simulate read counts according to the null hypothesis (i.e., with empirical sample size factors, per-gene total counts and dispersions, but without effects of predictor variables) and to compute the empirical false discovery rate. License GPL-3 Depends DESeq2,GenomicRanges NeedsCompilation no Repository CRAN Date/Publication 2015-05-27 08:55:36 R topics documented: empiricalfdr.deseq2-package.............................. 2 empiricalfdr........................................ 3 fdrbicurve......................................... 4 fdrtable........................................... 5 simulatecounts....................................... 7 Index 9 1

2 empiricalfdr.deseq2-package empiricalfdr.deseq2-package Simulation-Based False Discovery Rate in RNA-Seq Details Auxiliary functions for the DESeq2 package to simulate read counts according to the null hypothesis (i.e., with empirical sample size factors, per-gene total counts and dispersions, but without effects of predictor variables) and to compute the empirical false discovery rate. Package: empiricalfdr.deseq2 Type: Package Version: 1.0.2 Date: 2015-04-01 License: GPL-3 The key function is simulatecounts, which takes a fitted DESeq2 data object as an input and returns a simulated data object with the same sample size factors, total counts and dispersions for each gene as in real data, but without the effect of predictor variables. Functions fdrtable, fdrbicurve and empiricalfdr compare the DESeq2 results obtained for the real and simulated data, compute the empirical false discovery rate (the ratio of the number of differentially expressed genes detected in the simulated data and their number in the real data) and plot the results. Mikhail V. Matz Maintainer: Mikhail V. Matz <matz@utexas.edu>

empiricalfdr 3 empiricalfdr Computing the p-value cutoff to achieve a given FDR. This function calculates the cutoff at which a particular false discovery rate is observed using loess smoothing and interpolation. Usage empiricalfdr(fdr.table, FDR = 0.1, maxlogp = 5, plot = FALSE, span = 0.1,...) Arguments Value fdr.table FDR The output of fdrtable(): a dataframe listing p-value cutoffs and the number of null hypothesis rejections at each cutoff in the real and simulated datasets. The target false discovery rate. maxlogp Maximal negative decimal logarithm of the p-value for plotting (the default, 5, implies the data for p-values better than 10e-5 will not be plotted) plot span Whether to produce a plot. span parameter for loess smoothing.... Additional plotting parameters. The function returns a single value, which is the p-value cutoff at which the target FDR is observed. With plot = TRUE, also plots the observed experimental FDRs with the loess smoother. Mikhail V. Matz

4 fdrbicurve fdrbicurve Plots the numbers of null hypothesis rejections Plots the numbers of null hypothesis rejections in real data and data simulated under null hypothesis (false positives) Usage fdrbicurve(fdr.table, maxlogp = 5,...) Arguments fdr.table The output of fdrtable(): a dataframe listing p-value cutoffs and the number of null hypothesis rejections at each cutoff in the real and simulated datasets. maxlogp Maximal negative decimal logarithm of the p-value for plotting (the default, 5, implies the data for p-values better than 10e-5 will not be plotted)... Additional plotting parameters.

fdrtable 5 Value The plot is designed to ascertain that the number of discoveries in real data (black line) indeed exceeds the number of false positives (red line) across the range of p-value cutoffs. The grey dotted line is the number of discoveries expected under uniform distribution of p-values. Mikhail V. Matz fdrtable Computes false discovery rates for a series of p-value cutoffs. Given vectors of p-values from real data and data simulated under a null hypothesis, produces a table listing the number of null hypothesis rejections under a range of p-value cutoffs in real and simulated data.

6 fdrtable Usage fdrtable(real.p, sim.p) Arguments real.p sim.p Vector of p-values in the real data. Vector of p-values in the simulated data. Value The function returns a dataframe listing p-value cutoffs, in steps of 0.1 on the decimal log scale, the number of null hypothesis rejections at each cutoff in real and simulated datasets, and their ratio (the false discovery rate). Mikhail V. Matz

simulatecounts 7 simulatecounts Simulating RNA-seq read counts Usage This function takes a fitted DESeq2 data object as an input and returns a simulated data object with the same sample size factors, total counts and dispersions for each gene as in real data, but without the effect of predictor variables. simulatecounts(deseq.object) Arguments deseq.object DESeq2 data object, with estimated size factors and dispersions (output of DE- Seq() function). Details Value For each gene, the total counts are randomly resampled into different samples. The estimated pergene dispersions are understood as the suare of the coefficient of variation and used to simulate random deviations in per-sample assignment probability. The probabilities of per-sample assignment are also weighted by the empirical sample size factors. DESeq2 data object Mikhail V. Matz

8 simulatecounts

Index Topic DESeq2 Topic RNA-seq Topic false discovery rate Topic simulation empiricalfdr.deseq2 (empiricalfdr.deseq2-package), 2 empiricalfdr.deseq2-package, 2 9