Tutorial for the WGCNA package for R: I. Network analysis of liver expression data in female mice
|
|
|
- Maurice Bryan
- 9 years ago
- Views:
Transcription
1 Tutorial for the WGCNA package for R: I. Network analysis of liver expression data in female mice 2.b Step-by-step network construction and module detection Peter Langfelder and Steve Horvath November 25, 2014 Contents 0 Preliminaries: setting up the R session 1 2 Step-by-step construction of the gene network and identification of modules 2 2.b Step-by-step network construction and module detection b.1 Choosing the soft-thresholding power: analysis of network topology b.2 Co-expression similarity and adjacency b.3 Topological Overlap Matrix (TOM) b.4 Clustering using TOM b.5 Merging of modules whose expression profiles are very similar Preliminaries: setting up the R session Here we assume that a new R session has just been started. We load the WGCNA package, set up basic parameters and load data saved in the first part of the tutorial. Important note: The code below uses parallel computation where multiple cores are available. This works well when R is run from a terminal or from the Graphical User Interface (GUI) shipped with R itself, but at present it does not work with RStudio and possibly other third-party R environments. If you use RStudio or other third-party R environments, skip the enablewgcnathreads() call below. # Display the current working directory getwd(); # If necessary, change the path below to the directory where the data files are stored. # "." means current directory. On Windows use a forward slash / instead of the usual \. workingdir = "."; setwd(workingdir); # Load the WGCNA package library(wgcna) # The following setting is important, do not omit. options(stringsasfactors = FALSE); # Allow multi-threading within WGCNA. At present this call is necessary. # Any error here may be ignored but you may want to update WGCNA if you see one. # Caution: skip this line if you run RStudio or other third-party R environments. # See note above. enablewgcnathreads() # Load the data saved in the first part lnames = load(file = "FemaleLiver-01-dataInput.RData");
2 #The variable lnames contains the names of loaded variables. lnames We have loaded the variables datexpr and dattraits containing the expression and trait data, respectively. 2 Step-by-step construction of the gene network and identification of modules This step is the bedrock of all network analyses using the WGCNA methodology. We present three different ways of constructing a network and identifying modules: a. Using a convenient 1-step network construction and module detection function, suitable for users wishing to arrive at the result with minimum effort; b. Step-by-step network construction and module detection for users who would like to experiment with customized/alternate methods; c. An automatic block-wise network construction and module detection method for users who wish to analyze data sets too large to be analyzed all in one. In this tutorial section, we illustrate the step-by-step network construction and module detection. 2.b Step-by-step network construction and module detection 2.b.1 Choosing the soft-thresholding power: analysis of network topology Constructing a weighted gene network entails the choice of the soft thresholding power β to which co-expression similarity is raised to calculate adjacency [1]. The authors of [1] have proposed to choose the soft thresholding power based on the criterion of approximate scale-free topology. We refer the reader to that work for more details; here we illustrate the use of the function picksoftthreshold that performs the analysis of network topology and aids the user in choosing a proper soft-thresholding power. The user chooses a set of candidate powers (the function provides suitable default values), and the function returns a set of network indices that should be inspected, for example as follows: # Choose a set of soft-thresholding powers powers = c(c(1:10), seq(from = 12, to=20, by=2)) # Call the network topology analysis function sft = picksoftthreshold(datexpr, powervector = powers, verbose = 5) # Plot the results: sizegrwindow(9, 5) par(mfrow = c(1,2)); cex1 = 0.9; # Scale-free topology fit index as a function of the soft-thresholding power plot(sft$fitindices[,1], -sign(sft$fitindices[,3])*sft$fitindices[,2], xlab="soft Threshold (power)",ylab="scale Free Topology Model Fit,signed R^2",type="n", main = paste("scale independence")); text(sft$fitindices[,1], -sign(sft$fitindices[,3])*sft$fitindices[,2], labels=powers,cex=cex1,col="red"); # this line corresponds to using an R^2 cut-off of h abline(h=0.90,col="red") # Mean connectivity as a function of the soft-thresholding power plot(sft$fitindices[,1], sft$fitindices[,5], xlab="soft Threshold (power)",ylab="mean Connectivity", type="n", main = paste("mean connectivity")) text(sft$fitindices[,1], sft$fitindices[,5], labels=powers, cex=cex1,col="red") The result is shown in Fig. 1. We choose the power 6, which is the lowest power for which the scale-free topology fit index reaches 0.90.
3 Scale independence Mean connectivity Scale Free Topology Model Fit,signed R^ Soft Threshold (power) Mean Connectivity Soft Threshold (power) Figure 1: Analysis of network topology for various soft-thresholding powers. The left panel shows the scale-free fit index (y-axis) as a function of the soft-thresholding power (x-axis). The right panel displays the mean connectivity (degree, y-axis) as a function of the soft-thresholding power (x-axis). 2.b.2 Co-expression similarity and adjacency We now calculate the adjacencies, using the soft thresholding power 6: softpower = 6; adjacency = adjacency(datexpr, power = softpower); 2.b.3 Topological Overlap Matrix (TOM) To minimize effects of noise and spurious associations, we transform the adjacency into Topological Overlap Matrix, and calculate the corresponding dissimilarity: # Turn adjacency into topological overlap TOM = TOMsimilarity(adjacency); disstom = 1-TOM 2.b.4 Clustering using TOM We now use hierarchical clustering to produce a hierarchical clustering tree (dendrogram) of genes. Note that we use the function hclust that provides a much faster hierarchical clustering routine than the standard hclust function. # Call the hierarchical clustering function genetree = hclust(as.dist(disstom), method = "average"); # Plot the resulting clustering tree (dendrogram) sizegrwindow(12,9) plot(genetree, xlab="", sub="", main = "Gene clustering on TOM-based dissimilarity", labels = FALSE, hang = 0.04); The clustering dendrogram plotted by the last command is shown in Figure 2. In the clustering tree (dendrogram), each leaf, that is a short vertical line, corresponds to a gene. Branches of the dendrogram group together densely interconnected, highly co-expressed genes. Module identification amounts to the identification of individual branches ( cutting the branches off the dendrogram ). There are several methods for branch cutting; our standard method is the Dynamic Tree Cut from the package dynamictreecut. The next snippet of code illustrates its use.
4 Gene clustering on TOM based dissimilarity Height Figure 2: Clustering dendrogram of genes, with dissimilarity based on topological overlap. # We like large modules, so we set the minimum module size relatively high: minmodulesize = 30; # Module identification using dynamic tree cut: dynamicmods = cutreedynamic(dendro = genetree, distm = disstom, deepsplit = 2, pamrespectsdendro = FALSE, minclustersize = minmodulesize); table(dynamicmods) The function returned 22 modules labeled 1 22 largest to smallest. Label 0 is reserved for unassigned genes. The above command lists the sizes of the modules. We now plot the module assignment under the gene dendrogram: # Convert numeric lables into colors dynamiccolors = labels2colors(dynamicmods) table(dynamiccolors) # Plot the dendrogram and colors underneath sizegrwindow(8,6) plotdendroandcolors(genetree, dynamiccolors, "Dynamic Tree Cut", dendrolabels = FALSE, hang = 0.03, addguide = TRUE, guidehang = 0.05, main = "Gene dendrogram and module colors") The result is shown in Fig. 3.
5 Gene dendrogram and module colors Height Dynamic Tree Cut as.dist(disstom) hclust (*, "average") Figure 3: Clustering dendrogram of genes, with dissimilarity based on topological overlap, together with assigned module colors. 2.b.5 Merging of modules whose expression profiles are very similar The Dynamic Tree Cut may identify modules whose expression profiles are very similar. It may be prudent to merge such modules since their genes are highly co-expressed. To quantify co-expression similarity of entire modules, we calculate their eigengenes and cluster them on their correlation: # Calculate eigengenes MEList = moduleeigengenes(datexpr, colors = dynamiccolors) MEs = MEList$eigengenes # Calculate dissimilarity of module eigengenes MEDiss = 1-cor(MEs); # Cluster module eigengenes METree = hclust(as.dist(mediss), method = "average"); # Plot the result sizegrwindow(7, 6) plot(metree, main = "Clustering of module eigengenes", xlab = "", sub = "") We choose a height cut of 0.25, corresponding to correlation of 0.75, to merge (see Fig. 4):
6 MEDissThres = 0.25 # Plot the cut line into the dendrogram abline(h=medissthres, col = "red") # Call an automatic merging function merge = mergeclosemodules(datexpr, dynamiccolors, cutheight = MEDissThres, verbose = 3) # The merged module colors mergedcolors = merge$colors; # Eigengenes of the new merged modules: mergedmes = merge$newmes; To see what the merging did to our module colors, we plot the gene dendrogram again, with the original and merged module colors underneath (Figure 5). sizegrwindow(12, 9) #pdf(file = "Plots/geneDendro-3.pdf", wi = 9, he = 6) plotdendroandcolors(genetree, cbind(dynamiccolors, mergedcolors), c("dynamic Tree Cut", "Merged dynamic"), dendrolabels = FALSE, hang = 0.03, addguide = TRUE, guidehang = 0.05) #dev.off() In the subsequent analysis, we will use the merged module colors in mergedcolors. We save the relevant variables for use in subsequent parts of the tutorial: # Rename to modulecolors modulecolors = mergedcolors # Construct numerical labels corresponding to the colors colororder = c("grey", standardcolors(50)); modulelabels = match(modulecolors, colororder)-1; MEs = mergedmes; # Save module colors and labels for use in subsequent parts save(mes, modulelabels, modulecolors, genetree, file = "FemaleLiver-02-networkConstruction-stepByStep.RData") References [1] B. Zhang and S. Horvath. A general framework for weighted gene co-expression network analysis. Statistical Applications in Genetics and Molecular Biology, 4(1):Article 17, 2005.
7 Consensus clustering of consensus module eigengenes Height MElightgreen MElightyellow MEblack MEturquoise MEbrown MEroyalblue MEgreen MEred MEpurple MEtan MEmagenta MEyellow MEblue MEmidnightblue MEdarkgreen MEsalmon MEgrey MEpink MEdarkred MEcyan MEgreenyellow MEgrey60 MElightcyan Figure 4: Clustering dendrogram of genes, with dissimilarity based on topological overlap, together with assigned merged module colors and the original module colors.
8 Cluster Dendrogram Height Dynamic Tree Cut Merged dynamic d hclust (*, "average") Figure 5: Clustering dendrogram of genes, with dissimilarity based on topological overlap, together with assigned merged module colors and the original module colors.
Tutorial for the WGCNA package for R: I. Network analysis of liver expression data in female mice
Tutorial for the WGCNA package for R: I. Network analysis of liver expression data in female mice 2.a Automatic network construction and module detection Peter Langfelder and Steve Horvath November 25,
Tutorial for the WGCNA package for R I. Network analysis of liver expression data in female mice
Tutorial for the WGCNA package for R I. Network analysis of liver expression data in female mice 2.c Dealing with large data sets: block-wise network construction and module detection Peter Langfelder
Methods for network visualization and gene enrichment analysis July 17, 2013. Jeremy Miller Scientist I [email protected]
Methods for network visualization and gene enrichment analysis July 17, 2013 Jeremy Miller Scientist I [email protected] Outline Visualizing networks using R Visualizing networks using outside
Statistical Applications in Genetics and Molecular Biology
Statistical Applications in Genetics and Molecular Biology Volume 4, Issue 1 2005 Article 17 A General Framework for Weighted Gene Co-Expression Network Analysis Bin Zhang Steve Horvath Departments of
A General Framework for Weighted Gene Co-expression Network Analysis
Please cite: Statistical Applications in Genetics and Molecular Biology (2005). A General Framework for Weighted Gene Co-expression Network Analysis Bin Zhang and Steve Horvath Departments of Human Genetics
Cluster Analysis using R
Cluster analysis or clustering is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other
Enron Network Analysis Tutorial r date()
Enron Network Analysis Tutorial r date() Enron Tutorial We provide this Enron Tutorial as an appendix to the paper in Journal of Statistical Education, Network Analysis with the Enron Email Corpus. The
COC131 Data Mining - Clustering
COC131 Data Mining - Clustering Martin D. Sykora [email protected] Tutorial 05, Friday 20th March 2009 1. Fire up Weka (Waikako Environment for Knowledge Analysis) software, launch the explorer window
Journal of Statistical Software
JSS Journal of Statistical Software October 2006, Volume 16, Code Snippet 3. http://www.jstatsoft.org/ LDheatmap: An R Function for Graphical Display of Pairwise Linkage Disequilibria between Single Nucleotide
Clustering. 15-381 Artificial Intelligence Henry Lin. Organizing data into clusters such that there is
Clustering 15-381 Artificial Intelligence Henry Lin Modified from excellent slides of Eamonn Keogh, Ziv Bar-Joseph, and Andrew Moore What is Clustering? Organizing data into clusters such that there is
Distances, Clustering, and Classification. Heatmaps
Distances, Clustering, and Classification Heatmaps 1 Distance Clustering organizes things that are close into groups What does it mean for two genes to be close? What does it mean for two samples to be
A Demonstration of Hierarchical Clustering
Recitation Supplement: Hierarchical Clustering and Principal Component Analysis in SAS November 18, 2002 The Methods In addition to K-means clustering, SAS provides several other types of unsupervised
Each function call carries out a single task associated with drawing the graph.
Chapter 3 Graphics with R 3.1 Low-Level Graphics R has extensive facilities for producing graphs. There are both low- and high-level graphics facilities. The low-level graphics facilities provide basic
7 Time series analysis
7 Time series analysis In Chapters 16, 17, 33 36 in Zuur, Ieno and Smith (2007), various time series techniques are discussed. Applying these methods in Brodgar is straightforward, and most choices are
Visualizing non-hierarchical and hierarchical cluster analyses with clustergrams
Visualizing non-hierarchical and hierarchical cluster analyses with clustergrams Matthias Schonlau RAND 7 Main Street Santa Monica, CA 947 USA Summary In hierarchical cluster analysis dendrogram graphs
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller
Tutorial 3: Graphics and Exploratory Data Analysis in R Jason Pienaar and Tom Miller Getting to know the data An important first step before performing any kind of statistical analysis is to familiarize
Designing forms for auto field detection in Adobe Acrobat
Adobe Acrobat 9 Technical White Paper Designing forms for auto field detection in Adobe Acrobat Create electronic forms more easily by using the right elements in your authoring program to take advantage
UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS
UNSUPERVISED MACHINE LEARNING TECHNIQUES IN GENOMICS Dwijesh C. Mishra I.A.S.R.I., Library Avenue, New Delhi-110 012 [email protected] What is Learning? "Learning denotes changes in a system that enable
Hierarchical Clustering Analysis
Hierarchical Clustering Analysis What is Hierarchical Clustering? Hierarchical clustering is used to group similar objects into clusters. In the beginning, each row and/or column is considered a cluster.
Basic Analysis of Microarray Data
Basic Analysis of Microarray Data A User Guide and Tutorial Scott A. Ness, Ph.D. Co-Director, Keck-UNM Genomics Resource and Dept. of Molecular Genetics and Microbiology University of New Mexico HSC Tel.
Time series clustering and the analysis of film style
Time series clustering and the analysis of film style Nick Redfern Introduction Time series clustering provides a simple solution to the problem of searching a database containing time series data such
Getting Started with Command Prompts
Getting Started with Command Prompts Updated March, 2013 Some courses such as TeenCoder : Java Programming will ask the student to perform tasks from a command prompt (Windows) or Terminal window (Mac
TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:
Table of contents: Access Data for Analysis Data file types Format assumptions Data from Excel Information links Add multiple data tables Create & Interpret Visualizations Table Pie Chart Cross Table Treemap
Table of Contents. Online backup Manager User s Guide
Table of Contents Backup / Restore VMware Virtual Machines... Error! Bookmark not defined. Backup virtual machines running on VMware ESXi / ESX Server with VDDK / non VDDK... 2 Requirements and recommendations...
Module 9. User Interface Design. Version 2 CSE IIT, Kharagpur
Module 9 User Interface Design Lesson 21 Types of User Interfaces Specific Instructional Objectives Classify user interfaces into three main types. What are the different ways in which menu items can be
Learning Module 4 - Thermal Fluid Analysis Note: LM4 is still in progress. This version contains only 3 tutorials.
Learning Module 4 - Thermal Fluid Analysis Note: LM4 is still in progress. This version contains only 3 tutorials. Attachment C1. SolidWorks-Specific FEM Tutorial 1... 2 Attachment C2. SolidWorks-Specific
Bioinformatics: Network Analysis
Bioinformatics: Network Analysis Graph-theoretic Properties of Biological Networks COMP 572 (BIOS 572 / BIOE 564) - Fall 2013 Luay Nakhleh, Rice University 1 Outline Architectural features Motifs, modules,
An Order-Invariant Time Series Distance Measure [Position on Recent Developments in Time Series Analysis]
An Order-Invariant Time Series Distance Measure [Position on Recent Developments in Time Series Analysis] Stephan Spiegel and Sahin Albayrak DAI-Lab, Technische Universität Berlin, Ernst-Reuter-Platz 7,
Cluster software and Java TreeView
Cluster software and Java TreeView To download the software: http://bonsai.hgc.jp/~mdehoon/software/cluster/software.htm http://bonsai.hgc.jp/~mdehoon/software/cluster/manual/treeview.html Cluster 3.0
Clustering UE 141 Spring 2013
Clustering UE 141 Spring 013 Jing Gao SUNY Buffalo 1 Definition of Clustering Finding groups of obects such that the obects in a group will be similar (or related) to one another and different from (or
Smart Connection 9 Element Labels
08 Smart Connection 9 Element Labels This document is part of the documentation for Smart Connection 9 and is an extract from the former Smart Connection 9 User Guide for InDesign. For more information
Text Analytics using High Performance SAS Text Miner
Text Analytics using High Performance SAS Text Miner Edward R. Jones, Ph.D. Exec. Vice Pres.; Texas A&M Statistical Services Abstract: The latest release of SAS Enterprise Miner, version 13.1, contains
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER JMP Genomics Step-by-Step Guide to Bi-Parental Linkage Mapping Introduction JMP Genomics offers several tools for the creation of linkage maps
PRODUCT INFORMATION. Insight+ Uses and Features
PRODUCT INFORMATION Insight+ Traditionally, CAE NVH data and results have been presented as plots, graphs and numbers. But, noise and vibration must be experienced to fully comprehend its effects on vehicle
This chapter will demonstrate how to perform multiple linear regression with IBM SPSS
CHAPTER 7B Multiple Regression: Statistical Methods Using IBM SPSS This chapter will demonstrate how to perform multiple linear regression with IBM SPSS first using the standard method and then using the
Mathematics (Project Maths Phase 1)
2012. M128 S Coimisiún na Scrúduithe Stáit State Examinations Commission Leaving Certificate Examination, 2012 Sample Paper Mathematics (Project Maths Phase 1) Paper 2 Ordinary Level Time: 2 hours, 30
Tutorial for proteome data analysis using the Perseus software platform
Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information
Cluster Analysis. Aims and Objectives. What is Cluster Analysis? How Does Cluster Analysis Work? Postgraduate Statistics: Cluster Analysis
Aims and Objectives By the end of this seminar you should: Cluster Analysis Have a working knowledge of the ways in which similarity between cases can be quantified (e.g. single linkage, complete linkage
Scientific Graphing in Excel 2010
Scientific Graphing in Excel 2010 When you start Excel, you will see the screen below. Various parts of the display are labelled in red, with arrows, to define the terms used in the remainder of this overview.
Visualization Plugin for ParaView
Alexey I. Baranov Visualization Plugin for ParaView version 1.3 Springer Contents 1 Visualization with ParaView..................................... 1 1.1 ParaView plugin installation.................................
Steven M. Ho!and. Department of Geology, University of Georgia, Athens, GA 30602-2501
CLUSTER ANALYSIS Steven M. Ho!and Department of Geology, University of Georgia, Athens, GA 30602-2501 January 2006 Introduction Cluster analysis includes a broad suite of techniques designed to find groups
There are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
Describe the Create Profile dialog box. Discuss the Update Profile dialog box.examine the Annotate Profile dialog box.
A profile represents the ground surface along a specified path. A profile of the horizontal alignment showing the existing surface ground line is required before creating the vertical alignment, also known
JustClust User Manual
JustClust User Manual Contents 1. Installing JustClust 2. Running JustClust 3. Basic Usage of JustClust 3.1. Creating a Network 3.2. Clustering a Network 3.3. Applying a Layout 3.4. Saving and Loading
Essay 5 Tutorial for a Three-Dimensional Heat Conduction Problem Using ANSYS Workbench
Essay 5 Tutorial for a Three-Dimensional Heat Conduction Problem Using ANSYS Workbench 5.1 Introduction The problem selected to illustrate the use of ANSYS software for a three-dimensional steadystate
DeCyder Extended Data Analysis module Version 1.0
GE Healthcare DeCyder Extended Data Analysis module Version 1.0 Module for DeCyder 2D version 6.5 User Manual Contents 1 Introduction 1.1 Introduction... 7 1.2 The DeCyder EDA User Manual... 9 1.3 Getting
Formulas, Functions and Charts
Formulas, Functions and Charts :: 167 8 Formulas, Functions and Charts 8.1 INTRODUCTION In this leson you can enter formula and functions and perform mathematical calcualtions. You will also be able to
LabVIEW Day 6: Saving Files and Making Sub vis
LabVIEW Day 6: Saving Files and Making Sub vis Vern Lindberg You have written various vis that do computations, make 1D and 2D arrays, and plot graphs. In practice we also want to save that data. We will
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP
Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP ABSTRACT In data mining modelling, data preparation
Quick Start. Creating a Scoring Application. RStat. Based on a Decision Tree Model
Creating a Scoring Application Based on a Decision Tree Model This Quick Start guides you through creating a credit-scoring application in eight easy steps. Quick Start Century Corp., an electronics retailer,
Pavement Management System Overview
TABLE OF CONTENTS The Home Screen... 3 The Gutter... 4 Icons... 4 Quick Links... 5 Miscellaneous... 6 PMS Menus... 7 Setup Menu... 7 Construction Setup... 7 1. Material Codes... 8 2. Standard Sections...
Visualization with OpenDX
Alexey I. Baranov Visualization with OpenDX User s Guide Springer Contents 1 Visualization with OpenDX..................................... 1 1.1 OpenDX module installation.................................
PURPOSE OF GRAPHS YOU ARE ABOUT TO BUILD. To explore for a relationship between the categories of two discrete variables
3 Stacked Bar Graph PURPOSE OF GRAPHS YOU ARE ABOUT TO BUILD To explore for a relationship between the categories of two discrete variables 3.1 Introduction to the Stacked Bar Graph «As with the simple
Package copa. R topics documented: August 9, 2016
Package August 9, 2016 Title Functions to perform cancer outlier profile analysis. Version 1.41.0 Date 2006-01-26 Author Maintainer COPA is a method to find genes that undergo
WEKA Explorer User Guide for Version 3-4-3
WEKA Explorer User Guide for Version 3-4-3 Richard Kirkby Eibe Frank November 9, 2004 c 2002, 2004 University of Waikato Contents 1 Launching WEKA 2 2 The WEKA Explorer 2 Section Tabs................................
Cluster Analysis. Isabel M. Rodrigues. Lisboa, 2014. Instituto Superior Técnico
Instituto Superior Técnico Lisboa, 2014 Introduction: Cluster analysis What is? Finding groups of objects such that the objects in a group will be similar (or related) to one another and different from
2 SYSTEM DESCRIPTION TECHNIQUES
2 SYSTEM DESCRIPTION TECHNIQUES 2.1 INTRODUCTION Graphical representation of any process is always better and more meaningful than its representation in words. Moreover, it is very difficult to arrange
SAS Analyst for Windows Tutorial
Updated: August 2012 Table of Contents Section 1: Introduction... 3 1.1 About this Document... 3 1.2 Introduction to Version 8 of SAS... 3 Section 2: An Overview of SAS V.8 for Windows... 3 2.1 Navigating
Tutorial: 3D Pipe Junction Using Hexa Meshing
Tutorial: 3D Pipe Junction Using Hexa Meshing Introduction In this tutorial, you will generate a mesh for a three-dimensional pipe junction. After checking the quality of the first mesh, you will create
Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data
Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data The Illumina TopHat Alignment and Cufflinks Assembly and Differential Expression apps make RNA data analysis accessible to any user, regardless
Package bigrf. February 19, 2015
Version 0.1-11 Date 2014-05-16 Package bigrf February 19, 2015 Title Big Random Forests: Classification and Regression Forests for Large Data Sets Maintainer Aloysius Lim OS_type
Protein Protein Interaction Networks
Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics
R software tutorial: Random Forest Clustering Applied to Renal Cell Carcinoma Steve Horvath and Tao Shi
R software tutorial: Random Forest Clustering Applied to Renal Cell Carcinoma Steve orvath and Tao Shi Correspondence: [email protected] Department of uman Genetics and Biostatistics University
Planning Terrestrial Radio Networks
Planning Terrestrial Radio Networks Lab Exercise Manual IK2500 - Radio Communication, Basic Course September 23, 2008 Short Summary The scope of this lab is to help students to develop basic skills in
Knowledge Discovery and Data Mining
Knowledge Discovery and Data Mining Lecture 15 - ROC, AUC & Lift Tom Kelsey School of Computer Science University of St Andrews http://tom.home.cs.st-andrews.ac.uk [email protected] Tom Kelsey ID5059-17-AUC
Practical Differential Gene Expression. Introduction
Practical Differential Gene Expression Introduction In this tutorial you will learn how to use R packages for analysis of differential expression. The dataset we use are the gene-summarized count data
ECONOMICS 351* -- Stata 10 Tutorial 2. Stata 10 Tutorial 2
Stata 10 Tutorial 2 TOPIC: Introduction to Selected Stata Commands DATA: auto1.dta (the Stata-format data file you created in Stata Tutorial 1) or auto1.raw (the original text-format data file) TASKS:
sample median Sample quartiles sample deciles sample quantiles sample percentiles Exercise 1 five number summary # Create and view a sorted
Sample uartiles We have seen that the sample median of a data set {x 1, x, x,, x n }, sorted in increasing order, is a value that divides it in such a way, that exactly half (i.e., 50%) of the sample observations
Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study
Analyzing the Effect of Treatment and Time on Gene Expression in Partek Genomics Suite (PGS) 6.6: A Breast Cancer Study The data for this study is taken from experiment GSE848 from the Gene Expression
SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard
p. 1 SPSS Tutorial, Feb. 7, 2003 Prof. Scott Allard The following tutorial is a guide to some basic procedures in SPSS that will be useful as you complete your data assignments for PPA 722. The purpose
They can be obtained in HQJHQH format directly from the home page at: http://www.engene.cnb.uam.es/downloads/kobayashi.dat
HQJHQH70 *XLGHG7RXU This document contains a Guided Tour through the HQJHQH platform and it was created for training purposes with respect to the system options and analysis possibilities. It is not intended
Step-by-Step Guide to Basic Expression Analysis and Normalization
Step-by-Step Guide to Basic Expression Analysis and Normalization Page 1 Introduction This document shows you how to perform a basic analysis and normalization of your data. A full review of this document
Arena Tutorial 1. Installation STUDENT 2. Overall Features of Arena
Arena Tutorial This Arena tutorial aims to provide a minimum but sufficient guide for a beginner to get started with Arena. For more details, the reader is referred to the Arena user s guide, which can
Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs
Types of Variables Chapter 1: Looking at Data Section 1.1: Displaying Distributions with Graphs Quantitative (numerical)variables: take numerical values for which arithmetic operations make sense (addition/averaging)
SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis
SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis Goal: This tutorial introduces several websites and tools useful for determining linkage disequilibrium
Exiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays
Exiqon Array Software Manual Quick guide to data extraction from mircury LNA microrna Arrays March 2010 Table of contents Introduction Overview...................................................... 3 ImaGene
MPLAB X + CCS C Compiler Tutorial
MPLAB X + CCS C Compiler Tutorial How to install the CCS C Compiler inside MPLAB X Before the CCS C Compiler can be used inside MPLAB X, the CCS C MPLAB X Plug-in must be installed. This process can be
Guide for Data Visualization and Analysis using ACSN
Guide for Data Visualization and Analysis using ACSN ACSN contains the NaviCell tool box, the intuitive and user- friendly environment for data visualization and analysis. The tool is accessible from the
Visual Tutorial Basic Edition 1. Visual. Basic Edition Tutorial. www.visuallightingsoftware.com
Visual Tutorial Basic Edition 1 Visual Basic Edition Tutorial www.visuallightingsoftware.com Visual Tutorial Basic Edition 2 Basic Edition Tutorial Introduction In this tutorial, you will use the Visual
How To Cluster Of Complex Systems
Entropy based Graph Clustering: Application to Biological and Social Networks Edward C Kenley Young-Rae Cho Department of Computer Science Baylor University Complex Systems Definition Dynamically evolving
Classifying Large Data Sets Using SVMs with Hierarchical Clusters. Presented by :Limou Wang
Classifying Large Data Sets Using SVMs with Hierarchical Clusters Presented by :Limou Wang Overview SVM Overview Motivation Hierarchical micro-clustering algorithm Clustering-Based SVM (CB-SVM) Experimental
LSA SAF products: files and formats
LSA SAF products: files and formats Carla Barroso, IPMA Application of Remote Sensing Data for Drought Monitoring Introduction to Eumetsat LANDSAF Products 11-15 November Slovenia OUTLINE Where to get
Create Charts in Excel
Create Charts in Excel Table of Contents OVERVIEW OF CHARTING... 1 AVAILABLE CHART TYPES... 2 PIE CHARTS... 2 BAR CHARTS... 3 CREATING CHARTS IN EXCEL... 3 CREATE A CHART... 3 HOW TO CHANGE THE LOCATION
Categorical Data Visualization and Clustering Using Subjective Factors
Categorical Data Visualization and Clustering Using Subjective Factors Chia-Hui Chang and Zhi-Kai Ding Department of Computer Science and Information Engineering, National Central University, Chung-Li,
How To Cluster
Data Clustering Dec 2nd, 2013 Kyrylo Bessonov Talk outline Introduction to clustering Types of clustering Supervised Unsupervised Similarity measures Main clustering algorithms k-means Hierarchical Main
QuantStudio 3D AnalysisSuite Software: Relative Quantification
HELP QuantStudio 3D AnalysisSuite Software: Relative Quantification for use with QuantStudio 3D Digital PCR System and QuantStudio 3D AnalysisSuite Server System Publication Number MAN0009636 Revision
Period04 User Guide. Abstract. 1 Introduction. P. Lenz, M. Breger
Comm. in Asteroseismology Vol. 146, 2005 Period04 User Guide P. Lenz, M. Breger Department of Astronomy, Univ. Vienna, Türkenschanzstrasse 17, 1180 Vienna, Austria Abstract Period04, an extended version
A Short Guide to R with RStudio
Short Guides to Microeconometrics Fall 2013 Prof. Dr. Kurt Schmidheiny Universität Basel A Short Guide to R with RStudio 1 Introduction 2 2 Installing R and RStudio 2 3 The RStudio Environment 2 4 Additions
Big Data: Rethinking Text Visualization
Big Data: Rethinking Text Visualization Dr. Anton Heijs [email protected] Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important
Lab 0: Preparing your laptop for the course OS X
Lab 0: Preparing your laptop for the course OS X Four pieces of software are needed to complete this course: 1. VMD Views and analyses molecular models. 2. NAMD Performs molecular dynamics simulations.
Two-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?...
Two-Way ANOVA tests Contents at a glance I. Definition and Applications...2 II. Two-Way ANOVA prerequisites...2 III. How to use the Two-Way ANOVA tool?...3 A. Parametric test, assume variances equal....4
Strong and Weak Ties
Strong and Weak Ties Web Science (VU) (707.000) Elisabeth Lex KTI, TU Graz April 11, 2016 Elisabeth Lex (KTI, TU Graz) Networks April 11, 2016 1 / 66 Outline 1 Repetition 2 Strong and Weak Ties 3 General
Immunophenotyping Flow Cytometry Tutorial. Contents. Experimental Requirements...1. Data Storage...2. Voltage Adjustments...3. Compensation...
Immunophenotyping Flow Cytometry Tutorial Contents Experimental Requirements...1 Data Storage...2 Voltage Adjustments...3 Compensation...5 Experimental Requirements For immunophenotyping with FITC and
Neural Networks Lesson 5 - Cluster Analysis
Neural Networks Lesson 5 - Cluster Analysis Prof. Michele Scarpiniti INFOCOM Dpt. - Sapienza University of Rome http://ispac.ing.uniroma1.it/scarpiniti/index.htm [email protected] Rome, 29
2. Simple Linear Regression
Research methods - II 3 2. Simple Linear Regression Simple linear regression is a technique in parametric statistics that is commonly used for analyzing mean response of a variable Y which changes according
Exercise 1: How to Record and Present Your Data Graphically Using Excel Dr. Chris Paradise, edited by Steven J. Price
Biology 1 Exercise 1: How to Record and Present Your Data Graphically Using Excel Dr. Chris Paradise, edited by Steven J. Price Introduction In this world of high technology and information overload scientists
