LD-Plus. Visualizing SNP Statistics in the Context of Linkage. Disequilibrium



Similar documents
SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis

VISUAL INTEGRATION OF RESULTS FROM A LARGE DNA BIOBANK (BIOVU) USING SYNTHESIS-VIEW *

Statistical Analysis for Genetic Epidemiology (S.A.G.E.) Version 6.2 Graphical User Interface (GUI) Manual

Using SQL Developer. Copyright 2008, Oracle. All rights reserved.

LICENSE4J FLOATING LICENSE SERVER USER GUIDE

Tutorial on gplink. PLINK tutorial, December 2006; Shaun Purcell,

Installing Ruby on Windows XP

Importing and Exporting With SPSS for Windows 17 TUT 117

Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER

AES Crypt User Guide

CRSP MOVEit Cloud Getting Started Guide

IBM SPSS Statistics 20 Part 1: Descriptive Statistics

Ruby On Rails. CSCI 5449 Submitted by: Bhaskar Vaish

Integrate Rails into an Existing IIS Web infrastructure using Mongrel

Volume SYSLOG JUNCTION. User s Guide. User s Guide

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

Help File. Version February, MetaDigger for PC

Backing Up TestTrack Native Project Databases

Code Estimation Tools Directions for a Services Engagement

RuleBender Tutorial

Introduction to Synoptic

Project 5 Twitter Analyzer Due: Fri :59:59 pm

Package snp.plotter. February 20, 2015

AmbrosiaMQ-MuleSource ESB Integration

This document was derived from simulation software created by Steve Robbins which was supported by NSF DUE

SAS Analyst for Windows Tutorial

Step by Step Guide to Importing Genetic Data into JMP Genomics

Computer Science and Engineering Linux Cisco VPN Client Installation and Setup Guide

This document presents the new features available in ngklast release 4.4 and KServer 4.2.

AWS Data Pipeline. Developer Guide API Version

a) Install the SDK into a directory of your choice (/opt/java/jdk1.5.0_11, /opt/java/jdk1.6.0_02, or YOUR_JAVA_HOME_DIR)

MySQL Quick Start Guide

CLC Server Command Line Tools USER MANUAL

WFU_DICOMtk v1.0. ANSIR Laboratory Wake Forest University School of Medicine

SPSS Introduction. Yi Li

OpenGeo Suite for Linux Release 3.0

Assignment 4 CPSC 217 L02 Purpose. Important Note. Data visualization

Scatter Plots with Error Bars

CASSI: Genome-Wide Interaction Analysis Software

Note: With v3.2, the DocuSign Fetch application was renamed DocuSign Retrieve.

1. Product Information

Hudson Continous Integration Server. Stefan Saasen,

Getting started manual

Encryption Wizard. You are ready to download the ewizard software. I've included the Quick Start below to guide you through the process.

CafePilot has 3 components: the Client, Server and Service Request Monitor (or SRM for short).

LAE Enterprise Server Installation Guide

Memory Management Simulation Interactive Lab

Assignment 2: More MapReduce with Hadoop

Online Backup Client User Manual Linux

Genomatic: an R package for DNA fragment analysis project management

NaviCell Data Visualization Python API

OLH: Oracle Loader for Hadoop OSCH: Oracle SQL Connector for Hadoop Distributed File System (HDFS)

Setting up PostgreSQL

SAS/GRAPH 9.2 ODS Graphics Editor. User s Guide

Summary Table for SolarWinds Web Help Desk

RIC 2007 SNAP: Symbolic Nuclear Analysis Package. Chester Gingrich USNRC/RES 3/13/07

RecoveryVault Express Client User Manual

OPTOFORCE DATA VISUALIZATION 3D

USER GUIDE. Snow Inventory Client for Unix Version Release date Document date

Beginner s Matlab Tutorial

Getting Started with Mplus Version 7.31 Demo for Mac OS X and Linux

SAIP 2012 Performance Engineering

Online Backup Linux Client User Manual

MySQL Quick Start Guide

Professional. SlickEdif. John Hurst IC..T...L. i 1 8 О 7» \ WILEY \ Wiley Publishing, Inc.

Web Document Clustering

CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler

Project 2 Database Design and ETL

Wavelink Avalanche Mobility Center Linux Reference Guide

MET 306. Activity 8a. Mechanism Design Creo 2.0 Level 7 POINT A GROUND LINK LINK 1 LINK 2 LINK 3 POINT B 10/15/2010 1

Online Backup Client User Manual

SNPbrowser Software v3.5

Network Probe User Guide

Entrust Certificate Services. Java Code Signing. User Guide. Date of Issue: December Document issue: 2.0

Building Ruby, Rails, LightTPD, and MySQL on Tiger

Summary Table for SolarWinds Web Help Desk

IUCLID 5 Guidance and support. Installation Guide Distributed Version. Linux - Apache Tomcat - PostgreSQL

ELECTRONIC DATA PROCESSOR (EDP) QUICKSTART FOR DATA PROVIDERS

Site Configuration SETUP GUIDE. Windows Hosts Single Workstation Installation. May08. May 08

INSTALLING MALTED 3.0 IN LINUX MALTED: INSTALLING THE SYSTEM IN LINUX. Installing Malted 3.0 in LINUX

NAS 253 Introduction to Backup Plan

EXERCISE 6: CREATING A FUSION PROJECT PART 1

User s Guide for Program CLAM (Classification Method)

Table of Contents. The RCS MINI HOWTO

Chapter 1: Getting Started

Configuring Microsoft IIS 5.0 With Pramati Server

A Sample OFBiz application implementing remote access via RMI and SOAP Table of contents

Kurz MODBUS Client User s Guide

Lab 3: Introduction to Data Acquisition Cards

Sterling Web. Localization Guide. Release 9.0. March 2010

Lab 1 Beginning C Program

Installing IBM Websphere Application Server 7 and 8 on OS4 Enterprise Linux

MS Data Analysis I: Importing Data into Genespring and Initial Quality Control

McAfee Firewall for Linux 8.0.0

Journal of Statistical Software

Structural Health Monitoring Tools (SHMTools)

Chapter 4 Displaying and Describing Categorical Data

FileNet System Manager Dashboard Help

Business Objects Enterprise version 4.1. Report Viewing

Transcription:

LD LD-Plus Visualizing SNP Statistics in the Context of Linkage Disequilibrium

Introduction LD-Plus is a data visualization script for the display of single SNP statistics in the context of linkage disequilibrium and haplotype structures. LD-Plus can display both continuous and categorical SNP statistics, and is designed to parse Haploview output to display both D' and r 2 LD plots, called or user-defined haplotype blocks, and haplotype frequencies. Example Output

Installation For Windows 1. Download and install Ruby. I used the one-click installer as the simplest option. http://rubyforge.org/frs/download.php/47082/ruby186-27_rc2.exe Enable RubyGems needs to be checked. Otherwise use defaults for installation. 2. Download RMagick windows gem from the rmagick-win32 section of http://rubyforge.org/frs/?group_id=12&release_id=35951 3. Extract to a temporary directory 4. Run ImageMagick-6.5.3-10-Q8-windows-dll.exe to install ImageMagick. Accept the default options. 5. Make sure RubyGems is up-to-date. If you are using an older version, update it now. Enter the command gem update --system **6. Installing the gem is very simple. Start a Command Prompt window. Use the cd command to make the temporary directory that contains the unzipped files the current directory. Enter this command: gem install rmagick --local You should see: Successfully installed rmagick-2.10.0-x86-mswin32 If instead you get the message Error installing gem rmagick[.gem]: buffer error then you didn't update RubyGems. Do so now and re-run the gem install command. 7. Download ld_plus.rb from http://chgr.mc.vanderbilt.edu/ritchielab/method.php?method=ldplus

For Linux 1. Download and install Ruby. See the following site for instructions on Linux and OS X. http://www.ruby-lang.org/en/downloads/ 2. Download and install ImageMagick. See the following site for instructions on Linux and OS X. http://rmagick.rubyforge.org/install2-linux.html http://rmagick.rubyforge.org/install-osx.html 3. Download ld_plus.rb from http://chgr.mc.vanderbilt.edu/ritchielab/method.php?method=ldplus Haploview Pre-processing Haploview is a popular software tool developed by the Broad Institute to process SNP data to assess linkage disequilibrium, haplotype structure, and basic association statistics in family-based and case-control data. Haploview is available at: http://www.broadinstitute.org/haploview/haploview Haploview can be executed as a command-line utility without a graphical user interface. In the Linux environment, this is accomplished by downloading the Java Archive (JAR) file for Haploview from the Broad website. The full list of command line options for Haploview is available here: http://www.broadinstitute.org/science/programs/medical-and-population-genetics/haploview/chapter-3- command-line-options If we have a PED file formatted collection of genotype data called MyData.ped, and a map file called MyData.map, we would execute Haploview like so to generate all necessary LD statistics and haplotype block calls for LD-Plus input.

java -jar Haploview.jar -n -pedfile MyData.ped -info MyData.map -skipcheck -blockoutput GAB -dprime This command will produce two files: MyData.ped.GABRIELblocks and MyData.ped.LD that contain haplotype block definitions (by the Gabriel et al. method) and D' and r 2 LD statistics, respectively. Continuous SNP Statistics Any continuous SNP statistic, such as p-values, Fsts, minor allele frequencies, and genotyping efficiency, can be displayed in LD-Plus as a line graph. Continuous statistics should be arranged in a tab-delimited file like so: SNP Fst -0.20.5 -Log(p-value) 0.016.2 rs13228494 0.1867 0.236862 rs2968855 0.0896 16.1181 rs2907948 0.1016 16.1181 rs11763311 0.0259 1.868209 rs2968853 0.0828 16.1181 The header for each column should contain the label for the statistic followed by space-delimited lower and upper bounds for the axis labels. In this example, Fst values range from -0.2 to 0.5 and the -Log(p-value) ranges from 0.0 to 16.2. It is important to note that statistic labels cannot contain spaces. Multiple Statistics can be shown in the same track and thus on the same axis. The following example would display three ethnic specific statistics on the same scale and track, with color and dash-patterns. The data series label within this track is specified after the axis ranges in the column header. SNP -Log(Exact) 0.016.2 AA/EA -Log(p-value) 0.016.2 AA/HC rs13228494 16.1181 0.236862 rs2968855 3.068303 16.1181 rs2907948 0.641694 16.1181 rs11763311 9.21034 1.868209 rs2968853 3.092243 16.1181

Categorical SNP Information LD-Plus can also display multiple binary SNP attributes to indicate inclusion of a SNP in a group. For example, exonic SNPs, SNPs in regulatory elements, or availability of SNPs on common genotyping platforms can be displayed. Categorical SNP attributes are specified in a tab-delimited text file like so: SNP Exonic Affymetrix_6.0 rs13228494 0 1 rs2968855 1 1 rs2907948 0 0 rs11763311 0 0 rs2968853 1 1 In this file, 1 indicates inclusion in the group and 0 indicates exclusion from the group. As with the continuous data file, column headers become titles for the SNP attribute track, and cannot contain spaces. Running LD-Plus If we have generated Haploview output files for MyData.ped as described above, and generated a continuous SNP information file, MyData.stats, and a categorical SNP information file, MyData.snpinfo, we would run From a command prompt, type: ruby ld_plus.rb -r -d -l MyData.ped.LD -i MyData.map -b MyData.ped.GABRIELblocks -a MyData.stats -s MyData.snpinfo This will generate the desired plot.