SOLiD Software Quick Facts



Similar documents
AccessData Corporation AD Lab System Specification Guide v1.1

Restricted Document. Pulsant Technical Specification

SBClient and Microsoft Windows Terminal Server (Including Citrix Server)

Readme File. Purpose. Introduction to Data Integration Management. Oracle s Hyperion Data Integration Management Release 9.2.

Hardware Requirements

risk2value System Requirements

SMART Product Drivers 11.3 for Windows and Mac computers

Improved Data Center Power Consumption and Streamlining Management in Windows Server 2008 R2 with SP1

How To Install Fcus Service Management Software On A Pc Or Macbook

Caching Software Performance Test: Microsoft SQL Server Acceleration with FlashSoft Software 3.8 for Windows Server

HP ExpertOne. HP2-T21: Administering HP Server Solutions. Table of Contents

Citrix XenServer from HP Getting Started Guide

Preparing to Deploy Reflection : A Guide for System Administrators. Version 14.1

FOCUS Service Management Software Version 8.5 for Passport Business Solutions Installation Instructions

HADOOP. Session 1: Introduction to Hadoop & Big Data. Session 2: Hadoop Distributed File Systems. Session 3: Administering Hadoop Cluster

Service Desk Self Service Overview

NETWRIX CHANGE NOTIFIER

Instant Chime for IBM Sametime Quick Start Guide

ATL: Atlas Transformation Language. ATL Installation Guide

1) Update the AccuBuild Program to the latest version Version or later.

FINRA Regulation Filing Application Batch Submissions

Microsoft has released Windows 8.1, a free upgrade to Windows 8. Follow the steps below to upgrade to Windows 8.1.

Application Note: 202

Identify Storage Technologies and Understand RAID

HP Point of Sale FAQ Warranty, Care Pack Service & Support. Limited warranty... 2 HP Care Pack Services... 3 Support... 3

KronoDesk Migration and Integration Guide Inflectra Corporation

FOCUS Service Management Software Version 8.5 for CounterPoint Installation Instructions

Avatier Identity Management Suite

An Oracle White Paper January Oracle WebLogic Server on Oracle Database Appliance

MaaS360 Cloud Extender

Installation Guide Marshal Reporting Console

Exercise 5 Server Configuration, Web and FTP Instructions and preparatory questions Administration of Computer Systems, Fall 2008

State of Wisconsin. File Server Service Service Offering Definition

Professional Training Courses

Creating automated reports using VBS AN 44

Helpdesk Support Tickets & Knowledgebase

Alexsys Team 2 Service Desk

Copyright 2013, SafeNet, Inc. All rights reserved. We have attempted to make these documents complete, accurate, and

Ten Steps for an Easy Install of the eg Enterprise Suite

AvePoint High Speed Migration Supplementary Tools

Safe PST Backup Enterprise Edition Administrator Guide

Uninstalling and Reinstalling on a Server Computer. Medical Director / PracSoft

How To Install An Orin Failver Engine On A Network With A Network Card (Orin) On A 2Gigbook (Orion) On An Ipad (Orina) Orin (Ornet) Ornet (Orn

Bitrix Intranet. Product Requirements

Customers FAQs for Webroot SecureAnywhere Identity Shield

Deployment Overview (Installation):

o How AD Query Works o Installation Requirements o Inserting your License Key o Selecting and Changing your Search Domain

ScaleIO Security Configuration Guide

Network Intrusion Detection

Outlook Plug-In. Send Conference Invites from Outlook. Downloading Outlook Plug-In CONFERENCING & COLLABORATION RESERVATIONLESS-PLUS

Table of Contents. This document is for informational purposes only. MICROSOFT MAKES NO WARRANTIES, EXPRESS OR IMPLIED, IN THIS SUMMARY.

xdb Configuration Guide

FAQs for Webroot SecureAnywhere Identity Shield

Identify Major Server Hardware Components

Fermilab Time & Labor Desktop Computer Requirements

TaskCentre v4.5 Send Message (SMTP) Tool White Paper

Telelink 6. Installation Manual

Using Sentry-go Enterprise/ASPX for Sentry-go Quick & Plus! monitors

New in this release. Sphere (October 2013)

DocAve 6 Supplementary Tools

SYSTEM MONITORING PLUG-IN FOR MICROSOFT SQL SERVER

Diagnosis and Troubleshooting

TaskCentre v4.5 SMTP Tool White Paper

Mobile Device Manager Admin Guide. Reports and Alerts

User Guide. Excel Data Management Pack (EDM-Pack) OnCommand Workflow Automation (WFA) Abstract PROFESSIONAL SERVICES. Date: December 2015

TaskCentre v4.5 File Transfer (FTP) Tool White Paper

ROSS RepliWeb Operations Suite for SharePoint. SSL User Guide

Software Update Notification

April 3, Release Notes

ACTIVITY MONITOR. Live view of remote desktops. You may easily have a look at any user s desktop.

Intransa VideoAppliance VA1020 Series Server/Storage Appliance V1.3

TaskCentre v4.5 MS SQL Server Trigger Tool White Paper

Readme File. Purpose. What is Translation Manager 9.3.1? Hyperion Translation Manager Release Readme

User Manual Brainloop Outlook Add-In. Version 3.4

Implementing SQL Manage Quick Guide

SaaS Listing CA Cloud Service Management

This guide is intended for administrators, who want to install, configure, and manage SAP Lumira, server for BI Platform

Datasheet. PV4E Management Software Features

LeadStreet Broker Guide

MapReduce Laboratory

Configuring, Monitoring and Deploying a Private Cloud with System Center 2012 Boot Camp

Introduction to Mindjet MindManager Server

ACTIVITY MONITOR Real Time Monitor Employee Activity Monitor

Diagnostic Manager Change Log

TaskCentre v4.5 Send Fax (Tobit) Tool White Paper

Best Practice - Pentaho BA for High Availability

1)What hardware is available for installing/configuring MOSS 2010?

Tech Notes Promise RAID

Product Documentation. New Features Guide. Version 9.7.5/XE6

The Relativity Appliance Installation Guide

Attunity RepliWeb SSL Guide

Software Distribution

Performance of an Infiniband cluster running MPI applications

BackupAssist SQL Add-on

Serv-U Distributed Architecture Guide

TERMS OF REFERENCE. Consultancy Services: The Development of a Cloud-Based Client Relationship Management Tool for CAIPA 1

Getting Started Guide

Getting started with Android

FUJITSU Software ServerView Suite ServerView PrimeCollect

Dreamweaver MX Templates

Transcription:

SOLiD Sftware Quick Facts 1. What s New in SOLiD Sftware Data Prcessing and Data Analysis... 2 2. SOLiD Data Prcessing Overview... 4 3. SOLiD 4 Result Size and On-Instrument Cluster... 5 4. SOLiD Offline Analysis Cluster Specificatin... 7 5. SOLiD BiScpe Sftware... 9 6. BiScpe v1.2 Sftware vs. Crna_Lite fr Offline Analysis... 12 7. Data Visualizatin... 13 1

1. What s New in SOLiD Sftware Data Prcessing and Data Analysis SOLiD Accuracy Enhancer Tl The SOLiD Accuracy Enhancer Tl (SAET) is a spectral alignment errr crrectin tl, which, when applied t raw data generated by the SOLiD platfrm, reduces the clr calling errr rate by factr f three t five withut having the reference genme. Decrease in errr rate imprves mapping, SNP calling, and de nv assembly results. Use the SAET t pre-prcess raw reads befre alignment. SAET reduces the clr calling errr rate by a factr f three t five withut having the reference genme. SAET is nt recmmended fr use with whle genme resequencing f large genmes where large > 600 Mbases. The decrease in errr rate imprves mapping, SNP calling, and de nv assembly results. Mapping becmes mre accurate and the number f mapped reads increases by 40-50%. SAET is available thrugh the BiScpe Sftware cmmand line nly. Histry tab Use the histry tab in the Web brwser t dwnlad r view files generated during a selected plug-in sessin. The histry feature is available nly frm the BiScpe Sftware web brwser. Barcde script The barcde script runs a given BiScpe Sftware data analysis n a set f barcde library read files in batch mde. Use the script t run simultaneus secndary r tertiary tests n barcded libraries. BAM frmat utput BiScpe Sftware secndary analysis (mapping and pairing) nw prduces a BAM file as the main alignment frmat. Mate pair and paired end analysis prduces a BAM file while a single file cnversin is needed fr fragment libraries. Depending n the utput filter selected, unmapped and secndary alignments can be included. HD-300 perfrmance HD-300 increases thrughput. BiScpe Sftware v1.2 can prcess larger files at increased thrughput, which results in mre reads while maintaining current speed and increasing density. 2

SNP detectin pst-errr file changes BiScpe Sftware nw prvides the ptin f pre-generated Prbe Errr files. The feature f prviding pre-generated files has the main advantage f time saved in bypassing the regeneratin f Prbe Errr files every time SNP detectin is executed. Using SOLiDBiScpe.cm Users f clud cmputing can wrk with BiScpe Sftware using SOLiDBiScpe.cm. This feature prvides the fllwing benefits: N up frnt (capital) cst and n cst cmmitments Pay as yu g; predictable peratinal csts Scale up/dwn as demand needs Uplad data via Internet r physical shipments f hard drives Fr mre infrmatin, please visit here. Fusin/Splicing A fusin junctin is a sectin f transcribed RNA that maps t an exn frm ne gene fllwed by an exn frm anther gene. It can ccur as the result f a translcatin, deletin, r chrmsmal inversin. It excludes exn-exn bundaries that arise frm alternative splicing fr a gene. ChIP-SEQ BiScpe Sftware v1.2 gives yu the ptin t perfrm ChIP-Seq resequencing thrugh the BiScpe Sftware brwser. Paired-end mapping Biscpe Sftware v1.2 added supprt fr paired end experiments in additin t mate pair. In a nrmal paired end experiment, the F5 and F3 tags matched t the genme n different strands and facing tward each ther, and satisfies a distance cnstraint determined by insert size. Exprt Cnfiguratin Yu can nw use the web brwser as well as the cmmand line t create the cnfiguratin file that is required t perfrm experiments n barcded libraries. 3

2. SOLiD System Data Prcessing Overview SOLiD Instrument Cntrl Sftware (ICS) Prvides autmated instrument peratin and submits data prcessing jbs fr primary analysis: Imaging Bead finding Image registratin Filtering Clr call SOLiD Experiment Tracking Systems (SETS) Web-based applicatin that enables users t view n-instrument real-time data and cmpleted run analysis reprts frm the SOLiD Analysis Tls. Secndary analysis (mapping) is enabled n-instrument, althugh it is recmmended t run secndary analysis ff-instrument using BiScpe Sftware. SOLiD BiScpe Sftware SOLiD BiScpe Sftware prvides a cmmand line and simple web interface that builds cnfiguratin files fr running applicatin-specific sequence analysis tls. The BiScpe Sftware framewrk enables the user t perfrm ff-instrument secndary and tertiary analyses, and it allws cnfigurable 4

biinfrmatics wrkflws fr resequencing (mapping, SNP finding (dibayes), cpy number variatins, inversins, small indels, large indels) and whle transcriptme analysis (mapping, splicing/fusin detectin, cunting, UCSC WIG Files creatin). Results can be exprted as BAM frmats. The resulting industrystandard files frm BiScpe Sftware can be used with third-party visualizatin and analysis sftware tls. 3. SOLiD 4 System Result Size and On-Instrument Cluster The SOLiD Sftware analysis pipeline generates results in BAM frmat cntaining base space sequence. Mapping results in BAM frmat cntain base space sequences, clr space sequences and quality values. The data sizes frm the SOLiD 4 System, as indicated in the table belw, are fr fully laded experiments and directly crrelate with the thrughput (ttal bases generated frm a run). Table 1: Result size generated n SOLiD 4 System sequencing (Assume 2 slides and depsitin densities f 300K beads/panel, 2357 panels) 50 nt tag/300k/panel,2357 panels Image data size Primary analysis Primary analysis data results size in.spch size (flatfile:.csfasta, frmat _QV.qual,.stats ) 1 slide 1 tag 1.84 TB 646 GB 170 GB 1 slide 2 tags 3.6 TB 1.29 TB 340 GB 2 slides 1 tag 3.6 TB 1.29 TB 340 GB 2 slides 2 tags 7.2 TB 2.58 TB 680 GB Nte: Image data is nt needed after analysis is cmplete and intensity files are n lnger required fr submissin int NCBI. 5

http://www.ncbi.nlm.nih.gv/traces/sra/static/sequence_read_archive_overvie w.pdf On-Instrument Cmpute Cluster Specificatin SOLiD 4 System Specificatin Cmputer Cmpnents 19-inch flat screen mnitr, muse and keybard Instrument cntrller Head nde Cmpute ndes (3) Shared strage Gigabit Switch Pwer distributin units (2) Pwer crds (2), attached Sftware Suite Instrument Cntrl Sftware v4.0 SOLiD Experimental Tracking System (SETS) v4.0 Instrument Cntrller Hardware: Intel Xen prcessrs Operating system: Micrsft Windws XP Prfessinal, Service Pack 2 Installed RAM:8 GB Hard disk strage: dual 250 GB SATA hard drives (RAID-1) Peripheral: CD-RW/DVD ROM, 19-inch flat screen mnitr, keybard, muse Head Nde Cmpute Ndes (each) Shared Strage Gigabit Switch Pwer Distributin Units Hardware: Intel Xen Quad Cre prcessrs (2) Operating system: 64-bit LINUX Installed RAM: 24 GB Hard disk strage: 6x 1 TB SATA hard drives (RAID-5) Hardware: Intel Xen Quad Cre prcessrs (2) Operating System: 64-bit LINUX Installed RAM: 24 GB Hard disk strage: 2 x1 TB SATA hard drives (RAID-0) Hard disk strage: 15x 1 TB SATA hard drives RAID-5 w/ ht spare 16 prt Gigabit Switch Rack PDU (2) with 16 utput cnnectr While full mapping is enabled n-instrument (cluster) fr small genmes (i.e. Bacteria), we d nt recmmend secndary analysis (mapping) n the instrument cluster. Instead we highly recmmend secndary analysis (mapping) using a separate ffline analysis cluster. Hwever, fr large genmes (i.e. 6

human), a sub-mapping can be perfrmed n-instrument fr quality assessment f the run nly. 4. Offline Analysis Cluster Specificatin The SOLiD 4 System allws cycle by cycle aut-exprt and manual exprt f primary analysis results t an ffline cluster where the secndary analysis can be perfrmed independently f the instrument. This feature enables the instrument t be utilized fr additinal experiments while secndary analyses are being perfrmed. Offline Analysis The ffline analysis cluster specificatin is shwn belw and represents the minimal and the recmmended (Penguin Cmputing) specificatin fr ffline data analysis. BiScpe Sftware Offline Cluster Specificatin Minimal Offline Cluster Specificatin Minimal Offline Cluster Specificatin Head Nde Cmpute Ndes (minimum 3 cmpute ndes) Gigabit Switch > 2 GHz prcessrs 16 GB RAM 100 GB strage lcal disk space fr OS+ sftware installatin > 2 GHz prcessrs > 2 GHz prcessrs,16+ GB RAM, 8+ cres per nde > 500 GB strage lcal disk space fr OS+ sftware installatin 1 GB Switch Operatin System Cents 4.x, 5.x RedHat 4.x, 5.x Strage >10TB 7

Penguin Entry Level Cluster fr SOLiD Offline Analysis 8

5. SOLiD BiScpe Sftware OVERVIEW SOLiD BiScpe Sftware is a framewrk fr biinfrmatics tls t perfrm ff-instrument secndary and tertiary analysis. BiScpe Sftware cnsists f a cllectin f biinfrmatics tls that are integrated int a single cmmand line shell. Additinally, BiScpe Sftware prvides a simple web interface t help build instructins (cnfiguratin files) t run these tls. BiScpe Sftware includes mapping, pairing, SNP finding, structural variatins and whle transcriptme analyses. FEATURES BiScpe Sftware 1.2, using a flexible pipeline architecture, enables maximum flexibility and ease f use fr perfrming high thrughput data analysis fr SOLiD instrument data. It cntains fllwing features: 9

Mapping BiScpe Sftware 1.2 features Applied Bisystems newly develped mapping algrithm MaxMapper, which drastically imprves the mapping rate and mapping speed ver the previus generatins f mapping techniques. It prduces mapped data that gives excellent sensitivity and specificity in SNP finding using Applied Bisystems dibayes SNP finding algrithm, a cmpnent in BiScpe Sftware. Pairing BiScpe Sftware 1.2 features Applied Bisystems updated pairing algrithm which handles mapped data frm MaxMapper. It prvides new pairing categries D and E and prvides reads file that d nt pair. Resequencing Applicatins BiScpe Sftware 1.2 features five resequencing applicatins: SNP Finding, Cpy Number Variatin fr Human, Small indel finding, Large indel finding and Inversin. SNP Finding BiScpe Sftware features Applied Bisystems latest SNP finding technlgy fr SOLiD System instrument data which allws sensitive and specific SNP detectin even at mderate t lw cverage. It allws varying levels f stringency and prvides full cntrl ver many filters fr custmizatin t fit particular experimental designs. CNV (Human) BiScpe Sftware features Applied Bisystems latest prgress n human cpy number variatin detectin technlgy which allws detecting variatins as small as 5KB and as large as the whle chrmsme in humans frm single sample sequences. It can detect such variatins at very lw cverage, even at 1x. Small Indel BiScpe Sftware features Applied Bisystems latest prgress n detecting small indel variatin. By using a nvel split read technique alngside a pwerful indel caller n pileups, is able t detect deletin up t 500bp and insertins up t 20bp. Large Indel BiScpe Sftware features Applied Bisystems latest prgress n identifying large insertins and deletins cmpared t a reference genme using SOLiD mate pair libraries. It is able t 10

identify large insertins and deletins (indels) frm 100bp t 100Kb with great cnfidence. It is able t accept multiple mate pair libraries t increase cverage. Inversin BiScpe Sftware features Applied Bisystems latest prgress n detecting genmic regins that are inverted with respect t reference. Taking advantage f lnger insert size and lwer inverted dimmer nise f SOLiD mate pair technlgy, it prduces a cnfident list f inversins Whle Transcriptme Analysis The Whle Transcriptme Analysis (WTA) in BiScpe Sftware aligns t a reference genme. With mapping results, it cunts the number f tags aligned with exns, and can cnvert the BAM file t WIG fr display f cverage n the UCSC genme brwser. WTA als supprts experiments in fusin detectin Create UCSC WIG file This plug-in takes the BAM file and cnverts it int WIG files cntaining cverage data. Cverage is the number f reads cvering a given genme stranded psitin. Cunt knwn exns Given a BAM file f mapped reads and predefined regins (such as exns) prvided by the user in a.gtf frmat file, this plugin generates tag cunts fr anntated regins. Fusin, splicing A fusin junctin is a sectin f transcribed RNA that maps t an exn frm ne gene fllwed by an exn frm anther gene. It can ccur as the result f a translcatin, deletin, r chrmsmal inversin. A fusin junctin excludes exn-exn bundaries that arise frm alternative splicing fr a gene. There are five mdels f alternative splicing: Exn skipping r cassette exn: in this case, an exn may be spliced ut f the primary transcript r retained. This is the mst cmmn mde in mammalian pre-mrnas Mutually exclusive exns: One f tw exns is retained in mrnas after splicing, but nt bth. Alternative dnr site: An alternative 5 splice junctin (dnr site) is used, changing the 3 bundary f the upstream exn. 11

Alternative acceptr site: An alternative 3 splice junctin (acceptr site) is used, changing the 5 bundary f the dwnstream exn. Intrn retentin: A sequence may be spliced ut as an intrn r simply retained. This is distinguished frm exn skipping because the retained sequence is nt flanked by intrns. If the retained intrn is in the cding regin, the intrn must encde amin acids in frame with the neighbring exns.. 6. B iscpe 1.2 Sftware vs. Crna_Lite fr Offline Analysis Offline Data Analysis BiScpe 1.2 Crna_Lite v4.2 Analysis executin Integrated cmmand line and Integrated cmmand. Simple web interface Can run batch mde. Prgramming Java Scripting languages language User Interface fr parameter setting and analysis GUI (Brwser interface f BiScpe); Cmmand line interface Expanded rich parameter setting thrugh cmmand-line interface Multiple run cmbinatin analysis Mapping Algrithm Max Mapper fr SOLiD 4 MapReads System Default Mapping Max Mapper: Anchr and Full length with fixed setting Extend number f mismatches Iterative Mapping User cnfigurable NO/Manual Multi-threading N SNP algrithm DiBayes SNP caller Integrated small indel analysis Integrated large indel analysis Integrated Human CNV analysis N N 12

Integrated Inversin analysis Integrated Whle N Transcriptme analysis Output results frmat SAM/BAM utput. Fasta-like matching utput (including unique match and all matches), paired mates, Optinal GFF v0.2. SNP list text file Stats File New frmat: add extensin Old Stats file infrmatin and quality value Speed Optimized cmpute perfrmance fr cmplex Supprt cmplex genme analysis genme analysis Warranty N N AB supprt t end users Supprted OS Linux CentOS v4.x, PBS (Trque), PBS pr and SGE Linux, PBS, LSF, SGE 7. Data Visualizatin The BiScpe Sftware pipeline will generate reads results in BAM frmat including Base Space Sequence. It can be visualized in brwsers such as UCSC and Brad Institute s Integrative Genmics Viewer (IGV): 13

Fr research use nly. Nt intended fr any animal r human therapeutic r diagnstic use. 2010 Life Technlgies Crpratin. All rights reserved. The trademarks mentined herein are the prperty f Life Technlgies Crpratin r their respective wners. 14