Practical Solutions for Big Data Analytics
|
|
|
- Godfrey French
- 10 years ago
- Views:
Transcription
1 Practical Solutions for Big Data Analytics Ravi Madduri Computation Institute Paul Dave Dinanath Sulakhe Alex Rodriguez
2 Urban Science High energy physics Genomics Molecular biology Cosmology Metagenomics Economics Climate change Linguistics Visual arts
3 We are a non-profit organization of researchers, developers, and bioinformaticians, building solutions for the advancement of research in various fields
4 Our vision for a 21st century discovery infrastructure To provide more capability for more people at substantially lower cost
5 Agenda 12:30pm Challenges for biomedical analysis at scale 12:45pm Best practices and solution components 1:00pm Introduction to Globus Genomics, Globus Transfer 1:30pm Exercise: Transferring raw NGS datasets from sequencing centers and sharing 2:00pm Refreshment Break 2:15pm Demonstration of QC pipeline, Exome, RNA, Whole Genome analysis 2:30pm Exercise: Running example pipelines with sample data sets 2:45pm Running analysis at scale using Globus Genomics 3:15pm Exercise/Demonstration: Executing Exome, RNA, QC pipelines at scale 3:45pm Interactive Q&A and Session Wrap-up
6 All materials are available at: Please complete the sign-up form. Thank you!
7 Challenges in Biomedical analysis at scale
8 I need to get my sequence data from the NGS core to the lab for analysis. to easily, quickly, and reliably move or mirror (some or all of) my data to other places Lab server, HPC cluster, desktop, public cloud server to easily and securely share my data with my colleagues at other institutions. to make my data available for others to replicate my experiments. a good place to backup and archive my research data, at a reasonable price.
9 Managing data should be easy Staging Store Ingest Store Registry Scale-out Store Analysis Store Community Store Archive Mirror
10 but it s hard and frustrating! Scale-out Store Staging Store Expired credentials Ingest Store!! Analysis Store NATs Firewalls Archive Permission Registry denied!! Community Store Failed Server / Net Mirror
11 Challenges in Sequencing Analysis Data Movement and Access Challenges Sequencing Centers Public Data Research Lab Storage Manually move the data to the Compute node Install all the tools required for the Analysis BWA, Picard, GATK, Filtering Scripts, etc. Shell scripts to sequen?ally execute the tools Manually modify the scripts for any change Error Prone, difficult to keep track, messy.. Difficult to maintain and transfer the knowledge Fastq Ref Genome Seq Center Local Cluster/ Cloud Modify Picard Data is distributed in different loca?ons Research labs need access to the data for analysis Be able to Share data with other researchers/collaborators Inefficient ways of data movement Data needs to be available on the local and Distributed Compute Resources Local Clusters, Cloud, Grid Once we have the Sequence Data Install Alignment GATK (Re)Run Script Variant Calling How do we analyze this Sequence Data Manual Data Analysis
12 Solutions for Biomedical analysis at scale
13 Research Data Management as a Service Reduced data Facility data acquisi?on Globus transfer service Globus data publication service Globus sharing service Analysis/ Sharing
14 What is Globus? Big data transfer, sharing, publication and discovery simply, securely, and fast directly from your own storage systems
15 Reliable, secure, high-performance file transfer and synchronization Fire-and-forget transfers Automatic fault recovery Data Source 2 Globus moves and syncs files Data Destination Seamless security integration 1 User initiates transfer request 3 Globus notifies user
16 Simple, secure sharing off existing storage systems Easily share large data with any user or group No cloud storage required 2 Globus tracks shared files; no need to move files to cloud storage! Data Source 1 User A selects file(s) to share, selects user or group, and sets permissions 3 User B logs in to Globus and accesses shared file
17 Globus is SaaS Web, command line, and REST interfaces Reduced IT operational costs New features automatically available Consolidated support & troubleshooting Easy to add your laptop, server, cluster, supercomputer, etc. with Globus Connect
18 globus genomics Flexible, scalable, affordable genomics analysis for all biologists
19 Globus Genomics Globus Genomics Galaxy Based Workflow Management System Sequencing Centers Seq Center Public Data Globus Online Provides a High- performance Fault- tolerant Research Lab Secure file transfer Service between all data- endpoints Storage Local Cluster/ Cloud Galaxy Data Libraries Fastq Picard Alignment GATK Variant Calling Globus Online Ref Genome Integrated within Galaxy Web- based UI Drag- Drop workflow crea?ons Easily modify Workflows with new tools Analy?cal tools are automa?cally run on the scalable compute resources when possible Galaxy on Cluster/Cloud Data Management Data Analysis
20 Globus Genomics Workflows can be easily defined and automated with integrated Galaxy Platform capabilities Data movement is streamlined with integrated Globus file-transfer functionality Resources can be provisioned on-demand with Amazon Web Services cloud based infrastructure
21 Agenda 12:30pm Challenges for biomedical analysis at scale 12:45pm Best practices and solution components 1:00pm Introduction to Globus Genomics, Globus Transfer 1:30pm Exercise: Transferring raw NGS datasets from sequencing centers and sharing 2:00pm Refreshment Break 2:15pm Demonstration of QC pipeline, Exome, RNA, Whole Genome analysis 2:30pm Exercise: Running example pipelines with sample data sets 2:45pm Running analysis at scale using Globus Genomics 3:15pm Exercise/Demonstration: Executing Exome, RNA, QC pipelines at scale 3:45pm Interactive Q&A and Session Wrap-up
22 Exercise: Transferring raw NGS datasets from sequencing centers and sharing
23 Exercise 1: Account Signup 1. Go to: globus.org/signup 2. Create your Globus account 3. Validate address 4. Optional: Login with your campus/incommon identity
24 Exercise 2: Transfer, Sharing, Group Management 1. Install Globus Connect Personal 2. Move file(s) from esnet#anl-diskpt1 to your laptop 3. Check your for a notification on successful transfer 4. Go to globus.org/groups and search for group named BioIT2015. Click Join the Group
25 Exercise 3: Transferring data from a Sequencing center 1. Login to 2. Click on Browse and Get Data using Globus Online tool on the left hand panel 3. Start typing the name of the endpoint sulakhe#sequencingcenter 4. Log in to the endpoint with username genomics, password globus 5. Select the the forward Exome files under Exome-seq-sample data and click Execute 6. Repeat the above step for reverse file
26 Break Resume at 2:15pm
27 Agenda 12:30pm Challenges for biomedical analysis at scale 12:45pm Best practices and solution components 1:00pm Introduction to Globus Genomics, Globus Transfer 1:30pm Exercise: Transferring raw NGS datasets from sequencing centers and sharing 2:00pm Refreshment Break 2:15pm Demonstration of QC pipeline, Exome, RNA, Whole Genome analysis 2:30pm Exercise: Running example pipelines with sample data sets 2:45pm Running analysis at scale using Globus Genomics 3:15pm Exercise/Demonstration: Executing Exome, RNA, QC pipelines at scale 3:45pm Interactive Q&A and Session Wrap-up
28 Demonstration of QC pipeline, Exome, RNA, Whole Genome analysis
29 RNA-Seq Analysis Workflow Differen?al Expression Data Transfer Quality Control Alignment Read / Gene Count Data Transfer
30 RNA-Seq Analysis Workflow (Stage 1) Data Transfer Quality Control Quality Control Read Alignment Transcript Assemble / Transcript Count
31 RNA-Seq Analysis Workflow (Stage 2) Tuxedo Package CummeRBund Cuffdiff Cuffmerge DEXSeq DESeq Stage 1 Inputs
32 Exome / Whole Genome Analysis Stage 1 Inputs Alignment BAM Cleanup GATK Realignment Variant Calling
33 Agenda 12:30pm Challenges for biomedical analysis at scale 12:45pm Best practices and solution components 1:00pm Introduction to Globus Genomics, Globus Transfer 1:30pm Exercise: Transferring raw NGS datasets from sequencing centers and sharing 2:00pm Refreshment Break 2:15pm Demonstration of QC pipeline, Exome, RNA, Whole Genome analysis 2:30pm Exercise: Running example pipelines with sample data sets 2:45pm Running analysis at scale using Globus Genomics 3:15pm Exercise/Demonstration: Executing Exome, RNA, QC pipelines at scale 3:45pm Interactive Q&A and Session Wrap-up
34 Running example pipelines with sample data sets
35 Exercise 4 : Exome Analysis Workflow 1. Login to 2. Copy dbsnp and 1000G files from: Shared Data -> Data Libraries -> Reference Data Library (into same history) 3. On the Main page, click on the Workflow for Illumina Exome-Seq and import workflow 4. Run Workflow: Click Workflow tab and select the imported Exome workflow and click Run 5. Input Parameters: Select appropriate input files (Forward & Reverse as well as reference files for each step. 6. Click Run Workflow button.
36 Exercise 5 : Exome Analysis Workflow Try the Exome Workflow with Transfers jobs as inputs (Under Published Workflows)
37 Running Batch Job Create workflow Generate user API Key (if necessary) Download and fill out workflow table file Upload table file Submit
38 Running Batch Job (Workflow)
39 Running Batch Job (API Key)
40 Running Batch Job (Download)
41 Running Batch Job (Fill out)
42 Running Batch Job (Upload)
43 Running Batch Job (Submit)
44 Example Collaborations Dobyns Lab Backround: Investigate the nature and causes of a wide range of human developmental brain disorders Approach: Replaced manual analysis with Globus Genomics Results: Achieved greater than 20X speed-up in analysis of exome data Future Plans: Leverage scale-out capability of Globus Genomics on 150 exome data set and seek to achieve 50X speed-up in analysis
45 Example Collaborations Georgetown Medical Center Backround: Innovation Center for Biomedical Informatics is an academic hub for innovative research in the field of biomedical informatics. Approach: Augment current team and tools with a NGS analysis platform to support standard and best-practice pipelines while leveraging elastic cloud-based resources. Results: Pilot effort is complete improved quality and performance results on whole genome, exome and RNA-Seq pipelines utilizing Globus Genomics Future Plans: Provide Globus Genomics as a well-managed platform-as-aservice for ICBI collaborators and users
46 Diversity of Collaborations Cox Lab Volchenboum Lab Olopade Lab
47 Typical Engagement Proof of Concept Limited scale Existing or slightly modified pipeline Setup Training Testing Pilot Multi-endpoint Additional tools Pipeline validation Scale-out analysis Optimization Training Staged transition to production Production Monthly, annual subscription Startup help Training Support 1 2 weeks 1 3 months Ongoing
48 Wrapping up
49 We are a non-profit service provider to various research communities
50 We are a non-profit service provider to various research communities We offer multiple subscription tiers to provide a cost-effective solution and ensure sustainability of our service
51 Subscription Pricing Starter Standard Large Cumula/ve Analysis Workload* (over a 12- month subscrip/on) ~ 800 exomes ~80 whole genomes ~ 400 RNA- seqs ~ 4000 exomes ~ 400 whole genomes ~ 2000 RNA- seqs ~ exomes ~ 2000 whole genomes ~ RNA- seqs Technical Support Access to Enhanced Workbench M- F, 9-5 CT 2- business day response M- F, 9-5 CT, 1- business day response M- F, 9-5 CT 1- business day response Yes Yes Yes Mul/- sample submission Yes Yes Yes Usage Dashboard Yes Yes Yes Price/Performance Controls Basic Advanced Advanced On- Demand Tool Wrapping No Limited Yes HIPAA / op/onal BAA Not Available Available Available Annual subscrip+ons start at $5,000 for individual PIs and $10,000 for core labs * Representa+ve workloads based on human genome, GATK variant calling pipeline (whole genome, exome), Tuxedo suite of tools (RNA- Seq), etc.
52 Thank you to our sponsors! U.S. DEPARTMENT OF ENERGY
Large-scale Research Data Management and Analysis Using Globus Services. Ravi Madduri Argonne National Lab University of Chicago @madduri
Large-scale Research Data Management and Analysis Using Globus Services Ravi Madduri Argonne National Lab University of Chicago @madduri Outline Who we are Challenges in Big Data Management and Analysis
Globus Genomics Tutorial GlobusWorld 2014
Globus Genomics Tutorial GlobusWorld 2014 Agenda Overview of Globus Genomics Example Collaborations Demonstration Globus Genomics interface Globus Online integration Scenario 1: Using Globus Genomics for
Globus Research Data Management: Introduction and Service Overview
Globus Research Data Management: Introduction and Service Overview Kyle Chard [email protected] Ben Blaiszik [email protected] Thank you to our sponsors! U. S. D E P A R T M E N T OF ENERGY 2 Agenda
globus online Globus Online for Research Data Management Rachana Ananthakrishnan Great Plains Network Annual Meeting 2013
globus online Globus Online for Research Data Management Rachana Ananthakrishnan Great Plains Network Annual Meeting 2013 We started with technology proven in many large-scale grids GridFTP GRAM MyProxy
Data Analysis & Management of High-throughput Sequencing Data. Quoclinh Nguyen Research Informatics Genomics Core / Medical Research Institute
Data Analysis & Management of High-throughput Sequencing Data Quoclinh Nguyen Research Informatics Genomics Core / Medical Research Institute Current Issues Current Issues The QSEQ file Number files per
Globus for Data Management
Globus for Data Management Computation Institute Rachana Ananthakrishnan ([email protected]) Data Management Challenges Transfers often take longer than expected based on available network capacities
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
Introduction to Arvados. A Curoverse White Paper
Introduction to Arvados A Curoverse White Paper Contents Arvados in a Nutshell... 4 Why Teams Choose Arvados... 4 The Technical Architecture... 6 System Capabilities... 7 Commitment to Open Source... 12
Building Bioinformatics Capacity in Africa. Nicky Mulder CBIO Group, UCT
Building Bioinformatics Capacity in Africa Nicky Mulder CBIO Group, UCT Outline What is bioinformatics? Why do we need IT infrastructure? What e-infrastructure does it require? How we are developing this
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE. October 2013
ENABLING DATA TRANSFER MANAGEMENT AND SHARING IN THE ERA OF GENOMIC MEDICINE October 2013 Introduction As sequencing technologies continue to evolve and genomic data makes its way into clinical use and
How to Use JCWHosting Reseller Cloud Storage Solution
How to Use JCWHosting Reseller Cloud Storage Solution Go to https://www.internetspace.co.za and log in with your Cloud Reseller account username and password. How to Use create a cloud account for your
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
Alternative Deployment Models for Cloud Computing in HPC Applications. Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix
Alternative Deployment Models for Cloud Computing in HPC Applications Society of HPC Professionals November 9, 2011 Steve Hebert, Nimbix The case for Cloud in HPC Build it in house Assemble in the cloud?
DTI Image Processing Pipeline and Cloud Computing Environment
DTI Image Processing Pipeline and Cloud Computing Environment Kyle Chard Computation Institute University of Chicago and Argonne National Laboratory Introduction DTI image analysis requires the use of
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
Delivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
Tableau Online. Understanding Data Updates
Tableau Online Understanding Data Updates Author: Francois Ajenstat July 2013 p2 Whether your data is in an on-premise database, a database, a data warehouse, a cloud application or an Excel file, you
G E N OM I C S S E RV I C ES
GENOMICS SERVICES THE NEW YORK GENOME CENTER NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. capabilities. N E X T- G E
Anchor End-User Guide
Table of Contents How to Access Your Account How to Upload Files How to Download the Desktop Sync Folder Sync Folder How to Share a File 3 rd Party Share from Web UI 3 rd Party Share from Sync Folder Team-Share
BioHPC Web Computing Resources at CBSU
BioHPC Web Computing Resources at CBSU 3CPG workshop Robert Bukowski Computational Biology Service Unit http://cbsu.tc.cornell.edu/lab/doc/biohpc_web_tutorial.pdf BioHPC infrastructure at CBSU BioHPC Web
LifeScope Genomic Analysis Software 2.5
USER GUIDE LifeScope Genomic Analysis Software 2.5 Graphical User Interface DATA ANALYSIS METHODS AND INTERPRETATION Publication Part Number 4471877 Rev. A Revision Date November 2011 For Research Use
Simple, Secure User Guide for OpenDrive Drive Application v1.2.0.4 for OS-X Platform 20150501 May 2015
Simple, Secure User Guide for OpenDrive Drive Application v1.2.0.4 for OS-X Platform 20150501 May 2015 Table of Contents Logging into the Drive Application 4 Log In Sign Up Access the Drive Application
Administering Jive for Outlook
Administering Jive for Outlook TOC 2 Contents Administering Jive for Outlook...3 System Requirements...3 Installing the Plugin... 3 Installing the Plugin... 3 Client Installation... 4 Resetting the Binaries...4
New solutions for Big Data Analysis and Visualization
New solutions for Big Data Analysis and Visualization From HPC to cloud-based solutions Barcelona, February 2013 Nacho Medina [email protected] http://bioinfo.cipf.es/imedina Head of the Computational Biology
Chapter 9 PUBLIC CLOUD LABORATORY. Sucha Smanchat, PhD. Faculty of Information Technology. King Mongkut s University of Technology North Bangkok
CLOUD COMPUTING PRACTICE 82 Chapter 9 PUBLIC CLOUD LABORATORY Hand on laboratory based on AWS Sucha Smanchat, PhD Faculty of Information Technology King Mongkut s University of Technology North Bangkok
Hadoop in the Hybrid Cloud
Presented by Hortonworks and Microsoft Introduction An increasing number of enterprises are either currently using or are planning to use cloud deployment models to expand their IT infrastructure. Big
Administration GUIDE. SharePoint Server idataagent. Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 201
Administration GUIDE SharePoint Server idataagent Published On: 11/19/2013 V10 Service Pack 4A Page 1 of 201 Getting Started - SharePoint Server idataagent Overview Deployment Configuration Decision Table
Qsync Install Qsync utility Login the NAS The address is 192.168.1.210:8080 bfsteelinc.info:8080
Qsync Qsync is a cloud based file synchronization service empowered by QNAP Turbo NAS. Simply add files to your local Qsync folder, and they will be available on your Turbo NAS and all its connected devices.
Assignment # 1 (Cloud Computing Security)
Assignment # 1 (Cloud Computing Security) Group Members: Abdullah Abid Zeeshan Qaiser M. Umar Hayat Table of Contents Windows Azure Introduction... 4 Windows Azure Services... 4 1. Compute... 4 a) Virtual
Global TAC Secure FTP Site Customer User Guide
Global TAC Secure FTP Site Customer User Guide Introduction This guide is provided to assist you in using the GTAC Secure FTP site. This site resides in the Houston Remote Services Center (RSC), and is
IBM/Softlayer Object Storage for Offsite Backup
IBM/Softlayer Object Storage for Offsite Backup How to use IBM/Softlayer Object Storage for Offsite Backup How to use IBM/Softlayer Object Storage for Offsite Backup IBM/Softlayer Object Storage is a redundant
HIPAA Compliance Use Case
Overview HIPAA Compliance helps ensure that all medical records, medical billing, and patient accounts meet certain consistent standards with regard to documentation, handling, and privacy. Current Situation
Installation and Setup: Setup Wizard Account Information
Installation and Setup: Setup Wizard Account Information Once the My Secure Backup software has been installed on the end-user machine, the first step in the installation wizard is to configure their account
QA Software. a new way to view qa. Centralized, cloud-based data for easy management of TG-142 reporting. QA Pilot
QA Software a new way to view qa Centralized, cloud-based data for easy management of TG-142 reporting QA Pilot All clinics are tasked with recording, reporting and archiving vast amounts of data to document
WHMCS LUXCLOUD MODULE
èè WHMCS LUXCLOUD MODULE Update: 02.02.2015 Version 2.0 This information is only valid for partners who use the WHMCS module (v2.0 and higher). 1.1 General overview 1.2 Installing the plugin Go to your
Product Overview & Quick Start Guide
Product Overview & Quick Start Guide Table of Contents Sage CloudCall Plugin Overview 1 Company Overview 1 Product Overview 1 How to order CloudCall Click 2 Sage CloudCall - Administration Guide 3 Installation
The Globus Galaxies Platform: Delivering Science Gateways as a Service
The Globus Galaxies Platform: Delivering Science Gateways as a Service Ravi Madduri 1, Kyle Chard 1, Ryan Chard 2, Lukasz Lacinski 1, Alex Rodriguez 1, Dinanath Sulakhe 1, David Kelly 1, Utpal Dave 1,
RingStor User Manual. Version 2.1 Last Update on September 17th, 2015. RingStor, Inc. 197 Route 18 South, Ste 3000 East Brunswick, NJ 08816.
RingStor User Manual Version 2.1 Last Update on September 17th, 2015 RingStor, Inc. 197 Route 18 South, Ste 3000 East Brunswick, NJ 08816 Page 1 Table of Contents 1 Overview... 5 1.1 RingStor Data Protection...
Genomic Applications on Cray supercomputers: Next Generation Sequencing Workflow. Barry Bolding. Cray Inc Seattle, WA
Genomic Applications on Cray supercomputers: Next Generation Sequencing Workflow Barry Bolding Cray Inc Seattle, WA 1 CUG 2013 Paper Genomic Applications on Cray supercomputers: Next Generation Sequencing
Table of Contents. OpenDrive Drive 2. Installation 4 Standard Installation Unattended Installation
User Guide for OpenDrive Application v1.6.0.4 for MS Windows Platform 20150430 April 2015 Table of Contents Installation 4 Standard Installation Unattended Installation Installation (cont.) 5 Unattended
NETWRIX USER ACTIVITY VIDEO REPORTER
NETWRIX USER ACTIVITY VIDEO REPORTER ADMINISTRATOR S GUIDE Product Version: 1.0 January 2013. Legal Notice The information in this publication is furnished for information use only, and does not constitute
How To Backup An Exchange Server With 25Gb And More On A Microsoft Smartfiler With A Backup From A Backup To A Backup Point Set On A Flash Drive On A Pc Or Macbook Or Ipad On A Cheap Computer (For A
Using SmartFiler for Microsoft Exchange Server Backup Introduction The SmartFiler Backup Appliance Exchange Server backup solution is integrated with the Symantec Backup Exec System Recovery 2010. Using
VMware vcloud Air - Disaster Recovery User's Guide
VMware vcloud Air - Disaster Recovery User's Guide vcloud Air This document supports the version of each product listed and supports all subsequent versions until the document is replaced by a new edition.
Deploying SwiftStack Object Storage for Storage Made Easy
Deploying SwiftStack Object Storage for Storage Made Easy March 2015 Page 1 Table of Contents Table of Contents Introduction Preparation Basic Integration Steps Example: Configuring a Cloud Drive Client
Deploy App Orchestration 2.6 for High Availability and Disaster Recovery
Deploy App Orchestration 2.6 for High Availability and Disaster Recovery Qiang Xu, Cloud Services Nanjing Team Last Updated: Mar 24, 2015 Contents Introduction... 2 Process Overview... 3 Before you begin...
Citrix EdgeSight User s Guide. Citrix EdgeSight for Endpoints 5.4 Citrix EdgeSight for XenApp 5.4
Citrix EdgeSight User s Guide Citrix EdgeSight for Endpoints 5.4 Citrix EdgeSight for XenApp 5.4 Copyright and Trademark Notice Use of the product documented in this guide is subject to your prior acceptance
Double-Take Cloud Migration Center (CMC) Tech Brief
Double-Take Cloud Migration Center (CMC) Tech Brief Overview Double-Take Cloud Migration Center is an online service that enables migrations from VMware vsphere and Amazon Web Services EC2 to Microsoft
Analysis of ChIP-seq data in Galaxy
Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers
Going Google... with Gaggle!!!!
Going Google... with Gaggle!!!! Challenges of "Going Google" WRPS had been using Gaggle.net for student email. We were pleased with the filtering and notification- Google did not provide this level of
Introduction to NGS data analysis
Introduction to NGS data analysis Jeroen F. J. Laros Leiden Genome Technology Center Department of Human Genetics Center for Human and Clinical Genetics Sequencing Illumina platforms Characteristics: High
Cloud. Hosted Exchange Administration Manual
Cloud Hosted Exchange Administration Manual Table of Contents Table of Contents... 1 Table of Figures... 4 1 Preface... 6 2 Telesystem Hosted Exchange Administrative Portal... 7 3 Hosted Exchange Service...
INITIAL SYNCHRONIZATION...
Contents INTRODUCTION... 1 DATA SYNCHRONIZATION... 2 SYNCHRONIZATION RULES... 2 SYNC TOOL... 4 BILLQUICK ONLINE SYNC TOOL... 4 Synchronization Options... 4 INITIAL SYNCHRONIZATION... 11 EXISTING BILLQUICK-NEW
Qlik Sense Cloud. Qlik Sense 2.0.4 Copyright 1993-2015 QlikTech International AB. All rights reserved.
Qlik Sense Cloud Qlik Sense 2.0.4 Copyright 1993-2015 QlikTech International AB. All rights reserved. Copyright 1993-2015 QlikTech International AB. All rights reserved. Qlik, QlikTech, Qlik Sense, QlikView,
Managing Your Microsoft Windows Server Fleet with AWS Directory Service. May 2015
Managing Your Microsoft Windows Server Fleet with AWS Directory Service May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document is provided for informational
Epimorphics Linked Data Publishing Platform
Epimorphics Linked Data Publishing Platform Epimorphics Services for G-Cloud Version 1.2 15 th December 2014 Authors: Contributors: Review: Andy Seaborne, Martin Merry Dave Reynolds Epimorphics Ltd, 2013
High Speed Transfers Using the Aspera Node API! Aspera Live Webinars November 6, 2012!
High Speed Transfers Using the Aspera Node API Aspera Live Webinars November 6, 2012 Mike Flathers Chief Technologist Aspera Developer Platform [email protected] Agenda Brief Aspera Technology Overview
Quick Start Guide. IT Management On-Demand
1 Quick Start Guide Quick Start Guide IT Management On-Demand Introduction... 2 Getting Started... 3 Planning Your Deployment... 5 Performing a Test Deployment... 6 Enterprise Deployment Options... 8 Remote
Integrated Rule-based Data Management System for Genome Sequencing Data
Integrated Rule-based Data Management System for Genome Sequencing Data A Research Data Management (RDM) Green Shoots Pilots Project Report by Michael Mueller, Simon Burbidge, Steven Lawlor and Jorge Ferrer
DreamFactory on Microsoft SQL Azure
DreamFactory on Microsoft SQL Azure Account Setup and Installation Guide For general information about the Azure platform, go to http://www.microsoft.com/windowsazure/. For general information about the
Oracle Database Backup Service. Secure Backup in the Oracle Cloud
Oracle Database Backup Service Secure Backup in the Oracle Cloud Today s organizations are increasingly adopting cloud-based IT solutions and migrating on-premises workloads to public clouds. The motivation
Copyright 2012 Trend Micro Incorporated. All rights reserved.
Trend Micro Incorporated reserves the right to make changes to this document and to the products described herein without notice. Before installing and using the software, please review the readme files,
DocAve Online 3. User Guide. Service Pack 9 Cumulative Update 1
DocAve Online 3 User Guide Service Pack 9 Cumulative Update 1 Issued August 2015 Table of Contents What s New in the Guide... 6 About... 7 Submitting Documentation Feedback to AvePoint... 8 Browser and
Focusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
SYSTEM REQUIREMENTS...
Contents INTRODUCTION... 1 BillQuick Online Setup Checklist... 3 SYSTEM REQUIREMENTS... 4 Hardware Requirements... 4 Software Requirements... 4 START-UP... 5 BILLQUICK ONLINE ACCOUNT... 5 BILLQUICK ONLINE
Copyright 2015 EMC Corporation. All rights reserved. 1
Copyright 2015 EMC Corporation. All rights reserved. 1 CLOUD READY DATA PROTECTION BUILT FOR SOFTWARE DEFINED DATACENTER YATIN PATIL Copyright 2015 EMC Corporation. All rights reserved. 2 TWEET US! Are
GETTING STARTED WITH FLEXI-CLOUD
GETTING STARTED WITH FLEXI-CLOUD WELCOME TO FLEXI-CLOUD. Flexi-CLOUD is the "on-demand" licensing solution powered by MYRIAD-connect. This document explains how to install Flexi-CLOUD servers and how to
Single Sign On. SSO & ID Management for Web and Mobile Applications
Single Sign On and ID Management Single Sign On SSO & ID Management for Web and Mobile Applications Presenter: Manish Harsh Program Manager for Developer Marketing Platforms of NVIDIA (Visual Computing
Test Case 3 Active Directory Integration
April 12, 2010 Author: Audience: Joe Lowry and SWAT Team Evaluator Test Case 3 Active Directory Integration The following steps will guide you through the process of directory integration. The goal of
NTP Software VFM Administration Web Site for EMC Atmos
NTP Software VFM Administration Web Site for EMC Atmos User Manual Revision 1.1 - July 2015 This guide details the method for using NTP Software VFM Administration Web Site, from an administrator s perspective.
Explain how to prepare the hardware and other resources necessary to install SQL Server. Install SQL Server. Manage and configure SQL Server.
Course 6231A: Maintaining a Microsoft SQL Server 2008 Database About this Course Elements of this syllabus are subject to change. This five-day instructor-led course provides students with the knowledge
Comodo Cloud Drive Software Version 1.0
2 Comodo Cloud Drive Software Version 1.0 User Guide Guide Version 1.0.101414 Comodo Security Solutions 1255 Broad Street STE 100 Clifton, NJ 07013 Table of Contents 1. Introduction to Comodo Cloud Drive...
Informatica Cloud & Redshift Getting Started User Guide
Informatica Cloud & Redshift Getting Started User Guide 2014 Informatica Corporation. No part of this document may be reproduced or transmitted in any form, by any means (electronic, photocopying, recording
Data Governance in the Hadoop Data Lake. Michael Lang May 2015
Data Governance in the Hadoop Data Lake Michael Lang May 2015 Introduction Product Manager for Teradata Loom Joined Teradata as part of acquisition of Revelytix, original developer of Loom VP of Sales
Propalms TSE Deployment Guide
Propalms TSE Deployment Guide Version 7.0 Propalms Ltd. Published October 2013 Overview This guide provides instructions for deploying Propalms TSE in a production environment running Windows Server 2003,
About the Princess Margaret Computational Biology Resource Centre (PMCBRC) cluster
Cluster Info Sheet About the Princess Margaret Computational Biology Resource Centre (PMCBRC) cluster Welcome to the PMCBRC cluster! We are happy to provide and manage this compute cluster as a resource
Xopero Backup Build your private cloud backup environment. Getting started
Xopero Backup Build your private cloud backup environment Getting started 07.05.2015 List of contents Introduction... 2 Get Management Center... 2 Setup Xopero to work... 3 Change the admin password...
Overview. Timeline Cloud Features and Technology
Overview Timeline Cloud is a backup software that creates continuous real time backups of your system and data to provide your company with a scalable, reliable and secure backup solution. Storage servers
DataKeeper Cloud Edition. v7.5. Installation Guide
DataKeeper Cloud Edition v7.5 Installation Guide March 2013 This document and the information herein is the property of SIOS Technology Corp. (previously known as SteelEye Technology, Inc.) and all unauthorized
1 What is Cloud Computing?... 2 2 Cloud Infrastructures... 2 2.1 OpenStack... 2 2.2 Amazon EC2... 4 3 CAMF... 5 3.1 Cloud Application Management
1 What is Cloud Computing?... 2 2 Cloud Infrastructures... 2 2.1 OpenStack... 2 2.2 Amazon EC2... 4 3 CAMF... 5 3.1 Cloud Application Management Frameworks... 5 3.2 CAMF Framework for Eclipse... 5 3.2.1
User Guide Online Backup
User Guide Online Backup Table of contents Table of contents... 1 Introduction... 2 Adding the Online Backup Service to your Account... 2 Getting Started with the Online Backup Software... 4 Downloading
Scaling up to Production
1 Scaling up to Production Overview Productionize then Scale Building Production Systems Scaling Production Systems Use Case: Scaling a Production Galaxy Instance Infrastructure Advice 2 PRODUCTIONIZE
Xythos on Demand Quick Start Guide For Xythos Drive
Xythos on Demand Quick Start Guide For Xythos Drive What is Xythos on Demand? Xythos on Demand is not your ordinary online storage or file sharing web site. Instead, it is an enterprise-class document
SAP HANA Cloud Platform Frequently Asked Questions - Business
SAP HANA Cloud Platform Frequently Asked Questions - Business SAP HANA Cloud Platform 1. What is SAP HANA Cloud Platform? SAP HANA Cloud Platform, the in-memory Platform-as-a-Service offering from SAP,
How To Use Hp Vertica Ondemand
Data sheet HP Vertica OnDemand Enterprise-class Big Data analytics in the cloud Enterprise-class Big Data analytics for any size organization Vertica OnDemand Organizations today are experiencing a greater
#jenkinsconf. Jenkins as a Scientific Data and Image Processing Platform. Jenkins User Conference Boston #jenkinsconf
Jenkins as a Scientific Data and Image Processing Platform Ioannis K. Moutsatsos, Ph.D., M.SE. Novartis Institutes for Biomedical Research www.novartis.com June 18, 2014 #jenkinsconf Life Sciences are
SafeCom G2 Enterprise Disaster Recovery Manual
SafeCom G2 Enterprise Disaster Recovery Manual D60612-06 September 2009 Trademarks: SafeCom, SafeCom Go, SafeCom P:Go, SafeCom OnLDAP, SafeCom epay and the SafeCom logo are trademarks of SafeCom a/s. Company
Sage Intelligence Financial Reporting for Sage ERP X3 Version 6.5 Installation Guide
Sage Intelligence Financial Reporting for Sage ERP X3 Version 6.5 Installation Guide Table of Contents TABLE OF CONTENTS... 3 1.0 INTRODUCTION... 1 1.1 HOW TO USE THIS GUIDE... 1 1.2 TOPIC SUMMARY...
Scientific and Technical Applications as a Service in the Cloud
Scientific and Technical Applications as a Service in the Cloud University of Bern, 28.11.2011 adapted version Wibke Sudholt CloudBroker GmbH Technoparkstrasse 1, CH-8005 Zurich, Switzerland Phone: +41
