Geospiza s Finch-Server: A Complete Data Management System for DNA Sequencing
|
|
|
- Evelyn Norris
- 10 years ago
- Views:
Transcription
1 KOO10 5/31/04 12:17 PM Page Geospiza s Finch-Server: A Complete Data Management System for DNA Sequencing Sandra Porter, Joe Slagel, and Todd Smith Geospiza, Inc., Seattle, WA Introduction The increased demands for DNA sequencing services and greater volume of data have created a need for software designed to support sequencing activities and applications. Commercial data management systems, such as the Finch-Server (Geospiza Inc., Seattle, WA), provide laboratories with robust data processing and quality analysis tools designed specifically to support DNA sequencing. Even laboratories new to sequencing can implement these systems quickly to analyze projects and identify problem areas. Data management systems offer a variety of benefits. Two immediate benefits are a decrease in cost and a shorter time to complete sequencing projects. Small increases in quality or read length can have a large impact on the number of sequencing reactions required to complete a project. Because the overall cost of a sequencing project is determined primarily by the number of reads, project costs are lowered when data management tools are used to detect and correct problems in a timely fashion. A later benefit to researchers is the ability to link sequence data to sample history. It is important to know, for example, if a sequence was obtained from a cloned DNA fragment or from a clinical sample that might contain a heterogeneous mixture of sequences. Finch-Servers were designed to solve common problems and meet the data management requirements shared by many laboratories. This approach benefits customers because multiple users participate in the design, review, and testing process. Additional benefits include a shorter time frame from purchase to use and a more robust and adaptable system, DNA Sequencing: Optimizing the Process and Analysis Edited by Jan Kieleczawa 2005 Jones and Bartlett Publishers 131
2 KOO10 5/27/04 5:25 PM Page Porter, Slagel, and Smith unlike custom systems that require a long development process and significant time investment on the part of the laboratory. This chapter describes the software requirements of different types of sequencing laboratories and discusses how Finch-Servers help those laboratories meet their needs. The Geospiza Finch-Server A Finch-Server is a Web application with selected components of the Finch-Suite and a relational database management system (RDBMS), either Oracle (Redwood Shores, CA) or Solid (Mountain View, CA), installed on a computer with a UNIX or Linux operating system. Technicians and researchers use Web browsers such as Internet Explorer or Netscape to interact with the Finch-Server through a local intranet, or, in the case of geographically dispersed sites, over a secure Internet connection. Finch-Servers operate either as stand-alone systems or within a distributed computing environment. Finch-Systems are Finch-Servers designed for different types of laboratories and different data processing requirements. Each Finch-System includes a Finch-Server along with selected components of the Finch-Suite, a set of integrated software modules designed to support data management and analysis. These include the: Sequencing Request Manager, Instrument Manager, Chromatogram Manager, Assembly Manager, Data Repository Manager, and Basic Local Alignment Sequence Tool (BLAST) Manager. Components of the Finch-Suite are organized within the Finch Core DNA Sequencing System (Figure 10.1), and the Finch Assembly and BLAST Systems (Figure 10.2). The Complete Finch DNA Sequencing System contains all of the Finch-Suite components. Many laboratories need to store large numbers of data files and sequence assemblies. Customers have reported storing well over half a million chromatogram files in the Finch-Server and generating sequence assemblies from over 60,000 reads. These systems also provide a way for researchers to maintain their original, unprocessed data files, thus making it possible to apply new base-calling technology or other analysis tools to older data and data from varied sources. Storage media can be added to accommodate an increasing quantity of data over time. The Finch-Server, therefore, is able to provide a robust, scalable system, for storing chromatogram files, sequence assemblies, sequence databases, and BLAST results. Much of the data stored in the Finch-Server are presented in tables that can be sorted by column. For example, one can easily sort reads by the number of vector clones by selecting the appropriate column heading. Customized data views also can be generated using data browsers that
3 KOO10 5/27/04 5:25 PM Page 133 Chapter 10. The Finch-Server Data Management System 133 quickly extract selected information. A laboratory manager might use the data browser to view all the sequencing runs performed with a certain instrument. A researcher setting up an assembly might select all of the reads from a genomic library that match E. coli and use the move icon to transfer them to a trash folder, thus keeping E. coli sequences out of the assembly. Further, Finch-Servers store information in a relational database management system, allowing users to obtain selected information in response to an SQL (Structured Query Language) statement. The capacity to perform customized queries using SQL and obtain customized reports and information is a powerful tool for additional research, process management, and project oversight. Data Management for Core Facilities Managing Sequencing Requests Core laboratories in academic institutions and biotechnology companies have a variety of general and customized requirements that software systems must be able to meet. First, the laboratory s customers need a convenient method for submitting work requests. This is accomplished in the Finch-Server through Web forms that allow customers to make requests using their own desktop computers. Core lab customers use the Finch- Sequencing Request Manager to enter experimental information for experiments performed on different scales. This information is stored in the database, allowing retrieval at a later date. Web forms, designed to accommodate tubes, 96- or 384-well plates, or batch modes, simplify data entry and sample naming (Figure 10.1A). A configurable, automated naming system assigns unique names to each sample. For example, samples that are used for sequence assembly must be named in a certain way to be compatible with the Phrap assembly program (6). The Sequencing Request Manager helps customers name samples appropriately with a minimal amount of extra work. Sample Tracking Service laboratories also require the ability to track samples through each stage of the sequencing process. Not only can laboratory personnel determine where samples are located in a queue, the Finch-Server allows customers to monitor the status of their own samples over the Internet, saving time spent on the phone. At the end of the sequencing process, the Finch-Server simplifies data delivery. Customers can either view (and store) their data in the Finch-Server or download the data to their desktop computer.
4 KOO10 5/27/04 5:25 PM Page 134 (a) (b) (g) Finch 134
5 KOO10 5/27/04 5:25 PM Page 135 (d) (f) (c) (e) Figure The Finch Core DNA Sequencing System. (a) A web page from the Finch-Sequencing Request Manager. Part of the form for submitting a request to sequence samples from a 96 well plate is shown. Single wells can be selected, or multiple wells, chosen by selecting column or row headings. (b) The Finch-Chromatogram Manager Folder Report. This report summarizes quality and selected statistics for all the reads in an individual folder, including the number of sequences that match vector sequences, E. coli, or other selected sequences. (c) The Chromatogram Details report, also from the Chromatogram Manager. Quality scores, from Phred, are shown for each base in the read, along with the sequence and other information. Links are available for viewing the trace file (d) and/or downloading the data. (d) A chromatogram trace file together with a plot of Phred quality scores. (e) The Sequencing Run Details report from the Finch-Instrument Manager. The read length (darker color) and sequence quality (light color) are shown on the y axis and the individual capillaries and/or lanes, on the x axis. The regularly repeating blocks of poor quality samples (indicated by a dark color) result from a plugged fang in the sequencing instrument. (f) The Instrument Capillary Usage report. Average sequence quality is represented by colors ranging from bright green (best) to black (poor). Run dates are shown at the top, from right to left. Problem capillaries (or samples) are shown by the black boxes in the first two columns. (g) The Finch Core DNA Sequencing System overview. Laboratories submit requests using the Sequencing Request Manager. The lab uses the Instrument Manager to set up sample sheets and downloads them to the sequencing instrument. Data are loaded into the Chromatogram Manager and the customers get the results. 135
6 KOO10 5/27/04 5:25 PM Page Porter, Slagel, and Smith Security Core facilities often have clients from different laboratories, making it important to control access to data. Customers, on the other hand, want to share data with collaborators and other members of their laboratory, yet simultaneously prevent data access by competitors. These requirements are further complicated by the needs of the core facility. Personnel in the core facility must be able to view all of the data in order to monitor the quality of the laboratory s work. This problem is solved in the Finch- Server with a user hierarchy that allows administrators and technicians wider access than the facility s customers and assigns researchers to specific lab groups. All researchers within a lab group are able to view data belonging to that group but are prevented from viewing data owned by others. Only the facility administrator can add or delete researchers from a lab group by the facility director. As a result, researchers can share data through the Web in a protected manner with their collaborators and selected colleagues, while laboratory personnel can monitor data quality without compromising proprietary information. Preparation of Sample Sheets and Managing Instrument-Related Data After a customer has submitted a work request to a service laboratory, the laboratory needs to review requests and organize workflow. Technicians use the Finch Instrument Manager to create sample sheets in the Finch- Server by combining samples from different work requests to produce the optimum number of samples for each sequencing instrument. Finch- Servers support all of the widely used sequencing instruments, including those from ABI instruments (Applied Biosystems, Foster City, CA), the MegaBACE (Amersham, Piscataway, NJ), the Beckman CEQ2000 (Beckman, Fullerton, CA), and others. Laboratory technicians download sample sheets to their sequencing instrument and complete the sequencing run. Instrument-related service information and serial numbers also can be stored in the Instrument Manager. Management of Chromatogram Files and Quality Control The Finch Chromatogram Manager provides a useful tool for storing sequences and chromatogram-related information. Chromatogram files from a completed sequencing run are uploaded and stored in the Chromatogram Manager. The Chromatogram Manager also can be used to store individual files or packages of related chromatogram files that have been downloaded from public or private databases through the Internet. Chromatogram file data are checked by the Finch-Server during the upload process to prevent accidental loading of duplicate
7 KOO10 5/27/04 5:25 PM Page 137 Chapter 10. The Finch-Server Data Management System 137 chromatograms. This feature allowed one customer to diagnose an unsuspected problem in their instrument software. The instrument software apparently created two files with different file names but identical chromatogram data. This bug was confirmed when the customer checked with the instrument vendor. Once in the Finch-Server, the chromatogram files enter a data processing pipeline. Some of the steps in this pipeline include: base-calling, quality measurement, and quality trimming with either Phred (3 6), TraceTuner (Paracel, Inc.), or the KB base caller (Applied Biosystems); identification of vector sequences with Cross Match (6); and the generation of graphical reports that summarize run information and statistics for sets of sequences (examples are shown in Figure 10.1b d). Data quality analysis can be customized using pipelines to select the basecaller depending on the sequencing instrument, or laboratory. Sequencing Instrument Performance Instrument reports provide feedback to laboratories about the status of each run and the performance of each sequencing instrument. Blocked liquid transfer devices or problem capillaries can be quickly identified by viewing graphical reports (Figure 10.1, e, f). The ability to monitor capillary performance for changes in data quality over time helps laboratories determine when instruments require service. Service information and identification numbers are easily stored in the Finch-Server, providing a means to quickly view the service and repair history for any sequencing instrument. Billing Shared facilities need to monitor use and bill customers accordingly, in a timely fashion. Billing numbers can be stored in the Finch-Server when work requests are submitted. Structured Query Language (SQL) statements can be used in conjunction with billing numbers or other identification, to generate reports that define how resources are used, the amount of work performed, who requested the work, and the request date. These reports can be generated automatically, through custom services, and delivered to accounting departments. Data Management for Large-Scale Sequencing Laboratories engaged in genomic and production sequencing need additional features in a data management system. Production sequencing,
8 KOO10 5/27/04 5:25 PM Page Porter, Slagel, and Smith expressed sequence tags (ESTs) clustering, single-nucleotide polymorphism (SNP) discovery, and genomic sequencing require the ability to assemble sequences; set up, maintain, and update sequence databases; pipelines can be designed specifically for automated data processing, filtering, and BLAST searches (1). These additional requirements can be met by stand-alone Finch systems, such as the Finch Assembly System and/or the Finch BLAST System; or with the Complete Finch DNA Sequencing System, which includes all of the components of the core system in addition to the Finch-Assembly Manager, the Finch-Data Repository Manager, and the Finch-BLAST Manager. An overview of these components is shown in Figure DNA Sequence Assembly The Finch-Assembly Manager works with the Chromatogram Manager to assemble sets of reads into longer, contiguous sequences. Folders with specific sets of reads are created and organized in the Chromatogram Manager. To assemble sequences, one chooses the appropriate folder and assembly program, and presses a button to start the assembly. The Assembly Manager provides different options for sequence assembly, including Phrap ( Green, International Human Genome Sequencing Consortium, 2001), a parallel version, SPS-Phrap (Southwest Parallel Software, Albuquerque, NM, USA), or both. The Finch-Server s Relational Database Management System (RDBMS) stores all the parameters and details of the assembly, making it easy to repeat the assembly later with new or additional data. Sequence assemblies also can be treated as a type of experiment and performed with sets of sequences and different parameters. Becuase all of the assembly results and conditions are stored in the RDBMS, the optimum parameters are easily located for later work. Phrap assembles reads by comparing pairs of sequences, locating overlapping regions, and using quality information from the base-caller to help reconstruct the original sequence. The assembly can be improved by incorporating additional information about each read. For example, Phrap qualities those used to build the final contig incorporate information about the relative orientation of each read, and the sequencing chemistry. If a base has been confirmed by an alternate method (sequencing the opposite strand or using a different type of chemistry), Phrap assigns a higher quality value to the base at that position. The highest quality bases are used to build a mosaic, which represents the sequence of the contig. Information from each assembly is provided in tables and graphical reports. Tables include information about the number of reads in each contig, the location where each read aligns, and links to the read sequence,
9 KOO10 5/27/04 5:25 PM Page 139 (a) Finch (b) (c) (d) Figure The Finch-Assembly and BLAST Systems. (a) The Finch- Assembly Manager uses read data, stored in the Chromatogram Manager, to set up sequence assemblies. The Finch-Data Repository Manager maintains current versions of public and private sequence databases. BLAST searches of selected databases are conducted with the Finch-BLAST Manager. Query sequences can be pasted in a form, uploaded from a file, or selected from either the Chromatogram Manager or the Assembly Manager. Data processing pipelines can be set up to perform queries in an automated fashion. (b) The Assembly Set Details report from the Assembly Manager. This report provides information from the assembly along with a graph showing the number of reads per kilobase (y axis) for different length contigs (x axis). (c) A contig report from the Assembly Manager shows the Phrap quality across the length of the contig (top), sequence discrepancies as a function of quality (middle), and the number of sequences mapping to each part of the contig (bottom). (d) A screenshot from the BLAST Manager shows BLAST parameters and a table with the results. 139
10 KOO10 5/27/04 5:25 PM Page Porter, Slagel, and Smith quality values, and trace file. Chimera reports are provided to help diagnose experimental artifacts. If further information is needed, say the locations of potential deletion clones, the Phrap output file can be downloaded for viewing. Figure 10.2 b and c show two of the Assembly Manager s graphical reports. For this experiment, reads from several ESTs were obtained from the Washington University Genome Center (St. Louis, MO), stored in the Chromatogram Manager, and assembled with the Assembly Manager. A graph of the assembly results (Figure 10.2b) shows the number of reads per kb (y axis) versus the contig length (x axis). Because the reads used in this assembly were obtained from ESTs, sequences with a large number of reads/kb were likely to represent highly expressed genes, repetitive sequences, mitochondrial sequences, or ribosomal RNA. If these results were obtained from assembling reads from a genomic sequencing project, a large number of reads/kb might help flag problem contigs and find assembly problems due to repetitive sequences. Graphical reports also provide an overview of the Phrap quality values for each contig, the coverage depth, and positions with discrepant bases. High-quality sequence discrepancies are a strong indicator of potential polymorphisms and valuable in SNP discovery (2). Maintaining Updated Sequence Databases Many companies and sequencing centers have found it helpful to maintain local copies of sequence databases. Not only can the searches be faster when using local resources, data are protected because the searches are performed in a secure environment. Researchers don t have to take risks with valuable information by sending proprietary sequences over the Internet. Further, local searches allow one to search proprietary or custom databases that aren t available over the Internet. To meet this need, Geospiza developed the Finch BLAST System (Figure 10.2), which allows researchers to maintain up-to-date versions of any sequence database and use BLAST as a search tool. The content in public and private sequence databases changes on a daily basis, making it difficult for companies and research institutions to keep local versions current. The Finch-Data Repository Manager operates through a command line interface and updates local copies of databases on a regular schedule. Users specify which sequence data to retrieve from remote or local sites and designate where the files should be stored. When new sequences are retrieved, databases are automatically updated and log files are generated. reports instantly notify users that updates have occurred. This system also enables researchers to maintain multiple databases and multiple versions of each database.
11 KOO10 5/27/04 5:25 PM Page 141 Chapter 10. The Finch-Server Data Management System 141 The Finch-BLAST Manager The Finch-BLAST Manager is used to perform BLAST searches of either nucleotide or protein sequence databases (Figure 10.2d). Researchers can search one or more databases, concurrently, with single or multiple sequences. Several of the BLAST programs are available in the BLAST Manager, including blastn, blastp, blastx, tblastx, and tblastn. The BLAST manager stores the search results and parameters, thus allowing comparisons to be made between database searches at different points of time. Data processing pipelines can be employed by users to assemble reads stored in the Chromatogram Manager, and automated BLAST searches of selected databases performed with assembled sequences or individual reads. For example, a BLAST search can be carried out whenever updates occur in a specific database. BLAST can be employed also for quality control. In one example, a batch of clinical samples from polymerase chain reaction (PCR) assays was mixed up by a commercial sequencing lab. The problem was uncovered by constructing a custom database of the expected PCR products and comparing the reads with the expected products using BLAST. Filters also are available in the BLAST Manager that allow users to mask low complexity sequences that might obscure search results. Repetitive sequences can be masked by adding RepeatMasker (Geospiza, Inc., Seattle, WA, USA) to the system. The addition of filters permits users to set up powerful screening systems designed to enhance data mining, genome sequencing, SNP discovery, or other applications. Data Management Over the Internet: ifinch ifinch is a Finch-Server designed for individual researchers and/or smaller laboratories with short term sequencing projects. Subscriptions to ifinch allow small laboratories immediate access to integrated data management tools without making a long-term investment. Unlike other Finch-Servers, Geospiza acts as the system administrator for ifinch, ensuring for example, that data are backed up on a regular basis, hardware is kept up to date, and that the system runs smoothly. Remote access allows laboratories to store data off-site, securely, minimizing the risk of data loss. References 1. Altschul, S.F., Madden, T.L., Schäffer, A.A., et al Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:
12 KOO10 5/27/04 5:25 PM Page Porter, Slagel, and Smith 2. Clifford R., Edmonson, M., Hu, Y., et al Expression-based genetic/ physical maps of single-nucleotide polymorphisms identified by the cancer genome anatomy project. Genome Res 8: Ewing, B., and Green, P Base-calling of automated sequencer traces using Phred. II. Error probabilities. Genome Res 8: Ewing, B., Hillier, L., Wendl, M.C., and Green, P Base-calling of automated sequencer traces using Phred. I. Accuracy assessment. Genome Res 8: Green, E Strategies for the systematic sequencing of complex genomes. Nature Rev Genet 2: International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome. Nature 409:
Introduction to Bioinformatics 3. DNA editing and contig assembly
Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 [email protected]
4.2.1. What is a contig? 4.2.2. What are the contig assembly programs?
Table of Contents 4.1. DNA Sequencing 4.1.1. Trace Viewer in GCG SeqLab Table. Box. Select the editor mode in the SeqLab main window. Import sequencer trace files from the File menu. Select the trace files
SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications
Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
Vector NTI Advance 11 Quick Start Guide
Vector NTI Advance 11 Quick Start Guide Catalog no. 12605050, 12605099, 12605103 Version 11.0 December 15, 2008 12605022 Published by: Invitrogen Corporation 5791 Van Allen Way Carlsbad, CA 92008 U.S.A.
Bioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
DNA Sequencing Overview
DNA Sequencing Overview DNA sequencing involves the determination of the sequence of nucleotides in a sample of DNA. It is presently conducted using a modified PCR reaction where both normal and labeled
EnterpriseLink Benefits
EnterpriseLink Benefits GGY AXIS 5001 Yonge Street Suite 1300 Toronto, ON M2N 6P6 Phone: 416-250-6777 Toll free: 1-877-GGY-AXIS Fax: 416-250-6776 Email: [email protected] Web: www.ggy.com Table of Contents
CUSTOM DNA SEQUENCING SERVICES
CUSTOM DNA SEQUENCING SERVICES Satisfied Customers are our Driving Force We never stop exceeding your Expectations Value Read Service Single read sequencing of plasmid inserts or PCR products in tube and
Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data
Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data The Illumina TopHat Alignment and Cufflinks Assembly and Differential Expression apps make RNA data analysis accessible to any user, regardless
SUBJECT: New Features in Version 5.3
User Bulletin Sequencing Analysis Software v5.3 July 2007 SUBJECT: New Features in Version 5.3 This user bulletin includes the following topics: New Features in v5.3.....................................
SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD
White Paper SGI High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems Haruna Cofer*, PhD January, 2012 Abstract The SGI High Throughput Computing (HTC) Wrapper
Version 5.0 Release Notes
Version 5.0 Release Notes 2011 Gene Codes Corporation Gene Codes Corporation 775 Technology Drive, Ann Arbor, MI 48108 USA 1.800.497.4939 (USA) +1.734.769.7249 (elsewhere) +1.734.769.7074 (fax) www.genecodes.com
LESSON 9. Analyzing DNA Sequences and DNA Barcoding. Introduction. Learning Objectives
9 Analyzing DNA Sequences and DNA Barcoding Introduction DNA sequencing is performed by scientists in many different fields of biology. Many bioinformatics programs are used during the process of analyzing
SMRT Analysis v2.2.0 Overview. 1. SMRT Analysis v2.2.0. 1.1 SMRT Analysis v2.2.0 Overview. Notes:
SMRT Analysis v2.2.0 Overview 100 338 400 01 1. SMRT Analysis v2.2.0 1.1 SMRT Analysis v2.2.0 Overview Welcome to Pacific Biosciences' SMRT Analysis v2.2.0 Overview 1.2 Contents This module will introduce
Focusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
Sanger Sequencing and Quality Assurance. Zbigniew Rudzki Department of Pathology University of Melbourne
Sanger Sequencing and Quality Assurance Zbigniew Rudzki Department of Pathology University of Melbourne Sanger DNA sequencing The era of DNA sequencing essentially started with the publication of the enzymatic
Description: Molecular Biology Services and DNA Sequencing
Description: Molecular Biology s and DNA Sequencing DNA Sequencing s Single Pass Sequencing Sequence data only, for plasmids or PCR products Plasmid DNA or PCR products Plasmid DNA: 20 100 ng/μl PCR Product:
Bioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
Reporting Services. White Paper. Published: August 2007 Updated: July 2008
Reporting Services White Paper Published: August 2007 Updated: July 2008 Summary: Microsoft SQL Server 2008 Reporting Services provides a complete server-based platform that is designed to support a wide
LexisNexis Concordance Evolution Amazing speed plus LAW PreDiscovery and LexisNexis Near Dupe integration
LexisNexis Concordance Evolution Amazing speed plus LAW PreDiscovery and LexisNexis Near Dupe integration LexisNexis is committed to developing new and better Concordance Evolution capabilities. All based
Digital Asset Management. Content Control for Valuable Media Assets
Digital Asset Management Content Control for Valuable Media Assets Overview Digital asset management is a core infrastructure requirement for media organizations and marketing departments that need to
Analyzing A DNA Sequence Chromatogram
LESSON 9 HANDOUT Analyzing A DNA Sequence Chromatogram Student Researcher Background: DNA Analysis and FinchTV DNA sequence data can be used to answer many types of questions. Because DNA sequences differ
Delivering the power of the world s most successful genomics platform
Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE
An Overview of DNA Sequencing
An Overview of DNA Sequencing Prokaryotic DNA Plasmid http://en.wikipedia.org/wiki/image:prokaryote_cell_diagram.svg Eukaryotic DNA http://en.wikipedia.org/wiki/image:plant_cell_structure_svg.svg DNA Structure
CA Database Performance
DATA SHEET CA Database Performance CA Database Performance helps you proactively manage and alert on database performance across the IT infrastructure, regardless of whether the database is located within
Basic Unix/Linux 1. Software Testing Interview Prep
Basic Unix/Linux 1 Programming Fundamentals and Concepts 2 1. What is the difference between web application and client server application? Client server application is designed typically to work in a
Oracle Warehouse Builder 10g
Oracle Warehouse Builder 10g Architectural White paper February 2004 Table of contents INTRODUCTION... 3 OVERVIEW... 4 THE DESIGN COMPONENT... 4 THE RUNTIME COMPONENT... 5 THE DESIGN ARCHITECTURE... 6
Biological Databases and Protein Sequence Analysis
Biological Databases and Protein Sequence Analysis Introduction M. Madan Babu, Center for Biotechnology, Anna University, Chennai 25, India Bioinformatics is the application of Information technology to
PATROL From a Database Administrator s Perspective
PATROL From a Database Administrator s Perspective September 28, 2001 Author: Cindy Bean Senior Software Consultant BMC Software, Inc. 3/4/02 2 Table of Contents Introduction 5 Database Administrator Tasks
Data Analysis for Ion Torrent Sequencing
IFU022 v140202 Research Use Only Instructions For Use Part III Data Analysis for Ion Torrent Sequencing MANUFACTURER: Multiplicom N.V. Galileilaan 18 2845 Niel Belgium Revision date: August 21, 2014 Page
Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)
Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) A typical RNA Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,
CA NSM System Monitoring. Option for OpenVMS r3.2. Benefits. The CA Advantage. Overview
PRODUCT BRIEF: CA NSM SYSTEM MONITORING OPTION FOR OPENVMS Option for OpenVMS r3.2 CA NSM SYSTEM MONITORING OPTION FOR OPENVMS HELPS YOU TO PROACTIVELY DISCOVER, MONITOR AND DISPLAY THE HEALTH AND AVAILABILITY
User Bulletin. GeneMapper Software Version 4.0. Installation Options. In This User Bulletin. Overview
User Bulletin Software Version 4.0 February 2006 SUBJECT: Installation Options In This User Bulletin Overview This user bulletin covers:............................... 2 Installation Options for the........
CA NSM System Monitoring Option for OpenVMS r3.2
PRODUCT SHEET CA NSM System Monitoring Option for OpenVMS CA NSM System Monitoring Option for OpenVMS r3.2 CA NSM System Monitoring Option for OpenVMS helps you to proactively discover, monitor and display
User Manual for Web. Help Desk Authority 9.0
User Manual for Web Help Desk Authority 9.0 2011ScriptLogic Corporation ALL RIGHTS RESERVED. ScriptLogic, the ScriptLogic logo and Point,Click,Done! are trademarks and registered trademarks of ScriptLogic
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
Mascot Integra: Data management for Proteomics ASMS 2004
Mascot Integra: Data management for Proteomics 1 Mascot Integra: Data management for proteomics What is Mascot Integra? What Mascot Integra isn t Instrument integration in Mascot Integra Designing and
WebEx. Remote Support. User s Guide
WebEx Remote Support User s Guide Version 6.5 Copyright WebEx Communications, Inc. reserves the right to make changes in the information contained in this publication without prior notice. The reader should
Amazing speed and easy to use designed for large-scale, complex litigation cases
Amazing speed and easy to use designed for large-scale, complex litigation cases LexisNexis is committed to developing new and better Concordance Evolution capabilities. All based on feedback from customers
Monitoring Replication
Monitoring Replication Article 1130112-02 Contents Summary... 3 Monitor Replicator Page... 3 Summary... 3 Status... 3 System Health... 4 Replicator Configuration... 5 Replicator Health... 6 Local Package
Pairwise Sequence Alignment
Pairwise Sequence Alignment [email protected] SS 2013 Outline Pairwise sequence alignment global - Needleman Wunsch Gotoh algorithm local - Smith Waterman algorithm BLAST - heuristics What
BUYER S GUIDE: PC INVENTORY AND SOFTWARE USAGE METERING TOOLS
BUYER S GUIDE: PC INVENTORY AND SOFTWARE USAGE METERING TOOLS A guide for identifying an IT/software asset management product that best meets the needs of your organization 200 West Mercer Street Suite
LifeScope Genomic Analysis Software 2.5
USER GUIDE LifeScope Genomic Analysis Software 2.5 Graphical User Interface DATA ANALYSIS METHODS AND INTERPRETATION Publication Part Number 4471877 Rev. A Revision Date November 2011 For Research Use
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY
How Sequencing Experiments Fail
How Sequencing Experiments Fail v1.0 Simon Andrews [email protected] Classes of Failure Technical Tracking Library Contamination Biological Interpretation Something went wrong with a machine
A Tutorial in Genetic Sequence Classification Tools and Techniques
A Tutorial in Genetic Sequence Classification Tools and Techniques Jake Drew Data Mining CSE 8331 Southern Methodist University [email protected] www.jakemdrew.com Sequence Characters IUPAC nucleotide
User Guide for the Genetic Analysis Lab Information Management System (dnalims)
UNIVERSITY CORE DNA SERVICES University Core Genetic Analysis Laboratory Faculty of Medicine Health Sciences Centre, Rm. B104A Tel: (403) 220-4503, Fax: (403) 283-4907, Email: [email protected] www.ucalgary.ca/dnalab
SAP Data Services 4.X. An Enterprise Information management Solution
SAP Data Services 4.X An Enterprise Information management Solution Table of Contents I. SAP Data Services 4.X... 3 Highlights Training Objectives Audience Pre Requisites Keys to Success Certification
How is genome sequencing done?
How is genome sequencing done? Using 454 Sequencing on the Genome Sequencer FLX System, DNA from a genome is converted into sequence data through four primary steps: Step One DNA sample preparation; Step
Bioruptor NGS: Unbiased DNA shearing for Next-Generation Sequencing
STGAAC STGAACT GTGCACT GTGAACT STGAAC STGAACT GTGCACT GTGAACT STGAAC STGAAC GTGCAC GTGAAC Wouter Coppieters Head of the genomics core facility GIGA center, University of Liège Bioruptor NGS: Unbiased DNA
Introduction To Real Time Quantitative PCR (qpcr)
Introduction To Real Time Quantitative PCR (qpcr) SABiosciences, A QIAGEN Company www.sabiosciences.com The Seminar Topics The advantages of qpcr versus conventional PCR Work flow & applications Factors
Voluntary Product Accessibility Report
Voluntary Product Accessibility Report Compliance and Remediation Statement for Section 508 of the US Rehabilitation Act for OpenText Content Server 10.5 October 23, 2013 TOGETHER, WE ARE THE CONTENT EXPERTS
User's Manual. Intego Remote Management Console User's Manual Page 1
User's Manual Intego Remote Management Console User's Manual Page 1 Intego Remote Management Console for Macintosh 2007 Intego, Inc. All Rights Reserved Intego, Inc. www.intego.com This manual was written
Chapter 2: Getting Started
Chapter 2: Getting Started Once Partek Flow is installed, Chapter 2 will take the user to the next stage and describes the user interface and, of note, defines a number of terms required to understand
Information Server Documentation SIMATIC. Information Server V8.0 Update 1 Information Server Documentation. Introduction 1. Web application basics 2
Introduction 1 Web application basics 2 SIMATIC Information Server V8.0 Update 1 System Manual Office add-ins basics 3 Time specifications 4 Report templates 5 Working with the Web application 6 Working
Biorepository and Biobanking
Biorepository and Biobanking LabWare s solution for biorepositories and biobanks combines powerful specimen tracking and logistics capabilities with specimen processing and workflow management features.
Artisan Scientific is You~ Source for: Quality New and Certified-Used/Pre:-awned ECJuiflment
Looking for more information? Visit us on the web at http://www.artisan-scientific.com for more information: Price Quotations Drivers Technical Specifications. Manuals and Documentation Artisan Scientific
What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11
What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 W elcome to North Face Software s software. With this software, you can accomplish
Oracle Data Integrator 11g: Integration and Administration
Oracle University Contact Us: Local: 1800 103 4775 Intl: +91 80 4108 4709 Oracle Data Integrator 11g: Integration and Administration Duration: 5 Days What you will learn Oracle Data Integrator is a comprehensive
Analysis of ChIP-seq data in Galaxy
Analysis of ChIP-seq data in Galaxy November, 2012 Local copy: https://galaxy.wi.mit.edu/ Joint project between BaRC and IT Main site: http://main.g2.bx.psu.edu/ 1 Font Conventions Bold and blue refers
TruSeq Custom Amplicon v1.5
Data Sheet: Targeted Resequencing TruSeq Custom Amplicon v1.5 A new and improved amplicon sequencing solution for interrogating custom regions of interest. Highlights Figure 1: TruSeq Custom Amplicon Workflow
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER
Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER JMP Genomics Step-by-Step Guide to Bi-Parental Linkage Mapping Introduction JMP Genomics offers several tools for the creation of linkage maps
Sentaurus Workbench Comprehensive Framework Environment
Data Sheet Comprehensive Framework Environment Overview is a complete graphical environment for creating, managing, executing, and analyzing TCAD simulations. Its intuitive graphical user interface allows
When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
UGENE Quick Start Guide
Quick Start Guide This document contains a quick introduction to UGENE. For more detailed information, you can find the UGENE User Manual and other special manuals in project website: http://ugene.unipro.ru.
CA Workload Automation Agents for Mainframe-Hosted Implementations
PRODUCT SHEET CA Workload Automation Agents CA Workload Automation Agents for Mainframe-Hosted Operating Systems, ERP, Database, Application Services and Web Services CA Workload Automation Agents are
Oracle 11g Database Administration
Oracle 11g Database Administration Part 1: Oracle 11g Administration Workshop I A. Exploring the Oracle Database Architecture 1. Oracle Database Architecture Overview 2. Interacting with an Oracle Database
An Oracle White Paper November 2010. Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics
An Oracle White Paper November 2010 Leveraging Massively Parallel Processing in an Oracle Environment for Big Data Analytics 1 Introduction New applications such as web searches, recommendation engines,
System Integration Software
System Integration Software Release Notes for Version 6.0.5 1.0 Compatibility Currently, PC9000 is compatible with the following Radionics control/communicators: D7212, D7412, D7412G D9112, D9412 and D9412G
Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes
Chapter 2. imapper: A web server for the automated analysis and mapping of insertional mutagenesis sequence data against Ensembl genomes 2.1 Introduction Large-scale insertional mutagenesis screening in
Simplifying Data Interpretation with Nexus Copy Number
Simplifying Data Interpretation with Nexus Copy Number A WHITE PAPER FROM BIODISCOVERY, INC. Rapid technological advancements, such as high-density acgh and SNP arrays as well as next-generation sequencing
ANSYS EKM Overview. What is EKM?
ANSYS EKM Overview What is EKM? ANSYS EKM is a simulation process and data management (SPDM) software system that allows engineers at all levels of an organization to effectively manage the data and processes
BLAST. Anders Gorm Pedersen & Rasmus Wernersson
BLAST Anders Gorm Pedersen & Rasmus Wernersson Database searching Using pairwise alignments to search databases for similar sequences Query sequence Database Database searching Most common use of pairwise
Papermule Workflow. Workflow and Asset Management Software. Papermule Ltd
Papermule Workflow Papermule Workflow - the power to specify adaptive and responsive workflows that let the business manage production problems in a resilient way. Workflow and Asset Management Software
IAF Business Intelligence Solutions Make the Most of Your Business Intelligence. White Paper November 2002
IAF Business Intelligence Solutions Make the Most of Your Business Intelligence White Paper INTRODUCTION In recent years, the amount of data in companies has increased dramatically as enterprise resource
GenBank, Entrez, & FASTA
GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,
A Primer of Genome Science THIRD
A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:
Early Cloud Experiences with the Kepler Scientific Workflow System
Available online at www.sciencedirect.com Procedia Computer Science 9 (2012 ) 1630 1634 International Conference on Computational Science, ICCS 2012 Early Cloud Experiences with the Kepler Scientific Workflow
Getting Started Guide
Primer Express Software Version 3.0 Getting Started Guide Before You Begin Designing Primers and Probes for Quantification Assays Designing Primers and Probes for Allelic Discrimination Assays Ordering
Module 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
Usage Analysis Tools in SharePoint Products and Technologies
Usage Analysis Tools in SharePoint Products and Technologies Date published: June 9, 2004 Summary: Usage analysis allows you to track how websites on your server are being used. The Internet Information
Go where the biology takes you. Genome Analyzer IIx Genome Analyzer IIe
Go where the biology takes you. Genome Analyzer IIx Genome Analyzer IIe Go where the biology takes you. To published results faster With proven scalability To the forefront of discovery To limitless applications
Searching Nucleotide Databases
Searching Nucleotide Databases 1 When we search a nucleic acid databases, Mascot always performs a 6 frame translation on the fly. That is, 3 reading frames from the forward strand and 3 reading frames
When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
RS MDM. Integration Guide. Riversand
RS MDM 2009 Integration Guide This document provides the details about RS MDMCenter integration module and provides details about the overall architecture and principles of integration with the system.
Critical Care EEG Database Public Edition. User Manual
Critical Care EEG Database Public Edition User Manual v. 9/25/2015 Table of Contents Overview... 2 Installation... 2 Basic Structure 2 Installing the Files 3 Connecting to Data 4 Configuration... 4 System
What s new in TIBCO Spotfire 6.5
What s new in TIBCO Spotfire 6.5 Contents Introduction... 3 TIBCO Spotfire Analyst... 3 Location Analytics... 3 Support for adding new map layer from WMS Server... 3 Map projections systems support...
A RESOURCE GUIDE FOR NEW FINANCIAL SYSTEM PROFESSIONALS
WELCOME KIT A RESOURCE GUIDE FOR NEW FINANCIAL SYSTEM PROFESSIONALS *Disclaimer: In the following documentation, dates, screen captures and data are not necessarily reflective of the current year. Settings
Big Data: Rethinking Text Visualization
Big Data: Rethinking Text Visualization Dr. Anton Heijs [email protected] Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important
enetdnc Take Control of your Manufacturing Process and Improve Your Throughput Quickly, Easily and Cost Effectively with...
Take Control of your Manufacturing Process and Improve Your Throughput Quickly, Easily and Cost Effectively with... enetdnc DNC Through Your Ethernet Network Ethernet Based DNC Wireless DNC Machine Monitoring
Running Agilent GeneSpring MPP on the Cloud
Running Agilent GeneSpring MPP on the Cloud Technical Overview Authors Stephen Madden, Rick A. Fasani, and Michael Rosenberg Agilent Technologies, Inc. Santa Clara, California, USA Introduction Cloud computing
A Streamlined Workflow for Untargeted Metabolomics
A Streamlined Workflow for Untargeted Metabolomics Employing XCMS plus, a Simultaneous Data Processing and Metabolite Identification Software Package for Rapid Untargeted Metabolite Screening Baljit K.
QPR WorkFlow. Minimize Process Time, Maximize Process Outcome. QPR WorkFlow 1
QPR WorkFlow Minimize Process Time, Maximize Process Outcome QPR WorkFlow 1 QPR WorkFlow: Eliminate the Gap between Process Design and Process Automation Proper management and execution of your operational
