Information Integration
|
|
- Roger Gray
- 7 years ago
- Views:
Transcription
1 Information Integration Assignment 3: Consistency of Annotations (NCBI UniProtKB) Ulf Leser
2 Overview Reuse all existing data We want to integrate and compare functional annotation from another data source the UniProtKB UniProtKB can be accessed through several interfaces Your choice We don t want to produce a DDoS restrict comparison to a small region of a chromosome Therefore, we need to clean MAP_LOCATION information Finally :-)
3 Task 1: Data Cleansing MAP_LOCATION So far, we left MAP_LOCATION as it was, though we have seen many problems We go for a quick-and-dirty solution to keep things simple Set the field to NULL if the value is - Remove all parts after a Remove all parts after a ; Remove all parts after a - Remove all parts after a (blank) Perform the same changes with CHROMOSOME We leave the few inconsistencies between CHROMOSOME and MAP_LOCATION (for now) Produces 1:1 relationship between a gene and a chromosomal region In varying levels of granularity
4 Task 2: NCBI UniProtKB Mapping??? VERSION! We need a mapping from GENE_ID (NCBI) to UniProtKB-ID (EBI) This is a little complicated GENE_ID 1:N PROTEIN_ACCESSION 1:N UniProtKB-ID Step 1 Create a 1:N relationship between NCBI genes and NCBI PROTEIN_ACCESSION This should have been done already in assignment 2 Remove from PROTEIN_ACCESSION the version part Everything after a. Step 2 Download the File gene_refseq_uniprot_collab from the NCBI web site Upload data into a database table (2 columns) Check that many PROTEIN_ACCESSIONs appear in the first column of this table
5 Task 3: Access UniProtKB Query all proteins whose genes are located in 21q22.1 (exact match) Should be around 50 You may use either of three ways (chose now! We want diversity!) Option 1: Use HTTP and parse flatfile See Option 1: Use HTTP and parse XML See Option 3: Use a Java library and parse nothing See : See also : Download/access/extract the following information per protein Taxonomy ID(s) All functional annotations with GO terms Update your database schema to add this information
6 Where is the data? XML <dbreference type="ncbi Taxonomy" key="2" id="10116"/> <dbreference type="go" key="32" id="go: "> Flatfile API OX NCBI_TaxID=10116; DR GO; GO: ; F:endopeptidase inhibitor activity; getncbitaxonomyids() getgoterms()
7 Which method? Group Method 1 API 2 API 3 Flatfile 4 XML 5 Flatfile 6 API 7 XML 10 Flatfile 11 API 12 XML 13 API 14 Flatfile 15 XML 16 API
8 Task 4: Compare Functional Annotation Knowledge about the function of a gene or protein is often highly diverging among data sources Let s see
9 Task 5: Queries Answer the following queries 1. How many different MAP_LOCATIONS did remain? 2. How many genes are located on each chromosome? 3. Compute a frequency table: How many NCBI genes are connected to 1,2, UNIPROTKB-IDs? 4. How many NCBI genes have a TAX_ID attached that is different from one of the TAX_IDs of at least one associated UniProtKB-ID? 5. How many UniProtKB GO associations did you collect for how many UniProtKB-ID s for how many NCBI genes (three numbers)? 6. How many gene-go associations exist in the NCBI data that are not in the UniProtKB data? 7. How many protein-go associations exist in the UniProtKB data that are not in the NCBI data? 8. How many NCBI genes have at least one diverging annotation 1. Either a GO term in NCBI / not in UniProt or vice versa
10 Competition Accessing a data source over the web can be slow or fast Access the requested information as fast as possible This includes HTTP connection, parsing, writing in your database, etc. Write a program that Takes as input a MAP_LOCATION Computes all NCBI genes at this location Retrieves all GO and TAX data for all associated UniProtKB proteins The program must be executable as is on gruenau2
11 Deliverables By Monday 8.12., or Wednesday 10.12, 23:59 o clock Three weeks Send by mail as ASCII An updated schema graph with the new table(s) for UniProt data Queries and answers for 8 questions For the competition: Executable code as specified on last slide
Module 1. Sequence Formats and Retrieval. Charles Steward
The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.
More informationOn-line supplement to manuscript Galaxy for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly
On-line supplement to manuscript Galaxy for collaborative analysis of ENCODE data: Making large-scale analyses biologist-friendly DANIEL BLANKENBERG, JAMES TAYLOR, IAN SCHENCK, JIANBIN HE, YI ZHANG, MATTHEW
More informationInformation Retrieval Elasticsearch
Information Retrieval Elasticsearch IR Information retrieval (IR) is the activity of obtaining information resources relevant to an information need from a collection of information resources. Searches
More informationICE Trade Vault. Public User & Technology Guide June 6, 2014
ICE Trade Vault Public User & Technology Guide June 6, 2014 This material may not be reproduced or redistributed in whole or in part without the express, prior written consent of IntercontinentalExchange,
More informationReusable Data Access Patterns
Reusable Data Access Patterns Gary Helmling, Software Engineer @gario HBaseCon 2015 - May 7 Agenda A brief look at data storage challenges How these challenges have influenced our work at Cask Exploration
More informationINTERNET PROGRAMMING AND DEVELOPMENT AEC LEA.BN Course Descriptions & Outcome Competency
INTERNET PROGRAMMING AND DEVELOPMENT AEC LEA.BN Course Descriptions & Outcome Competency 1. 420-PA3-AB Introduction to Computers, the Internet, and the Web This course is an introduction to the computer,
More informationPDA DRIVEN WAREHOUSE INVENTORY MANAGEMENT SYSTEM Sebastian Albert Master of Science in Technology sebastianpremraj@yahoo.com
PDA DRIVEN WAREHOUSE INVENTORY MANAGEMENT SYSTEM Sebastian Albert Master of Science in Technology sebastianpremraj@yahoo.com Abstract In times of economic slow-down, cutting costs is the major strategy
More informationDatabase manager does something that sounds trivial. It makes it easy to setup a new database for searching with Mascot. It also makes it easy to
1 Database manager does something that sounds trivial. It makes it easy to setup a new database for searching with Mascot. It also makes it easy to automate regular updates of these databases. 2 However,
More informationProtein Protein Interactions (PPI) APID (Agile Protein Interaction DataAnalyzer)
APID (Agile Protein Interaction DataAnalyzer) 23 APID (Agile Protein Interaction DataAnalyzer) Integrates and unifies 7 DBs: BIND, DIP, HPRD, IntAct, MINT, BioGRID. Includes 51,873 proteins 241,204 interactions
More informationLibrary page. SRS first view. Different types of database in SRS. Standard query form
SRS & Entrez SRS Sequence Retrieval System Bengt Persson Whatis SRS? Sequence Retrieval System User-friendly interface to databases http://srs.ebi.ac.uk Developed by Thure Etzold and co-workers EMBL/EBI
More informationMarcum LLP MFT Guide
MFT Guide Contents 1. Logging In...3 2. Installing the Upload Wizard...4 3. Uploading Files Using the Upload Wizard...5 4. Downloading Files Using the Upload Wizard...8 5. Frequently Asked Questions...9
More informationSimplifying e Business Collaboration by providing a Semantic Mapping Platform
Simplifying e Business Collaboration by providing a Semantic Mapping Platform Abels, Sven 1 ; Sheikhhasan Hamzeh 1 ; Cranner, Paul 2 1 TIE Nederland BV, 1119 PS Amsterdam, Netherlands 2 University of Sunderland,
More informationKam D. Dahlquist Department of Biology. John David N. Dionisio Department of Electrical Engineering & Computer Science
http://xmlpipedb.cs.lmu.edu Kam D. Dahlquist Department of Biology John David N. Dionisio Department of Electrical Engineering & Computer Science Loyola Marymount University A Reusable, Open Source Tool
More informationorg.rn.eg.db December 16, 2015 org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank accession numbers.
org.rn.eg.db December 16, 2015 org.rn.egaccnum Map Entrez Gene identifiers to GenBank Accession Numbers org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank
More informationFundamentals of Web Programming a
Fundamentals of Web Programming a Universal Description, Discovery, and Integration Teodor Rus rus@cs.uiowa.edu The University of Iowa, Department of Computer Science a Copyright 2009 Teodor Rus. These
More informationTalend Component tgoogleanalyticsmanagement
Talend Component tgoogleanalyticsmanagement Purpose This component addresses the needs of gathering Google Analytics data for a large number of profiles and fine-grained detail data. To prepare the queries
More informationWhat is Data Virtualization?
What is Data Virtualization? Rick F. van der Lans Data virtualization is receiving more and more attention in the IT industry, especially from those interested in data management and business intelligence.
More informationVisualizing an OrientDB Graph Database with KeyLines
Visualizing an OrientDB Graph Database with KeyLines Visualizing an OrientDB Graph Database with KeyLines 1! Introduction 2! What is a graph database? 2! What is OrientDB? 2! Why visualize OrientDB? 3!
More informationID of alternative translational initiation events. Description of gene function Reference of NCBI database access and relative literatures
Data resource: In this database, 650 alternatively translated variants assigned to a total of 300 genes are contained. These database records of alternative translational initiation have been collected
More informationCurrent Order Tool Experiences Complaints
Current Order Tool Experiences Complaints Log in unadvertised case sensitivity for email address that is used as login id CERES Dataset Info pages are too crowded!! On the Data Products Catalog page, remove
More informationPractical Database Design and Tuning
Practical Database Design and Tuning 1. Physical Database Design in Relational Databases Factors that Influence Physical Database Design: A. Analyzing the database queries and transactions For each query,
More informationsql-schema-comparer: Support of Multi-Language Refactoring with Relational Databases
sql-schema-comparer: Support of Multi-Language Refactoring with Relational Databases Hagen Schink Institute of Technical and Business Information Systems Otto-von-Guericke-University Magdeburg, Germany
More informationNaviCell Data Visualization Python API
NaviCell Data Visualization Python API Tutorial - Version 1.0 The NaviCell Data Visualization Python API is a Python module that let computational biologists write programs to interact with the molecular
More informationTutorial. Reference Genome Tracks. Sample to Insight. November 27, 2015
Reference Genome Tracks November 27, 2015 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com support-clcbio@qiagen.com Reference
More informationFiles. Files. Files. Files. Files. File Organisation. What s it all about? What s in a file?
Files What s it all about? Information being stored about anything important to the business/individual keeping the files. The simple concepts used in the operation of manual files are often a good guide
More informationKatta & Hadoop. Katta - Distributed Lucene Index in Production. Stefan Groschupf Scale Unlimited, 101tec. sg{at}101tec.com
1 Katta & Hadoop Katta - Distributed Lucene Index in Production Stefan Groschupf Scale Unlimited, 101tec. sg{at}101tec.com foto by: belgianchocolate@flickr.com 2 Intro Business intelligence reports from
More informationPower Tools for Pivotal Tracker
Power Tools for Pivotal Tracker Pivotal Labs Dezmon Fernandez Victoria Kay Eric Dattore June 16th, 2015 Power Tools for Pivotal Tracker 1 Client Description Pivotal Labs is an agile software development
More informationNCBI resources III: GEO and ftp site. Yanbin Yin Spring 2013
NCBI resources III: GEO and ftp site Yanbin Yin Spring 2013 1 Homework assignment 2 Search colon cancer at GEO and find a data Series and perform a GEO2R analysis Write a report (in word or ppt) to include
More informationSettlers of Catan Phase 1
Settlers of Catan Phase 1 Objective In this phase you will design, implement, and test the following subsystems: 1. Catan Model 2. Server Proxy 3. Server Poller Catan Model The Catan Model will be at the
More informationAbstract. Description
Project title: Bloodhound: Dynamic client-side autocompletion features for the Apache Bloodhound ticket system Name: Sifa Sensay Student e-mail: sifasensay@gmail.com Student Major: Software Engineering
More informationDALHOUSIE NOTES ON PAYROLL EXPENSE DETAIL IN FINANCE SELF SERVICE. QUICK REFERENCE As of September 1, 2015
DALHOUSIE NOTES ON PAYROLL EXPENSE DETAIL IN FINANCE SELF SERVICE QUICK REFERENCE As of September 1, 2015 Quick reference document outlining the basic steps to access the payroll expense detail results
More informationProseminar Wissenschaftliches Arbeiten. Ulf Leser
Proseminar Wissenschaftliches Arbeiten Ulf Leser Proseminar We want to teach you how to approach a scientific topic to find scientific literature and discern relevant from irrelevant to systematically
More informationMA-WA1920: Enterprise iphone and ipad Programming
MA-WA1920: Enterprise iphone and ipad Programming Description This 5 day iphone training course teaches application development for the ios platform. It covers iphone, ipad and ipod Touch devices. This
More information14 Databases. Source: Foundations of Computer Science Cengage Learning. Objectives After studying this chapter, the student should be able to:
14 Databases 14.1 Source: Foundations of Computer Science Cengage Learning Objectives After studying this chapter, the student should be able to: Define a database and a database management system (DBMS)
More informationExtracting Your Company s Data with the New Audit Data Standard
Extracting Your Company s Data with the New Audit Data Standard Written by Kristine Hasenstab and Eric E. Cohen Have you ever been responsible for meeting an internal or external auditor's request for
More informationIP phone services setup
IP phone services setup This chapter provides information to configure IP phone services. About IP phone service setup, page 1 IP phone service deletion, page 2 IP phone service settings, page 3 IP phone
More information000-420. IBM InfoSphere MDM Server v9.0. Version: Demo. Page <<1/11>>
000-420 IBM InfoSphere MDM Server v9.0 Version: Demo Page 1. As part of a maintenance team for an InfoSphere MDM Server implementation, you are investigating the "EndDate must be after StartDate"
More informationSequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011
Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear
More informationJet Data Manager 2012 User Guide
Jet Data Manager 2012 User Guide Welcome This documentation provides descriptions of the concepts and features of the Jet Data Manager and how to use with them. With the Jet Data Manager you can transform
More informationMBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world.
MBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world. Pierre Venuat, University of Poitiers Mentors: Brian Schlining and Nancy Jacobsen Stout
More informationExam Name: IBM InfoSphere MDM Server v9.0
Vendor: IBM Exam Code: 000-420 Exam Name: IBM InfoSphere MDM Server v9.0 Version: DEMO 1. As part of a maintenance team for an InfoSphere MDM Server implementation, you are investigating the "EndDate must
More informationLecture 11 Data storage and LIMS solutions. Stéphane LE CROM lecrom@biologie.ens.fr
Lecture 11 Data storage and LIMS solutions Stéphane LE CROM lecrom@biologie.ens.fr Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog
More informationBuilding Java Servlets with Oracle JDeveloper
Building Java Servlets with Oracle JDeveloper Chris Schalk Oracle Corporation Introduction Developers today face a formidable task. They need to create large, distributed business applications. The actual
More informationDrupal CMS for marketing sites
Drupal CMS for marketing sites Intro Sample sites: End to End flow Folder Structure Project setup Content Folder Data Store (Drupal CMS) Importing/Exporting Content Database Migrations Backend Config Unit
More informationMission-Critical Database with Real-Time Search for Big Data
Mission-Critical Database with Real-Time Search for Big Data February 17, 2012 Slide 1 Overview About MarkLogic Why MarkLogic Case Studies Technology and Features Slide 2 About MarkLogic 10 years in business
More informationDNA Sequence formats
DNA Sequence formats [Plain] [EMBL] [FASTA] [GCG] [GenBank] [IG] [IUPAC] [How Genomatix represents sequence annotation] Plain sequence format A sequence in plain format may contain only IUPAC characters
More informationUser Guide. Trade Finance Global. Reports Centre. October 2015. nordea.com/cm OR tradefinance Name of document 8/8 2015/V1
User Guide Trade Finance Global Reports Centre October 2015 nordea.com/cm OR tradefinance Name of document 2015/V1 8/8 Table of Contents 1 Trade Finance Global (TFG) Reports Centre Overview... 4 1.1 Key
More informationWhen you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
More informationWhat is Data Virtualization? Rick F. van der Lans, R20/Consultancy
What is Data Virtualization? by Rick F. van der Lans, R20/Consultancy August 2011 Introduction Data virtualization is receiving more and more attention in the IT industry, especially from those interested
More informationRDS Building Centralized Monitoring and Control
RDS Building Centralized Monitoring and Control 1. Overview This document explains the concept and differing options for the monitoring and control of RDS replication over your network. The very basic
More informationEfficiently Identifying Inclusion Dependencies in RDBMS
Efficiently Identifying Inclusion Dependencies in RDBMS Jana Bauckmann Department for Computer Science, Humboldt-Universität zu Berlin Rudower Chaussee 25, 12489 Berlin, Germany bauckmann@informatik.hu-berlin.de
More informationIntroduction to the SIF 3.0 Infrastructure: An Environment for Educational Data Exchange
Introduction to the SIF 3.0 Infrastructure: An Environment for Educational Data Exchange SIF 3.0 Infrastructure Goals of the Release Environment types & types Registration & Security Basic Architectural
More informationERP guide and questions
Scandiweb ERP guide and questions Sometimes client asks you a short questions - how much would cost integration of customers ERP? And sometimes they provide some data on objects to be imported e.g. products,
More informationProSightPC 3.0 Quick Start Guide
ProSightPC 3.0 Quick Start Guide The Thermo ProSightPC 3.0 application is the only proteomics software suite that effectively supports high-mass-accuracy MS/MS experiments performed on LTQ FT and LTQ Orbitrap
More informationSemantic Stored Procedures Programming Environment and performance analysis
Semantic Stored Procedures Programming Environment and performance analysis Marjan Efremov 1, Vladimir Zdraveski 2, Petar Ristoski 2, Dimitar Trajanov 2 1 Open Mind Solutions Skopje, bul. Kliment Ohridski
More informationPC USER S GUIDE ECONOMIC DATA ST. LOUIS FED
FRED ADD-IN PC USER S GUIDE F OR EXCEL 2010 E C O N O M I C R E S E A R C H F E D E R A L R E S E R V E B A N K O F S T. L O U I S P. O. B O X 4 4 2 S A I N T L O U I S, M I S S O U R I, 6 3 1 6 6 R E
More informationPDQ-Wizard Prototype 1.0 Installation Guide
PDQ-Wizard Prototype 1.0 Installation Guide University of Edinburgh 2005 GTI and edikt 1. Introduction This document is for users who want set up the PDQ-Wizard system. It includes how to configure the
More informationManage Workflows. Workflows and Workflow Actions
On the Workflows tab of the Cisco Finesse administration console, you can create and manage workflows and workflow actions. Workflows and Workflow Actions, page 1 Add Browser Pop Workflow Action, page
More informationitunes Store Publisher User Guide Version 1.1
itunes Store Publisher User Guide Version 1.1 Version Date Author 1.1 10/09/13 William Goff Table of Contents Table of Contents... 2 Introduction... 3 itunes Console Advantages... 3 Getting Started...
More informationSave time, money and effort with a CRM platform designed for Local Authorities
Save time, money and effort with a CRM platform designed for Local Authorities Berni Simmons & Rachel Clinton Abavus Ltd. www.abavus.co.uk Frequently Asked Questions Is this session being recorded? Yes
More informationThe end. Carl Nettelblad 2015-06-04
The end Carl Nettelblad 2015-06-04 The exam and end of the course Don t forget the course evaluation! Closing tomorrow, Friday Project upload deadline tonight Book presentation appointments with Kalyan
More informationHow to Choose the Right Data Storage Format for Your Measurement System
1 How to Choose the Right Data Storage Format for Your Measurement System Overview For many new measurement systems, choosing the right data storage approach is an afterthought. Engineers often end up
More informationVisualizing Data: Scalable Interactivity
Visualizing Data: Scalable Interactivity The best data visualizations illustrate hidden information and structure contained in a data set. As access to large data sets has grown, so has the need for interactive
More informationComparison of Triple Stores
Comparison of Triple Stores Abstract In this report we present evaluation of triple stores. We present load times and discuss the inferencing capabilities of Jena SDB backed with MySQL, Sesame native,
More informationACCESS 2007. Importing and Exporting Data Files. Information Technology. MS Access 2007 Users Guide. IT Training & Development (818) 677-1700
Information Technology MS Access 2007 Users Guide ACCESS 2007 Importing and Exporting Data Files IT Training & Development (818) 677-1700 training@csun.edu TABLE OF CONTENTS Introduction... 1 Import Excel
More information1 File Processing Systems
COMP 378 Database Systems Notes for Chapter 1 of Database System Concepts Introduction A database management system (DBMS) is a collection of data and an integrated set of programs that access that data.
More informationHow to design a database for electronic data capture in
How to design a database for electronic data capture in 1 Outline REDCap background REDCap features - System features - Database design features - Data Entry(DE) & management features Real- Bme CRF creabon
More informationStructured Content: the Key to Agile. Web Experience Management. Introduction
Structured Content: the Key to Agile CONTENTS Introduction....................... 1 Structured Content Defined...2 Structured Content is Intelligent...2 Structured Content and Customer Experience...3 Structured
More informationMETHODS IN MEDICAL INFORMATICS
Chapman & Hall/CRC Mathematical and Computational Biology Series METHODS IN MEDICAL INFORMATICS Fundamentals of Healthcare Programming in Perln Pythoni and Ruby Jules J- Berman TECHNISCHE INFORMATION SBIBLIOTHEK
More informationSimplyCast emarketing Email User Guide
SimplyCast emarketing Email User Guide Email User Guide Page 1 Contents 1. Email Overview... 3 2. Features Overview... 3 3. Email Editor Features... 8 4. How to Create an Email Campaign?... 5 5. Additional
More informationLesson 4 Web Service Interface Definition (Part I)
Lesson 4 Web Service Interface Definition (Part I) Service Oriented Architectures Module 1 - Basic technologies Unit 3 WSDL Ernesto Damiani Università di Milano Interface Definition Languages (1) IDLs
More informationPHP Language Binding Guide For The Connection Cloud Web Services
PHP Language Binding Guide For The Connection Cloud Web Services Table Of Contents Overview... 3 Intended Audience... 3 Prerequisites... 3 Term Definitions... 3 Introduction... 4 What s Required... 5 Language
More informationMIB Explorer Feature Matrix
MIB Explorer Feature Matrix Lite Pro Android Standards and Protocols Supported SNMPv1 (RFC 1157), SNMPv2c (RFC 1901/1905), and SNMPv3 (RFC 3412-3417). Transport Protocols UDP, TCP, and. All transport protocols
More informationXML Processing and Web Services. Chapter 17
XML Processing and Web Services Chapter 17 Textbook to be published by Pearson Ed 2015 in early Pearson 2014 Fundamentals of http://www.funwebdev.com Web Development Objectives 1 XML Overview 2 XML Processing
More informationEnterprise Service Bus
We tested: Talend ESB 5.2.1 Enterprise Service Bus Dr. Götz Güttich Talend Enterprise Service Bus 5.2.1 is an open source, modular solution that allows enterprises to integrate existing or new applications
More informationMigrate from Exchange Public Folders to Business Productivity Online Standard Suite
Migrate from Exchange Public Folders to Business Productivity Online Standard Suite White Paper Microsoft Corporation Published: July 2009 Information in this document, including URL and other Internet
More informationThis document presents the new features available in ngklast release 4.4 and KServer 4.2.
This document presents the new features available in ngklast release 4.4 and KServer 4.2. 1) KLAST search engine optimization ngklast comes with an updated release of the KLAST sequence comparison tool.
More informationBlueJ Teamwork Tutorial
BlueJ Teamwork Tutorial Version 2.0 for BlueJ Version 2.5.0 (and 2.2.x) Bruce Quig, Davin McCall School of Engineering & IT, Deakin University Contents 1 OVERVIEW... 3 2 SETTING UP A REPOSITORY... 3 3
More informationManaging large sound databases using Mpeg7
Max Jacob 1 1 Institut de Recherche et Coordination Acoustique/Musique (IRCAM), place Igor Stravinsky 1, 75003, Paris, France Correspondence should be addressed to Max Jacob (max.jacob@ircam.fr) ABSTRACT
More informationICE econfirm. FAQs April 2012
ICE econfirm FAQs April 2012 This material may not be reproduced or redistributed in whole or in part without the express, prior written consent of IntercontinentalExchange, Inc. Copyright Intercontinental
More informationThe human gene encoding Glucose-6-phosphate dehydrogenase (G6PD) is located on chromosome X in cytogenetic band q28.
Tutorial Module 5 BioMart You will learn about BioMart, a joint project developed and maintained at EBI and OiCR www.biomart.org How to use BioMart to quickly obtain lists of gene information from Ensembl
More informationUsing Databases in R
Using Databases in R Marc Carlson Fred Hutchinson Cancer Research Center May 20, 2010 Introduction Example Databases: The GenomicFeatures Package Basic SQL Using SQL from within R Outline Introduction
More informationCity Data Pipeline. A System for Making Open Data Useful for Cities. stefan.bischof@tuwien.ac.at
City Data Pipeline A System for Making Open Data Useful for Cities Stefan Bischof 1,2, Axel Polleres 1, and Simon Sperl 1 1 Siemens AG Österreich, Siemensstraße 90, 1211 Vienna, Austria {bischof.stefan,axel.polleres,simon.sperl}@siemens.com
More informationA Tutorial in Genetic Sequence Classification Tools and Techniques
A Tutorial in Genetic Sequence Classification Tools and Techniques Jake Drew Data Mining CSE 8331 Southern Methodist University jakemdrew@gmail.com www.jakemdrew.com Sequence Characters IUPAC nucleotide
More informationA Complete Example of Next- Gen DNA Sequencing Read Alignment. Presentation Title Goes Here
A Complete Example of Next- Gen DNA Sequencing Read Alignment Presentation Title Goes Here 1 FASTQ Format: The de- facto file format for sharing sequence read data Sequence and a per- base quality score
More informationWhen to consider OLAP?
When to consider OLAP? Author: Prakash Kewalramani Organization: Evaltech, Inc. Evaltech Research Group, Data Warehousing Practice. Date: 03/10/08 Email: erg@evaltech.com Abstract: Do you need an OLAP
More informationPPInterFinder A Web Server for Mining Human Protein Protein Interaction
PPInterFinder A Web Server for Mining Human Protein Protein Interaction Kalpana Raja, Suresh Subramani, Jeyakumar Natarajan Data Mining and Text Mining Laboratory, Department of Bioinformatics, Bharathiar
More informationXML Programming with PHP and Ajax
http://www.db2mag.com/story/showarticle.jhtml;jsessionid=bgwvbccenyvw2qsndlpskh0cjunn2jvn?articleid=191600027 XML Programming with PHP and Ajax By Hardeep Singh Your knowledge of popular programming languages
More informationmetaengine DataConnect For SharePoint 2007 Configuration Guide
metaengine DataConnect For SharePoint 2007 Configuration Guide metaengine DataConnect for SharePoint 2007 Configuration Guide (2.4) Page 1 Contents Introduction... 5 Installation and deployment... 6 Installation...
More informationPronestor Room & Catering
Pronestor Room & Catering Module 2 Installation of additional modules Page 2.0 2.9 User import (AD integration) Page 2.1 2.4 o Service Accounts (hosted and on-premises) o Active Directory Structure o Installation
More informationCopyright 2012, Jeffrey S. Krause, Esq. and Lawtopia, LLC
Lawyers bill for time but sell words Communication is a lawyers stock in trade Printed documents, electronic documents, and emails are all communications/documents that you might want to find or reuse
More informationSAS MDM 4.2. User s Guide. SAS Documentation
SAS MDM 4.2 User s Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2015. SAS MDM 4.2: User's Guide. Cary, NC: SAS Institute Inc. SAS MDM 4.2:
More informationEPiServer and XForms - The Next Generation of Web Forms
EPiServer and XForms - The Next Generation of Web Forms How EPiServer's forms technology allows Web site editors to easily create forms, and developers to customize form behavior and appearance. WHITE
More informationHTTP and HTTPS Statistics Services
CHAPTER 9 This chapter describes the HTTP and HTTPS Statistics service, which returns HTTP and HTTPS connection information and statistics for individual WAEs, device groups, and for the WAAS network,
More informationQlik REST Connector Installation and User Guide
Qlik REST Connector Installation and User Guide Qlik REST Connector Version 1.0 Newton, Massachusetts, November 2015 Authored by QlikTech International AB Copyright QlikTech International AB 2015, All
More informationSAS MDM 4.1. User s Guide Second Edition. SAS Documentation
SAS MDM 4.1 User s Guide Second Edition SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2014. SAS MDM 4.1: User's Guide, Second Edition. Cary, NC:
More informationHow To Use The Assembly Database In A Microarray (Perl) With A Microarcode) (Perperl 2) (For Macrogenome) (Genome 2)
The Ensembl Core databases and API Useful links Installation instructions: http://www.ensembl.org/info/docs/api/api_installation.html Schema description: http://www.ensembl.org/info/docs/api/core/core_schema.html
More informationData Mailbox. support.ewon.biz. Reference Guide
Reference Guide RG 005-0-EN / Rev. 1.0 Data Mailbox The Data Mailbox is a Talk2M service that gathers ewon historical data and makes it available for third party applications in an easy way. support.ewon.biz
More informationTop 10 Things to Know about WRDS
Top 10 Things to Know about WRDS 1. Do I need special software to use WRDS? WRDS was built to allow users to use standard and popular software. There is no WRDSspecific software to install. For example,
More information