EMBL-EBI. 3D databases and data warehouse technology

Size: px
Start display at page:

Download "EMBL-EBI. 3D databases and data warehouse technology"

Transcription

1 3D databases and data warehouse technology

2 Overview Overall Strategy Terms and background Populating the databases Clean up processes How can I use the database? What next

3 What is a database? By the term database we refer to the system rather than the data Indexed file space Also used as a shorthand for a database management system (DBMS) Methods for accessing and changing data Controls for referential integrity

4 Normalisation Data fields in a normalised database appear only once CHAIN ID A attr 185 RESIDUE CHAIN ID A A SEQ COMP ID ASP COMPONENT Data fields in a denormalised database are repeated in different places 1 2 LYS ID ASP attr -1 LYS +1 CHAIN ID attr A 185 RESIDUE CHAIN ID A SEQ 1 COMP ID ASP A 2 LYS CHAINattr COMPattr COMPONENT ID attr ASP -1 LYS +1

5 Structural hierarchy assembly molecule (entity) chain residue

6 ASU and assemblies assembly ASU chain chain residues residues

7 The pipeline archive services edited cif archive DB data warehouse manual edit post-load processes distribution

8 The first steps archive services edited cif archive DB data warehouse manual edit post-load processes distribution

9 The first steps A series of scripts Parses non-standard header records Fills in chain identifiers Outputs a first cut clean file Manual editing ~1000 entries require manual editing The result is a format file that can be passed to the subsequent automatic steps

10 bizarre errors 1ew1 ATOM 47 N6 A A ATOM 59 1H6 A A ATOM 60 2H6 A A ATOM 47 N6 A A ATOM 59 1H6 A A ATOM 60 2H6 A A

11 automatic processing archive services edited cif archive DB data warehouse manual edit post-load processes distribution

12 process details Automatic cleanup (d2c) Incorporates quaternary structure information Runs a lot of checks and corrections Outputs mmcif file Loading Metadata-driven custom loader Load through views with insert triggers Many heuristics also applied to data within these triggers

13 Using reference data Variations in legacy data Hinders accurate searches Hinders links to other services Match data against controlled vocabularies Within scripts Within database during load Semi-automated Use string matching algorithms Effective when controlled vocabulary well maintained $COLI COLI E. COLI E.COLI ESCHERCHIA COLI ESCHERICHI $COLI ESCHERICHIA $ COLI ESCHERICHIA $COLI ESCHERICHIA COLI ESCHERICHIA COLI. EXCHERICHIA COLI EXPRESCHERICHIA COLI

14 Chemical Components More difficult to deal with Where coordinates and nomenclature do not agree, have to make a judgement on which, if either, are correct We maintain a curated database of compounds, against which legacy data is compared atom nomenclature ongoing; relatively easy to correct where the compound has been correctly identified Stereochemistry may indicate that the compound name is incorrect

15 Ligand nomenclature Ligands are often named inconsistently or even entirely incorrectly, e.g. -Dmannose (MAN) vs. -D-mannose (BMA) MAN Errors are detected using a graphbased structure comparison algorithm BMA

16 not all cases resolvable 1d7t DTY 4 in chain A, model 1 - is it D or L?? HEADER DE NOVO PROTEIN 19-OCT-99 1D7T TITLE NMR STRUCTURE OF AN ENGINEERED CONTRYPHAN CYCLIC PEPTIDE TITLE 2 (MOTIF CPXXPXC) MODRES 1D7T DTY A 4 TYR D-TYROSINE HET DTY A 4 21 HETNAM DTY D-TYROSINE FORMUL 1 DTY C9 H11 N1 O3

17 post-load processing archive services edited cif archive DB data warehouse manual edit post-load processes distribution

18 process details Involved in deriving data and building crosslinks to other services Geometric information Analysing non-polymer components and assembling full entities from individual components Links to taxonomy and sequence databases

19 transformation to DW archive services edited cif archive DB data warehouse manual edit post-load processes distribution

20 process details Set of SQL scripts Supports Oracle (routinely) and MySQL (development) Periodically undertake full transform takes a couple of weeks Provide weekly incremental patches much faster Supports transforms into different data marts

21 coming soon Continuing cleanup HET group curation Sequence cross-references Citations More choice on downloads Data marts (even single tables) Groups of entries Release of clean files (end 2006)

22 who did what archive services edited cif archive DB data warehouse manual edit post-load processes distribution

Data Warehouse Technology And The MSD Databases

Data Warehouse Technology And The MSD Databases Data Warehouse Technology And The MSD Databases Philip McNeil Data Warehouses The MSD Databases Populating & using the Search Database Data Warehouses What is a Data Warehouse? A subject-oriented, integrated,

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 See Markers-ORDER-DB Logically Related Tables Relational Approach: Physically Related Tables: The Relationship Screen

More information

CSPP 53017: Data Warehousing Winter 2013" Lecture 6" Svetlozar Nestorov" " Class News

CSPP 53017: Data Warehousing Winter 2013 Lecture 6 Svetlozar Nestorov  Class News CSPP 53017: Data Warehousing Winter 2013 Lecture 6 Svetlozar Nestorov Class News Homework 4 is online Due by Tuesday, Feb 26. Second 15 minute in-class quiz today at 6:30pm Open book/notes Last 15 minute

More information

Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011

Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011 Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear

More information

Databases and Information Management

Databases and Information Management Databases and Information Management Reading: Laudon & Laudon chapter 5 Additional Reading: Brien & Marakas chapter 3-4 COMP 5131 1 Outline Database Approach to Data Management Database Management Systems

More information

ETL Overview. Extract, Transform, Load (ETL) Refreshment Workflow. The ETL Process. General ETL issues. MS Integration Services

ETL Overview. Extract, Transform, Load (ETL) Refreshment Workflow. The ETL Process. General ETL issues. MS Integration Services ETL Overview Extract, Transform, Load (ETL) General ETL issues ETL/DW refreshment process Building dimensions Building fact tables Extract Transformations/cleansing Load MS Integration Services Original

More information

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya

Chapter 6 Basics of Data Integration. Fundamentals of Business Analytics RN Prasad and Seema Acharya Chapter 6 Basics of Data Integration Fundamentals of Business Analytics Learning Objectives and Learning Outcomes Learning Objectives 1. Concepts of data integration 2. Needs and advantages of using data

More information

Relational Databases for the Business Analyst

Relational Databases for the Business Analyst Relational Databases for the Business Analyst Mark Kurtz Sr. Systems Consulting Quest Software, Inc. mark.kurtz@quest.com 2010 Quest Software, Inc. ALL RIGHTS RESERVED Agenda The RDBMS and its role in

More information

The Data Warehouse ETL Toolkit

The Data Warehouse ETL Toolkit 2008 AGI-Information Management Consultants May be used for personal purporses only or by libraries associated to dandelon.com network. The Data Warehouse ETL Toolkit Practical Techniques for Extracting,

More information

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days

THE DATA WAREHOUSE ETL TOOLKIT CDT803 Three Days Three Days Prerequisites Students should have at least some experience with any relational database management system. Who Should Attend This course is targeted at technical staff, team leaders and project

More information

DBMS Project. COP5725 - Spring 2011. Final Submission Report

DBMS Project. COP5725 - Spring 2011. Final Submission Report DBMS Project COP5725 - Spring 2011 Final Submission Report Chandra Shekar # 6610-6717 Nitin Gujral # 4149-1481 Rajesh Sindhu # 4831-2035 Shrirama Tejasvi # 7521-6735 LINK TO PROJECT Project Website : www.cise.ufl.edu/~mallela

More information

PassTest. Bessere Qualität, bessere Dienstleistungen!

PassTest. Bessere Qualität, bessere Dienstleistungen! PassTest Bessere Qualität, bessere Dienstleistungen! Q&A Exam : 070-463 Title : Implementing a Data Warehouse with Microsoft SQL Server 2012 Version : DEMO 1 / 11 1.You are developing a project that contains

More information

Topics. Database Essential Concepts. What s s a Good Database System? Using Database Software. Using Database Software. Types of Database Programs

Topics. Database Essential Concepts. What s s a Good Database System? Using Database Software. Using Database Software. Types of Database Programs Topics Software V:. Database concepts: records, fields, data types. Relational and objectoriented databases. Computer maintenance and operation: storage health and utilities; back-up strategies; keeping

More information

Extraction Transformation Loading ETL Get data out of sources and load into the DW

Extraction Transformation Loading ETL Get data out of sources and load into the DW Lection 5 ETL Definition Extraction Transformation Loading ETL Get data out of sources and load into the DW Data is extracted from OLTP database, transformed to match the DW schema and loaded into the

More information

Service Oriented Data Management

Service Oriented Data Management Service Oriented Management Nabin Bilas Integration Architect Integration & SOA: Agenda Integration Overview 5 Reasons Why Is Critical to SOA Oracle Integration Solution Integration

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 Copyright 2011 Pearson Education, Inc. Student Learning Objectives How does a relational database organize data,

More information

Transaction Management Overview

Transaction Management Overview Transaction Management Overview Chapter 16 Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1 Transactions Concurrent execution of user programs is essential for good DBMS performance. Because

More information

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE

Whitepaper. Data Warehouse/BI Testing Offering YOUR SUCCESS IS OUR FOCUS. Published on: January 2009 Author: BIBA PRACTICE YOUR SUCCESS IS OUR FOCUS Whitepaper Published on: January 2009 Author: BIBA PRACTICE 2009 Hexaware Technologies. All rights reserved. Table of Contents 1. 2. Data Warehouse - Typical pain points 3. Hexaware

More information

Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5

Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5 Protein physics, Lecture 5 Peptide bonds: resonance structure Properties of proteins: Peptide bonds and side chains Proteins are linear polymers However, the peptide binds and side chains restrict conformational

More information

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2 Class Announcements TIM 50 - Business Information Systems Lecture 15 Database Assignment 2 posted Due Tuesday 5/26 UC Santa Cruz May 19, 2015 Database: Collection of related files containing records on

More information

Data Migration Strategy in ERP Introduction:

Data Migration Strategy in ERP Introduction: Introduction: Along with optimum design of chart of account structure, effective and efficient migration of data is one of the cornerstones for the success of an ERP implementation. Since a number of questions

More information

ER/Studio 8.0 New Features Guide

ER/Studio 8.0 New Features Guide ER/Studio 8.0 New Features Guide Copyright 1994-2008 Embarcadero Technologies, Inc. Embarcadero Technologies, Inc. 100 California Street, 12th Floor San Francisco, CA 94111 U.S.A. All rights reserved.

More information

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK

Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK Advanced Medicinal & Pharmaceutical Chemistry CHEM 5412 Dept. of Chemistry, TAMUK Dai Lu, Ph.D. dlu@tamhsc.edu Tel: 361-221-0745 Office: RCOP, Room 307 Drug Discovery and Development Drug Molecules Medicinal

More information

HP Quality Center. Upgrade Preparation Guide

HP Quality Center. Upgrade Preparation Guide HP Quality Center Upgrade Preparation Guide Document Release Date: November 2008 Software Release Date: November 2008 Legal Notices Warranty The only warranties for HP products and services are set forth

More information

Benefits of Normalisation in a Data Base - Part 1

Benefits of Normalisation in a Data Base - Part 1 Denormalisation (But not hacking it) Denormalisation: Why, What, and How? Rodgers Oracle Performance Tuning Corrigan/Gurry Ch. 5, p69 Stephen Mc Kearney, 2001. 1 Overview Purpose of normalisation Methods

More information

How, What, and Where of Data Warehouses for MySQL

How, What, and Where of Data Warehouses for MySQL How, What, and Where of Data Warehouses for MySQL Robert Hodges CEO, Continuent. Introducing Continuent The leading provider of clustering and replication for open source DBMS Our Product: Continuent Tungsten

More information

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives

An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives An EVIDENCE-ENHANCED HEALTHCARE ECOSYSTEM for Cancer: I/T perspectives Chalapathy Neti, Ph.D. Associate Director, Healthcare Transformation, Shahram Ebadollahi, Ph.D. Research Staff Memeber IBM Research,

More information

ORACLE ENTERPRISE DATA QUALITY PRODUCT FAMILY

ORACLE ENTERPRISE DATA QUALITY PRODUCT FAMILY ORACLE ENTERPRISE DATA QUALITY PRODUCT FAMILY The Oracle Enterprise Data Quality family of products helps organizations achieve maximum value from their business critical applications by delivering fit

More information

Vojtěch Kupča (vojtech@hafro.is), Peter Sandbeck (pes@dfu.min.dk) Marine Research Institute, Reykjavik, Iceland January 2003

Vojtěch Kupča (vojtech@hafro.is), Peter Sandbeck (pes@dfu.min.dk) Marine Research Institute, Reykjavik, Iceland January 2003 DST 2 data warehouse structure and data import Vojtěch Kupča (vojtech@hafro.is), Peter Sandbeck (pes@dfu.min.dk) Marine Research Institute, Reykjavik, Iceland January 2003 This paper covers description

More information

Data Hierarchy. Traditional File based Approach. Hierarchy of Data for a Computer-Based File

Data Hierarchy. Traditional File based Approach. Hierarchy of Data for a Computer-Based File Management Information Systems Data and Knowledge Management Dr. Shankar Sundaresan (Adapted from Introduction to IS, Rainer and Turban) LEARNING OBJECTIVES Recognize the importance of data, issues involved

More information

SQL Server. 2012 for developers. murach's TRAINING & REFERENCE. Bryan Syverson. Mike Murach & Associates, Inc. Joel Murach

SQL Server. 2012 for developers. murach's TRAINING & REFERENCE. Bryan Syverson. Mike Murach & Associates, Inc. Joel Murach TRAINING & REFERENCE murach's SQL Server 2012 for developers Bryan Syverson Joel Murach Mike Murach & Associates, Inc. 4340 N. Knoll Ave. Fresno, CA 93722 www.murach.com murachbooks@murach.com Expanded

More information

GeoKettle: A powerful open source spatial ETL tool

GeoKettle: A powerful open source spatial ETL tool GeoKettle: A powerful open source spatial ETL tool FOSS4G 2010 Dr. Thierry Badard, CTO Spatialytics inc. Quebec, Canada tbadard@spatialytics.com Barcelona, Spain Sept 9th, 2010 What is GeoKettle? It is

More information

D61830GC30. MySQL for Developers. Summary. Introduction. Prerequisites. At Course completion After completing this course, students will be able to:

D61830GC30. MySQL for Developers. Summary. Introduction. Prerequisites. At Course completion After completing this course, students will be able to: D61830GC30 for Developers Summary Duration Vendor Audience 5 Days Oracle Database Administrators, Developers, Web Administrators Level Technology Professional Oracle 5.6 Delivery Method Instructor-led

More information

Refinement of a pdb-structure and Convert

Refinement of a pdb-structure and Convert Refinement of a pdb-structure and Convert A. Search for a pdb with the closest sequence to your protein of interest. B. Choose the most suitable entry (or several entries). C. Convert and resolve errors

More information

Flatten from/to Relational

Flatten from/to Relational Flatten from/to Relational Jácome Cunha João Saraiva Joost Visser CIC 2007 22-23 October Universidade do Minho Software Improvement Group Jácome Cunha Flatten from/to Relational 1 / 21 Overview Jácome

More information

THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC)

THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC) ABOUT THE CAMBRIDGE CRYSTALLOGRAPHIC DATA CENTRE (CCDC) The CCDC is the trusted research institution responsible for the 50-year old Cambridge Structural Database (CSD) and its applications. Used by thousands

More information

Data Consistency Management Overview January 2014. Customer

Data Consistency Management Overview January 2014. Customer Data Consistency Management Overview January 2014 Customer Agenda Motivation SAP Solution Manager as Tool for Data Consistency Management Transactional Correctness (TC) Guided Self Service Data Consistency

More information

SQL Server 2012. Integration Services. Design Patterns. Andy Leonard. Matt Masson Tim Mitchell. Jessica M. Moss. Michelle Ufford

SQL Server 2012. Integration Services. Design Patterns. Andy Leonard. Matt Masson Tim Mitchell. Jessica M. Moss. Michelle Ufford SQL Server 2012 Integration Services Design Patterns Andy Leonard Matt Masson Tim Mitchell Jessica M. Moss Michelle Ufford Contents J Foreword About the Authors About the Technical Reviewers Acknowledgments

More information

SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD

SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD White Paper SGI High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems Haruna Cofer*, PhD January, 2012 Abstract The SGI High Throughput Computing (HTC) Wrapper

More information

Transitioning to a Data Driven Enterprise - What is A Data Strategy and Why Do You Need One?

Transitioning to a Data Driven Enterprise - What is A Data Strategy and Why Do You Need One? Transitioning to a Data Driven Enterprise - What is A Data Strategy and Why Do You Need One? Mike Ferguson Managing Director Intelligent Business Strategies Information Builders Data Strategy Workshop

More information

LearnFromGuru Polish your knowledge

LearnFromGuru Polish your knowledge SQL SERVER 2008 R2 /2012 (TSQL/SSIS/ SSRS/ SSAS BI Developer TRAINING) Module: I T-SQL Programming and Database Design An Overview of SQL Server 2008 R2 / 2012 Available Features and Tools New Capabilities

More information

The peptide bond is rigid and planar

The peptide bond is rigid and planar Level Description Bonds Primary Sequence of amino acids in proteins Covalent (peptide bonds) Secondary Structural motifs in proteins: α- helix and β-sheet Hydrogen bonds (between NH and CO groups in backbone)

More information

Application note: SQL@CHIP Connecting the IPC@CHIP to a Database

Application note: SQL@CHIP Connecting the IPC@CHIP to a Database Application note: SQL@CHIP Connecting the IPC@CHIP to a Database 1. Introduction This application note describes how to connect an IPC@CHIP to a database and exchange data between those. As there are no

More information

Demystified CONTENTS Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals CHAPTER 2 Exploring Relational Database Components

Demystified CONTENTS Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals CHAPTER 2 Exploring Relational Database Components Acknowledgments xvii Introduction xix CHAPTER 1 Database Fundamentals 1 Properties of a Database 1 The Database Management System (DBMS) 2 Layers of Data Abstraction 3 Physical Data Independence 5 Logical

More information

The Year-end close procedures for Analytical Accounting in Microsoft Dynamics GP

The Year-end close procedures for Analytical Accounting in Microsoft Dynamics GP The Year-end close procedures for Analytical Accounting in Microsoft Dynamics GP INTRODUCTION There is not a 'separate' year-end close procedure for Analytical Accounting. Balance Brought Forward (BBF)

More information

database abstraction layer database abstraction layers in PHP Lukas Smith BackendMedia smith@backendmedia.com

database abstraction layer database abstraction layers in PHP Lukas Smith BackendMedia smith@backendmedia.com Lukas Smith database abstraction layers in PHP BackendMedia 1 Overview Introduction Motivation PDO extension PEAR::MDB2 Client API SQL syntax SQL concepts Result sets Error handling High level features

More information

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology

University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology University of Glasgow - Programme Structure Summary C1G5-5100 MSc Bioinformatics, Polyomics and Systems Biology Programme Structure - the MSc outcome will require 180 credits total (full-time only) - 60

More information

Geodatabase Programming with SQL

Geodatabase Programming with SQL DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with SQL Craig Gillgrass Assumptions Basic knowledge of SQL and relational databases Basic knowledge of the Geodatabase We ll hold

More information

Data Warehouse: Introduction

Data Warehouse: Introduction Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of Base and Mining Group of base and data mining group,

More information

Databases and BigData

Databases and BigData Eduardo Cunha de Almeida eduardo.almeida@uni.lu Outline of the course Introduction Database Systems (E. Almeida) Distributed Hash Tables and P2P (C. Cassagnes) NewSQL (D. Kim and J. Meira) NoSQL (D. Kim)

More information

HALOGEN. Technical Design Specification. Version 2.0

HALOGEN. Technical Design Specification. Version 2.0 HALOGEN Technical Design Specification Version 2.0 10th August 2010 1 Document Revision History Date Author Revision Description 27/7/09 D Carter, Mark Widdowson, Stuart Poulton, Lex Comber 1.1 First draft

More information

Driver for JDBC Implementation Guide

Driver for JDBC Implementation Guide www.novell.com/documentation Driver for JDBC Implementation Guide Identity Manager 4.0.2 January 2014 Legal Notices Novell, Inc. makes no representations or warranties with respect to the contents or use

More information

SIZE & ESTIMATION OF DATA WAREHOUSE SYSTEMS

SIZE & ESTIMATION OF DATA WAREHOUSE SYSTEMS SIZE & ESTIMATION OF DATA WAREHOUSE SYSTEMS Luca Santillo (luca.santillo@gmail.com) Abstract Data Warehouse Systems are a special context for the application of functional software metrics. The use of

More information

Ahsay Backup Software. Whitepaper Ahsay Backup Agent

Ahsay Backup Software. Whitepaper Ahsay Backup Agent Version 6 Oct 2011 Table of Content 1 Introduction...3 2 Ahsay Backup Agents...4 2.1 Microsoft Exchange...4 2.2 Microsoft SQL Server...4 2.3 Lotus Domino/s...5 2.4 Oracle Database...6 2.5 MySQL Database...7

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. Insert Information Protection Policy Classification from Slide 8 1 Copyright 2011, Oracle and/or its affiliates. All rights The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated

More information

Relational Data Analysis I

Relational Data Analysis I Relational Data Analysis I Relational Data Analysis Prepares Business data for representation using the relational model The relational model is implemented in a number of popular database systems Access

More information

Introduction to Database Systems. Chapter 1 Introduction. Chapter 1 Introduction

Introduction to Database Systems. Chapter 1 Introduction. Chapter 1 Introduction Introduction to Database Systems Winter term 2013/2014 Melanie Herschel melanie.herschel@lri.fr Université Paris Sud, LRI 1 Chapter 1 Introduction After completing this chapter, you should be able to:

More information

Whitepaper. Data Warehouse/BI Testing Offering. Published on: January 2010 Author: Sena Periasamy

Whitepaper. Data Warehouse/BI Testing Offering. Published on: January 2010 Author: Sena Periasamy Published on: January 2010 Author: Sena Periasamy Hexaware Technologies. All rights reserved. Table of Contents 1. 2. Data Warehouse - Typical pain points 3. Hexaware Solution 4. DWH Testing Why is it

More information

Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell

Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell Lecture 2 Protein conformation ecap Proteins.. > 50% dry weight of a cell ell s building blocks and molecular tools. More important than genes A large variety of functions http://www.tcd.ie/biochemistry/courses/jf_lectures.php

More information

EPA's National Emission Inventory Criteria Data System Functions for Streamlining Data Processing and QA

EPA's National Emission Inventory Criteria Data System Functions for Streamlining Data Processing and QA ABSTRACT EPA's National Emission Inventory Criteria Data System Functions for Streamlining Data Processing and QA Steven Boone Director of Information Technology and Donna McKenzie Database Analyst E.

More information

Whitepaper FailSafeSolutions Backup Agent

Whitepaper FailSafeSolutions Backup Agent Version 6 Oct 20122 Table of Content 1 Introduction... 3 2 FailSafeSolutions Backup Agents... 4 2.1 Microsoft Exchange... 4 2.2 Microsoft SQL Server... 5 2.3 Lotus Domino/s... 6 2.4 Oracle Database...

More information

Guide for Bioinformatics Project Module 3

Guide for Bioinformatics Project Module 3 Structure- Based Evidence and Multiple Sequence Alignment In this module we will revisit some topics we started to look at while performing our BLAST search and looking at the CDD database in the first

More information

HOW TO CONNECT TO CAC DATABASE SERVER USING MySQL

HOW TO CONNECT TO CAC DATABASE SERVER USING MySQL HOW TO CONNECT TO CAC DATABASE SERVER USING MySQL 22 mai 2013 In some projects, it is quite interesting to access to the CAC database for 3rd party apps integration purposes or just to check out the information

More information

Physical Database Design Process. Physical Database Design Process. Major Inputs to Physical Database. Components of Physical Database Design

Physical Database Design Process. Physical Database Design Process. Major Inputs to Physical Database. Components of Physical Database Design Physical Database Design Process Physical Database Design Process The last stage of the database design process. A process of mapping the logical database structure developed in previous stages into internal

More information

Commercial Database Software Development- A review.

Commercial Database Software Development- A review. Commercial Database Software Development- A review. A database software has wide applications. A database software is used in almost all the organizations. Over 15 years many tools have been developed

More information

How to resolve Root Certificate Expiry Issue for Enterprise Manager - Database Control (10.2.0.4)

How to resolve Root Certificate Expiry Issue for Enterprise Manager - Database Control (10.2.0.4) PURPOSE 14 th June, 2011 This paper demonstrates how to resolve the Oracle Enterprise Manager Database Control configuration errors in Oracle Database versions 10.2.0.4 or 10.2.0.5, arising due to the

More information

dependent independent claims

dependent independent claims Mechanics of claim drafting Karin Pramberger Belgrade 16/17 Nov 2006 1 dependent independent claims 1. all essential features in independent claims with at least 1 novelty conferring feature 2. fall back

More information

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH

IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria

More information

Structure Tools and Visualization

Structure Tools and Visualization Structure Tools and Visualization Gary Van Domselaar University of Alberta gary.vandomselaar@ualberta.ca Slides Adapted from Michel Dumontier, Blueprint Initiative 1 Visualization & Communication Visualization

More information

Introduction to NoSQL Databases and MapReduce. Tore Risch Information Technology Uppsala University 2014-05-12

Introduction to NoSQL Databases and MapReduce. Tore Risch Information Technology Uppsala University 2014-05-12 Introduction to NoSQL Databases and MapReduce Tore Risch Information Technology Uppsala University 2014-05-12 What is a NoSQL Database? 1. A key/value store Basic index manager, no complete query language

More information

Data Migration Service An Overview

Data Migration Service An Overview Metalogic Systems Pvt Ltd J 1/1, Block EP & GP, Sector V, Salt Lake Electronic Complex, Calcutta 700091 Phones: +91 33 2357-8991 to 8994 Fax: +91 33 2357-8989 Metalogic Systems: Data Migration Services

More information

The Advantages of Enterprise Historians vs. Relational Databases

The Advantages of Enterprise Historians vs. Relational Databases GE Intelligent Platforms The Advantages of Enterprise Historians vs. Relational Databases Comparing Two Approaches for Data Collection and Optimized Process Operations The Advantages of Enterprise Historians

More information

Quiz! Database Indexes. Index. Quiz! Disc and main memory. Quiz! How costly is this operation (naive solution)?

Quiz! Database Indexes. Index. Quiz! Disc and main memory. Quiz! How costly is this operation (naive solution)? Database Indexes How costly is this operation (naive solution)? course per weekday hour room TDA356 2 VR Monday 13:15 TDA356 2 VR Thursday 08:00 TDA356 4 HB1 Tuesday 08:00 TDA356 4 HB1 Friday 13:15 TIN090

More information

7. Databases and Database Management Systems

7. Databases and Database Management Systems 7. Databases and Database Management Systems 7.1 What is a File? A file is a collection of data or information that has a name, called the Filename. There are many different types of files: Data files

More information

ebusiness Web Hosting Alternatives Self hosting Internet Service Provider (ISP) hosting Commerce Service Provider (CSP) hosting

ebusiness Web Hosting Alternatives Self hosting Internet Service Provider (ISP) hosting Commerce Service Provider (CSP) hosting ebusiness Web Hosting and E-Business Software Web Hosting Alternatives Self hosting Internet Service Provider (ISP) hosting Commerce Service Provider (CSP) hosting Shared hosting Dedicated hosting 1 Considerations

More information

Data Analysis. Management Information Systems 13

Data Analysis. Management Information Systems 13 Data Analysis Management Information Systems 13 166137-01+02 Management Information Systems Spring 2014 Sync Sangwon Lee, Ph. D D. of Information & Electronic Commerce WONKWANG University Prof. Dr. SSL

More information

Frequently Asked Questions. Secure Log Manager. Last Update: 6/25/01. 6303 Barfield Road Atlanta, GA 30328 Tel: 404.236.2600 Fax: 404.236.

Frequently Asked Questions. Secure Log Manager. Last Update: 6/25/01. 6303 Barfield Road Atlanta, GA 30328 Tel: 404.236.2600 Fax: 404.236. Frequently Asked Questions Secure Log Manager Last Update: 6/25/01 6303 Barfield Road Atlanta, GA 30328 Tel: 404.236.2600 Fax: 404.236.2626 1. What is Secure Log Manager? Secure Log Manager (SLM) is designed

More information

Agile Development and Schema Evolution. Schema Evolution Agile Development & Databases Case Study on the Herschel Project

Agile Development and Schema Evolution. Schema Evolution Agile Development & Databases Case Study on the Herschel Project Agile Development and Schema Evolution Schema Evolution Agile Development & Databases Case Study on the Herschel Project Schema Evolution Adapt the logical/physical data model of a database Reasons Performance

More information

Chapter 3 - Data Replication and Materialized Integration

Chapter 3 - Data Replication and Materialized Integration Prof. Dr.-Ing. Stefan Deßloch AG Heterogene Informationssysteme Geb. 36, Raum 329 Tel. 0631/205 3275 dessloch@informatik.uni-kl.de Chapter 3 - Data Replication and Materialized Integration Motivation Replication:

More information

Visual FoxPro Accessing MySQL. presented to The Atlanta FoxPro Users Group June 19 th, 2007

Visual FoxPro Accessing MySQL. presented to The Atlanta FoxPro Users Group June 19 th, 2007 Visual FoxPro Accessing MySQL presented to The Atlanta FoxPro Users Group June 19 th, 2007 Comparing MySQL with the VFP DBC, SQL Server, PostgreSQL VFP DBC Pros: 1. Very fast 2. Free DBC multiuser access

More information

The Advantages of Plant-wide Historians vs. Relational Databases

The Advantages of Plant-wide Historians vs. Relational Databases GE Intelligent Platforms The Advantages of Plant-wide Historians vs. Relational Databases Comparing Two Approaches for Data Collection and Optimized Process Operations The Advantages of Plant-wide Historians

More information

ICAB4136B Use structured query language to create database structures and manipulate data

ICAB4136B Use structured query language to create database structures and manipulate data ICAB4136B Use structured query language to create database structures and manipulate data Release: 1 ICAB4136B Use structured query language to create database structures and manipulate data Modification

More information

QUESTION: 1 Which two features differ between the probabilistic and deterministic approach to Suspect Duplication Processing? (Choose two.

QUESTION: 1 Which two features differ between the probabilistic and deterministic approach to Suspect Duplication Processing? (Choose two. 1 IBM - 000-420 IBM InfoSphere MDM Server v9.0 QUESTION: 1 Which two features differ between the probabilistic and deterministic approach to Suspect Duplication Processing? (Choose two.) A. candidate list

More information

Cyber Security Challenge Australia 2014

Cyber Security Challenge Australia 2014 Cyber Security Challenge Australia 2014 www.cyberchallenge.com.au CySCA2014 Web Penetration Testing Writeup Background: Pentest the web server that is hosted in the environment at www.fortcerts.cysca Web

More information

ebusiness Web Hosting Alternatives Considerations Self hosting Internet Service Provider (ISP) hosting

ebusiness Web Hosting Alternatives Considerations Self hosting Internet Service Provider (ISP) hosting ebusiness Web Hosting and E-Business Software Web Hosting Alternatives Self hosting Internet Service Provider (ISP) hosting Commerce Service Provider (CSP) hosting Shared hosting Dedicated hosting Considerations

More information

Data. Data and database. Aniel Nieves-González. Fall 2015

Data. Data and database. Aniel Nieves-González. Fall 2015 Data and database Aniel Nieves-González Fall 2015 Data I In the context of information systems, the following definitions are important: 1 Data refers simply to raw facts, i.e., facts obtained by measuring

More information

SYLLABUS FOR CS340: INTRODUCTION TO DATABASES

SYLLABUS FOR CS340: INTRODUCTION TO DATABASES SYLLABUS FOR CS340: INTRODUCTION TO DATABASES COURSE INFORMATION CREDITS: 4 INSTRUCTOR: AMENEH SARBAZIAZAD EMAIL: SARBAZIA@ONID.ORST.EDU COURSE CATALOG DESCRIPTION: Design and implementation of relational

More information

Tivoli Storage Manager Explained

Tivoli Storage Manager Explained IBM Software Group Dave Cannon IBM Tivoli Storage Management Development Oxford University TSM Symposium 2003 Presentation Objectives Explain TSM behavior for selected operations Describe design goals

More information

Oracle BIEE and SOA Integration : Step by Step. Mark Rittman, Director, Rittman Mead Consulting

Oracle BIEE and SOA Integration : Step by Step. Mark Rittman, Director, Rittman Mead Consulting Oracle BIEE and SOA Integration : Step by Step Mark Rittman, Director, Rittman Mead Consulting What is Service-Orientated Architecture? Not a technology or product, more a design approach Exposure of product

More information

The Ramachandran Map of More Than. 6,500 Perfect Polypeptide Chains

The Ramachandran Map of More Than. 6,500 Perfect Polypeptide Chains The Ramachandran Map of More Than 1 6,500 Perfect Polypeptide Chains Zoltán Szabadka, Rafael Ördög, Vince Grolmusz manuscript received March 19, 2007 Z. Szabadka, R. Ördög and V. Grolmusz are with Eötvös

More information

project collects data from national events, both natural and manmade, to be stored and evaluated by

project collects data from national events, both natural and manmade, to be stored and evaluated by Joseph Sebastian CS 2994 Spring 2014 Undergraduate Research Final Paper GOALS The goal of my research was to assist the Integrated Digital Event Archive (IDEAL) team in transferring their Twitter data

More information

Database Setup. Coding, Understanding, & Executing the SQL Database Creation Script

Database Setup. Coding, Understanding, & Executing the SQL Database Creation Script Overview @author R.L. Martinez, Ph.D. We need a database to perform the data-related work in the subsequent tutorials. Begin by creating the falconnight database in phpmyadmin using the SQL script below.

More information

Data Visualization and Business Insights Using SAS Visual Analytics. University of Connecticut Dan Sokol Thulasi Kumar 1/13/2015

Data Visualization and Business Insights Using SAS Visual Analytics. University of Connecticut Dan Sokol Thulasi Kumar 1/13/2015 Data Visualization and Business Insights Using SAS Visual Analytics University of Connecticut Dan Sokol Thulasi Kumar 1/13/2015 New Mission The primary mission of the Office of Institutional Research and

More information

INFO 1400. Koffka Khan. Tutorial 6

INFO 1400. Koffka Khan. Tutorial 6 INFO 1400 Koffka Khan Tutorial 6 Running Case Assignment: Improving Decision Making: Redesigning the Customer Database Dirt Bikes U.S.A. sells primarily through its distributors. It maintains a small customer

More information

An Oracle White Paper June 2013. Migrating Applications and Databases with Oracle Database 12c

An Oracle White Paper June 2013. Migrating Applications and Databases with Oracle Database 12c An Oracle White Paper June 2013 Migrating Applications and Databases with Oracle Database 12c Disclaimer The following is intended to outline our general product direction. It is intended for information

More information

Data Ownership and Enterprise Data Management: Implementing a Data Management Strategy (Part 3)

Data Ownership and Enterprise Data Management: Implementing a Data Management Strategy (Part 3) A DataFlux White Paper Prepared by: Mike Ferguson Data Ownership and Enterprise Data Management: Implementing a Data Management Strategy (Part 3) Leader in Data Quality and Data Integration www.flux.com

More information

SQL Server to Oracle A Database Migration Roadmap

SQL Server to Oracle A Database Migration Roadmap SQL Server to Oracle A Database Migration Roadmap Louis Shih Superior Court of California County of Sacramento Oracle OpenWorld 2010 San Francisco, California Agenda Introduction Institutional Background

More information

Physical Data Organization

Physical Data Organization Physical Data Organization Database design using logical model of the database - appropriate level for users to focus on - user independence from implementation details Performance - other major factor

More information

Week 1 Part 1: An Introduction to Database Systems. Databases and DBMSs. Why Use a DBMS? Why Study Databases??

Week 1 Part 1: An Introduction to Database Systems. Databases and DBMSs. Why Use a DBMS? Why Study Databases?? Week 1 Part 1: An Introduction to Database Systems Databases and DBMSs Data Models and Data Independence Concurrency Control and Database Transactions Structure of a DBMS DBMS Languages Databases and DBMSs

More information

Error Tolerant Searching of Uninterpreted MS/MS Data

Error Tolerant Searching of Uninterpreted MS/MS Data Error Tolerant Searching of Uninterpreted MS/MS Data 1 In any search of a large LC-MS/MS dataset 2 There are always a number of spectra which get poor scores, or even no match at all. 3 Sometimes, this

More information