ANALYSIS OF ENTITY-ATTRIBUTE-VALUE MODEL APPLICATIONS IN FREELY AVAILABLE DATABASE MANAGEMENT SYSTEMS FOR DNA MICROARRAY DATA PROCESSING 1.
|
|
|
- Clare O’Neal’
- 9 years ago
- Views:
Transcription
1 JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 20/2012, ISSN entity-attribute-value model, relational database management system, DNA microarray Tomasz WALLER 1, Damian ZAPART 1, Magdalena TKACZ 2, Zygmunt WRÓBEL 1 ANALYSIS OF ENTITY-ATTRIBUTE-VALUE MODEL APPLICATIONS IN FREELY AVAILABLE DATABASE MANAGEMENT SYSTEMS FOR DNA MICROARRAY DATA PROCESSING Large volumes of data are generated during DNA microarrays experiments. Database management systems (DBMS) are increasingly applied to these data, providing optimum processing and management from multiple microarray experiments. In this study, freely accessible DBMS software versions were compared (Microsoft SQL Server 2008 Express Edition, Oracle Database 10g Express Edition, DB2 Express-C 9.7.2, MySQL 5.1, and PostgreSQL 9.0). We examined them in the context of possible Entity-Attribute-Value (EAV) application as an optimal organization method for microarray data. It was confirmed in the comparative analysis of component data processing methods, consistent with the EAV model, that efficient methods for microarray data analysis are available in Microsoft SQL Server 2008 Express Edition and PostgreSQL 9.0 systems. Also, DNA microarray data processing was confirmed to be more efficient with Microsoft SQL Server 2008 Express Edition as compared with PostgreSQL 9.0. The EAV method was also shown to be suitable for use with open-source versions of DBMS software as an optimum storage model for DNA microarray data. In terms of data processing methods and performance, the Microsoft SQL Server 2008 Express Edition proved to be the best among compared database systems. 1. BACKGROUND DNA microarray is a commonly used technology to measure the transcription activity (expression) for tens of thousands of genes. Microarrays are widely used in genotyping, medical diagnosis, and pharmacy [8], and they can be constructed by the direct synthesis of oligonucleotide or cdna spotting. Microarray data were developed and specified based on the properties of data obtained with the oligonucleotide method developed by Affymetrix (HG-U133A microarray model). DNA microarray data are standard numerical data obtained by processing the microarray images used in the experiment. Each image reflects a gene activity profile for a genetic material tested with a specific microarray. Processed image data are converted into a table of numerical data, where each row corresponds to a specific gene and each column to a tested sample. Each column presents a gene expression profile for a specific sample, and each row presents an expression level for each gene for all analyzed genetic materials. The table size depends on the microarray model and the number of tested samples. The type of microarray determines the number of rows, which are constant for the applied model (for example, for HG-U133A by Affymetrix, is 22283), and the number of used microarrays corresponds to the number of columns. The data are the basis for all tests and analyses. A well-designed microarray database can provide valuable information on gene expression levels [1, 2, 4, 5]. The migration of data from a microarray experiment to a relational database is a simple process, generally involving the design of a suitable table within the DBMS. However if data from several experiments are to be stored, a problem will occur with in determining a suitable number of columns for the relational database. For each experiment, a different number of microarrays can be used, and the general number of columns cannot be determined in advance. Admittedly, one can assume that the experiment will include, for example 256 samples. However this fixed allocation of samples, is not a perfect solution, since it limits the possibility of using the database in experiments where the number of 1 Institute of Computer Science, Division of Biomedical Computer Systems, University of Silesia, Katowice, PL 2 Institute of Computer Science, Division of Information Systems, University of Silesia, Katowice, PL
2 samples exceeds the number assumed for the database design. Additionally, for experiments with smaller numbers of microarrays, the table will include unused columns and NULL values, which represent a superfluous load at the data processing stage. Fig. 1. An example of microarray data. Table section with experimental data - activity of 22,283 genes in 10 samples. 10 HG-U133A microarrays by Affymetrix were used. The data were pre-processed. Author's own data from DNA Microarray Laboratory of the Department of Molecular Biology of the Medical University of Silesia. An interesting alternative to a relational model is the EAV model promoted by the authors [2, 7]. Similar to the relational model, the data are also stored in tables. However, with the EAV model, the tables include 3 columns (entity ID, attribute name and attribute value) [6, 7]. With specific reference to microarray data, EAV model tables [15, 16, 17] will include the following columns: sample/microarray ID, gene ID and expression value (Fig. 2). 60 Fig. 2. Entity-relationship diagram. Simple database design based on EAV model. Author's own data. EAV model application in database design allows storage of data from several microarray experiments in a single table, where the data are diversified by the microarray type and the number of samples. An advantage of this approach is that it does not limit the number of attributes, and it is not burdened with NULL values.
3 At the same time, the EAV data structure leads to an undesirable increase in the number of rows in the table [6]. Numerous rows are a significant impediment in creating complex SQL queries, since applied microarray data analysis methods (e.g., Significance Analysis of Microarrays [14] or cluster analysis [14]) require a standard data layout, as illustrated in Fig. 1. Therefore, a main criterion for usefulness of EAV model in a specific database system is the availability of a cross table feature. With this feature, data columns and rows can be selected and rotated to change the structure of data, creating a typical microarray experiment table, having the structure compatible with the EAV model. 2. METHODS Two comparative analyses were conducted to investigate whether the EAV model can be used with an open-source database system for efficient DNA microarray data storage. The first analysis compared the most popular database systems in terms of accessibility of cross table development functionality. Systems offering no such feature were excluded from further analysis. Another criterion subject in conduced analysis concerns the processing of performance of DNA microarray data (consistent with the EAV model in selected DMBS systems). These analyses were conducted on a PC (CPU: Intel Core 2 Duo CPU, RAM 4 GB) with Windows 7 Professional 32 bit. 3. RESULTS 3.1. COMPARATIVE ANALYSIS OF AVAILABLE CROSS TABLE FEATURE IN DBMS Several database management systems are currently available. The most popular versions, based on Google search engine ratings (see Figure 3) were included in the study. The comparative analysis covers only the free DBMS versions. Fig. 3. Google search engine ratings of the DBMS. DBMS marked green were included in the comparative analysis. Author's own data based on [ The following systems were included: Microsoft SQL Server 2008 Express Edition, Oracle Database 10g Express Edition, DB2 Express-C 9.7.2, MySQL 5.1 and PostgreSQL 9.0. The analysis demonstrates that a dedicated and convenient method of creating cross tables is only provided by Microsoft SQL Server 2008 Express Edition and PostgreSQL 9.0 (see Table 1). 61
4 Table 1. Benchmarks. List of selected database management systems regarding cross table feature [9, 10, 11, 12, 13]. Author's own data. Database management system Microsoft SQL Server 2008 Express Edition Oracle Database 10g Express Edition DB2 Express-C MySQL 5.1 PostgreSQL 9.0 Cross table feature YES, PIVOT relational operator NO NO NO YES, CROSSTAB function Thus, Oracle Database 10g Express Edition, DB2 Express-C 9.7.2, and MySQL 5.1 systems were eliminated from further study, since no cross table feature is available COMPARATIVE ANALYSIS OF AVAILABLE CROSS TABLE FEATURE IN DBMS The DBMS usability analysis included a comparison of the time required to create a cross table based on the EAV table (see Fig 1) for 25, 50, 100 and 200 DNA microarrays. Fig. 4 provides the test results. Significantly better results were obtained using Microsoft SQL Server 2008 Express Edition, where the cross table creating method is based on an embedded PIVOT relational operator. Fig. 4. Comparison of DBMS efficiency. Comparison of efficiency of cross table creation depending on the data quantity. Author's own data from DNA Microarray Laboratory of the Department of Molecular Biology of the Medical University of Silesia. In the case of PostgreSQL 9.0, the cross table is generated with a standard function written in C programming language, which is much less effective. 4. CONCLUSIONS The EAV method was shown to be eligible for use in freely available versions of DBMS software as an optimum storage model for DNA microarray data. In view of the data processing methods offered by developers (and in particular, the availability of a simple method for the creation of cross tables), Microsoft SQL Server 2008 Express Edition and PostgreSQL 9.0 are recommended as the preferred methods. Microsoft SQL Server 2008 Express Edition proved to be the most efficient in the tests; however, it cannot be considered as the best choice due to its limited maximum database capacity of 4 GB (10 GB in the R2 version). 62
5 5. ACKNOWLEDGEMENTS We thank the Department of Molecular Biology of the Medical University of Silesia for the DNA microarray data. We also thank the reviewers for their critical comments which helped us improve our article. Tomasz Waller and Damian Zapart received a scholarship under the project "DoktoRIS - Scholarship Program for Innovative Silesia" co-financed by the European Union under the European Social Fund. BIBLIOGRAPHY [1] TSOI L.C. ZHENG W.J.: A Method of Microarray Data Storage Using Array Data Type, 2007, Comput Biol Chem, Vol. 31, No. 2, pp [2] MASYS D.: Database designs for microarray data, 2001, The Pharmacogenomics Journal, Vol. 1, pp [3] SPLENDIANI A., BRANDIZI M., EVEN G., OTTAVIO B., Pavelka N., PELIZZOLA M., MAYHAUS M., FOTI M., MAURI G., RICCIARDI-CASTAGNOLI P.: The Genopolis Microarray Database. BMC Bioinformatics 2007, 8(Suppl 1):S21. [4] KILLION P., SHERLOCK G., IYER V.: The Longhorn Array Database (LAD): An Freely-accessible, MIAME compliant implementation of the Stanford Microarray Database (SMD), 2003, BMC Bioinformatics, pp [5] SHAH S. P., HUANG Y., XU T., YUEN M., LING J., OUELLETTE F.: Atlas a data warehouse for integrative bioinformatics, 2005, BMC Bioinformatics, pp [6] GRAMACKI A., GRAMACKI J.: Modelowanie typu Entity-Attribute-Value w bazach danych, 2007, Pomiary Automatyka Kontrola, No. 5, pp [7] HONG-HAI D., TORALF K., ERHARD R.: Comparative Evaluation of Microarray-based Gene Expression Databases, BTW 2003, In Proc. 10. Fachtagung Datenbanksysteme für Business, Technologie und Web, Leipzig. [8] LENOIR T., GIANNELLA E.: The emergence and diffusion of DNA microarray technology, 2006, Journal of Biomedical Discovery and Collaboration, 1:1. [9] PostgreSQL Documentation - [10] Microsoft SQL Server 2008 Books Online (October 2009) - [11] MySQL 5.1 Reference Manual - [12] Oracle Database 10g Documentation Library - [13] DB2 database product documentation - [14] RUSSELL S., MEADOWS L., RUSSELL R.: Microarray Technology in Practice, 2009, Elsevier Inc. [15] THALLINGER G.G. et al., TAMEE: data management and analysis for tissue microarrays, 2007, BMC Bioinformatics, pp [16] JOHNSON S.B.: Generic data modeling for clinical repositories, 1996, J Am Med Inform Assoc, No. 3, pp [17] NADKARNI P.M. et al.: Organization of heterogeneous scientific data using the EAV/CR representation, 1999, J Am Med Inform Assoc, Vol. 6, pp
6 64 MEDICAL DATA CLASSIFICATION METHODS
Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM [email protected]
Lecture 11 Data storage and LIMS solutions Stéphane LE CROM [email protected] Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog
5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2
Class Announcements TIM 50 - Business Information Systems Lecture 15 Database Assignment 2 posted Due Tuesday 5/26 UC Santa Cruz May 19, 2015 Database: Collection of related files containing records on
Supported Platforms and Software Requirements Effective on 7 May.2014. HULFT-DataMagic for Windows Ver.2.2.0
Supported Platforms and Software Requirements Effective on 7 May.2014 for Ver.2.2.0 for Ver.2 English Edition Code conversion option for English Edition Database connectivity option (DB2) for English Edition
Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey
Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray
Comparative Performance Analysis of MySQL and SQL Server Relational Database Management Systems in Windows Environment
ISSN (Online) 2278-121 ISSN (Print) 2319-594 Vol. 4, Issue 3, March 215 Comparative Performance Analysis of and Relational Database Management Systems in Windows Environment Amlanjyoti Saikia 1, Sherin
Database Management System Choices. Introduction To Database Systems CSE 373 Spring 2013
Database Management System Choices Introduction To Database Systems CSE 373 Spring 2013 Outline Introduction PostgreSQL MySQL Microsoft SQL Server Choosing A DBMS NoSQL Introduction There a lot of options
Foundations of Business Intelligence: Databases and Information Management
Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 Copyright 2011 Pearson Education, Inc. Student Learning Objectives How does a relational database organize data,
TIM 50 - Business Information Systems
TIM 50 - Business Information Systems Lecture 15 UC Santa Cruz March 1, 2015 The Database Approach to Data Management Database: Collection of related files containing records on people, places, or things.
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - [email protected]. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - [email protected] CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
CREATING MINIMIZED DATA SETS BY USING HORIZONTAL AGGREGATIONS IN SQL FOR DATA MINING ANALYSIS
CREATING MINIMIZED DATA SETS BY USING HORIZONTAL AGGREGATIONS IN SQL FOR DATA MINING ANALYSIS Subbarao Jasti #1, Dr.D.Vasumathi *2 1 Student & Department of CS & JNTU, AP, India 2 Professor & Department
Data. Data and database. Aniel Nieves-González. Fall 2015
Data and database Aniel Nieves-González Fall 2015 Data I In the context of information systems, the following definitions are important: 1 Data refers simply to raw facts, i.e., facts obtained by measuring
Data Migration System in Heterogeneous Database Shinde Anita Vitthal 1, Thite Vaishali Baban 2, Roshni Warade 3, Krupali Chaudhari 4
Data Migration System in Heterogeneous Shinde Anita Vitthal 1, Thite Vaishali Baban 2, Roshni Warade 3, Krupali Chaudhari 4 Abstract:-With information becoming an increasingly valuable corporate asset,
Row Quantile Normalisation of Microarrays
Row Quantile Normalisation of Microarrays W. B. Langdon Departments of Mathematical Sciences and Biological Sciences University of Essex, CO4 3SQ Technical Report CES-484 ISSN: 1744-8050 23 June 2008 Abstract
Databases and Information Management
Databases and Information Management Reading: Laudon & Laudon chapter 5 Additional Reading: Brien & Marakas chapter 3-4 COMP 5131 1 Outline Database Approach to Data Management Database Management Systems
Foundations of Business Intelligence: Databases and Information Management
Foundations of Business Intelligence: Databases and Information Management Wienand Omta Fabiano Dalpiaz 1 drs. ing. Wienand Omta Learning Objectives Describe how the problems of managing data resources
Real-time Data Replication
Real-time Data Replication from Oracle to other databases using DataCurrents WHITEPAPER Contents Data Replication Concepts... 2 Real time Data Replication... 3 Heterogeneous Data Replication... 4 Different
Basics on Geodatabases
Basics on Geodatabases 1 GIS Data Management 2 File and Folder System A storage system which uses the default file and folder structure found in operating systems. Uses the non-db formats we mentioned
Chapter 6. Foundations of Business Intelligence: Databases and Information Management
Chapter 6 Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
Data Migration In Heterogeneous Databases (ETL)
Data Migration In Heterogeneous Databases (ETL) Tribhuvan Shweta M. 1, Rasane Pratiksha V. 2, Prof. M. R. Bendre 3 Abstract Development of economic systems presents rapid growth trend and requires establishing
Very Large Enterprise Network, Deployment, 25000+ Users
Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
Foundations of Business Intelligence: Databases and Information Management
Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 See Markers-ORDER-DB Logically Related Tables Relational Approach: Physically Related Tables: The Relationship Screen
David M. Kroenke and David J. Auer Database Processing 11 th Edition Fundamentals, Design, and Implementation. Chapter Objectives
David M. Kroenke and David J. Auer Database Processing 11 th Edition Fundamentals, Design, and Implementation Chapter One: Introduction 1-1 Chapter Objectives To understand the nature and characteristics
Information Management
Information Management Dr Marilyn Rose McGee-Lennon [email protected] What is Information Management about Aim: to understand the ways in which databases contribute to the management of large amounts
Big Data Use Case. How Rackspace is using Private Cloud for Big Data. Bryan Thompson. May 8th, 2013
Big Data Use Case How Rackspace is using Private Cloud for Big Data Bryan Thompson May 8th, 2013 Our Big Data Problem Consolidate all monitoring data for reporting and analytical purposes. Every device
Distributed Bioinformatics Computing System for DNA Sequence Analysis
Global Journal of Computer Science and Technology: A Hardware & Computation Volume 14 Issue 1 Version 1.0 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.
ABSTRACT 1. INTRODUCTION. Kamil Bajda-Pawlikowski [email protected]
Kamil Bajda-Pawlikowski [email protected] Querying RDF data stored in DBMS: SPARQL to SQL Conversion Yale University technical report #1409 ABSTRACT This paper discusses the design and implementation
Basic Analysis of Microarray Data
Basic Analysis of Microarray Data A User Guide and Tutorial Scott A. Ness, Ph.D. Co-Director, Keck-UNM Genomics Resource and Dept. of Molecular Genetics and Microbiology University of New Mexico HSC Tel.
Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner
24 Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner Rekha S. Nyaykhor M. Tech, Dept. Of CSE, Priyadarshini Bhagwati College of Engineering, Nagpur, India
Very Large Enterprise Network Deployment, 25,000+ Users
Very Large Enterprise Network Deployment, 25,000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
DBMS / Business Intelligence, SQL Server
DBMS / Business Intelligence, SQL Server Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the needs of IT professionals.
VBA and Databases (see Chapter 14 )
VBA and Databases (see Chapter 14 ) Kipp Martin February 29, 2012 Lecture Files Files for this module: retailersql.m retailer.accdb Outline 3 Motivation Modern Database Systems SQL Bringing Data Into MATLAB/Excel
Sisense. Product Highlights. www.sisense.com
Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze
Efficient Integration of Data Mining Techniques in Database Management Systems
Efficient Integration of Data Mining Techniques in Database Management Systems Fadila Bentayeb Jérôme Darmont Cédric Udréa ERIC, University of Lyon 2 5 avenue Pierre Mendès-France 69676 Bron Cedex France
Release: 1. ICADBS601A Build a data warehouse
Release: 1 ICADBS601A Build a data warehouse ICADBS601A Build a data warehouse Modification History Release Release 1 Comments This Unit first released with ICA11 Information and Communications Technology
How good can databases deal with Netflow data
How good can databases deal with Netflow data Bachelorarbeit Supervisor: bernhard [email protected] Inteligent Networks Group (INET) Ernesto Abarca Ortiz [email protected] OVERVIEW
Karl Lum Partner, LabKey Software [email protected]. Evolution of Connectivity in LabKey Server
Karl Lum Partner, LabKey Software [email protected] Evolution of Connectivity in LabKey Server Connecting Data to LabKey Server Lowering the barrier to connect scientific data to LabKey Server Increased
Analysis of gene expression data. Ulf Leser and Philippe Thomas
Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:
Discovering SQL. Wiley Publishing, Inc. A HANDS-ON GUIDE FOR BEGINNERS. Alex Kriegel WILEY
Discovering SQL A HANDS-ON GUIDE FOR BEGINNERS Alex Kriegel WILEY Wiley Publishing, Inc. INTRODUCTION xxv CHAPTER 1: DROWNING IN DATA, DYING OF THIRST FOR KNOWLEDGE 1 Data Deluge and Informational Overload
Accessing Your Database with JMP 10 JMP Discovery Conference 2012 Brian Corcoran SAS Institute
Accessing Your Database with JMP 10 JMP Discovery Conference 2012 Brian Corcoran SAS Institute JMP provides a variety of mechanisms for interfacing to other products and getting data into JMP. The connection
Virtuoso and Database Scalability
Virtuoso and Database Scalability By Orri Erling Table of Contents Abstract Metrics Results Transaction Throughput Initializing 40 warehouses Serial Read Test Conditions Analysis Working Set Effect of
Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives
Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives Describe how the problems of managing data resources in a traditional file environment are solved
Chapter 9 Joining Data from Multiple Tables. Oracle 10g: SQL
Chapter 9 Joining Data from Multiple Tables Oracle 10g: SQL Objectives Identify a Cartesian join Create an equality join using the WHERE clause Create an equality join using the JOIN keyword Create a non-equality
Use of the Research Patient Data Registry at Partners Healthcare, Boston
Use of the Research Patient Data Registry at Partners Healthcare, Boston Advancing Clinical Research with Hospital Clinical Records Shawn Murphy MD, Ph.D. Massachusetts General Hospital Outline of presentation
Relational Databases for the Business Analyst
Relational Databases for the Business Analyst Mark Kurtz Sr. Systems Consulting Quest Software, Inc. [email protected] 2010 Quest Software, Inc. ALL RIGHTS RESERVED Agenda The RDBMS and its role in
A Web Based Software for Synonymous Codon Usage Indices
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 3 (2013), pp. 147-152 International Research Publications House http://www. irphouse.com /ijict.htm A Web
Contents RELATIONAL DATABASES
Preface xvii Chapter 1 Introduction 1.1 Database-System Applications 1 1.2 Purpose of Database Systems 3 1.3 View of Data 5 1.4 Database Languages 9 1.5 Relational Databases 11 1.6 Database Design 14 1.7
How many of you have checked out the web site on protein-dna interactions?
How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss
Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
Preparing Data Sets for the Data Mining Analysis using the Most Efficient Horizontal Aggregation Method in SQL
Preparing Data Sets for the Data Mining Analysis using the Most Efficient Horizontal Aggregation Method in SQL Jasna S MTech Student TKM College of engineering Kollam Manu J Pillai Assistant Professor
Electronic Laboratory Notebook in the Graduate Level Laboratory Informatics Program
Electronic Laboratory Notebook in the Graduate Level Laboratory Informatics Program Mahesh Merchant, Paresh Sanghani*, Sonal Sanghani* * Department of Biochemistry and Molecular Biology Mahesh Merchant
Syllabus. HMI 7437: Data Warehousing and Data/Text Mining for Healthcare
Syllabus HMI 7437: Data Warehousing and Data/Text Mining for Healthcare 1. Instructor Illhoi Yoo, Ph.D Office: 404 Clark Hall Email: [email protected] Office hours: TBA Classroom: TBA Class hours: TBA
Using Web Services for Customised Data Entry
Using Web Services for Customised Data Entry A thesis submitted in partial fulfilment of the requirements for the Degree of Master of Applied Computing at Lincoln University by Yanbo Deng Lincoln University
CMS Query Suite. CS4440 Project Proposal. Chris Baker Michael Cook Soumo Gorai
CMS Query Suite CS4440 Project Proposal Chris Baker Michael Cook Soumo Gorai I) Motivation Relational databases are great places to efficiently store large amounts of information. However, information
32-bit and 64-bit BarTender. How to Select the Right Version for Your Needs WHITE PAPER
32-bit and 64-bit BarTender How to Select the Right Version for Your Needs WHITE PAPER Contents Overview 3 The Difference Between 32-bit and 64-bit 3 Find Out if Your Computer is Capable of Running 64-bit
How to Design and Create Your Own Custom Ext Rep
Combinatorial Block Designs 2009-04-15 Outline Project Intro External Representation Design Database System Deployment System Overview Conclusions 1. Since the project is a specific application in Combinatorial
edoc Document Generation Suite
e Doc Suite is a set of Microsoft Office add-ins for Word, Excel & PowerPoint that lets you use your data in MS Office with ease. Creating simple flat tables from data sources is possible in MS Office,
International Journal of Advanced Research in Computer Science and Software Engineering
Volume, Issue, March 201 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com An Efficient Approach
Database Optimizing Services
Database Systems Journal vol. I, no. 2/2010 55 Database Optimizing Services Adrian GHENCEA 1, Immo GIEGER 2 1 University Titu Maiorescu Bucharest, Romania 2 Bodenstedt-Wilhelmschule Peine, Deutschland
Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:
Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova
Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel
On a Hadoop-based Analytics Service System
Int. J. Advance Soft Compu. Appl, Vol. 7, No. 1, March 2015 ISSN 2074-8523 On a Hadoop-based Analytics Service System Mikyoung Lee, Hanmin Jung, and Minhee Cho Korea Institute of Science and Technology
Pivot Charting in SharePoint with Nevron Chart for SharePoint
Pivot Charting in SharePoint Page 1 of 10 Pivot Charting in SharePoint with Nevron Chart for SharePoint The need for Pivot Charting in SharePoint... 1 Pivot Data Analysis... 2 Functional Division of Pivot
BUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
A Generic Model for Querying Multiple Databases in a Distributed Environment Using JDBC and an Uniform Interface
A Generic Model for Querying Multiple Databases in a Distributed Environment Using JDBC and an Uniform Interface Barkha Bhagwant Keni, M.Madiajagan, B.Vijayakumar Abstract - This paper discusses about
Full-text Search in Intermediate Data Storage of FCART
Full-text Search in Intermediate Data Storage of FCART Alexey Neznanov, Andrey Parinov National Research University Higher School of Economics, 20 Myasnitskaya Ulitsa, Moscow, 101000, Russia [email protected],
Technical. Overview. ~ a ~ irods version 4.x
Technical Overview ~ a ~ irods version 4.x The integrated Ru e-oriented DATA System irods is open-source, data management software that lets users: access, manage, and share data across any type or number
Foundations of Business Intelligence: Databases and Information Management
Foundations of Business Intelligence: Databases and Information Management Content Problems of managing data resources in a traditional file environment Capabilities and value of a database management
This guide specifies the required and supported system elements for the application.
System Requirements Contents System Requirements... 2 Supported Operating Systems and Databases...2 Features with Additional Software Requirements... 2 Hardware Requirements... 4 Database Prerequisites...
Technology Foundations. Conan C. Albrecht, Ph.D.
Technology Foundations Conan C. Albrecht, Ph.D. Overview 9. Human Analysis Reports 8. Create Reports 6. Import Data 7. Primary Analysis Data Warehouse 5. Transfer Data as CSV, TSV, or XML 1. Extract Data
Intro to Databases. ACM Webmonkeys 2011
Intro to Databases ACM Webmonkeys 2011 Motivation Computer programs that deal with the real world often need to store a large amount of data. E.g.: Weather in US cities by month for the past 10 years List
AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE
ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS
XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines. A.Zydroń 18 April 2009. Page 1 of 12
XTM Web 2.0 Enterprise Architecture Hardware Implementation Guidelines A.Zydroń 18 April 2009 Page 1 of 12 1. Introduction...3 2. XTM Database...4 3. JVM and Tomcat considerations...5 4. XTM Engine...5
inforouter V8.0 Server & Client Requirements
inforouter V8.0 Server & Client Requirements Please review this document thoroughly before proceeding with the installation of inforouter Version 8. This document describes the minimum and recommended
A Data Grid Model for Combining Teleradiology and PACS Operations
MEDICAL IMAGING TECHNOLOGY Vol.25 No.1 January 2007 7 特 集 論 文 / 遠 隔 医 療 と 画 像 通 信 A Data Grid Model for Combining Teleradiology and Operations H.K. HUANG *, Brent J. LIU *, Zheng ZHOU *, Jorge DOCUMET
PERFORMANCE EVALUATION OF DATABASE MANAGEMENT SYSTEMS BY THE ANALYSIS OF DBMS TIME AND CAPACITY
Vol.2, Issue.2, Mar-Apr 2012 pp-067-072 ISSN: 2249-6645 PERFORMANCE EVALUATION OF DATABASE MANAGEMENT SYSTEMS BY THE ANALYSIS OF DBMS TIME AND CAPACITY Aparna Kaladi 1 and Priya Ponnusamy 2 1 M.E Computer
Clinical Database Information System for Gbagada General Hospital
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 2, Issue 9, September 2015, PP 29-37 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org
Exploring Microsoft Office Access 2007. Chapter 2: Relational Databases and Multi-Table Queries
Exploring Microsoft Office Access 2007 Chapter 2: Relational Databases and Multi-Table Queries 1 Objectives Design data Create tables Understand table relationships Share data with Excel Establish table
Software and Methods for the Analysis of Affymetrix GeneChip Data. Rafael A Irizarry Department of Biostatistics Johns Hopkins University
Software and Methods for the Analysis of Affymetrix GeneChip Data Rafael A Irizarry Department of Biostatistics Johns Hopkins University Outline Overview Bioconductor Project Examples 1: Gene Annotation
Enterprise Network Deployment, 10,000 25,000 Users
Enterprise Network Deployment, 10,000 25,000 Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
Data Migration between Document-Oriented and Relational Databases
Abstract Current tools for data migration between documentoriented and relational databases have several disadvantages. We propose a new approach for data migration between documentoriented and relational
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community
Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community Ntinos Krampis Asst. Professor J. Craig Venter Institute [email protected] http://www.jcvi.org/cms/about/bios/kkrampis/
CALIFORNIA STATE UNIVERSITY CHANNEL ISLANDS
CALIFORNIA STATE UNIVERSITY CHANNEL ISLANDS PROGRAM MODIFICATION DATE: 12.06.06 PROGRAM AREA: BIOLOGY AND BUSINESS AND ECONOMICS SEMESTER /YEAR FIRST EFFECTED: FALL 2007 Please use the following format
Delegation of the European Union to Bosnia and Herzegovina
EUROPEAN UNION Delegation of the European Union to Bosnia and Herzegovina Clarification No.2 to the Tender Dossier Consolidation and further development of the Judicial communication and Information System
GeoKettle: A powerful open source spatial ETL tool
GeoKettle: A powerful open source spatial ETL tool FOSS4G 2010 Dr. Thierry Badard, CTO Spatialytics inc. Quebec, Canada [email protected] Barcelona, Spain Sept 9th, 2010 What is GeoKettle? It is
Data Mining Solutions for the Business Environment
Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania [email protected] Over
David M. Kroenke and David J. Auer Database Processing 12 th Edition
David M. Kroenke and David J. Auer Database Processing 12 th Edition Fundamentals, Design, and Implementation ti Chapter One: Introduction Modified & translated by Walter Chen Dept. of Civil Engineering
