HALOGEN. Technical Design Specification. Version 2.0

Size: px
Start display at page:

Download "HALOGEN. Technical Design Specification. Version 2.0"

Transcription

1 HALOGEN Technical Design Specification Version th August

2 Document Revision History Date Author Revision Description 27/7/09 D Carter, Mark Widdowson, Stuart Poulton, Lex Comber 1.1 First draft following review by IT Services team. 10/8/09 D Carter 2.0 Issued to Project Board. Approvals Date Name Title Embedded signature/ 2

3 1. Purpose of Document The purpose of this paper is to document the design of the IT related infrastructure that will be used to support users of the HALOGEN system. This paper begins by describing the general approach and principles that will be applied to the implementation phase of the project. Subsequent sections describe a generic model that can be used to support the design of database management systems and their related infrastructure; reviews some of the key data quality issues associated with the pilot datasets and finally identifies specific software tools and hardware that will be deployed by the University of Leicester for HALOGEN. 2. Implementation Approach and Design Principles 2.1 Implementation Approach The high level user and service requirements have been agreed with the research users and documented in the HALOGEN Service Specification ( The implementation approach for the delivering a system to support these requirements will be to develop two prototypes during the implementation phase of the pilot project. These are briefly described below. Prototype 1 The first prototype will be structured around using the ArcGIS software package to directly process and analyse the pilot datasets. As part of this phase of work procedures to clean and format the raw data in the pilot datasets will be developed, and ArcGIS specific processing and visualisation requirements will be defined and delivered. This will give the research team a capability to begin to analyse new sources of research data and further formulate their thinking on the research questions they wish to address and the techniques that will be of most value to them. There is no lead time for providing the infrastructure to support the development of the initial prototype. The data storage requirements can be met through use of currently spare capacity on the existing Research Storage infrastructure managed by IT Services; the University already has the required ArcGIS software licenses and standard desktop workstations can support the processing associated with the visualisation and analyses of data. The specialist ArcGIS resource required to undertake the majority of the work for this phase has been secured and is available from mid-august. Prototype 2 The second prototype will build on the initial system and introduce a central database which will be used as a repository to hold research data in a structured and standardised way. This will provide a database management system that will support the longer term aims of HALOGEN researchers to increase the number and size of the external data sources input into the system and to be able to interrogate and extract information for analyses using a variety of different analyses tools not just ArcGIS. There is no implied dependency between the two prototypes and, resource permitting, the development of the second prototype can progress in parallel to the first. 3

4 There will be a need to purchase dedicated server equipment to host the central database. In order not to introduce any delay it is proposed to use existing server capacity to provide a virtual server development environment for the early phases of the implementation project. At this stage it is proposed to use database and data management tools that are available free to the research community. The resource required to complete the development of this prototype will be predominantly IT Services staff with input from specialist GIS resource around database design issues as necessary. 2.2 Design Principles The following high level design principles have been assumed when making decisions on the database and infrastructure for HALOGEN. As this is currently a non-business critical service for the University and, from a research perspective, a pilot project, the level of resilience and redundancy built into the design is low. In order to limit cost the team have selected database and software products which are either free to use for academic staff of where the University already has site license agreements in place. The working assumption has been that the pilot will be successful and that additional funds will be found to extend the service. The infrastructure design proposed can be scaled up at extra cost to support both the storage and processing capacity growth projections in the Service Specification. Any scripts to clean, load or extract data will be developed in such a way that they can be operated without the aid of specialist IT staff by research users. IT Services will be responsible for the support and maintenance of all infrastructure components deployed to support the project. 4

5 3. Database Management Model The generic database management model on the following page identifies that for database requirements like those of HALOGEN there are five stages or processes that need to be considered. These are described briefly below. Source Data - This involves obtaining and storing the source data that will be used as input to the research database. Included within this are procedures to refresh data as new versions become available. Extract, Transform and Load (ETL) This involves procedures to extract from the source data those items which are of relevance to the research group. In many cases the raw data may need to be cleaned and formatted in some way in order to improve its integrity or make it compatible with other source data sets. For example, it may be useful to introduce common codes for regions or check that the entries in specific fields are complete and accurate. Finally, there needs to be a way of loading the required data into some type of database. Data Storage This involves procedures to store and manage the core data required by researchers. For example, procedures and policies will need to be defined for governing the backup, recovery and access of data. Data Analyses This involves applying various tools and techniques to the data to produce information. In some cases it may involve extracting selected information for analyses using tools like SAS, SPSS or R. Visualisation This is one specific form of data analyses. From a HALOGEN perspective the geographical visualisation of data is the key user requirement and so it is appropriate to consider it as a separate stage. The diagram also identifies some of the many software tools that could be used to support different stages of the model. Each institution needs to choose those products which best suit their requirements and the skill sets of those involved in any project. 5

6 6

7 Database Management Software A key design decision is which database software to use. From a University of Leicester perspective three database products were considered as suitable for a database with the potential capacity and complexity requirements of HALOGEN. Initial discussions with researchers suggest requirements for spatial querying, data mining and full text search/manipulation. The three options considered were Oracle, SQL Server and MySQL. The key issues relating to each are summarised below. Oracle Oracle 9i (or higher) Enterprise Edition required with both spatial querying and data mining options added. Licensing costs for Oracle are high and combined with the limited skills and resources available for Oracle make this option prohibitive for the University of Leicester given the alternatives available. SQL Server SQL Server 2008 Standard Edition is required. Only the 2008 version of the software offers the spatial data querying option. Data mining and full text indexing options are also included. The Research Computer Services team do not currently have the relevant skills to support this software and, whilst the relevant skills do exist elsewhere in IT Services, these staff are already over committed to project work. MySQL MySQL offers support for both spatial data querying and full text indexing. Data mining is offered by 3 rd party open source provider Pentaho. There are open source versions of the software which are available at no costs and deemed suitable for the pilot phase of the project. The relevant skills and resource is available in the Research Computer Services team. Based on the above the team has chosen MySQL as the database software for HALOGEN. The software tools that we currently plan to use to support each stage of the database management model are summarised below for both of the prototypes. Prototype 1 Source Data & Extract, Transform, Load Excel, Access, PERL scripts Data Storage ArcGIS is based on a database and has utilities to assist with metadata management. Data Analyses ArcGIS has the ability to conduct many types of standard statistical analyses and to output files in a format that can be used by packages like SPSS or R. Visualisation - ArcGIS Prototype 2 Source Data & Extract, Transform, Load Excel, Access, PERL and/or Python scripts Data Storage MySQL (for some types of data, e.g. large images, it may be more appropriate to store these outside of the database via a Filesystem and store relevant metadata in MySQL) Data Analyses Database extracts/queries will provide output for use in tools favoured by researchers. For example, SPSS, R and ArcGIS. Visualisation ArcGIS 7

8 4. Data Quality Issues with Source Data Sets As part of the design phase of the project an initial review of the quality of the pilot datasets was performed by GIS specialists in Geography. The intention was to highlight any major issues or problems that could impact the use of ArcGIS. The findings are presented below for the Portable Antiquities Scheme (PAS) which now includes the Fitzwilliam coins database, the Key to English Place Names (KEN) and University of Leicester Genetics data. All the sample datasets have some form of georeference PAS: easting and northing fields although 10% are empty Genetics: population field which generally describes the county of the record English place names: gridref fields with components of a an OS grid PAS dataset is generally fine with good georeferencing, except for the 10% of the records with empty easting and northing fields. Suggested improvements: - fill in any blank values (e.g. with a 0, etc) - assess if all the fields are needed in the file to be imported into ArcGIS Genetics data is in a strange format and will take extensive manipulation to get in a format to be imported into ArcGIS and there will still be problems. This data can be mapped as it stands with some manipulations although only to a county area centroid. Suggested improvements: - Have only one header row - Export the file as a csv from the dbf export > text file myfile.csv access should do the rest, including management of blank fields. If this cannot be done some simple grep / replace could be run on the data. - Assess how to manage not typed this is a numeric field and having non-numeric characters entered will confuse ArcGIS - Typed descriptions need to be avoided so develop and use a key code to ensure consistency - Introduce separate fields for different things: e.g. Father from Australia vs. Father from Australia Mother from New Zealand. The latter needs to in two separate columns for ease of analyses. There is a requirement to add an alternative geographical location to each record in the Genetic database. This will reference the centre of gravity of the surname of the male whose Y chromosome was analysed. The geographical coding for these locations will be systematic and should not cause problems, but some surnames (e.g. common ones like 'Smith') will lack a location. KEN data English place names provided in.mdb format with some.csv files. This was generally of reasonable quality, not too much formatting will be needed to get the data into ArcGIS. However the spatial referencing is coarse and some manipulation is needed to concatenate three fields in order to provide a 1km OS grid reference (e.g. SK, 75, 48 SK7548) which could be linked to an easting and northing via data 8

9 downloaded from Edina.ac.uk. This data can be mapped after the concatenated grid reference elements have been linked to OS1km grid data to give the easting and northing values. Suggested improvements: - Provide data in csv format - Provide complete 6 digit referencing not 4 or 5 digit - Consider more detailed referencing - Concatenate current referencing if possible, although this is not essential (Note that only 191 / 313 records have this georeference i.e. ~40% do not) - Link to gazetteer to improve data quality. This can be done but it will need other data to identify the correct gazetteer record. - Avoid control characters in the free text (etymology) as this can create problems when the data is read into ArcGIS There are some general guidelines for importing data to ArcGIS which will be followed to simplify data preparation activities. These are: -.csv or excel (.xls) format -.mdb format but not.accdb - no non-alphanumeric characters in field names, although underscores are ok - do not start the field names with a number - field names should be <12 characters in length - have one type of data in each field e.g. characters or numbers (characters can include numbers but they cannot be manipulated numerically) - do not have too many (i.e. 100s) spurious fields some of the data may not load - do not have sparse fields ArcGIS will make a guess (often a bad one) at what type of field it is (e.g. text, etc) so complete with 0, N/A, etc - avoid labelling identifiers / referents as ID myid is better - use a standard set of descriptors in any field if possible rather than free text. If a predefinable set of free text descriptions are to be entered and this is being done manually then I suggest a code is used to avoid typos. In summary, the most problematic data set is the Genetics data. Considerable processing will be needed to clean and standardise the data prior to analyses. As this data is internal to the University of Leicester the team are comfortable that this is achievable. Overall the team believe that all the issues identified to date can be overcome as part of the pilot project. 5. Infrastructure & Implementation Resource To support the pilot phase of the project IT Services will need to purchase, install and configure appropriate database server and storage. These items are outlined below. Database Server The intention is to buy a production server with the following specification: 12 core x86_64 64GB System Ram Redundant PSU RAID 1 System disks RAID 0+1 DB Storage 800GB 3yr On-site next business day support 9

10 The operating system will be Linux The cost is estimated as 15,000 inclusive of VAT. Storage To support data storage and back up requirements it is proposed to buy additional capacity on the Research Data Storage system managed by IT Services. HALOGEN will be provided with 2 Terabytes of usable primary storage capacity and a further 2 Terabytes of backup storage capacity. Data will be backed up to a different data centre than that which hosts the database server on a daily basis. The recovery of data will be managed by the Research Computer Services team on receipt of a support request. The cost of this will be 2,400 inclusive of VAT. Software The free version of the open source MySQL database software will be used for the pilot project. The use of ArcGIS by HALOGEN is covered by a site license and is therefore free. It is possible that additional software products may be identified later in the project. User Access Users will be able to access the infrastructure from Windows (CIFS), Linux (NFS) and MAC (CFS) workstations. 10

11 Resource Estimates To develop the two prototypes will require the following resource. Geography ArcGIS expertise, data clean up and visualisation: 40 days Research Computer Services infrastructure build and database configuration: days Database Management Services database design and consultancy: days 11

The cross-disciplinary Roots of the British collaboration between scholars in humanities and

The cross-disciplinary Roots of the British collaboration between scholars in humanities and HALOGEN RESEARCH DATA MANAGEMENT BENEFITS CASE STUDY 1. BACKGROUND The cross-disciplinary Roots of the British collaboration between scholars in humanities and genetics at the University of Leicester (Wellcome

More information

Basics on Geodatabases

Basics on Geodatabases Basics on Geodatabases 1 GIS Data Management 2 File and Folder System A storage system which uses the default file and folder structure found in operating systems. Uses the non-db formats we mentioned

More information

Spanish examples IPR: Up to Date & Zones

Spanish examples IPR: Up to Date & Zones Spanish examples IPR: Up to Date & Zones 1 Spanish IPR examples We have chosen the open source option: Python Why Python? Easy to learn and understand for not it people and code can be freely used and

More information

Sisense. Product Highlights. www.sisense.com

Sisense. Product Highlights. www.sisense.com Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze

More information

TheraDoc v4.6.1 Hardware and Software Requirements

TheraDoc v4.6.1 Hardware and Software Requirements TheraDoc v4.6.1 Hardware and Software Requirements In preparation for the release of TheraDoc v4.6.1, we have the following important information to communicate. Client Workstation Browser Requirements

More information

Personal Geodatabase 101

Personal Geodatabase 101 Personal Geodatabase 101 There are a variety of file formats that can be used within the ArcGIS software. Two file formats, the shape file and the personal geodatabase were designed to hold geographic

More information

GIS III: GIS Analysis Module 2a: Introduction to Network Analyst

GIS III: GIS Analysis Module 2a: Introduction to Network Analyst *** Files needed for exercise: nc_cty.shp; target_stores_infousa.dbf; streets.sdc (provided by street map usa); NC_tracts_2000sf1.shp Goals: To learn how to use the Network analyst tools to perform network

More information

ICAS4108B Complete database back-up and recovery

ICAS4108B Complete database back-up and recovery ICAS4108B Complete database back-up and recovery Release: 1 ICAS4108B Complete database back-up and recovery Modification History Not Applicable Unit Descriptor Unit descriptor This unit defines the competency

More information

DATABASE ANALYST I DATABASE ANALYST II

DATABASE ANALYST I DATABASE ANALYST II CITY OF ROSEVILLE DATABASE ANALYST I DATABASE ANALYST II DEFINITION To perform professional level work in designing, installing, managing, updating, and securing a variety of database systems, including

More information

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros David Moses January 2014 Paper on Cloud Computing I Background on Tools and Technologies in Amazon Web Services (AWS) In this paper I will highlight the technologies from the AWS cloud which enable you

More information

Datzilla. Error Reporting and Tracking for NOAA Data

Datzilla. Error Reporting and Tracking for NOAA Data Datzilla Error Reporting and Tracking for NOAA Data Overview Datzilla is a web based system used to report and track errors in NOAA datasets and Data Products. It is an adaptation of the software bug tracking

More information

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices

Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices Sawmill Log Analyzer Best Practices!! Page 1 of 6 Sawmill Log Analyzer Best Practices! Sawmill Log Analyzer Best Practices!! Page 2 of 6 This document describes best practices for the Sawmill universal

More information

DBMS / Business Intelligence, SQL Server

DBMS / Business Intelligence, SQL Server DBMS / Business Intelligence, SQL Server Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the needs of IT professionals.

More information

G-Cloud Service Definition Cadcorp Web Map Layers

G-Cloud Service Definition Cadcorp Web Map Layers G-Cloud Service Definition Cadcorp Web Map Layers Ref: RM1557/iii Government Procurement Services G-Cloud III Contents 1. Introduction... 3 2. Service Overview... 4 2.1 Web Map Layers... 4 2.2 Initial

More information

College of Engineering, Technology, and Computer Science

College of Engineering, Technology, and Computer Science College of Engineering, Technology, and Computer Science Design and Implementation of Cloud-based Data Warehousing In partial fulfillment of the requirements for the Degree of Master of Science in Technology

More information

Data Management Implementation Plan

Data Management Implementation Plan Appendix 8.H Data Management Implementation Plan Prepared by Vikram Vyas CRESP-Amchitka Data Management Component 1. INTRODUCTION... 2 1.1. OBJECTIVES AND SCOPE... 2 2. DATA REPORTING CONVENTIONS... 2

More information

How To Write A File System On A Microsoft Office 2.2.2 (Windows) (Windows 2.3) (For Windows 2) (Minorode) (Orchestra) (Powerpoint) (Xls) (

How To Write A File System On A Microsoft Office 2.2.2 (Windows) (Windows 2.3) (For Windows 2) (Minorode) (Orchestra) (Powerpoint) (Xls) ( Remark Office OMR 8 Supported File Formats User s Guide Addendum Remark Products Group 301 Lindenwood Drive, Suite 100 Malvern, PA 19355-1772 USA www.gravic.com Disclaimer The information contained in

More information

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:

Chapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem: Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:

More information

User Guide. Analytics Desktop Document Number: 09619414

User Guide. Analytics Desktop Document Number: 09619414 User Guide Analytics Desktop Document Number: 09619414 CONTENTS Guide Overview Description of this guide... ix What s new in this guide...x 1. Getting Started with Analytics Desktop Introduction... 1

More information

Release 8.2 Hardware and Software Requirements. PowerSchool Student Information System

Release 8.2 Hardware and Software Requirements. PowerSchool Student Information System Release 8.2 Hardware and Software Requirements PowerSchool Student Information System Released January 2015 Document Owner: Documentation Services This edition applies to Release 8.2 of the PowerSchool

More information

AWS Schema Conversion Tool. User Guide Version 1.0

AWS Schema Conversion Tool. User Guide Version 1.0 AWS Schema Conversion Tool User Guide AWS Schema Conversion Tool: User Guide Copyright 2016 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress may

More information

Database as a Service (DaaS) Version 1.02

Database as a Service (DaaS) Version 1.02 Database as a Service (DaaS) Version 1.02 Table of Contents Database as a Service (DaaS) Overview... 4 Database as a Service (DaaS) Benefit... 4 Feature Description... 4 Database Types / Supported Versions...

More information

TARRANT COUNTY PURCHASING DEPARTMENT

TARRANT COUNTY PURCHASING DEPARTMENT JACK BEACHAM, C.P.M., A.P.P. PURCHASING AGENT TARRANT COUNTY PURCHASING DEPARTMENT AUGUST 4, 2010 RFP NO. 2010-103 ROB COX, C.P.M., A.P.P. ASSISTANT PURCHASING AGENT RFP FOR DIGITAL ASSET MANAGEMENT SYSTEM

More information

Seamless Web Data Entry for SAS Applications D.J. Penix, Pinnacle Solutions, Indianapolis, IN

Seamless Web Data Entry for SAS Applications D.J. Penix, Pinnacle Solutions, Indianapolis, IN Seamless Web Data Entry for SAS Applications D.J. Penix, Pinnacle Solutions, Indianapolis, IN ABSTRACT For organizations that need to implement a robust data entry solution, options are somewhat limited

More information

Gladstone Health & Leisure Technical Services

Gladstone Health & Leisure Technical Services Gladstone Health & Leisure Technical Services Plus2 Environment Server Recommendations Commercial in Confidence Database Server Specifications Database server specifications are based on sizes in use on

More information

Resources You can find more resources for Sync & Save at our support site: http://www.doforms.com/support.

Resources You can find more resources for Sync & Save at our support site: http://www.doforms.com/support. Sync & Save Introduction Sync & Save allows you to connect the DoForms service (www.doforms.com) with your accounting or management software. If your system can import a comma delimited, tab delimited

More information

Job Description. Working Hours Standard 35 hours per week Normally working Mon Fri 9am to 5pm with additional hours as required

Job Description. Working Hours Standard 35 hours per week Normally working Mon Fri 9am to 5pm with additional hours as required Job Description Job Title Oracle Support Technical Developer Function IT Services Applications Reporting to Applications Manager Direct Reports None Working Hours Standard 35 hours per week Normally working

More information

Stellar Phoenix. SQL Database Repair 6.0. Installation Guide

Stellar Phoenix. SQL Database Repair 6.0. Installation Guide Stellar Phoenix SQL Database Repair 6.0 Installation Guide Overview Stellar Phoenix SQL Database Repair software is an easy to use application designed to repair corrupt or damaged Microsoft SQL Server

More information

From Firebird 1.5 to 2.5

From Firebird 1.5 to 2.5 From Firebird 1.5 to 2.5 How to migrate 75Gb database, with 564 tables, 5000+ stored procedures, 813 triggers, which is working 24x7, with ~400 users in less than 4 months About IBSurgeon Tools and consulting

More information

CLIDATA In Ostrava 18/06/2013

CLIDATA In Ostrava 18/06/2013 CLIDATA In Ostrava 18/06/2013 Content Introduction...3 Structure of Clidata Application...4 Clidata Database...5 Rich Java Client...6 Oracle Discoverer...7 Web Client...8 Map Viewer...9 Clidata GIS and

More information

Real-time Data Replication

Real-time Data Replication Real-time Data Replication from Oracle to other databases using DataCurrents WHITEPAPER Contents Data Replication Concepts... 2 Real time Data Replication... 3 Heterogeneous Data Replication... 4 Different

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management Wienand Omta Fabiano Dalpiaz 1 drs. ing. Wienand Omta Learning Objectives Describe how the problems of managing data resources

More information

Using Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms

Using Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms Using Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms Mohammed M. Elsheh and Mick J. Ridley Abstract Automatic and dynamic generation of Web applications is the future

More information

How To Use Gfi Mailarchiver On A Pc Or Macbook With Gfi Email From A Windows 7.5 (Windows 7) On A Microsoft Mail Server On A Gfi Server On An Ipod Or Gfi.Org (

How To Use Gfi Mailarchiver On A Pc Or Macbook With Gfi Email From A Windows 7.5 (Windows 7) On A Microsoft Mail Server On A Gfi Server On An Ipod Or Gfi.Org ( GFI MailArchiver for Exchange 4 Manual By GFI Software http://www.gfi.com Email: info@gfi.com Information in this document is subject to change without notice. Companies, names, and data used in examples

More information

High Availability Databases based on Oracle 10g RAC on Linux

High Availability Databases based on Oracle 10g RAC on Linux High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN, June 2006 Luca Canali, CERN IT Outline Goals Architecture of an HA DB Service Deployment at the CERN Physics Database

More information

SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)

SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box) SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box) SQL Server White Paper Published: January 2012 Applies to: SQL Server 2012 Summary: This paper explains the different ways in which databases

More information

Oracle Database 11g Comparison Chart

Oracle Database 11g Comparison Chart Key Feature Summary Express 10g Standard One Standard Enterprise Maximum 1 CPU 2 Sockets 4 Sockets No Limit RAM 1GB OS Max OS Max OS Max Database Size 4GB No Limit No Limit No Limit Windows Linux Unix

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 See Markers-ORDER-DB Logically Related Tables Relational Approach: Physically Related Tables: The Relationship Screen

More information

Description of Application

Description of Application Description of Application Operating Organization: Coeur d Alene Tribe, Plummer, Idaho Community of Interest: U.S. Indian tribes and their governments; rural governments OS and software requirements: Microsoft

More information

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project

GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project Description of Application The Spatial Data Warehouse project at the USGS/EROS distributes services and data in support of The National

More information

Transitioning from a Physical to Virtual Production Environment. Ryan Miller Middle Tennessee Electric Membership Corp

Transitioning from a Physical to Virtual Production Environment. Ryan Miller Middle Tennessee Electric Membership Corp Transitioning from a Physical to Virtual Production Environment Ryan Miller Middle Tennessee Electric Membership Corp Introduction MTEMC Distribute electricity to ~200,000 residential & business members

More information

Geodatabase Programming with SQL

Geodatabase Programming with SQL DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with SQL Craig Gillgrass Assumptions Basic knowledge of SQL and relational databases Basic knowledge of the Geodatabase We ll hold

More information

MicroStrategy Desktop

MicroStrategy Desktop MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT. 1 Import data from

More information

Setting up a database for multi-user access

Setting up a database for multi-user access BioNumerics Tutorial: Setting up a database for multi-user access 1 Aims There are several situations in which multiple users in the same local area network (LAN) may wish to work with a shared BioNumerics

More information

Oracle Net Service Name Resolution

Oracle Net Service Name Resolution Oracle Net Service Name Resolution Getting Rid of the TNSNAMES.ORA File! Simon Pane Oracle Database Principal Consultant March 19, 2015 ABOUT ME Working with the Oracle DB since version 6 Oracle Certified

More information

Parallels Plesk Automation

Parallels Plesk Automation Parallels Plesk Automation Contents Compact Configuration: Linux Shared Hosting 3 Compact Configuration: Mixed Linux and Windows Shared Hosting 4 Medium Size Configuration: Mixed Linux and Windows Shared

More information

Relational Databases for the Business Analyst

Relational Databases for the Business Analyst Relational Databases for the Business Analyst Mark Kurtz Sr. Systems Consulting Quest Software, Inc. mark.kurtz@quest.com 2010 Quest Software, Inc. ALL RIGHTS RESERVED Agenda The RDBMS and its role in

More information

DBMS / Business Intelligence, Business Intelligence / DBMS

DBMS / Business Intelligence, Business Intelligence / DBMS DBMS / Business Intelligence, Business Intelligence / DBMS Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the

More information

ICADBS402A Complete database backup and restore

ICADBS402A Complete database backup and restore ICADBS402A Complete database backup and restore Release: 1 ICADBS402A Complete database backup and restore Modification History Version ICADBS402A Comments This version first released with ICA11 Information

More information

itop: the open-source ITSM solution

itop: the open-source ITSM solution itop: the open-source ITSM solution itop is a multi-client web portal designed for service providers and businesses. Simple and easy to use, it allows all configuration items and their relationships to

More information

Data Management Nuts and Bolts. Don Johnson Scientific Computing and Visualization

Data Management Nuts and Bolts. Don Johnson Scientific Computing and Visualization Data Management Nuts and Bolts Don Johnson Scientific Computing and Visualization Overview Data Management Storing data Sharing data Moving data Tracking data (Client responsibility) Where can you obtain

More information

Geospatial Server Performance Colin Bertram UK User Group Meeting 23-Sep-2014

Geospatial Server Performance Colin Bertram UK User Group Meeting 23-Sep-2014 Geospatial Server Performance Colin Bertram UK User Group Meeting 23-Sep-2014 Topics Auditing a Geospatial Server Solution Web Server Strategies and Configuration Database Server Strategy and Configuration

More information

Very Large Enterprise Network, Deployment, 25000+ Users

Very Large Enterprise Network, Deployment, 25000+ Users Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering

More information

POPI Cloud Backups. Overview. The Challenges

POPI Cloud Backups. Overview. The Challenges POPI Cloud Backups Overview POPI Online Backup offers a simple, Secure, rapid deployment solution that is customizable and cost effective to implement a Backup strategy throughout your organisation. POPI

More information

W16 Data Mining Workshop

W16 Data Mining Workshop W16 Data Mining Workshop Brantley Synco, Director, Internal Audit and Compliance, Baptist Health System Jim Donaldson, Director of Compliance/Privacy and Security Officer, Baptist Health Care Corporation

More information

University of Arkansas Libraries ArcGIS Desktop Tutorial. Section 4: Preparing Data for Analysis

University of Arkansas Libraries ArcGIS Desktop Tutorial. Section 4: Preparing Data for Analysis : Preparing Data for Analysis When a user acquires a particular data set of interest, it is rarely in the exact form that is needed during analysis. This tutorial describes how to change the data to make

More information

5-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual

5-Bay Raid Sub-System Smart Removable 3.5 SATA Multiple Bay Data Storage Device User's Manual 5-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual www.vipower.com Table of Contents 1. How the SteelVine (VPMP-75511R/VPMA-75511R) Operates... 1 1-1 SteelVine

More information

How To Monitor A Document Management System On A Web Browser On A Linux Computer (For Free)

How To Monitor A Document Management System On A Web Browser On A Linux Computer (For Free) Whitepaper: Monitoring the Interwoven Worksite DMS By S. Bondy Abstract The document management system is crucial to the daily operation of the modern law firm. Yet frequently, the document management

More information

Leveraging Public Clouds to Ensure Data Availability

Leveraging Public Clouds to Ensure Data Availability Systems Engineering at MITRE CLOUD COMPUTING SERIES Leveraging Public Clouds to Ensure Data Availability Toby Cabot Lawrence Pizette The MITRE Corporation manages federally funded research and development

More information

Installing The SysAidTM Server Locally

Installing The SysAidTM Server Locally Installing The SysAidTM Server Locally Document Updated: 17 October 2010 Introduction SysAid is available in two editions: a fully on-demand ASP solution and an installed, in-house solution for your server.

More information

Chapter 24: Creating Reports and Extracting Data

Chapter 24: Creating Reports and Extracting Data Chapter 24: Creating Reports and Extracting Data SEER*DMS includes an integrated reporting and extract module to create pre-defined system reports and extracts. Ad hoc listings and extracts can be generated

More information

Inge Os Sales Consulting Manager Oracle Norway

Inge Os Sales Consulting Manager Oracle Norway Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database

More information

Data Lab Operations Concepts

Data Lab Operations Concepts Data Lab Operations Concepts 1 Introduction This talk will provide an overview of Data Lab components to be implemented Core infrastructure User applications Science Capabilities User Interfaces The scope

More information

Enterprise GIS Solutions to GIS Data Dissemination

Enterprise GIS Solutions to GIS Data Dissemination Enterprise GIS Solutions to GIS Data Dissemination ESRI International User Conference July 13 17, 2009 Wendy M. Turner Senior GIS Engineer & Program Manager Freedom Consulting Group, LLC Building the Enterprise

More information

Easy Data Centralization with Webster. User Guide

Easy Data Centralization with Webster. User Guide Easy Data Centralization with Webster User Guide CONTENTS 3-4 1 Introducing Webster Webster - An Introduction 5-14 2 Installing & Configuring Webster Installing the System Configuring Webster 15-18 3 Managing

More information

Oracle SQL Developer Migration

Oracle SQL Developer Migration An Oracle White Paper May 2010 Oracle SQL Developer Migration Overview... 3 Introduction... 3 Oracle SQL Developer: Architecture and Supported Platforms... 3 Supported Platforms... 4 Supported Databases...

More information

Groundwater Chemistry

Groundwater Chemistry Mapping and Modeling Groundwater Chemistry By importing Excel spreadsheets into ArcGIS 9.2 By Mike Price, Entrada/San Juan, Inc. In ArcGIS 9.2, Microsoft Excel spreadsheet data can be imported and used

More information

Spectrum Technology Platform. Version 9.0. Spectrum Spatial Administration Guide

Spectrum Technology Platform. Version 9.0. Spectrum Spatial Administration Guide Spectrum Technology Platform Version 9.0 Spectrum Spatial Administration Guide Contents Chapter 1: Introduction...7 Welcome and Overview...8 Chapter 2: Configuring Your System...9 Changing the Default

More information

WalesHER GAT User Manual

WalesHER GAT User Manual WalesHER GAT User Manual Automated Data Upload (User Levels 0 & 1) This document provides guidance on uploading datasets in csv format to WalesHER using the Load Data tool and migration SQLs in phpmyadmin

More information

Hardware and Software Requirements for Installing California.pro

Hardware and Software Requirements for Installing California.pro Hardware and Requirements for Installing California.pro This document lists the hardware and software requirements to install and run California.pro. Workstation with SQL Server Recommended: 64-Bit Windows

More information

Web Hosting. E-Mail Hosting. Cloud File Hosting. The Genio Group (214) 732-7411 info@thegeniogroup.com www.thegeniogroup.com

Web Hosting. E-Mail Hosting. Cloud File Hosting. The Genio Group (214) 732-7411 info@thegeniogroup.com www.thegeniogroup.com Web Hosting E-Mail Hosting Cloud File Hosting Genio Hosting Servers All of Genio s Hosting Servers run on Apple hardware running Mac OS X Server. Mac OS X Server leverages the computing power of 64-bit

More information

Troubleshooting SQL Server Enterprise Geodatabase Performance Issues. Matthew Ziebarth and Ben Lin

Troubleshooting SQL Server Enterprise Geodatabase Performance Issues. Matthew Ziebarth and Ben Lin Troubleshooting SQL Server Enterprise Geodatabase Performance Issues Matthew Ziebarth and Ben Lin Troubleshooting SQL Server Enterprise Geodatabase Performance Issues AGENDA General configuration recommendations

More information

Abstract. Introduction

Abstract. Introduction Data Replication and Data Sharing Integrating Heterogeneous Spatial Databases Mark Stoakes and Katherine Irwin Professional Services, Safe Software Inc. Abstract Spatial data warehouses are becoming more

More information

Enterprise Network Deployment, 10,000 25,000 Users

Enterprise Network Deployment, 10,000 25,000 Users Enterprise Network Deployment, 10,000 25,000 Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering

More information

HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - aniketb1@umbc.edu. CMSC 601 - Presentation

HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - aniketb1@umbc.edu. CMSC 601 - Presentation HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - aniketb1@umbc.edu CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous

More information

Advanced analytics at your hands

Advanced analytics at your hands 2.3 Advanced analytics at your hands Neural Designer is the most powerful predictive analytics software. It uses innovative neural networks techniques to provide data scientists with results in a way previously

More information

EMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER

EMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER White Paper EMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER Abstract The objective of this white paper is to describe the architecture of and procedure for configuring EMC Documentum xplore

More information

BENEFITS OF AUTOMATING DATA WAREHOUSING

BENEFITS OF AUTOMATING DATA WAREHOUSING BENEFITS OF AUTOMATING DATA WAREHOUSING Introduction...2 The Process...2 The Problem...2 The Solution...2 Benefits...2 Background...3 Automating the Data Warehouse with UC4 Workload Automation Suite...3

More information

We look beyond IT. Cloud Offerings

We look beyond IT. Cloud Offerings Cloud Offerings cstor Cloud Offerings As today s fast-moving businesses deal with increasing demands for IT services and decreasing IT budgets, the onset of cloud-ready solutions has provided a forward-thinking

More information

Hardware and Software Requirements for Installing California.pro

Hardware and Software Requirements for Installing California.pro Hardware and Requirements for Installing California.pro This document lists the hardware and software requirements to install and run California.pro. Workstation with SQL Server type: Pentium IV-compatible

More information

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY

More information

Toolbox 4.3. System Requirements

Toolbox 4.3. System Requirements Toolbox 4.3 February 2015 Contents Introduction... 2 Requirements for Toolbox 4.3... 3 Toolbox Applications... 3 Installing on Multiple Computers... 3 Concurrent Loading, Importing, Processing... 4 Client...

More information

Unicenter Patch Management

Unicenter Patch Management Unicenter Patch Management Best Practices for Managing Security Updates R11 This documentation (the Documentation ) and related computer software program (the Software ) (hereinafter collectively referred

More information

Data Warehouse Center Administration Guide

Data Warehouse Center Administration Guide IBM DB2 Universal Database Data Warehouse Center Administration Guide Version 8 SC27-1123-00 IBM DB2 Universal Database Data Warehouse Center Administration Guide Version 8 SC27-1123-00 Before using this

More information

How to recover a failed Storage Spaces

How to recover a failed Storage Spaces www.storage-spaces-recovery.com How to recover a failed Storage Spaces ReclaiMe Storage Spaces Recovery User Manual 2013 www.storage-spaces-recovery.com Contents Overview... 4 Storage Spaces concepts and

More information

ER/Studio 8.0 New Features Guide

ER/Studio 8.0 New Features Guide ER/Studio 8.0 New Features Guide Copyright 1994-2008 Embarcadero Technologies, Inc. Embarcadero Technologies, Inc. 100 California Street, 12th Floor San Francisco, CA 94111 U.S.A. All rights reserved.

More information

What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11

What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 W elcome to North Face Software s software. With this software, you can accomplish

More information

EnterpriseLink Benefits

EnterpriseLink Benefits EnterpriseLink Benefits GGY AXIS 5001 Yonge Street Suite 1300 Toronto, ON M2N 6P6 Phone: 416-250-6777 Toll free: 1-877-GGY-AXIS Fax: 416-250-6776 Email: axis@ggy.com Web: www.ggy.com Table of Contents

More information

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data

Alexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are

More information

MIS0094 SuccessNet Upgrade 1.1.0.5 Guide. Individuals Implementing SuccessNet. To assist in the upgrade of the software to version 1.1.0.

MIS0094 SuccessNet Upgrade 1.1.0.5 Guide. Individuals Implementing SuccessNet. To assist in the upgrade of the software to version 1.1.0. Title: Audience: Maintained by: Purpose: MIS0094 SuccessNet Upgrade 1.1.0.5 Guide Individuals Implementing SuccessNet. Technical Services To assist in the upgrade of the software to version 1.1.0.5 Requirements

More information

hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau

hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau Powered by Vertica Solution Series in conjunction with: hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau The cost of healthcare in the US continues to escalate. Consumers, employers,

More information

GUIDE TO REDCAP EXPORTED FILES

GUIDE TO REDCAP EXPORTED FILES GUIDE TO REDCAP EXPORTED FILES UNDERSTANDING DATA FORMATS AND LOADING DATA INTO ANALYSIS SOFTWARE INTRODUCTION At some point in time in the course of your REDCap project, you will need to export your data

More information

INFORMATION MANAGERS ROUNDTABLE SHELLEY COOKE, WHITNEY WEBER MONDAY APRIL 23, 2012 1:30 3:00 PM PST

INFORMATION MANAGERS ROUNDTABLE SHELLEY COOKE, WHITNEY WEBER MONDAY APRIL 23, 2012 1:30 3:00 PM PST INFORMATION MANAGERS ROUNDTABLE SHELLEY COOKE, WHITNEY WEBER MONDAY APRIL 23, 2012 1:30 3:00 PM PST I. Biotics 4 Session Topics Compatibility issues, other common challenges, support for Biotics 4 (Informational,

More information

Table of Contents SQL Server Option

Table of Contents SQL Server Option Table of Contents SQL Server Option STEP 1 Install BPMS 1 STEP 2a New Customers with SQL Server Database 2 STEP 2b Restore SQL DB Upsized by BPMS Support 6 STEP 2c - Run the "Check Dates" Utility 7 STEP

More information

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web

More information

MySQL for Beginners Ed 3

MySQL for Beginners Ed 3 Oracle University Contact Us: 1.800.529.0165 MySQL for Beginners Ed 3 Duration: 4 Days What you will learn The MySQL for Beginners course helps you learn about the world's most popular open source database.

More information

Solution Brief: Creating Avid Project Archives

Solution Brief: Creating Avid Project Archives Solution Brief: Creating Avid Project Archives Marquis Project Parking running on a XenData Archive Server provides Fast and Reliable Archiving to LTO or Sony Optical Disc Archive Cartridges Summary Avid

More information

Results CRM 2012 User Manual

Results CRM 2012 User Manual Results CRM 2012 User Manual A Guide to Using Results CRM Standard, Results CRM Plus, & Results CRM Business Suite Table of Contents Installation Instructions... 1 Single User & Evaluation Installation

More information