HALOGEN. Technical Design Specification. Version 2.0
|
|
- Jodie Osborne
- 8 years ago
- Views:
Transcription
1 HALOGEN Technical Design Specification Version th August
2 Document Revision History Date Author Revision Description 27/7/09 D Carter, Mark Widdowson, Stuart Poulton, Lex Comber 1.1 First draft following review by IT Services team. 10/8/09 D Carter 2.0 Issued to Project Board. Approvals Date Name Title Embedded signature/ 2
3 1. Purpose of Document The purpose of this paper is to document the design of the IT related infrastructure that will be used to support users of the HALOGEN system. This paper begins by describing the general approach and principles that will be applied to the implementation phase of the project. Subsequent sections describe a generic model that can be used to support the design of database management systems and their related infrastructure; reviews some of the key data quality issues associated with the pilot datasets and finally identifies specific software tools and hardware that will be deployed by the University of Leicester for HALOGEN. 2. Implementation Approach and Design Principles 2.1 Implementation Approach The high level user and service requirements have been agreed with the research users and documented in the HALOGEN Service Specification ( The implementation approach for the delivering a system to support these requirements will be to develop two prototypes during the implementation phase of the pilot project. These are briefly described below. Prototype 1 The first prototype will be structured around using the ArcGIS software package to directly process and analyse the pilot datasets. As part of this phase of work procedures to clean and format the raw data in the pilot datasets will be developed, and ArcGIS specific processing and visualisation requirements will be defined and delivered. This will give the research team a capability to begin to analyse new sources of research data and further formulate their thinking on the research questions they wish to address and the techniques that will be of most value to them. There is no lead time for providing the infrastructure to support the development of the initial prototype. The data storage requirements can be met through use of currently spare capacity on the existing Research Storage infrastructure managed by IT Services; the University already has the required ArcGIS software licenses and standard desktop workstations can support the processing associated with the visualisation and analyses of data. The specialist ArcGIS resource required to undertake the majority of the work for this phase has been secured and is available from mid-august. Prototype 2 The second prototype will build on the initial system and introduce a central database which will be used as a repository to hold research data in a structured and standardised way. This will provide a database management system that will support the longer term aims of HALOGEN researchers to increase the number and size of the external data sources input into the system and to be able to interrogate and extract information for analyses using a variety of different analyses tools not just ArcGIS. There is no implied dependency between the two prototypes and, resource permitting, the development of the second prototype can progress in parallel to the first. 3
4 There will be a need to purchase dedicated server equipment to host the central database. In order not to introduce any delay it is proposed to use existing server capacity to provide a virtual server development environment for the early phases of the implementation project. At this stage it is proposed to use database and data management tools that are available free to the research community. The resource required to complete the development of this prototype will be predominantly IT Services staff with input from specialist GIS resource around database design issues as necessary. 2.2 Design Principles The following high level design principles have been assumed when making decisions on the database and infrastructure for HALOGEN. As this is currently a non-business critical service for the University and, from a research perspective, a pilot project, the level of resilience and redundancy built into the design is low. In order to limit cost the team have selected database and software products which are either free to use for academic staff of where the University already has site license agreements in place. The working assumption has been that the pilot will be successful and that additional funds will be found to extend the service. The infrastructure design proposed can be scaled up at extra cost to support both the storage and processing capacity growth projections in the Service Specification. Any scripts to clean, load or extract data will be developed in such a way that they can be operated without the aid of specialist IT staff by research users. IT Services will be responsible for the support and maintenance of all infrastructure components deployed to support the project. 4
5 3. Database Management Model The generic database management model on the following page identifies that for database requirements like those of HALOGEN there are five stages or processes that need to be considered. These are described briefly below. Source Data - This involves obtaining and storing the source data that will be used as input to the research database. Included within this are procedures to refresh data as new versions become available. Extract, Transform and Load (ETL) This involves procedures to extract from the source data those items which are of relevance to the research group. In many cases the raw data may need to be cleaned and formatted in some way in order to improve its integrity or make it compatible with other source data sets. For example, it may be useful to introduce common codes for regions or check that the entries in specific fields are complete and accurate. Finally, there needs to be a way of loading the required data into some type of database. Data Storage This involves procedures to store and manage the core data required by researchers. For example, procedures and policies will need to be defined for governing the backup, recovery and access of data. Data Analyses This involves applying various tools and techniques to the data to produce information. In some cases it may involve extracting selected information for analyses using tools like SAS, SPSS or R. Visualisation This is one specific form of data analyses. From a HALOGEN perspective the geographical visualisation of data is the key user requirement and so it is appropriate to consider it as a separate stage. The diagram also identifies some of the many software tools that could be used to support different stages of the model. Each institution needs to choose those products which best suit their requirements and the skill sets of those involved in any project. 5
6 6
7 Database Management Software A key design decision is which database software to use. From a University of Leicester perspective three database products were considered as suitable for a database with the potential capacity and complexity requirements of HALOGEN. Initial discussions with researchers suggest requirements for spatial querying, data mining and full text search/manipulation. The three options considered were Oracle, SQL Server and MySQL. The key issues relating to each are summarised below. Oracle Oracle 9i (or higher) Enterprise Edition required with both spatial querying and data mining options added. Licensing costs for Oracle are high and combined with the limited skills and resources available for Oracle make this option prohibitive for the University of Leicester given the alternatives available. SQL Server SQL Server 2008 Standard Edition is required. Only the 2008 version of the software offers the spatial data querying option. Data mining and full text indexing options are also included. The Research Computer Services team do not currently have the relevant skills to support this software and, whilst the relevant skills do exist elsewhere in IT Services, these staff are already over committed to project work. MySQL MySQL offers support for both spatial data querying and full text indexing. Data mining is offered by 3 rd party open source provider Pentaho. There are open source versions of the software which are available at no costs and deemed suitable for the pilot phase of the project. The relevant skills and resource is available in the Research Computer Services team. Based on the above the team has chosen MySQL as the database software for HALOGEN. The software tools that we currently plan to use to support each stage of the database management model are summarised below for both of the prototypes. Prototype 1 Source Data & Extract, Transform, Load Excel, Access, PERL scripts Data Storage ArcGIS is based on a database and has utilities to assist with metadata management. Data Analyses ArcGIS has the ability to conduct many types of standard statistical analyses and to output files in a format that can be used by packages like SPSS or R. Visualisation - ArcGIS Prototype 2 Source Data & Extract, Transform, Load Excel, Access, PERL and/or Python scripts Data Storage MySQL (for some types of data, e.g. large images, it may be more appropriate to store these outside of the database via a Filesystem and store relevant metadata in MySQL) Data Analyses Database extracts/queries will provide output for use in tools favoured by researchers. For example, SPSS, R and ArcGIS. Visualisation ArcGIS 7
8 4. Data Quality Issues with Source Data Sets As part of the design phase of the project an initial review of the quality of the pilot datasets was performed by GIS specialists in Geography. The intention was to highlight any major issues or problems that could impact the use of ArcGIS. The findings are presented below for the Portable Antiquities Scheme (PAS) which now includes the Fitzwilliam coins database, the Key to English Place Names (KEN) and University of Leicester Genetics data. All the sample datasets have some form of georeference PAS: easting and northing fields although 10% are empty Genetics: population field which generally describes the county of the record English place names: gridref fields with components of a an OS grid PAS dataset is generally fine with good georeferencing, except for the 10% of the records with empty easting and northing fields. Suggested improvements: - fill in any blank values (e.g. with a 0, etc) - assess if all the fields are needed in the file to be imported into ArcGIS Genetics data is in a strange format and will take extensive manipulation to get in a format to be imported into ArcGIS and there will still be problems. This data can be mapped as it stands with some manipulations although only to a county area centroid. Suggested improvements: - Have only one header row - Export the file as a csv from the dbf export > text file myfile.csv access should do the rest, including management of blank fields. If this cannot be done some simple grep / replace could be run on the data. - Assess how to manage not typed this is a numeric field and having non-numeric characters entered will confuse ArcGIS - Typed descriptions need to be avoided so develop and use a key code to ensure consistency - Introduce separate fields for different things: e.g. Father from Australia vs. Father from Australia Mother from New Zealand. The latter needs to in two separate columns for ease of analyses. There is a requirement to add an alternative geographical location to each record in the Genetic database. This will reference the centre of gravity of the surname of the male whose Y chromosome was analysed. The geographical coding for these locations will be systematic and should not cause problems, but some surnames (e.g. common ones like 'Smith') will lack a location. KEN data English place names provided in.mdb format with some.csv files. This was generally of reasonable quality, not too much formatting will be needed to get the data into ArcGIS. However the spatial referencing is coarse and some manipulation is needed to concatenate three fields in order to provide a 1km OS grid reference (e.g. SK, 75, 48 SK7548) which could be linked to an easting and northing via data 8
9 downloaded from Edina.ac.uk. This data can be mapped after the concatenated grid reference elements have been linked to OS1km grid data to give the easting and northing values. Suggested improvements: - Provide data in csv format - Provide complete 6 digit referencing not 4 or 5 digit - Consider more detailed referencing - Concatenate current referencing if possible, although this is not essential (Note that only 191 / 313 records have this georeference i.e. ~40% do not) - Link to gazetteer to improve data quality. This can be done but it will need other data to identify the correct gazetteer record. - Avoid control characters in the free text (etymology) as this can create problems when the data is read into ArcGIS There are some general guidelines for importing data to ArcGIS which will be followed to simplify data preparation activities. These are: -.csv or excel (.xls) format -.mdb format but not.accdb - no non-alphanumeric characters in field names, although underscores are ok - do not start the field names with a number - field names should be <12 characters in length - have one type of data in each field e.g. characters or numbers (characters can include numbers but they cannot be manipulated numerically) - do not have too many (i.e. 100s) spurious fields some of the data may not load - do not have sparse fields ArcGIS will make a guess (often a bad one) at what type of field it is (e.g. text, etc) so complete with 0, N/A, etc - avoid labelling identifiers / referents as ID myid is better - use a standard set of descriptors in any field if possible rather than free text. If a predefinable set of free text descriptions are to be entered and this is being done manually then I suggest a code is used to avoid typos. In summary, the most problematic data set is the Genetics data. Considerable processing will be needed to clean and standardise the data prior to analyses. As this data is internal to the University of Leicester the team are comfortable that this is achievable. Overall the team believe that all the issues identified to date can be overcome as part of the pilot project. 5. Infrastructure & Implementation Resource To support the pilot phase of the project IT Services will need to purchase, install and configure appropriate database server and storage. These items are outlined below. Database Server The intention is to buy a production server with the following specification: 12 core x86_64 64GB System Ram Redundant PSU RAID 1 System disks RAID 0+1 DB Storage 800GB 3yr On-site next business day support 9
10 The operating system will be Linux The cost is estimated as 15,000 inclusive of VAT. Storage To support data storage and back up requirements it is proposed to buy additional capacity on the Research Data Storage system managed by IT Services. HALOGEN will be provided with 2 Terabytes of usable primary storage capacity and a further 2 Terabytes of backup storage capacity. Data will be backed up to a different data centre than that which hosts the database server on a daily basis. The recovery of data will be managed by the Research Computer Services team on receipt of a support request. The cost of this will be 2,400 inclusive of VAT. Software The free version of the open source MySQL database software will be used for the pilot project. The use of ArcGIS by HALOGEN is covered by a site license and is therefore free. It is possible that additional software products may be identified later in the project. User Access Users will be able to access the infrastructure from Windows (CIFS), Linux (NFS) and MAC (CFS) workstations. 10
11 Resource Estimates To develop the two prototypes will require the following resource. Geography ArcGIS expertise, data clean up and visualisation: 40 days Research Computer Services infrastructure build and database configuration: days Database Management Services database design and consultancy: days 11
The cross-disciplinary Roots of the British collaboration between scholars in humanities and
HALOGEN RESEARCH DATA MANAGEMENT BENEFITS CASE STUDY 1. BACKGROUND The cross-disciplinary Roots of the British collaboration between scholars in humanities and genetics at the University of Leicester (Wellcome
More informationBasics on Geodatabases
Basics on Geodatabases 1 GIS Data Management 2 File and Folder System A storage system which uses the default file and folder structure found in operating systems. Uses the non-db formats we mentioned
More informationSpanish examples IPR: Up to Date & Zones
Spanish examples IPR: Up to Date & Zones 1 Spanish IPR examples We have chosen the open source option: Python Why Python? Easy to learn and understand for not it people and code can be freely used and
More informationSisense. Product Highlights. www.sisense.com
Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze
More informationTheraDoc v4.6.1 Hardware and Software Requirements
TheraDoc v4.6.1 Hardware and Software Requirements In preparation for the release of TheraDoc v4.6.1, we have the following important information to communicate. Client Workstation Browser Requirements
More informationPersonal Geodatabase 101
Personal Geodatabase 101 There are a variety of file formats that can be used within the ArcGIS software. Two file formats, the shape file and the personal geodatabase were designed to hold geographic
More informationGIS III: GIS Analysis Module 2a: Introduction to Network Analyst
*** Files needed for exercise: nc_cty.shp; target_stores_infousa.dbf; streets.sdc (provided by street map usa); NC_tracts_2000sf1.shp Goals: To learn how to use the Network analyst tools to perform network
More informationICAS4108B Complete database back-up and recovery
ICAS4108B Complete database back-up and recovery Release: 1 ICAS4108B Complete database back-up and recovery Modification History Not Applicable Unit Descriptor Unit descriptor This unit defines the competency
More informationDATABASE ANALYST I DATABASE ANALYST II
CITY OF ROSEVILLE DATABASE ANALYST I DATABASE ANALYST II DEFINITION To perform professional level work in designing, installing, managing, updating, and securing a variety of database systems, including
More informationBackground on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros
David Moses January 2014 Paper on Cloud Computing I Background on Tools and Technologies in Amazon Web Services (AWS) In this paper I will highlight the technologies from the AWS cloud which enable you
More informationDatzilla. Error Reporting and Tracking for NOAA Data
Datzilla Error Reporting and Tracking for NOAA Data Overview Datzilla is a web based system used to report and track errors in NOAA datasets and Data Products. It is an adaptation of the software bug tracking
More informationSawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices
Sawmill Log Analyzer Best Practices!! Page 1 of 6 Sawmill Log Analyzer Best Practices! Sawmill Log Analyzer Best Practices!! Page 2 of 6 This document describes best practices for the Sawmill universal
More informationDBMS / Business Intelligence, SQL Server
DBMS / Business Intelligence, SQL Server Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the needs of IT professionals.
More informationG-Cloud Service Definition Cadcorp Web Map Layers
G-Cloud Service Definition Cadcorp Web Map Layers Ref: RM1557/iii Government Procurement Services G-Cloud III Contents 1. Introduction... 3 2. Service Overview... 4 2.1 Web Map Layers... 4 2.2 Initial
More informationCollege of Engineering, Technology, and Computer Science
College of Engineering, Technology, and Computer Science Design and Implementation of Cloud-based Data Warehousing In partial fulfillment of the requirements for the Degree of Master of Science in Technology
More informationData Management Implementation Plan
Appendix 8.H Data Management Implementation Plan Prepared by Vikram Vyas CRESP-Amchitka Data Management Component 1. INTRODUCTION... 2 1.1. OBJECTIVES AND SCOPE... 2 2. DATA REPORTING CONVENTIONS... 2
More informationHow To Write A File System On A Microsoft Office 2.2.2 (Windows) (Windows 2.3) (For Windows 2) (Minorode) (Orchestra) (Powerpoint) (Xls) (
Remark Office OMR 8 Supported File Formats User s Guide Addendum Remark Products Group 301 Lindenwood Drive, Suite 100 Malvern, PA 19355-1772 USA www.gravic.com Disclaimer The information contained in
More informationChapter 6 8/12/2015. Foundations of Business Intelligence: Databases and Information Management. Problem:
Foundations of Business Intelligence: Databases and Information Management VIDEO CASES Chapter 6 Case 1a: City of Dubuque Uses Cloud Computing and Sensors to Build a Smarter, Sustainable City Case 1b:
More informationUser Guide. Analytics Desktop Document Number: 09619414
User Guide Analytics Desktop Document Number: 09619414 CONTENTS Guide Overview Description of this guide... ix What s new in this guide...x 1. Getting Started with Analytics Desktop Introduction... 1
More informationRelease 8.2 Hardware and Software Requirements. PowerSchool Student Information System
Release 8.2 Hardware and Software Requirements PowerSchool Student Information System Released January 2015 Document Owner: Documentation Services This edition applies to Release 8.2 of the PowerSchool
More informationAWS Schema Conversion Tool. User Guide Version 1.0
AWS Schema Conversion Tool User Guide AWS Schema Conversion Tool: User Guide Copyright 2016 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress may
More informationDatabase as a Service (DaaS) Version 1.02
Database as a Service (DaaS) Version 1.02 Table of Contents Database as a Service (DaaS) Overview... 4 Database as a Service (DaaS) Benefit... 4 Feature Description... 4 Database Types / Supported Versions...
More informationTARRANT COUNTY PURCHASING DEPARTMENT
JACK BEACHAM, C.P.M., A.P.P. PURCHASING AGENT TARRANT COUNTY PURCHASING DEPARTMENT AUGUST 4, 2010 RFP NO. 2010-103 ROB COX, C.P.M., A.P.P. ASSISTANT PURCHASING AGENT RFP FOR DIGITAL ASSET MANAGEMENT SYSTEM
More informationSeamless Web Data Entry for SAS Applications D.J. Penix, Pinnacle Solutions, Indianapolis, IN
Seamless Web Data Entry for SAS Applications D.J. Penix, Pinnacle Solutions, Indianapolis, IN ABSTRACT For organizations that need to implement a robust data entry solution, options are somewhat limited
More informationGladstone Health & Leisure Technical Services
Gladstone Health & Leisure Technical Services Plus2 Environment Server Recommendations Commercial in Confidence Database Server Specifications Database server specifications are based on sizes in use on
More informationResources You can find more resources for Sync & Save at our support site: http://www.doforms.com/support.
Sync & Save Introduction Sync & Save allows you to connect the DoForms service (www.doforms.com) with your accounting or management software. If your system can import a comma delimited, tab delimited
More informationJob Description. Working Hours Standard 35 hours per week Normally working Mon Fri 9am to 5pm with additional hours as required
Job Description Job Title Oracle Support Technical Developer Function IT Services Applications Reporting to Applications Manager Direct Reports None Working Hours Standard 35 hours per week Normally working
More informationStellar Phoenix. SQL Database Repair 6.0. Installation Guide
Stellar Phoenix SQL Database Repair 6.0 Installation Guide Overview Stellar Phoenix SQL Database Repair software is an easy to use application designed to repair corrupt or damaged Microsoft SQL Server
More informationFrom Firebird 1.5 to 2.5
From Firebird 1.5 to 2.5 How to migrate 75Gb database, with 564 tables, 5000+ stored procedures, 813 triggers, which is working 24x7, with ~400 users in less than 4 months About IBSurgeon Tools and consulting
More informationCLIDATA In Ostrava 18/06/2013
CLIDATA In Ostrava 18/06/2013 Content Introduction...3 Structure of Clidata Application...4 Clidata Database...5 Rich Java Client...6 Oracle Discoverer...7 Web Client...8 Map Viewer...9 Clidata GIS and
More informationReal-time Data Replication
Real-time Data Replication from Oracle to other databases using DataCurrents WHITEPAPER Contents Data Replication Concepts... 2 Real time Data Replication... 3 Heterogeneous Data Replication... 4 Different
More informationFoundations of Business Intelligence: Databases and Information Management
Foundations of Business Intelligence: Databases and Information Management Wienand Omta Fabiano Dalpiaz 1 drs. ing. Wienand Omta Learning Objectives Describe how the problems of managing data resources
More informationUsing Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms
Using Database Metadata and its Semantics to Generate Automatic and Dynamic Web Entry Forms Mohammed M. Elsheh and Mick J. Ridley Abstract Automatic and dynamic generation of Web applications is the future
More informationHow To Use Gfi Mailarchiver On A Pc Or Macbook With Gfi Email From A Windows 7.5 (Windows 7) On A Microsoft Mail Server On A Gfi Server On An Ipod Or Gfi.Org (
GFI MailArchiver for Exchange 4 Manual By GFI Software http://www.gfi.com Email: info@gfi.com Information in this document is subject to change without notice. Companies, names, and data used in examples
More informationHigh Availability Databases based on Oracle 10g RAC on Linux
High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN, June 2006 Luca Canali, CERN IT Outline Goals Architecture of an HA DB Service Deployment at the CERN Physics Database
More informationSQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box)
SQL Server 2012 Gives You More Advanced Features (Out-Of-The-Box) SQL Server White Paper Published: January 2012 Applies to: SQL Server 2012 Summary: This paper explains the different ways in which databases
More informationOracle Database 11g Comparison Chart
Key Feature Summary Express 10g Standard One Standard Enterprise Maximum 1 CPU 2 Sockets 4 Sockets No Limit RAM 1GB OS Max OS Max OS Max Database Size 4GB No Limit No Limit No Limit Windows Linux Unix
More informationFoundations of Business Intelligence: Databases and Information Management
Chapter 5 Foundations of Business Intelligence: Databases and Information Management 5.1 See Markers-ORDER-DB Logically Related Tables Relational Approach: Physically Related Tables: The Relationship Screen
More informationDescription of Application
Description of Application Operating Organization: Coeur d Alene Tribe, Plummer, Idaho Community of Interest: U.S. Indian tribes and their governments; rural governments OS and software requirements: Microsoft
More informationGeoCloud Project Report USGS/EROS Spatial Data Warehouse Project
GeoCloud Project Report USGS/EROS Spatial Data Warehouse Project Description of Application The Spatial Data Warehouse project at the USGS/EROS distributes services and data in support of The National
More informationTransitioning from a Physical to Virtual Production Environment. Ryan Miller Middle Tennessee Electric Membership Corp
Transitioning from a Physical to Virtual Production Environment Ryan Miller Middle Tennessee Electric Membership Corp Introduction MTEMC Distribute electricity to ~200,000 residential & business members
More informationGeodatabase Programming with SQL
DevSummit DC February 11, 2015 Washington, DC Geodatabase Programming with SQL Craig Gillgrass Assumptions Basic knowledge of SQL and relational databases Basic knowledge of the Geodatabase We ll hold
More informationMicroStrategy Desktop
MicroStrategy Desktop Quick Start Guide MicroStrategy Desktop is designed to enable business professionals like you to explore data, simply and without needing direct support from IT. 1 Import data from
More informationSetting up a database for multi-user access
BioNumerics Tutorial: Setting up a database for multi-user access 1 Aims There are several situations in which multiple users in the same local area network (LAN) may wish to work with a shared BioNumerics
More informationOracle Net Service Name Resolution
Oracle Net Service Name Resolution Getting Rid of the TNSNAMES.ORA File! Simon Pane Oracle Database Principal Consultant March 19, 2015 ABOUT ME Working with the Oracle DB since version 6 Oracle Certified
More informationParallels Plesk Automation
Parallels Plesk Automation Contents Compact Configuration: Linux Shared Hosting 3 Compact Configuration: Mixed Linux and Windows Shared Hosting 4 Medium Size Configuration: Mixed Linux and Windows Shared
More informationRelational Databases for the Business Analyst
Relational Databases for the Business Analyst Mark Kurtz Sr. Systems Consulting Quest Software, Inc. mark.kurtz@quest.com 2010 Quest Software, Inc. ALL RIGHTS RESERVED Agenda The RDBMS and its role in
More informationDBMS / Business Intelligence, Business Intelligence / DBMS
DBMS / Business Intelligence, Business Intelligence / DBMS Orsys, with 30 years of experience, is providing high quality, independant State of the Art seminars and hands-on courses corresponding to the
More informationICADBS402A Complete database backup and restore
ICADBS402A Complete database backup and restore Release: 1 ICADBS402A Complete database backup and restore Modification History Version ICADBS402A Comments This version first released with ICA11 Information
More informationitop: the open-source ITSM solution
itop: the open-source ITSM solution itop is a multi-client web portal designed for service providers and businesses. Simple and easy to use, it allows all configuration items and their relationships to
More informationData Management Nuts and Bolts. Don Johnson Scientific Computing and Visualization
Data Management Nuts and Bolts Don Johnson Scientific Computing and Visualization Overview Data Management Storing data Sharing data Moving data Tracking data (Client responsibility) Where can you obtain
More informationGeospatial Server Performance Colin Bertram UK User Group Meeting 23-Sep-2014
Geospatial Server Performance Colin Bertram UK User Group Meeting 23-Sep-2014 Topics Auditing a Geospatial Server Solution Web Server Strategies and Configuration Database Server Strategy and Configuration
More informationVery Large Enterprise Network, Deployment, 25000+ Users
Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
More informationPOPI Cloud Backups. Overview. The Challenges
POPI Cloud Backups Overview POPI Online Backup offers a simple, Secure, rapid deployment solution that is customizable and cost effective to implement a Backup strategy throughout your organisation. POPI
More informationW16 Data Mining Workshop
W16 Data Mining Workshop Brantley Synco, Director, Internal Audit and Compliance, Baptist Health System Jim Donaldson, Director of Compliance/Privacy and Security Officer, Baptist Health Care Corporation
More informationUniversity of Arkansas Libraries ArcGIS Desktop Tutorial. Section 4: Preparing Data for Analysis
: Preparing Data for Analysis When a user acquires a particular data set of interest, it is rarely in the exact form that is needed during analysis. This tutorial describes how to change the data to make
More information5-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual
5-Bay Raid Sub-System Smart Removable 3.5" SATA Multiple Bay Data Storage Device User's Manual www.vipower.com Table of Contents 1. How the SteelVine (VPMP-75511R/VPMA-75511R) Operates... 1 1-1 SteelVine
More informationHow To Monitor A Document Management System On A Web Browser On A Linux Computer (For Free)
Whitepaper: Monitoring the Interwoven Worksite DMS By S. Bondy Abstract The document management system is crucial to the daily operation of the modern law firm. Yet frequently, the document management
More informationLeveraging Public Clouds to Ensure Data Availability
Systems Engineering at MITRE CLOUD COMPUTING SERIES Leveraging Public Clouds to Ensure Data Availability Toby Cabot Lawrence Pizette The MITRE Corporation manages federally funded research and development
More informationInstalling The SysAidTM Server Locally
Installing The SysAidTM Server Locally Document Updated: 17 October 2010 Introduction SysAid is available in two editions: a fully on-demand ASP solution and an installed, in-house solution for your server.
More informationChapter 24: Creating Reports and Extracting Data
Chapter 24: Creating Reports and Extracting Data SEER*DMS includes an integrated reporting and extract module to create pre-defined system reports and extracts. Ad hoc listings and extracts can be generated
More informationInge Os Sales Consulting Manager Oracle Norway
Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database
More informationData Lab Operations Concepts
Data Lab Operations Concepts 1 Introduction This talk will provide an overview of Data Lab components to be implemented Core infrastructure User applications Science Capabilities User Interfaces The scope
More informationEnterprise GIS Solutions to GIS Data Dissemination
Enterprise GIS Solutions to GIS Data Dissemination ESRI International User Conference July 13 17, 2009 Wendy M. Turner Senior GIS Engineer & Program Manager Freedom Consulting Group, LLC Building the Enterprise
More informationEasy Data Centralization with Webster. User Guide
Easy Data Centralization with Webster User Guide CONTENTS 3-4 1 Introducing Webster Webster - An Introduction 5-14 2 Installing & Configuring Webster Installing the System Configuring Webster 15-18 3 Managing
More informationOracle SQL Developer Migration
An Oracle White Paper May 2010 Oracle SQL Developer Migration Overview... 3 Introduction... 3 Oracle SQL Developer: Architecture and Supported Platforms... 3 Supported Platforms... 4 Supported Databases...
More informationGroundwater Chemistry
Mapping and Modeling Groundwater Chemistry By importing Excel spreadsheets into ArcGIS 9.2 By Mike Price, Entrada/San Juan, Inc. In ArcGIS 9.2, Microsoft Excel spreadsheet data can be imported and used
More informationSpectrum Technology Platform. Version 9.0. Spectrum Spatial Administration Guide
Spectrum Technology Platform Version 9.0 Spectrum Spatial Administration Guide Contents Chapter 1: Introduction...7 Welcome and Overview...8 Chapter 2: Configuring Your System...9 Changing the Default
More informationWalesHER GAT User Manual
WalesHER GAT User Manual Automated Data Upload (User Levels 0 & 1) This document provides guidance on uploading datasets in csv format to WalesHER using the Load Data tool and migration SQLs in phpmyadmin
More informationHardware and Software Requirements for Installing California.pro
Hardware and Requirements for Installing California.pro This document lists the hardware and software requirements to install and run California.pro. Workstation with SQL Server Recommended: 64-Bit Windows
More informationWeb Hosting. E-Mail Hosting. Cloud File Hosting. The Genio Group (214) 732-7411 info@thegeniogroup.com www.thegeniogroup.com
Web Hosting E-Mail Hosting Cloud File Hosting Genio Hosting Servers All of Genio s Hosting Servers run on Apple hardware running Mac OS X Server. Mac OS X Server leverages the computing power of 64-bit
More informationTroubleshooting SQL Server Enterprise Geodatabase Performance Issues. Matthew Ziebarth and Ben Lin
Troubleshooting SQL Server Enterprise Geodatabase Performance Issues Matthew Ziebarth and Ben Lin Troubleshooting SQL Server Enterprise Geodatabase Performance Issues AGENDA General configuration recommendations
More informationAbstract. Introduction
Data Replication and Data Sharing Integrating Heterogeneous Spatial Databases Mark Stoakes and Katherine Irwin Professional Services, Safe Software Inc. Abstract Spatial data warehouses are becoming more
More informationEnterprise Network Deployment, 10,000 25,000 Users
Enterprise Network Deployment, 10,000 25,000 Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering
More informationHETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM. Aniket Bochare - aniketb1@umbc.edu. CMSC 601 - Presentation
HETEROGENEOUS DATA INTEGRATION FOR CLINICAL DECISION SUPPORT SYSTEM Aniket Bochare - aniketb1@umbc.edu CMSC 601 - Presentation Date-04/25/2011 AGENDA Introduction and Background Framework Heterogeneous
More informationAdvanced analytics at your hands
2.3 Advanced analytics at your hands Neural Designer is the most powerful predictive analytics software. It uses innovative neural networks techniques to provide data scientists with results in a way previously
More informationEMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER
White Paper EMC DOCUMENTUM xplore 1.1 DISASTER RECOVERY USING EMC NETWORKER Abstract The objective of this white paper is to describe the architecture of and procedure for configuring EMC Documentum xplore
More informationBENEFITS OF AUTOMATING DATA WAREHOUSING
BENEFITS OF AUTOMATING DATA WAREHOUSING Introduction...2 The Process...2 The Problem...2 The Solution...2 Benefits...2 Background...3 Automating the Data Warehouse with UC4 Workload Automation Suite...3
More informationWe look beyond IT. Cloud Offerings
Cloud Offerings cstor Cloud Offerings As today s fast-moving businesses deal with increasing demands for IT services and decreasing IT budgets, the onset of cloud-ready solutions has provided a forward-thinking
More informationHardware and Software Requirements for Installing California.pro
Hardware and Requirements for Installing California.pro This document lists the hardware and software requirements to install and run California.pro. Workstation with SQL Server type: Pentium IV-compatible
More informationDATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY
More informationToolbox 4.3. System Requirements
Toolbox 4.3 February 2015 Contents Introduction... 2 Requirements for Toolbox 4.3... 3 Toolbox Applications... 3 Installing on Multiple Computers... 3 Concurrent Loading, Importing, Processing... 4 Client...
More informationUnicenter Patch Management
Unicenter Patch Management Best Practices for Managing Security Updates R11 This documentation (the Documentation ) and related computer software program (the Software ) (hereinafter collectively referred
More informationData Warehouse Center Administration Guide
IBM DB2 Universal Database Data Warehouse Center Administration Guide Version 8 SC27-1123-00 IBM DB2 Universal Database Data Warehouse Center Administration Guide Version 8 SC27-1123-00 Before using this
More informationHow to recover a failed Storage Spaces
www.storage-spaces-recovery.com How to recover a failed Storage Spaces ReclaiMe Storage Spaces Recovery User Manual 2013 www.storage-spaces-recovery.com Contents Overview... 4 Storage Spaces concepts and
More informationER/Studio 8.0 New Features Guide
ER/Studio 8.0 New Features Guide Copyright 1994-2008 Embarcadero Technologies, Inc. Embarcadero Technologies, Inc. 100 California Street, 12th Floor San Francisco, CA 94111 U.S.A. All rights reserved.
More informationWhat you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11
What you can do:...3 Data Entry:...3 Drillhole Sample Data:...5 Cross Sections and Level Plans...8 3D Visualization...11 W elcome to North Face Software s software. With this software, you can accomplish
More informationEnterpriseLink Benefits
EnterpriseLink Benefits GGY AXIS 5001 Yonge Street Suite 1300 Toronto, ON M2N 6P6 Phone: 416-250-6777 Toll free: 1-877-GGY-AXIS Fax: 416-250-6776 Email: axis@ggy.com Web: www.ggy.com Table of Contents
More informationAlexander Nikov. 5. Database Systems and Managing Data Resources. Learning Objectives. RR Donnelley Tries to Master Its Data
INFO 1500 Introduction to IT Fundamentals 5. Database Systems and Managing Data Resources Learning Objectives 1. Describe how the problems of managing data resources in a traditional file environment are
More informationMIS0094 SuccessNet Upgrade 1.1.0.5 Guide. Individuals Implementing SuccessNet. To assist in the upgrade of the software to version 1.1.0.
Title: Audience: Maintained by: Purpose: MIS0094 SuccessNet Upgrade 1.1.0.5 Guide Individuals Implementing SuccessNet. Technical Services To assist in the upgrade of the software to version 1.1.0.5 Requirements
More informationhmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau
Powered by Vertica Solution Series in conjunction with: hmetrix Revolutionizing Healthcare Analytics with Vertica & Tableau The cost of healthcare in the US continues to escalate. Consumers, employers,
More informationGUIDE TO REDCAP EXPORTED FILES
GUIDE TO REDCAP EXPORTED FILES UNDERSTANDING DATA FORMATS AND LOADING DATA INTO ANALYSIS SOFTWARE INTRODUCTION At some point in time in the course of your REDCap project, you will need to export your data
More informationINFORMATION MANAGERS ROUNDTABLE SHELLEY COOKE, WHITNEY WEBER MONDAY APRIL 23, 2012 1:30 3:00 PM PST
INFORMATION MANAGERS ROUNDTABLE SHELLEY COOKE, WHITNEY WEBER MONDAY APRIL 23, 2012 1:30 3:00 PM PST I. Biotics 4 Session Topics Compatibility issues, other common challenges, support for Biotics 4 (Informational,
More informationTable of Contents SQL Server Option
Table of Contents SQL Server Option STEP 1 Install BPMS 1 STEP 2a New Customers with SQL Server Database 2 STEP 2b Restore SQL DB Upsized by BPMS Support 6 STEP 2c - Run the "Check Dates" Utility 7 STEP
More informationESS event: Big Data in Official Statistics. Antonino Virgillito, Istat
ESS event: Big Data in Official Statistics Antonino Virgillito, Istat v erbi v is 1 About me Head of Unit Web and BI Technologies, IT Directorate of Istat Project manager and technical coordinator of Web
More informationMySQL for Beginners Ed 3
Oracle University Contact Us: 1.800.529.0165 MySQL for Beginners Ed 3 Duration: 4 Days What you will learn The MySQL for Beginners course helps you learn about the world's most popular open source database.
More informationSolution Brief: Creating Avid Project Archives
Solution Brief: Creating Avid Project Archives Marquis Project Parking running on a XenData Archive Server provides Fast and Reliable Archiving to LTO or Sony Optical Disc Archive Cartridges Summary Avid
More informationResults CRM 2012 User Manual
Results CRM 2012 User Manual A Guide to Using Results CRM Standard, Results CRM Plus, & Results CRM Business Suite Table of Contents Installation Instructions... 1 Single User & Evaluation Installation
More information