Database Development. Richard Bruskiewich, Ph.D. Bioversity International. Principal Scientist Agricultural Biodiversity Informatics Theme

Similar documents
Database Technologies MySQL, PostgreSQL, SQL Server, Oracle. Development Tools Eclipse, Aptana, NetBeans

Analytic Modeling in Python

Case Study. Data Governance Portal Brainvire Infotech Pvt Ltd Page 1 of 1

Using Cloud Services for Building Next Generation Mobile Apps

Practical application of SAS Clinical Data Integration Server for conversion to SDTM data

Yes Bank open source CRM. Yes Bank Collaborative CRM (YCCRM) Case Study. Empower business with Professional Open Source. Solutions.

Logicify Fact Sheet. We bring logic to the software systems and development processes. We call this process to logicify.

SwiftScale: Technical Approach Document

Release 1. ICAPRG604A Create cloud computing services

This course provides students with the knowledge and skills to develop ASP.NET MVC 4 web applications.

INTERNET PROGRAMMING AND DEVELOPMENT AEC LEA.BN Course Descriptions & Outcome Competency

An Oracle White Paper May Oracle Tuxedo: An Enterprise Platform for Dynamic Languages

Case Study: Cloud project

Website design & development process

Web Frameworks. web development done right. Course of Web Technologies A.A. 2010/2011 Valerio Maggio, PhD Student Prof.

Service Oriented Architecture (SOA) Architecture, Governance, Standards and Technologies

IT3503 Web Development Techniques (Optional)

Tech Radar - May 2015

CMSs, Open Source, Hosted & Cloud-Based Applications

RED HAT SOFTWARE COLLECTIONS BRIDGING DEVELOPMENT AGILITY AND PRODUCTION STABILITY

opalang - Rapid & Secure Web Development

Inframoura. Migrating from ArcGIS to Boundless OpenGeo Suite. Diogo Vitorino

Databricks. A Primer

You ll need to have: It d be great if you have:

Healthcare process management in Python: a use case

GETTING BACK ON TRACK IN RECORD TIME: OPTIMIZING A VISUAL ANALYTICS PROGRAM AND PROCESS

Deploying Federal Geospatial Services

Service Oriented Architecture (SOA) Architecture, Governance, Standards and Technologies

IT3504: Web Development Techniques (Optional)

Appendix A: Case Studies

Acquia Introduction December 9th, 2009

E-DOCS SOLUTIONS & COMPUTER CONSULTANTS P.O.Box Dubai, UAE. Ph : Fax :

Databricks. A Primer

secure intelligence collection and assessment system Your business technologists. Powering progress

UNIVERSITY OF INFINITE AMBITIONS. MASTER OF SCIENCE COMPUTER SCIENCE DATA SCIENCE AND SMART SERVICES

Solutions for Quality Management in a Agile and Mobile World

Oracle Application Development Framework Overview

PROJECT MANAGER - TECHNICAL LEADER SENIOR PHP. 30 years old - 9 years of experience

Title: Front-end Web Design, Back-end Development, & Graphic Design Levi Gable Web Design Seattle WA

ORACLE APPLICATION EXPRESS 5.0

STATISTICA Solutions for Financial Risk Management Management and Validated Compliance Solutions for the Banking Industry (Basel II)

Building Semantic Content Management Framework

CSE 203 Web Programming 1. Prepared by: Asst. Prof. Dr. Maryam Eskandari

#define. What is #define

User Interface Modernization Banner Self-Service

Cloud computing in the Enterprise: An Overview

Surviving the Big Rewrite: Moving YELLOWPAGES.COM to Rails. John Straw YELLOWPAGES.COM

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Why NetDimensions Learning

Microsoft SOA Roadmap

What is a life cycle model?

Binary Vibes Information Technologies Pvt. Ltd. Driving The Web Everywhere. Corporate Profile

Web Cloud Architecture

ENTERPRISE-CLASS MONITORING SOLUTION FOR EVERYONE ALL-IN-ONE OPEN-SOURCE DISTRIBUTED MONITORING

Self-Protecting Documents for Cloud Storage Security

Information Technology Services Classification Level Range C Reports to. Manager ITS Infrastructure Effective Date June 29 th, 2015 Position Summary

Technical Management Strategic Capabilities Statement. Business Solutions for the Future

Libraries Workshop Catalog

Big Data for Investment Research Management

InforCloudSuite. Distribution Enterprise. Overview INFOR CLOUDSUITE DISTRIBUTION ENTERPRISE 1

SOA REFERENCE ARCHITECTURE: WEB TIER

Web 2.0 Technology Overview. Lecture 8 GSL Peru 2014

An Oracle White Paper October Maximize the Benefits of Oracle SOA Suite 11g with Oracle Service Bus

Offerte del 10 maggio 2013

100% NO CODING NO DEVELOPING IMMEDIATE BUSINESS -25% -70% UNLIMITED SCALABILITY DEVELOPMENT TIME SOFTWARE STABILITY

Senior developer / database administrator

Philip Klauzinski. Summary. Experience. Senior UI Engineer at Rightside

Trollhättan, Sweden

II. PREVIOUS RELATED WORK

Syllabus INFO-UB Design and Development of Web and Mobile Applications (Especially for Start Ups)

Role profile. London. not applicable. not applicable. not applicable. not applicable. not applicable. Not required. No travel

Implementing SharePoint 2010 as a Compliant Information Management Platform

Education. Relevant Courses

Cisco Data Preparation

MySQL Web Development PHP. System Administration XML APIs. Large Scale Systems Implementation. Amazon Web Services (AWS) Shell Scripting

Case Study. SaaS Based Multi-Store Market Place Brainvire Infotech Pvt. Ltd Page 1 of 5

Dr. Rob Donald - Curriculum Vitae. rob@statsresearch.co.uk, Web: Mob:

Visual Paradigm product adoption roadmap

Choosing a Content Management System (CMS)

Integrated Information Services (IIS) Strategic Plan

GOVERNMENT SERVICES. Open Source Software Development Web Content Management Mobile + Web Apps

IT Coordination Group and ECRIN Data Centers

Filestor Digital Asset Management. The way it works

Job Description Business Analyst/Programmer

Integrated Performance Monitoring

RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS RESULTS

CAREER OPPORTUNITIES

AQUA Private Registry Cloud Technical Overview

An Electronic Journal Management System

Skills for Employment Investment Project (SEIP)

Open Source Content Management System for content development: a comparative study

Contents. Introduction and System Engineering 1. Introduction 2. Software Process and Methodology 16. System Engineering 53

BEST WEB PROGRAMMING LANGUAGES TO LEARN ON YOUR OWN TIME

Our clients have come to rely on key benefits when working with us, including: Simplicity.

Methods and tools for data and software integration Enterprise Service Bus

Placing Your Applications in the Best Cloud Model

DOBUS And SBL Cloud Services Brochure

Expanding Uniformance. Driving Digital Intelligence through Unified Data, Analytics, and Visualization

Syllabus INFO-GB Design and Development of Web and Mobile Applications (Especially for Start Ups)

Automated Data Ingestion. Bernhard Disselhoff Enterprise Sales Engineer

Transcription:

Global Timber Tracking Network Database Development Richard Bruskiewich, Ph.D. Principal Scientist Agricultural Biodiversity Informatics Theme Conservation & Availability Bioversity International 26 March 2012

Database objectives Design requirements Overview Current implementation strategy: Initial design decisions Domain model semantics (first iteration) Progress to date: Status of implementation & deployment Brief tour of early prototype Next steps

Database Objectives Will aggregate reference DNA and isotope datasets Will provide scientifically (statistically) sound analytical tools to support or refute claims of timber provenance Target users/actors: DNA and isotope testing laboratories DNA and isotope reference standard d providers National legal enforcement agencies (e.g. customs, courts) International multilateral agencies (e.g. FAO?) Other interested parties: Forestry companies? Conservation NGO s? Any others?

Design Requirements Credible forensic resource: Quality assurance workflows for reference data Embodies and accurately expresses objective e science Provides an audit trail for forensic purposes Globally accessible system: Internet (web) based application online 24 x 7 Support for international languages by locale Secure system: Reference data treated as confidential Web access secure as a bank Scalable: Expand with available reference data Extendable to embody new science/algorithms Highly sustainable informatics implementation

Current Implementation Strategy Initial design decisions Domain model semantics (first iteration)

Initial Design Decision 1a After reviewing programming language options for web development, decided to use Python since it: Is a mature dynamically-typed language with a coherent clean syntax and powerful programming constructs. Is seen as more robust, less confusing and more versatile than PHP: i.e. Python supports back end systems s computing, not just web interfaces (PHP is very web centric). Is generally less tedious than Java(*) for prototyping, but still has native object oriented support. (*) Actually, a Java implementation of Python (Jython) is available, if we change our minds

Initial Design Decision 1b Python has broad, excellent software library support for scientific computing and data visualization, e.g. NumPy/Scipy for scientific computations RPy2 library for interfacing with R statistical software; PyMC for Monte Carlo analysis; PyMix for mixed models; PyBel interface to OpenBabel cheminformatics Python for ArcGIS, MayaVi for Scientific Data Visualization An many others

Initial Design Decision 2 Given Python, it was decided to embrace Django, one of fthe most well recognized dindustry-standard t d d frameworks for web site development, which: Has mature, solid design well supported by a global community Has very loosely-coupled, module architectural design, which encourages very rapid, iterative e development. elopment Very flexible HTML template-driven presentation layer. Web page formatting of the HTML will be done with Cascading Style Sheets in best practices way Javascript will be used for dynamic content.

Initial Design Decisions 3 Using Postgresql in the back end (but Django supports other back ends could be changed). We are targeting CentOS Linux for server deployment (although the prototype does run well under Windows ) Using NGINX + uwsgi as the web application container (faster but less onerous than Apache + FASTCGI). The staging web server is deployed as an HTTPS/SSL secured web site.

Domain model semantics (first iteration)

Progress to Date Project meeting at beginning of November 2012 Status of implementation & deployment HTTPS/SSL protected staging web server commissioned on a RackSpace hosted server Baseline project development environment established: e.g. source code repository, project wiki, etc. Core informatics implementation technologies were reviewed, are partially selected and are being applied. A preliminary data model was specified and implemented Iterative software development work started on prototype.

Brief Demonstration of Prototype

Next Steps Towards a leaner and more agile development strategy? A rapid, iterative lean/agile design/implementation cycle was promised at the November 2012 meeting but not yet put into place. Perhaps this process can start now, given initial progress on putting the prototype in place. Resourcing and collaboration? It s a 15 billion dollar a year problem. If the GTTN informatics system is to be properly implemented soon, would additional targeted resources (and dedicated personnel) help? Except for this presenter, the Bioversity team lacks technical expertise and certainly, human resourcing, to meet the challenge. What kind of community involvement is desired?

Thank you.