CU9 Science Enabling Applications Development Work Package Software Requirements Specification (WP970)
|
|
|
- Paul Hunter
- 10 years ago
- Views:
Transcription
1 Science Enabling Applications Development Work Package Software Requirements Specification (WP970) prepared by: approved by: reference: issue: revision: 1 X. Luri, P.M. Marrese, F.Julbe, H. Enke, N. Walton, G. Gracia, G. Comoretto X. Luri, P.M. Marresse date: status: Draft. Pending of formal approval in DPAC CU9
2 Abstract This document provides the list of requirements applicable to the Gaia science enabling applications work package (WP970). It also covers deliverable D4.1 for the GENIUS project. Software Requirement Specifications 2
3 Document History Issue Revision Date Author Comment D XL Code document change D FJL First version of the CU9 science enabling applications SRS D PM Separate table for requirements with no parents added Software Requirement Specifications 3
4 Contents 1 Introduction Objectives Scope Assumptions Applicable Documents Requirement Definition List of requirements General requirements Advanced data access tools Data Mining Cross-Matching Science Alerts Documentation Help Desk Public Outreach Missing Parent requirements 14 References 16 A Requirements traceability 17 Software Requirement Specifications 4
5 Acronym List The following table has been generated from the on-line Gaia acronym list: Acronym ASDC AUT BP CU DPAC ESA ESAC GAIA GENIUS GPDB HDFS MAN PM RP RVS SAT SRS SSS TAP TBD VO WP Description ASI Science Data Centre AUTomated Blue Photometer Coordination Unit (in DPAC) Data Processing and Analysis Consortium European Space Agency European Space Astronomy Centre (VilSpa) Global Astrometric Interferometer for Astrophysics (obsolete; now spelled as Gaia) Gaia European Network for Improved User Services Gaia Parameter DataBase Hadoop Distributed File System MANual Polarisation Maintaining Red Photometer Radial Velocity Spectrometer Satellite Archive Team Software Requirements Specification System Software Specification Table Access Protocol To Be Defined (Determined) Virtual Observatory Work Package Software Requirement Specifications 5
6 1 Introduction This document sets out the software requirements pertaining to the Gaia science enabling applications. 1.1 Objectives The objective of this document is to define a set of requirements for those applications developed for the GAIA catalogue scientific exploitation. Some of the listed requirements can be divided into smaller ones in order to improve its development and monitoring. The requirements will cover all funtional and technological aspects of the science enabling applications in order to produce fully functional state-of-the-art products for the GAIA catalogue scientific explotation. 1.2 Scope This work package (WP970) includes 5 sub-work packages that cover different functional domains and together they cover all aspects of the software applications to a full exploitation of the GAIA catalogue. 1.3 Assumptions Some requirements have higher level CU9 requirements. These dependencies are recorded but not repeated in this document; the relevant SRS (see next Section) should be referred to for further details. The top level requirements specification should be considered as level 0 requirements that are unlikely to change during the development and implementation iterations, while the derived and more detailed requirements herein may subject to significant revision consistent with a pragmatic and agile development process. 1.4 Applicable Documents When applicable documents change a change may be required in this document. The applicable documents are listed here for clarity; a full reference list is provided at the end of the document. (WOM-086) (WOM-033) (AB-026) Software Development Plan for CU9 Gaia Catalogue and Archive SRS Gaia data access scenarios summary For convenience only the structure of this document follows that of the level 0 SRS (WOM- 033). Software Requirement Specifications 6
7 1.5 Requirement Definition The requirements set out in this SRS follow the labelling scheme: Where : CU9-WP97x-X-SCOPE-xxx WP97x is the (sub )WP number as follows: WP971: Management WP972: Advanced data access tools WP973: Data Mining WP974: Cross-Matching WP975: Science Alerts X is either S (for Scientific), T (for Technical), Q (for Quality Assurance), or M (for Management) SCOPE is a four letter scope specification of the requirement following the identified list of possible values as shown in the list below xxx is a monotonically increasing counter for every unique combination SCOPE is a 4-letter scope specification of the requirement. In this document the following scopes have been used: GLOB: for top-level global requirements; ALGO: for requirements on the scientific algorithm to be applied in the data processing (this should be for the detail on how the functionality is to be achieved); CODE: for requirements on the coding activities (this should just be for how the code is written e.g. the GPDB should be used); COOR: for requirements on the coordination activities (this should not be used for functionality requested by other WPs, but that you coordinate with them in the design etc.); DATA: for requirements on data stream handling (this can include descriptions of input and output data); FUNC: concerning functional requirements (this describes the functionality that is required by the system); HARD: concerning hardware requirements; PERF: for requirements on performances; PLAN: for requirements on the planning activities; Software Requirement Specifications 7
8 QUAL: for requirements on the quality assurance (both scientific and software, robustness and quality of data could go here); RESO: for requirements on the resource management. DOCU: for requirements on documentation. Each requirement is presented with its own unique label and a number of attributes in the following form: CU9-WP97x-X-SCOPE-020 C.v Verification Status Description Parent: Parent CU9-WP97x-X- SCOPE-xxx C.v Verification Status Parent The unique identifier of the requirement (see above). Version number of the requirement composed of major part (C) corresponding to the cycle (1, 2 and 3 corresponding to A, B and C respectively in WOM-086) in which the requirement was created and minor part (v) corresponding to the version of the requirement. Envisaged validation method of requirement - this will be either AUT for automated or MAN for Manual. Status identifier. Higher level requirement or requirements in a comma separated list. Software Requirement Specifications 8
9 2 List of requirements 2.1 General requirements CU9-WP971-M-PLAN MAN Draft CU9 WP970 shall operate within the agreed management structures of the CU (which in turn operates within the existing DPAC management structure). Parent: CU9-ARC-M-PLN-020 CU9-WP971-M-PLAN MAN Draft CU9 WP970 shall develop all code and documentation within the DPAC code repository where it shall be visible to all DPAC members but modifiable only by CU9 members. Parent: CU9-ARC-M-PLN-040 CU9-WP971-M-PLAN MAN Draft CU9 WP970 shall follow the engineering guidelines laid down in WOM-086. Parent: CU9-ARC-M-PLN-060 CU9-WP971-M-PLAN MAN Draft CU9 WP970 developments shall be overseen by a System Engineering coordination group. Parent: CU9-ARC-M-PLN-060 CU9-WP971-M-COOR MAN Draft CU9 WP970 shall coordinate with other CU9 WPs in particular WP930 and WP950. Parent: 2.2 Advanced data access tools CU9-WP972-T-FUNC MAN Draft The Gaia Archive shall provide a TAP access to the Gaia Catalogue. Parent: CU9-ITG-T-FUN-140, CU9-ITG-T-FUN-020 CU9-WP972-T-FUNC MAN Draft The Gaia Archive shall provide a SSAP access delivering BP, RP and RVS spectra calibrated in wavelength and in flux. The delivered spectra shall be compliant with the Spectrum datamodel (VO standard). The spectra resolution shall be specified for each spectrum and standard units shall be used. Parent: CU9-ITG-T-FUN-140 Software Requirement Specifications 9
10 CU9-WP972-T-FUNC MAN Draft WP972 shall adapt at least one VO-tool able to display the data provided through the TAP access of the Gaia Archive. Parent: CU9-WP972-T-FUNC MAN Draft WP972 shall adapt at least one VO-tool in order to select and display Gaia spectra. Parent: CU9-WP972-T-FUNC MAN Draft WP972 shall adapt at least one VO-tool in order to visualise, at least in 2D, positionnal/astrometric Gaia data. Parent: CU9-ADV-T-FUN-040 CU9-WP972-T-FUNC MAN Draft All VO-tools adapted by WP972 shall be able to access the Gaia Catalogue through the Gaia Archive using one of the implemented VO protocols. Parent: CU9-ITG-T-FUN-140 CU9-WP972-T-FUNC MAN Draft All VO-tools adapted by WP972 shall be able to interact each other using SAMP, in order to process different types of Gaia data in the adequate tools. Parent: CU9-ADV-T-FUN-060 For instance: selecting a Gaia object inside Aladin and sending the position to SPLAT in order to display its spectra. CU9-WP972-T-FUNC MAN Draft WP972 shall deliver at least one spectra matching tool/service able to work on the spectra provided by the Gaia Archive. Parent: CU9-ITG-T-FUN-140, CU9-ITG-T-FUN-040 CU9-WP972-T-FUNC MAN Draft All spectra matching tools/services provided by WP972 shall be able to match Gaia spectra to users spectra. A users spectrum may be provided in different formats, but at least a Spectrum datamodel (VO standard) compliant input must be supported. Parent: CU9-ITG-T-FUN-140, CU9-ITG-T-FUN-040 CU9-WP972-T-FUNC MAN Draft All spectra matching tools/services provided by WP972 shall be able to access Gaia spectra through the Gaia Archive. Parent: CU9-ITG-T-FUN-140, CU9-ITG-T-FUN-040 Software Requirement Specifications 10
11 2.3 Data Mining CU9-WP973-T-FUNC MAN Draft Data mining framework will allow the community to perform complex queries through an easy to use interface. This User Interface must be based on a web technology preferably fully integrated or into the rest of the Gaia archive access and querying tools or compatible with them. Parent: CU9-CIF-T-MAN-020 CU9-WP973-T-FUNC MAN Draft Data mining framework must implement a basic set of MlLib (Machine Learning Libs) as a basic building blocks to build more complex use cases on top of them. These complex and more common use cases will also be implemented in the framework. Parent: CU9-CIF-T-MAN-020 CU9-WP973-T-FUNC MAN Draft The framework must integrate a job management platform to handle task execution. This job management system must implement the job submitting policies necessary such as: synchronized execution or batch execution in a prioritized queue policy, with the necessary cluster resource management. The usage of the cluster and its resources should be monitored by the appropriate tools. Parent: CU9-CIF-T-MAN-140 CU9-WP973-T-FUNC MAN Draft Advanced users will also be allowed to submit their own custom applications to the cluster though a proper entry point to the framework. The submitted job must also be integrated into the job management system for its execution. Parent: CU9-CIF-T-MAN-040, CU9-ITG-T-FUNC-200 CU9-WP973-T-FUNC MAN Draft The data mining framework must implement the security policies necessary according to the SAT (Science Archive team) and ESA security standards. Also it should be integrated into the rest of the security framework (single sign on service) with the rest of the Gaia Archive web services and interfaces. Parent: CU9-CIF-T-FUN-080 Software Requirement Specifications 11
12 CU9-WP973-T-FUNC MAN Draft Data mining task results must be displayed through the client using advanced data displaying tools (including graphical features -2D/3D-). Interaction with the visualization Work Package (WP980) must be established. Parent: CU9-ADV-T-FUN-020, CU9-ADV-T-FUN-040 CU9-WP973-T-DATA MAN Draft Gaia archive data must be provided into a HDFS (Hadoop Distributed File System) o similar distributed file system together with its relational version and both have to be in synch. Also, Archive metadata must be provided in order to be user by the data miming framework and its query system. A preferential file format to be used by the data mining framework must be evaluated and proposed by this work package. Parent: CU9-CAT-M-PLN Cross-Matching In the following by External Catalogues we intend the catalogues for which the cross-match will be pre-computed and will be part of the Gaia releases. Those will be catalogues with N stars greater than a few hundred millions, observed in the optical/near infrared and public available. The External Catalogues, the cross-match algorithm and the output will be release specific, so the following requirements will need to be fullfilled for each release. In order to calculate the cross-match of the Gaia Catalogue with External surveys, the latter must be homogenized in order for a single algorithm to run on several different catalogues. It is thus necessary to have both original catalogues (either full or a sub-set of the fields needed by cross-match and validation) and a cross-match specific version of the same catalogues. CU9-WP974-S-DATA MAN Draft Define External Catalogues (original catalogues and cross-match specific catalogues) input through a definition of the data models. CU9-WP974-S-DATA MAN Draft Define cross-match output through a definition of the data models (full output will be used for the cross-match validation, sub-set will be made available to final users). CU9-WP974-T-DATA MAN Draft Prepare and deliver to ESAC the gbin files of the original External Catalogues using the CU9 data models. Software Requirement Specifications 12
13 CU9-WP974-T-DATA MAN Draft Prepare and deliver to ESAC the gbin files of the cross-match specific External Catalogues using the CU9 data models. CU9-WP974-S-ALGO MAN Draft Detailed description and specifications of the procedure that calculates the cross-match of the Gaia Catalogue with the External Catalogues starting from the defined input and obtaining the defined output. CU9-WP974-T-FUNC MAN Draft A cross-match software compliant with the specified algorithm must be available in ASDC for cross-match validation or directly to calculate the cross-match. CU9-WP974-S-ALGO MAN Draft Definition of the validation process of the cross-match algorithm. CU9-WP974-T-FUNC MAN Draft The cross-match validation must be performed following the cross-match validation requirements. CU9-WP974-T-DATA MAN Draft Definition of the data set needed for the cross-match validation. CU9-WP974-T-DOCU MAN Draft Provide documentation on original External Catalogues and a detailed description of the corresponding data models. Parent: CU9-DOC-S-FUN-040 CU9-WP974-T-DOCU MAN Draft Provide documentation on cross-match specific External Catalogues and generate a detailed description of the corresponding data models. Parent: CU9-DOC-S-FUN-040 Software Requirement Specifications 13
14 CU9-WP974-T-DOCU MAN Draft Provide documentation on cross-match tables (output of cross-match activities) and generate a detailed description of the corresponding data models. Parent: CU9-DOC-S-FUN-040 CU9-WP974-T-DOCU MAN Draft Provide documentation on cross-match algorithm and on the tests performed to develop it. Parent: CU9-DOC-S-FUN-040 CU9-WP974-T-DOCU MAN Draft Provide documentation on the validation of the cross-match algorithm. Parent: CU9-DOC-S-FUN Science Alerts CU9-WP975-T-COOR MAN Draft...TBD Documentation CU9-WP975-T-DOCU MAN Draft...TBD... Parent: CU9-DOC-S-FUN-040,CU9-DOC-S-PLN-060,?? This section may be deleted, generation of the documentation should be added as requirements in each sub-wp section, in addition a requirement on the publication of the documentation should be sent to WP920 for inclusion in their SRS document. 2.7 Help Desk See elsewhere for the SRS of the relevant workpackage (WP953) 2.8 Public Outreach See elsewhere for the SRS of the relevant workpackage (WP960) 3 Missing Parent requirements Each requirement in this document must have a parent requirement at higher level. However for some of the requirements defined in this document no parent requirement could be found in the CU9 SRS WOM-033. We consider that WOM-033 needs to be updated to include the following top level CU9 requirements Software Requirement Specifications 14
15 WP970 Requirement CU9-WP971-M-COO-020 Missing Top level CU9 requirement No requirement on coordination between CU9 WPs exists in the CU9 SRS. Software Requirement Specifications 15
16 References [AB-026], Brown, A., Arenou, F., Hambly, N., et al., 2012, Gaia data access scenarios summary, GAIA-C9-TN-LEI-AB-026, URL [WOM-033], O Mullane, W., 2009, Gaia Catalogue and Archive Software Requirements and Specification, GAIA-C9-SD-ESAC-WOM-033, URL [WOM-086], O Mullane, W., Luri, X., Gracia, G., 2014, CU9 Software Development Plan, GAIA-C9-PL-ESAC-WOM-086, URL Software Requirement Specifications 16
17 A Requirements traceability The traceability between this SRS and parent requirements such as the SSS should be given here. A script makerequirementstraceparents.rb is provided in CU1/docs/common/scripts to create this from the higher level requirements optionally specified in the req TeX macro i.e. the PARENT requirement. If the requirements contain tex labels which start with req: then they will become clickable links in the table. If you do not have labels you may use the script addreqlabels.rb which attempts to add labels to all requirements. The following table provides traceability for derived requirements within this requirements specification, and also to level 0 requirements in WOM-033. Parent Requirement Requirements in this document CU9-ADV-T-FUN-020 CU9-WP973-T-DATA-020 CU9-ADV-T-FUN-040 CU9-WP973-T-DATA-020 CU9-ADV-T-FUN-080 CU9-WP974-S-DATA-020, CU9-WP974-S-DATA-040, CU9-WP974-T-DATA-020, CU9-WP974-T-DATA-040, CU9-WP974-S-ALGO-020, CU9-WP974-T-FUNC-020, CU9-WP974-S-ALGO-020, CU9-WP974-T-FUNC-020, CU9-WP974-T-DATA-020, CU9-WP975-T-COOR-020 CU9-ARC-M-PLN-020 CU9-WP971-M-PLAN-020 CU9-CAT-M-PLN-080 CU9-WP973-T-DATA-020 CU9-CIF-T-FUN-080 CU9-WP973-T-DATA-020 CU9-CIF-T-MAN-020 CU9-WP972-T-FUNC-200, CU9-WP973-T-DATA-020, CU9-WP973-T-DATA-020 CU9-CIF-T-MAN-040 CU9-WP973-T-DATA-020 CU9-CIF-T-MAN-140 CU9-WP973-T-DATA-020 CU9-DOC-S-FUN-040 CU9-WP974-T-DOCU-020, CU9-WP974-T-DOCU- 040, CU9-WP974-T-DOCU-060, CU9-WP974-T- DOCU-080, CU9-WP974-T-DOCU-100, CU9-WP975- T-DOCU-020 CU9-DOC-S-PLN-060 CU9-WP975-T-DOCU-020 CU9-ITG-T-FUNC-200 CU9-WP973-T-DATA-020?? CU9-WP975-T-DOCU-020 Software Requirement Specifications 17
How To Set Up A Rov-Dfd (Rov Zero Point) Du)
DU640 Radial Velocity Zero-Point Software Requirement Specifications prepared by: approved by: reference: issue: 4 revision: 1 date: 28-03-2008 status: Issued G. Jasniewicz, F. Crifo, D. Hestroffer, A.
on the establishment of a Brazilian Science Data Center (BSDC) General Guidelines
on the establishment of a Brazilian Science Data Center (BSDC) General Guidelines 1 Introduction Since the entrance of Brazil in ICRANet a variety of projects have been started to be developed a) in the
The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory
The Virtual Observatory: What is it and how can it help me? Enrique Solano LAEFF / INTA Spanish Virtual Observatory Astronomy in the XXI century The Internet revolution (the dot com boom ) has transformed
Exploring Gaia data with TOPCAT and the Virtual Observatory
Exploring Gaia data with TOPCAT and the Virtual Observatory Mark Taylor (University of Bristol) Gaia and the Unseen Brown Dwarf Question GREAT-ESF Workshop Torino University 26 March 2014 $Id: tcvo.tex,v
Software Project Management Plan
Sciamachy Data Centre (NL-SCIA-DC) Software Project Management Plan Version 1.1 (NL-SCIA-DC-SPMP-1.1) 3730 AE, De Bilt page 2 Abstract This Software Project Management Plan (SPMP) describes the planning,
UniGR Workshop: Big Data «The challenge of visualizing big data»
Dept. ISC Informatics, Systems & Collaboration UniGR Workshop: Big Data «The challenge of visualizing big data» Dr Ir Benoît Otjacques Deputy Scientific Director ISC The Future is Data-based Can we help?
CHAPTER 20 TESING WEB APPLICATIONS. Overview
CHAPTER 20 TESING WEB APPLICATIONS Overview The chapter describes the Web testing. Web testing is a collection of activities whose purpose is to uncover errors in WebApp content, function, usability, navigability,
The Scientific Data Mining Process
Chapter 4 The Scientific Data Mining Process When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean neither more nor less. Lewis Carroll [87, p. 214] In
Open source framework for data-flow visual analytic tools for large databases
Open source framework for data-flow visual analytic tools for large databases D5.6 v1.0 WP5 Visual Analytics: D5.6 Open source framework for data flow visual analytic tools for large databases Dissemination
Data Validation and Data Management Solutions
FRONTIER TECHNOLOGY, INC. Advanced Technology for Superior Solutions. and Solutions Abstract Within the performance evaluation and calibration communities, test programs are driven by requirements, test
CRITEO INTERNSHIP PROGRAM 2015/2016
CRITEO INTERNSHIP PROGRAM 2015/2016 A. List of topics PLATFORM Topic 1: Build an API and a web interface on top of it to manage the back-end of our third party demand component. Challenge(s): Working with
Test Automation Process
A white Success The performance testing helped the client identify and resolve performance bottlenecks which otherwise crippled the business. The ability to support 500 concurrent users Test Automation
STATEMENT OF WORK. NETL Cooperative Agreement DE-FC26-02NT41476
STATEMENT OF WORK NETL Cooperative Agreement DE-FC26-02NT41476 Database and Analytical Tool for the Management of Data Derived from U. S. DOE (NETL) Funded Fine Particulate (PM 2.5 ) Research PROJECT SCOPE
The PACS Software System. (A high level overview) Prepared by : E. Wieprecht, J.Schreiber, U.Klaas November,5 2007 Issue 1.
The PACS Software System (A high level overview) Prepared by : E. Wieprecht, J.Schreiber, U.Klaas November,5 2007 Issue 1.0 PICC-ME-DS-003 1. Introduction The PCSS, the PACS ICC Software System, is the
ASKAP Science Data Archive: Users and Requirements CSIRO ASTRONOMY AND SPACE SCIENCE (CASS)
ASKAP Science Data Archive: Users and Requirements CSIRO ASTRONOMY AND SPACE SCIENCE (CASS) Jessica Chapman, Data Workshop March 2013 ASKAP Science Data Archive Talk outline Data flow in brief Some radio
Concept and Project Objectives
3.1 Publishable summary Concept and Project Objectives Proactive and dynamic QoS management, network intrusion detection and early detection of network congestion problems among other applications in the
How To Understand And Understand The Science Of Astronomy
Introduction to the VO [email protected] ESAVO ESA/ESAC Madrid, Spain The way Astronomy works Telescopes (ground- and space-based, covering the full electromagnetic spectrum) Observatories Instruments
How to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning
How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume
Organization of VizieR's Catalogs Archival
Organization of VizieR's Catalogs Archival Organization of VizieR's Catalogs Archival Table of Contents Foreword...2 Environment applied to VizieR archives...3 The archive... 3 The producer...3 The user...3
Lecture 32 Big Data. 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop
Lecture 32 Big Data 1. Big Data problem 2. Why the excitement about big data 3. What is MapReduce 4. What is Hadoop 5. Get started with Hadoop 1 2 Big Data Problems Data explosion Data from users on social
Copyrighted www.eh1infotech.com +919780265007, 0172-5098107 Address :- EH1-Infotech, SCF 69, Top Floor, Phase 3B-2, Sector 60, Mohali (Chandigarh),
Content of 6 Months Software Testing Training at EH1-Infotech Module 1: Introduction to Software Testing Basics of S/W testing Module 2: SQA Basics Testing introduction and terminology Verification and
Solution White Paper Connect Hadoop to the Enterprise
Solution White Paper Connect Hadoop to the Enterprise Streamline workflow automation with BMC Control-M Application Integrator Table of Contents 1 EXECUTIVE SUMMARY 2 INTRODUCTION THE UNDERLYING CONCEPT
NASA s Big Data Challenges in Climate Science
NASA s Big Data Challenges in Climate Science Tsengdar Lee, Ph.D. High-end Computing Program Manager NASA Headquarters Presented at IEEE Big Data 2014 Workshop October 29, 2014 1 2 7-km GEOS-5 Nature Run
Overview of the involvement of local Research. Organisations, Enterprises, Universities in. national and international projects on Earth
Overview of the involvement of local Research Organisations, Enterprises, Universities in national and international projects on Earth Observation applications and services. ( Earth Observation, Satellite
3SL. Requirements Definition and Management Using Cradle
3SL Requirements Definition and Management Using Cradle November 2014 1 1 Introduction This white paper describes Requirements Definition and Management activities for system/product development and modification
DATA ITEM DESCRIPTION
DATA ITEM DESCRIPTION Form Approved OMB NO.0704-0188 Public reporting burden for collection of this information is estimated to average 110 hours per response, including the time for reviewing instructions,
The Project Management Plan will be used to guide, communicate and coordinate project efforts.
F.1 General Implementation Contractor Deliverables include critical system planning and development components. Sufficient deliverables have been identified at key steps in the project to guide the project
MEDICAL DATA MINING. Timothy Hays, PhD. Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012
MEDICAL DATA MINING Timothy Hays, PhD Health IT Strategy Executive Dynamics Research Corporation (DRC) December 13, 2012 2 Healthcare in America Is a VERY Large Domain with Enormous Opportunities for Data
Not Relational Models For The Management of Large Amount of Astronomical Data. Bruno Martino (IASI/CNR), Memmo Federici (IAPS/INAF)
Not Relational Models For The Management of Large Amount of Astronomical Data Bruno Martino (IASI/CNR), Memmo Federici (IAPS/INAF) What is a DBMS A Data Base Management System is a software infrastructure
Luncheon Webinar Series May 13, 2013
Luncheon Webinar Series May 13, 2013 InfoSphere DataStage is Big Data Integration Sponsored By: Presented by : Tony Curcio, InfoSphere Product Management 0 InfoSphere DataStage is Big Data Integration
Digital Collections as Big Data. Leslie Johnston, Library of Congress Digital Preservation 2012
Digital Collections as Big Data Leslie Johnston, Library of Congress Digital Preservation 2012 Data is not just generated by satellites, identified during experiments, or collected during surveys. Datasets
Reference Architecture, Requirements, Gaps, Roles
Reference Architecture, Requirements, Gaps, Roles The contents of this document are an excerpt from the brainstorming document M0014. The purpose is to show how a detailed Big Data Reference Architecture
Hadoop and Map-Reduce. Swati Gore
Hadoop and Map-Reduce Swati Gore Contents Why Hadoop? Hadoop Overview Hadoop Architecture Working Description Fault Tolerance Limitations Why Map-Reduce not MPI Distributed sort Why Hadoop? Existing Data
Analysis of Web Archives. Vinay Goel Senior Data Engineer
Analysis of Web Archives Vinay Goel Senior Data Engineer Internet Archive Established in 1996 501(c)(3) non profit organization 20+ PB (compressed) of publicly accessible archival material Technology partner
Data Mining Challenges and Opportunities in Astronomy
Data Mining Challenges and Opportunities in Astronomy S. G. Djorgovski (Caltech) With special thanks to R. Brunner, A. Szalay, A. Mahabal, et al. The Punchline: Astronomy has become an immensely datarich
Proposal template (technical annex) Health, demographic change and wellbeing Two-stage Research and Innovation actions Innovation actions
Proposal template (technical annex) Health, demographic change and wellbeing Two-stage Research and Innovation actions Innovation actions Note: This is for information only. The definitive templates to
MAST: The Mikulski Archive for Space Telescopes
MAST: The Mikulski Archive for Space Telescopes Richard L. White Space Telescope Science Institute 2015 April 1, NRC Space Science Week/CBPSS A model for open access The NASA astrophysics data archives
Rotorcraft Health Management System (RHMS)
AIAC-11 Eleventh Australian International Aerospace Congress Rotorcraft Health Management System (RHMS) Robab Safa-Bakhsh 1, Dmitry Cherkassky 2 1 The Boeing Company, Phantom Works Philadelphia Center
Software Quality Assurance Plan
For Database Applications Document ID: Version: 2.1a Planning Installation & Acceptance Integration & Test Requirements Definition Design Development 1 / 54 Copyright 2000-2006 Digital Publications LLC.
Functional Requirements for Digital Asset Management Project version 3.0 11/30/2006
/30/2006 2 3 4 5 6 7 8 9 0 2 3 4 5 6 7 8 9 20 2 22 23 24 25 26 27 28 29 30 3 32 33 34 35 36 37 38 39 = required; 2 = optional; 3 = not required functional requirements Discovery tools available to end-users:
for Big Data and Analytics
Organizational Models for Big Data and Analytics Robert L. Grossman Kevin P. Siegel Abstract: In this article, we introduce a framework for determining how analytics capability should be distributed within
IT Service Level Management 2.1 User s Guide SAS
IT Service Level Management 2.1 User s Guide SAS The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2006. SAS IT Service Level Management 2.1: User s Guide. Cary, NC:
Software Requirement Specifications V1.0
V1.0 1. Introduction 1.1 Purpose... 1 1.2 Document Conventions... 1 1.3 Intended Audience and Reading Suggestions... 1 1.4 Project Scope... 1 1.5 References... 1 2. Overall 2.1 Product Perspective... 2
Project Lifecycle Management (PLM)
Project Lifecycle Management (PLM) Process or Tool? Why PLM? Project Definition Project Management NEW REQUEST/ INITIATIVES SUPPORT (Quick fixes) PROJECT (Start Finish) ONGOING WORK (Continuous) ENHANCEMENTS
QUALITY CONTROL OF THE IUE FINAL ARCHIVE
QUALITY CONTROL OF THE IUE FINAL ARCHIVE N. Loiseau 1, E. Solano 1,M.Barylak 2 1 INSA/ESA IUE Observatory, Apdo. 50727, Villafranca del Castillo, 28080 Madrid (Spain). 2 ESA IUE Observatory, Apdo. 50727,
Work Process Management
GE Intelligent Platforms Work Process Management Achieving Operational Excellence through Consistent and Repeatable Plant Operations With Work Process Management, organizations can drive the right actions
IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems
IBM InfoSphere Guardium Data Activity Monitor for Hadoop-based systems Proactively address regulatory compliance requirements and protect sensitive data in real time Highlights Monitor and audit data activity
IBM Solution Framework for Lifecycle Management of Research Data. 2008 IBM Corporation
IBM Solution Framework for Lifecycle Management of Research Data Aspects of Lifecycle Management Research Utilization of research paper Usage history Metadata enrichment Usage Pattern / Citation Collaboration
4.13 System Testing. Section 4 Bidder's Products, Methodology, and Approach to the Project. 4.14 System Training
Section 4 Bidder's Products, Methodology, and Approach to the Project 4.1 FACTS II Requirements Summary 4.11 Interfaces 4.2 Functional Requirements 4.12 System Development 4.3 Technical Requirements 4.13
Visual Analysis for Extremely Large Scale Scientific Computing
Visual Analysis for Extremely Large Scale Scientific Computing D2.5 Big data interfaces for data acquisition Deliverable Information Grant Agreement no 619439 Web Site Related WP & Task: WP2, T2.4 http://www.velassco.eu/
D5.3.2b Automatic Rigorous Testing Components
ICT Seventh Framework Programme (ICT FP7) Grant Agreement No: 318497 Data Intensive Techniques to Boost the Real Time Performance of Global Agricultural Data Infrastructures D5.3.2b Automatic Rigorous
NIRCal Software data sheet
NIRCal Software data sheet NIRCal is an optional software package for NIRFlex N-500 and NIRMaster, that allows the development of qualitative and quantitative calibrations. It offers numerous chemometric
Multidimensional Data in the Virtual Observatory
IX Reunión Científica de la SEA Madrid- 15/09/2010 Red Temática SVO Multidimensional Data in the Virtual Observatory José Enrique Ruiz Grupo AMIGA Instituto de Astrofísica de Andalucía CSIC Contextual
How To Write An Inspire Directive
INSPIRE Infrastructure for Spatial Information in Europe Detailed definitions on the INSPIRE Network Services Title Detailed definitions on the INSPIRE Network Services Creator Date 2005-07-22 Subject
Data Management, Analysis Tools, and Analysis Mechanics
Chapter 2 Data Management, Analysis Tools, and Analysis Mechanics This chapter explores different tools and techniques for handling data for research purposes. This chapter assumes that a research problem
Response to Invitation to Tender: requirements and feasibility study on preservation of e-prints
Response to Invitation to Tender: requirements and feasibility study on preservation of e-prints A proposal to the JISC from the Arts and Humanities Data Service and the University of Nottingham, Project
Working version of the IT Platform and ITbased support tools for scenario development
Foresight Security Scenarios Mapping Research to a Comprehensive Approach to Exogenous EU Roles Working version of the IT Platform and ITbased support tools for scenario development Deliverable 2.2 BOC
Data Lab System Architecture
Data Lab System Architecture Data Lab Context Data Lab Architecture Astronomer s Desktop Web Page Cmdline Tools Legacy Apps User Code User Mgmt Data Lab Ops Monitoring Presentation Layer Authentication
EUR-Lex 2012 Data Extraction using Web Services
DOCUMENT HISTORY DOCUMENT HISTORY Version Release Date Description 0.01 24/01/2013 Initial draft 0.02 01/02/2013 Review 1.00 07/08/2013 Version 1.00 -v1.00.doc Page 2 of 17 TABLE OF CONTENTS 1 Introduction...
BMC Control-M Workload Automation
solution overview BMC Control-M Workload Automation Accelerating Delivery of Digital Services with Workload Management Table of Contents 1 SUMMARY 2 FASTER AND CHEAPER DYNAMIC WORKLOAD MANAGEMENT Minimize
Service Road Map for ANDS Core Infrastructure and Applications Programs
Service Road Map for ANDS Core and Applications Programs Version 1.0 public exposure draft 31-March 2010 Document Target Audience This is a high level reference guide designed to communicate to ANDS external
What is meant by the term, Lean Software Development? November 2014
What is meant by the term, Lean Software Development? Scope of this Report November 2014 This report provides a definition of Lean Software Development and explains some key characteristics. It explores
Data Refinery with Big Data Aspects
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 655-662 International Research Publications House http://www. irphouse.com /ijict.htm Data
USGS EOS SYSTEMS ENGINEERING MANAGEMENT PLAN (SEMP)
Department of the Interior U.S. Geological Survey USGS EOS SYSTEMS ENGINEERING MANAGEMENT PLAN (SEMP) September 2013 Executive Summary This Systems Engineering Management Plan (SEMP) outlines the engineering
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop
Role of Cloud Computing in Big Data Analytics Using MapReduce Component of Hadoop Kanchan A. Khedikar Department of Computer Science & Engineering Walchand Institute of Technoloy, Solapur, Maharashtra,
How To Improve The Performance Of Anatm
EXPLORATORY RESEARCH IN ATM David Bowen Chief ATM 4 th May 2015 1 ATM Research in Europe HORIZON Transport Challenges smart, green and integrated transport FlightPath 2050 five challenges to aviation beyond
Conquering the Astronomical Data Flood through Machine
Conquering the Astronomical Data Flood through Machine Learning and Citizen Science Kirk Borne George Mason University School of Physics, Astronomy, & Computational Sciences http://spacs.gmu.edu/ The Problem:
Deliverable D1.1. Building data bridges between biological and medical infrastructures in Europe. Grant agreement no.: 284209
Deliverable D1.1 Project Title: Building data bridges between biological and medical infrastructures in Europe Project Acronym: BioMedBridges Grant agreement no.: 284209 Research Infrastructures, FP7 Capacities
Sentaurus Workbench Comprehensive Framework Environment
Data Sheet Comprehensive Framework Environment Overview is a complete graphical environment for creating, managing, executing, and analyzing TCAD simulations. Its intuitive graphical user interface allows
Certification Report
Certification Report Symantec Network Access Control Version 12.1.2 Issued by: Communications Security Establishment Canada Certification Body Canadian Common Criteria Evaluation and Certification Scheme
How To Use The Correlog With The Cpl Powerpoint Powerpoint Cpl.Org Powerpoint.Org (Powerpoint) Powerpoint (Powerplst) And Powerpoint 2 (Powerstation) (Powerpoints) (Operations
orrelog SQL Table Monitor Adapter Users Manual http://www.correlog.com mailto:[email protected] CorreLog, SQL Table Monitor Users Manual Copyright 2008-2015, CorreLog, Inc. All rights reserved. No part
Deliverable 1.1 Description of Quality Management and Risk Processing Responsible partner:
Deliverable 1.1 Description of Quality Management and Risk Processing Responsible partner: BOKU - University of Natural Resources and Life Sciences, Vienna Institute for Transport Studies List of abbreviations
Canadian Astronomy Data Centre. Séverin Gaudet David Schade Canadian Astronomy Data Centre
Canadian Astronomy Data Centre Séverin Gaudet David Schade Canadian Astronomy Data Centre Data Activities in Astronomy Features of the astronomy data landscape Multi-wavelength datasets are increasingly
Software Requirements Specification for POS_Connect Page 1. Software Requirements Specification. for. POS_Connect. Version 1.0
Page 1 Software Requirements Specification for POS_Connect Version 1.0 1/9/2013 Page 2 Table of Contents Table of Contents Revision History 1. Introduction 1.1 Purpose 1.2 Document Conventions 1.3 Intended
Meta-Model specification V2 D602.012
PROPRIETARY RIGHTS STATEMENT THIS DOCUMENT CONTAINS INFORMATION, WHICH IS PROPRIETARY TO THE CRYSTAL CONSORTIUM. NEITHER THIS DOCUMENT NOR THE INFORMATION CONTAINED HEREIN SHALL BE USED, DUPLICATED OR
European Data Infrastructure - EUDAT Data Services & Tools
European Data Infrastructure - EUDAT Data Services & Tools Dr. Ing. Morris Riedel Research Group Leader, Juelich Supercomputing Centre Adjunct Associated Professor, University of iceland BDEC2015, 2015-01-28
