Notes on the architecture, design, and data processes in openfda. processing, data harmonization, and website technologies.
|
|
|
- Claribel Lambert
- 10 years ago
- Views:
Transcription
1 Notes on the architecture, design, and data processes in openfda OpenFDA uses cutting edge technologies and is a pilot for how FDA can develop and deploy novel applications in the public cloud securely and efficiently in the future. In these notes, we provide a high- level plain language description of the logical architecture, data sources, data processing, data harmonization, and website technologies. Logical Architecture The architecture and technology were chosen to make openfda scalable; quickly responsive; transferable to new technologies as they mature; easily accessible by application developers, researchers, and the general public; and transparent. The data are on the cloud that has been approved for federal use (Amazon Web Services East). 1 The figure shows openfda is built using modern, open standards and leveraging open source and cloud technologies. The system uses modules that work together or in sequence and can be transparently replaced as technology improves. The target consumers are other applications that use openfda; applications can query openfda in the form of Uniform Resource Locators (URLs). Of course, researchers and the general public can create and run queries, as well.
2 Figure. openfda logical architecture. OpenFDA is hosted in one of the secure cloud environments approved for federal use (Amazon Web Services US- East) 1 and uses Amazon Elastic Compute Cloud (EC2). 2 All of the content and data are encrypted to be read- only by the public. The platform is transportable to different cloud environments since it is deployed within a Docker container, 3 a complete file system that contains everything it needs to run the software. In addition, the data is portable to other software. Node.js is used within Docker as the open source, cross- platform runtime environment for server- side and networking applications. Node.js, using JavaScript, enables the
3 creation of highly scalable fast webservers, has a simple and elegant programmer interface, and has a large library of open source modules. 4 Git, a source control system, is used during the development and modification of any of the openfda content (data processing steps, analysis code, encryption protocols, and settings for the other modules). All of the code is copied into the independent GitHub Source Code Repository for openfda. 5 GitHub is a popular repository for open source code. The public data are regularly drawn from public FDA files. Luigi Python (the open source version created by Spotify to handle massive volumes of digitized music) 6 and Elasticsearch 7 are both used to prepare and load the data. Python manages the workflow of data and software modules. Elasticsearch is a fast, scalable, full text JSON (see next sentence) database built upon the open source Lucene project that has an easy to use RESTful API (see next paragraph). The final form of each dataset is JavaScript Object Notation (JSON), an open standard data format that is independent of programming language and supported by many programming languages, including Python, JavaScript, and R. 8-9 All of the openfda content is stored in Amazon Simple Storage Service (S3). 10 The Application Programming Interfaces (APIs) contain the automation for accessing and using the data. 11 They are Representational State Transfer (REST) type, to take advantage of modularity of the code, the ability to cache queries and responses, the ability to layer services, and scalability. 12
4 Queries are written in Lucene query syntax 13. All queries begin with and go on to specify the database API name and then the search or count specifications. The path for queries is represented with solid lines in the figure. API Umbrella 14 is an open source tool that keeps track of query statistics, and administers the free user- specific API keys that allow heavy use. API Umbrealla receives the incoming query and fetches the results. If the query is a duplicate, the response is found in API Umbrella s historical query cache. Otherwise, Node.js and Elasticsearch are used to search and analyze the specified JSON dataset in cloud storage using real- time distributed methods. The load balancer balances all the jobs. 15 OpenFDA has been able to handle over 100 requests per second across millions of records, which are all built in an open source environment and can be adopted for other public health big data challenges. StackExchange hosts questions, answers, and discussions related to openfda. 16 Data Source Details Four main data sources are currently available in openfda: adverse event reports for drugs and devices, recall reports for all products, and drug labeling. Drug adverse event reports includes approximately five million publicly available drug adverse event and medication error reports, of which almost 1.2 million reports are from Since 2004, FDA has published online quarterly drug adverse reaction reports from the FDA Adverse Event Reporting System (FAERS) The files listed on this webpage
5 contain raw data extracts for the indicated time ranges, are not cumulative, and require reconstruction into a relational database. The historical files are in Standardized General Markup (SGM) format and the current ones are in Extensible Markup Language (XML). Part of processing the files requires preserving only the most recent record of a particular reported incident. LevelDB software 20 is used to do the filtering; LevelDB (with Snappy 21 compression, a fast data compression and decompression library) is compatible with Node.js, C++, and Python. Safety report data contain semi- structured information about recalls, market withdrawals and safety alerts of FDA- regulated products archived in the Recall Enterprise System (RES) 22 since Recalling defective or dangerous products, by removing them from the market or correcting the problem, is one of the ways of protecting the public. 23 FDA provides various ways to access the recalls data, including an RSS feed, a Flickr stream, a search interface, weekly downloadable XML files (the ones used by openfda), and weekly downloadable CSV files. The openfda API provides a new option for easy and fast access. 24 Labeling data are composed of the updated Structured Product Labeling (SPL) data for 68,000 currently approved drugs. The labeling contains information necessary to inform healthcare providers about the safe and effective use of the drug for its approved use(s). 25 The FDA SPL staff make the updated SPL files in XML format available to the openfda staff in parallel to sending them to the National Library of Medicine, where they are available to the public. 26 Medical device adverse event reports include over 4 million reports of serious injuries, deaths, and device malfunctions, from 1991 to the present. In recent years, the
6 Manufacturer and User Facility Device Experience (MAUDE) has been receiving several hundred thousand reports per year. The source is the public downloadable version of MAUDE, which is in multiple zip files composed of pipe- delimited text files that require reconstruction into a relational database. 27 Processing of the Source Data The public data sources are converted to flat JSON files. Both of the adverse event report source databases are in relational flat tables that are processed in openfda to each form a large flat file with long records. The labelling source data are in XML format, with varying levels of hierarchy used for different records; the new flat table of records had to be designed after exploration of the extent of hierarchy in different sections of the individual records. The recall reports source files are also in XML format that the openfda process converts to a flat file. Relational databases were popular for the last several decades because storage was relatively expensive. However, the complexity of relational databases raises the risk of inaccurate analysis strategies. Software designed for big data has made it feasible to quickly execute search and analysis commands on very large flat files. Harmonization Process
7 To address issues related to differences in the structure of the three drug databases (adverse event reports, recalls, and labeling), openfda features harmonization on drug identifiers (generic name, brand name, etc), to make it easier to both search for and understand the drug products returned by API queries. The additional harmonization, or openfda, fields are created from the following four databases: NDC Directory. 28 OpenFDA uses application number, brand name, dosage form, generic name, manufacturer name, original packager indicator, NDC, type of drug product, route of administration, and active ingredients. SPL- Pharmacological Class Mappings. 29 OpenFDA uses all four types of pharmacologic class: mechanism of action, chemical structure, physiological effect, and approved indication class. SPL- RxNorm Mappings. 30 Synonym drug names are grouped into RxNorm concepts, and connected NDC, other drug names, ingredients, manufacturer, and pill attributes. OpenFDA uses the RxNorm Concept Unique Identifier that incorporates the drug concept, ingredients, strength, and dosage forms. Substance Registration System. 31 OpenFDA uses the Unique Ingredient Identifier. The harmonized openfda fields are then added to any record in the recalls, drug adverse event reports, and SPL flat files that match a field in the harmonization database. For recalls of drugs, the names of drugs and manufacturers, as well as NDC or UPC codes, were generally provided in free- text fields with other text. Regular expression- based extractors were built to identify this information for harmonization.
8 Website Technology The design of the open.fda.gov website draws on best practices in agile development, intuitive user experience, and data visualization. Its aim is to provide a unified, consistent presentation for all datasets to facilitate ease of learning both about the APIs and the datasets themselves. The site is organized thematically around broad data types (drugs, devices, and foods), rather than around datasets or FDA s internal organization. This scheme is deliberate, in order to more closely align with website users' mental models of FDA data. The website is characterized by a combination of interactive programmer- oriented example queries, visualizations, and examples that explain the nature of the data and how to use the query syntax and JSON results. Unlike most API websites, it recognizes that non- technical members of the public have an interest in these data, and employs the design principle of progressive disclosure to provide multiple layers of information depth. Interactive data visualizations and examples, with plain language annotation, are oriented towards members of the public but usable by both programmer and non- programmer users of the website. Plain, straightforward language is used throughout, including in field- by- field documentation of the JSON results for each API endpoint. The website was designed in an iterative fashion, incorporating feedback from both internal and external stakeholders. Since its launch, changes have been made to clarify documentation in response to feedback from the open source community.
9 Publicly available data provided through openfda are in the public domain with a CC0 Public Domain Dedication 32. The website was built with open source software: Jekyll for overall structure, 33 Bootstrap for responsive design including mobile compatibility, 34 Grunt for optimizing JavaScript, 35 and LESS/CSS 36 and D3 37 and C3 38 for data visualization. Conclusion OpenFDA brings a new model of big data search and analytics across disparate and complex sources by simplifying dataset structures and using modular open source technology. By: Taha A. Kass- Hout, MD, Roselie A. Bright, ScD, Adam Baker
10 References 1. FedRAMP Compliant Systems. FedRAMP, US General Services Administration. systems/. 2. Amazon EC2. Amazon er=bing&sc_medium=ec2_b&sc_content=ec2_bmm&sc_detail=+amazon%20+ec2&sc_c ategory=ec2&sc_segment= &sc_matchtype=p&sc_country=us&s_kwcid=al! 4422!10! ! &ef_id=VLAuEwAABfo1- Hwu: :s. 3. Build, Ship, Run. Docker, Inc Node.js. Node.js Foundation FDA/openFDA. GitHub, Inc Accessed in July Luigi. Python Software Foundation Accessed in July Elasticsearch: Search & Analyze Data in Real Time. Elasticsearch Introducing JSON. JSON The R Project for Statistical Computing. The R Foundation. project.org/.
11 10. Amazon Simple Storage Service Developer Guide. AWS Documentation Accessed in July Orenstein D. Application Programming Interface. Computerworld. January 10, development/application- programming- interface.html. 12. Kay R. Representational State Transfer (REST). Computer World. August 6, state- transfer- - rest-.html. 13. Lucene. Apache Software Foundation. June 21, API Umbrella Elastic Load Balancing. Amazon Web Services. Aws.amazon.com/elasticloadbalancing/. 16. StackExchange Reports Received and Reports Entered into FAERS by Year. Food and Drug Administration. August 6, versedrugeffects/ucm htm in July The Adverse Event Reporting System (AERS): Older Quarterly Data Files. Food and Drug Administration. August 15, 2013.
12 versedrugeffects/ucm htm. 19. FDA Adverse Event Reporting System (FAERS): Latest Quarterly Data Files. Food and Drug Administration. June 16, versedrugeffects/ucm htm LevelDB. LevelDB Snappy: a fast compressor/decompressor. Google Project Hosting Enforcement Reports. Food and Drug Administration. July 15, Accessed in July FDA 101: Product Recalls From First Alert to Effectiveness Checks. Food and Drug Administration. Updated April 29, Accessed in July Kass- Hout T. OpenFDA provides ready access to recall data. Food and Drug Administration. August 8, provides- ready- access- to- recall- data. 25. Kass- Hout T. Providing easy public access to prescription drug, over- the- counter drug, and biological product labeling. Food and Drug Administration. August 18, product- labeling/.
13 26. DailyMed. National Library of Medicine, US National Institutes of Health Manufacturer and User Facility Device Experience Database (MAUDE). Food and Drug Administration. May 7, ments/reportingadverseevents/ucm htm. 28. National Drug Code Database Background Information. Food and Drug Administration. June 14, Accessed in July SPL Resources: Download all mapping files. National Library of Medicine, US National Institutes of Health. dailymed.nlm.nih.gov/dailymed/spl- resources- all- mapping- files.cfm. 30. RxNorm Overview. National Library of Medicine, US National Institutes of Health. January 5, UNII List Download. Substance Registration System Unique Ingredient Identifier (UNII). National Library of Medicine, US National Institutes of Health. Updated March
14 32. Creative Commons Corp., CCO 1.0 Universal Preston- Werner T. Transform your plain text into static websites and blogs. Jeckyll Jekyllrb.com/. 34. Bootstrap. Bootstrap. Getbootstrap.com/. 35. GRUNT The JavaScript Task Runner. Gruntjs. Gruntjs.com/. 36. Getting started. LESS/CSS. Lesscss.org/#. 37. D3: Data- Driven Documents. D3js. D3js.org/. 38. Tanaka M. C3.js: D3- based reusable chart library. C3js C3js.org/. Accessed in July 2015.
FINAL REPORT. August 26, 2014 HHSF223201310098C. openfda: A Pilot Research Project To Evaluate How Best To Make Datasets Available Via a Web Portal
FINAL REPORT August 26, 2014 HHSF223201310098C openfda: A Pilot Research Project To Evaluate How Best To Make Datasets Available Via a Web Portal Thomas Goetz Iodine Inc. 34 Clyde Street San Francisco,
Sisense. Product Highlights. www.sisense.com
Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze
API Architecture. for the Data Interoperability at OSU initiative
API Architecture for the Data Interoperability at OSU initiative Introduction Principles and Standards OSU s current approach to data interoperability consists of low level access and custom data models
Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL. May 2015
Lambda Architecture for Batch and Real- Time Processing on AWS with Spark Streaming and Spark SQL May 2015 2015, Amazon Web Services, Inc. or its affiliates. All rights reserved. Notices This document
How To Set Up Wiremock In Anhtml.Com On A Testnet On A Linux Server On A Microsoft Powerbook 2.5 (Powerbook) On A Powerbook 1.5 On A Macbook 2 (Powerbooks)
The Journey of Testing with Stubs and Proxies in AWS Lucy Chang [email protected] Abstract Intuit, a leader in small business and accountants software, is a strong AWS(Amazon Web Services) partner
Real-Time Analytics on Large Datasets: Predictive Models for Online Targeted Advertising
Real-Time Analytics on Large Datasets: Predictive Models for Online Targeted Advertising Open Data Partners and AdReady April 2012 1 Executive Summary AdReady is working to develop and deploy sophisticated
OpenText Information Hub (ihub) 3.1 and 3.1.1
OpenText Information Hub (ihub) 3.1 and 3.1.1 OpenText Information Hub (ihub) 3.1.1 meets the growing demand for analytics-powered applications that deliver data and empower employees and customers to
Web project proposal. European e-skills Association
Web project proposal European e-skills Association LUCISMEDIA WEB DESIGN PROPOSAL CONTENTS Lucismedia... 3 Building enterprise social communities... 3 project objective... 4 Project scope... 6 Interface
Interoperable Cloud Storage with the CDMI Standard
Interoperable Cloud Storage with the CDMI Standard Storage and Data Management in a post-filesystem World Mark Carlson, SNIA TC and Oracle Co-Chair, SNIA Cloud Storage TWG and Initiative Author: Mark Carlson,
Team Members: Christopher Copper Philip Eittreim Jeremiah Jekich Andrew Reisdorph. Client: Brian Krzys
Team Members: Christopher Copper Philip Eittreim Jeremiah Jekich Andrew Reisdorph Client: Brian Krzys June 17, 2014 Introduction Newmont Mining is a resource extraction company with a research and development
Introduction to DevOps on AWS
Introduction to DevOps on AWS David Chapman December 2014 Contents Contents Abstract Introduction Agile Evolution to DevOps Infrastructure as Code AWS CloudFormation AWS AMI Continuous Deployment AWS CodeDeploy
IBM Digital Experience. Using Modern Web Development Tools and Technology with IBM Digital Experience
IBM Digital Experience Using Modern Web Development Tools and Technology with IBM Digital Experience Agenda The 2015 web development landscape and IBM Digital Experience Modern web applications and frameworks
Build Your Mobile Strategy Not Just Your Mobile Apps
Mobile Cloud Service Build Your Mobile Strategy Not Just Your Mobile Apps Copyright 2015 Oracle Corporation. All Rights Reserved. What is is it? Oracle Mobile Cloud Service provides everything you need
The Virtualization Practice
The Virtualization Practice White Paper: Managing Applications in Docker Containers Bernd Harzog Analyst Virtualization and Cloud Performance Management October 2014 Abstract Docker has captured the attention
WHITE PAPER Redefining Monitoring for Today s Modern IT Infrastructures
WHITE PAPER Redefining Monitoring for Today s Modern IT Infrastructures Modern technologies in Zenoss Service Dynamics v5 enable IT organizations to scale out monitoring and scale back costs, avoid service
AWS CodePipeline. User Guide API Version 2015-07-09
AWS CodePipeline User Guide AWS CodePipeline: User Guide Copyright 2015 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress may not be used in connection
CiteSeer x in the Cloud
Published in the 2nd USENIX Workshop on Hot Topics in Cloud Computing 2010 CiteSeer x in the Cloud Pradeep B. Teregowda Pennsylvania State University C. Lee Giles Pennsylvania State University Bhuvan Urgaonkar
SOA, case Google. Faculty of technology management 07.12.2009 Information Technology Service Oriented Communications CT30A8901.
Faculty of technology management 07.12.2009 Information Technology Service Oriented Communications CT30A8901 SOA, case Google Written by: Sampo Syrjäläinen, 0337918 Jukka Hilvonen, 0337840 1 Contents 1.
Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros
David Moses January 2014 Paper on Cloud Computing I Background on Tools and Technologies in Amazon Web Services (AWS) In this paper I will highlight the technologies from the AWS cloud which enable you
Cymon.io. Open Threat Intelligence. 29 October 2015 Copyright 2015 esentire, Inc. 1
Cymon.io Open Threat Intelligence 29 October 2015 Copyright 2015 esentire, Inc. 1 #> whoami» Roy Firestein» Senior Consultant» Doing Research & Development» Other work include:» docping.me» threatlab.io
Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole
Paper BB-01 Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole ABSTRACT Stephen Overton, Overton Technologies, LLC, Raleigh, NC Business information can be consumed many
Visualize your World. Democratization i of Geographic Data
Visualize your World Democratization i of Geographic Data Session Agenda Google GEO Solutions - More than just a Map Enabling our Government Customers- Examples Summary & Invite to Learn More About Google
Cloud Data Management Interface (CDMI) The Cloud Storage Standard. Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG
Cloud Data Management Interface (CDMI) The Cloud Storage Standard Mark Carlson, SNIA TC and Oracle Chair, SNIA Cloud Storage TWG SNIA Legal Notice The material contained in this tutorial is copyrighted
Collaborative Open Market to Place Objects at your Service
Collaborative Open Market to Place Objects at your Service D6.2.1 Developer SDK First Version D6.2.2 Developer IDE First Version D6.3.1 Cross-platform GUI for end-user Fist Version Project Acronym Project
SavvyDox Publishing Augmenting SharePoint and Office 365 Document Content Management Systems
SavvyDox Publishing Augmenting SharePoint and Office 365 Document Content Management Systems Executive Summary This white paper examines the challenges of obtaining timely review feedback and managing
Hadoop & Spark Using Amazon EMR
Hadoop & Spark Using Amazon EMR Michael Hanisch, AWS Solutions Architecture 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda Why did we build Amazon EMR? What is Amazon EMR?
Best Practices for Sharing Imagery using Amazon Web Services. Peter Becker
Best Practices for Sharing Imagery using Amazon Web Services Peter Becker Objectives Making Imagery Accessible Store massive volumes of imagery on inexpensive cloud storage Use elastic compute for image
Four Reasons Your Technical Team Will Love Acquia Cloud Site Factory
Four Reasons Your Technical Team Will Love Acquia Cloud Site Factory Table of Contents The Journey You ve Accepted.... 3 The Freedom of Open Source and Power of Drupal.... 4 Manage and Deploy Tens, Hundreds,
Open is as Open Does: Lessons from Running a Professional Open Source Company
Open is as Open Does: Lessons from Running a Professional Open Source Company Leon Rozenblit, JD, PhD Founder and CEO at Prometheus Research, LLC email: [email protected] twitter: @leon_rozenblit
Data processing goes big
Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,
Copyright 2013 Splunk Inc. Introducing Splunk 6
Copyright 2013 Splunk Inc. Introducing Splunk 6 Safe Harbor Statement During the course of this presentation, we may make forward looking statements regarding future events or the expected performance
Architecture Workshop
TIE-13100 / TIE-13106 Tietotekniikan projektityö / Project Work on Pervasive Systems Architecture Workshop Hadaytullah Marko Leppänen 21.10.2014 Workshop Plan Start Technologies Table (Collaboration) Workshop
Pivotal CRM 6.0. Benefit for your organization : a solution that can support your business needs
Pivotal CRM 6.0 Whatever the trend in market growth, have your customers drive your success with greater proficiency and greater flexibility and lower cost of ownership Benefit for your organization :
Visualizing a Neo4j Graph Database with KeyLines
Visualizing a Neo4j Graph Database with KeyLines Introduction 2! What is a graph database? 2! What is Neo4j? 2! Why visualize Neo4j? 3! Visualization Architecture 4! Benefits of the KeyLines/Neo4j architecture
Teradata Marketing Operations. Reduce Costs and Increase Marketing Efficiency
Teradata Marketing Operations Reduce Costs and Increase Marketing Efficiency Product Insight Brochure What Would You Do If You Knew? TM What would you do if you knew your marketing efforts could be freed
How To Manage A Multi Site In Drupal
http://platform.sh [email protected] MODERNISING DRUPAL MULTI-SITE IMPLEMENTATIONS Drupal multi-site is easily re-architected to run each site in its own containerised environment. It s better and it costs
NIH Commons Overview, Framework & Pilots - Version 1. The NIH Commons
The NIH Commons Summary The Commons is a shared virtual space where scientists can work with the digital objects of biomedical research, i.e. it is a system that will allow investigators to find, manage,
MassTransit vs. FTP Comparison
MassTransit vs. Comparison If you think is an optimal solution for delivering digital files and assets important to the strategic business process, think again. is designed to be a simple utility for remote
SIF 3: A NEW BEGINNING
SIF 3: A NEW BEGINNING The SIF Implementation Specification Defines common data formats and rules of interaction and architecture, and is made up of two parts: SIF Infrastructure Implementation Specification
Five Steps to Integrate SalesForce.com with 3 rd -Party Systems and Avoid Most Common Mistakes
Five Steps to Integrate SalesForce.com with 3 rd -Party Systems and Avoid Most Common Mistakes This white paper will help you learn how to integrate your SalesForce.com data with 3 rd -party on-demand,
Database Management System Choices. Introduction To Database Systems CSE 373 Spring 2013
Database Management System Choices Introduction To Database Systems CSE 373 Spring 2013 Outline Introduction PostgreSQL MySQL Microsoft SQL Server Choosing A DBMS NoSQL Introduction There a lot of options
Apigee Edge API Services Manage, scale, secure, and build APIs and apps
Manage, scale, secure, and build APIs and apps Hex #FC4C02 Hex #54585A Manage, scale, secure, and build APIs and Apps with is designed to unite the best of Internet and enterprise technologies to provide
JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights. 2013 Copyright Metric insights, Inc.
JAVASCRIPT CHARTING Scaling for the Enterprise with Metric Insights 2013 Copyright Metric insights, Inc. A REVOLUTION IS HAPPENING... 3! Challenges... 3! Borrowing From The Enterprise BI Stack... 4! Visualization
Why Big Data in the Cloud?
Have 40 Why Big Data in the Cloud? Colin White, BI Research January 2014 Sponsored by Treasure Data TABLE OF CONTENTS Introduction The Importance of Big Data The Role of Cloud Computing Using Big Data
Getting started with API testing
Technical white paper Getting started with API testing Test all layers of your composite applications, not just the GUI Table of contents Executive summary... 3 Introduction... 3 Who should read this document?...
Monitis Project Proposals for AUA. September 2014, Yerevan, Armenia
Monitis Project Proposals for AUA September 2014, Yerevan, Armenia Distributed Log Collecting and Analysing Platform Project Specifications Category: Big Data and NoSQL Software Requirements: Apache Hadoop
HYBRID CLOUD SUPPORT FOR LARGE SCALE ANALYTICS AND WEB PROCESSING. Navraj Chohan, Anand Gupta, Chris Bunch, Kowshik Prakasam, and Chandra Krintz
HYBRID CLOUD SUPPORT FOR LARGE SCALE ANALYTICS AND WEB PROCESSING Navraj Chohan, Anand Gupta, Chris Bunch, Kowshik Prakasam, and Chandra Krintz Overview Google App Engine (GAE) GAE Analytics Libraries
Software Development In the Cloud Cloud management and ALM
Software Development In the Cloud Cloud management and ALM First published in Dr. Dobb's Journal, February 2009: http://www.ddj.com/development-tools/212900736 Nick Gulrajani is a Senior Solutions Architect
PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015
Enterprise Scale Disease Modeling Web Portal PROPOSAL To Develop an Enterprise Scale Disease Modeling Web Portal For Ascel Bio Updated March 2015 i Last Updated: 5/8/2015 4:13 PM3/5/2015 10:00 AM Enterprise
Test Data Management Concepts
Test Data Management Concepts BIZDATAX IS AN EKOBIT BRAND Executive Summary Test Data Management (TDM), as a part of the quality assurance (QA) process is more than ever in the focus among IT organizations
The Purview Solution Integration With Splunk
The Purview Solution Integration With Splunk Integrating Application Management and Business Analytics With Other IT Management Systems A SOLUTION WHITE PAPER WHITE PAPER Introduction Purview Integration
Middleware- Driven Mobile Applications
Middleware- Driven Mobile Applications A motwin White Paper When Launching New Mobile Services, Middleware Offers the Fastest, Most Flexible Development Path for Sophisticated Apps 1 Executive Summary
WHITEPAPER. Why Dependency Mapping is Critical for the Modern Data Center
WHITEPAPER Why Dependency Mapping is Critical for the Modern Data Center OVERVIEW The last decade has seen a profound shift in the way IT is delivered and consumed by organizations, triggered by new technologies
What is a CMS? Why Node.js? Joel Barna. Professor Mike Gildersleeve IT 704 10/28/14. Content Management Systems: Comparison of Tools
Joel Barna Professor Mike Gildersleeve IT 704 10/28/14 Content Management Systems: Comparison of Tools What is a CMS? A content management system (CMS) is a system that provides a central interface for
MarkLogic Semantics in Healthcare and Life Sciences for LIDER COPYRIGHT 2015 MARKLOGIC CORPORATION. ALL RIGHTS RESERVED.
MarkLogic Semantics in Healthcare and Life Sciences for LIDER The Only Enterprise NoSQL Database Search & Query ACID Transactions High Availability / Disaster Recovery Replication Government-grade Security
I am not a prospect I am a partner
IntelliRx, Transforming Prospect to Partner I am not a prospect I am a partner Scalable Systems Life Science & Healthcare Practices Improve Your DNA Data, Numbers & Analytics Intelli Rx Scalable Systems
InRule. The Premier BRMS for the Microsoft Platform. Benefits THE POWER OF INRULE. Key Capabilities
InRule The Premier BRMS for the Microsoft Platform THE POWER OF INRULE InRule empowers technical and business users to change rules and calculations in applications with less effort, cost, and risk than
MENDIX FOR MOBILE APP DEVELOPMENT WHITE PAPER
MENDIX FOR MOBILE APP DEVELOPMENT WHITE PAPER TABLE OF CONTENTS Market Demand for Enterprise Mobile Mobile App Development Approaches Native Apps Mobile Web Apps Hybrid Apps Mendix Vision for Mobile App
WHITE PAPER. iet ITSM Enables Enhanced Service Management
iet ITSM Enables Enhanced Service Management iet ITSM Enables Enhanced Service Management Need for IT Service Management The focus within the vast majority of large and medium-size companies has shifted
HDFS Cluster Installation Automation for TupleWare
HDFS Cluster Installation Automation for TupleWare Xinyi Lu Department of Computer Science Brown University Providence, RI 02912 [email protected] March 26, 2014 Abstract TupleWare[1] is a C++ Framework
Search and Information Retrieval
Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search
4/25/2016 C. M. Boyd, [email protected] Practical Data Visualization with JavaScript Talk Handout
Practical Data Visualization with JavaScript Talk Handout Use the Workflow Methodology to Compare Options Name Type Data sources End to end Workflow Support Data transformers Data visualizers General Data
branddocs Technology edocument Solutions V.1.0.2013 V.11.0.2013
branddocs Technology V.1.0.2013 V.11.0.2013 edocument Solutions Contents 1.- Branddocs' Development Technology 03 2.- Development Technology Features 04 3.- Technical Architecture 05 4.- Description of
Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture
Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture Apps and data source extensions with APIs Future white label, embed or integrate Power BI Deploy Intelligent
Petroleum Web Applications to Support your Business. David Jacob & Vanessa Ramirez Esri Natural Resources Team
Petroleum Web Applications to Support your Business David Jacob & Vanessa Ramirez Esri Natural Resources Team Agenda Petroleum Web Apps to Support your Business The ArcGIS Location Platform Introduction
Log Analysis: Overall Issues p. 1 Introduction p. 2 IT Budgets and Results: Leveraging OSS Solutions at Little Cost p. 2 Reporting Security
Foreword p. xvii Log Analysis: Overall Issues p. 1 Introduction p. 2 IT Budgets and Results: Leveraging OSS Solutions at Little Cost p. 2 Reporting Security Information to Management p. 5 Example of an
Power Tools for Pivotal Tracker
Power Tools for Pivotal Tracker Pivotal Labs Dezmon Fernandez Victoria Kay Eric Dattore June 16th, 2015 Power Tools for Pivotal Tracker 1 Client Description Pivotal Labs is an agile software development
An enterprise- grade cloud management platform that enables on- demand, self- service IT operating models for Global 2000 enterprises
agility PLATFORM Product Whitepaper An enterprise- grade cloud management platform that enables on- demand, self- service IT operating models for Global 2000 enterprises ServiceMesh 233 Wilshire Blvd,
White Paper Take Control of Datacenter Infrastructure
Take Control of Datacenter Infrastructure Uniting the Governance of a Single System of Record with Powerful Automation Tools Take Control of Datacenter Infrastructure A new breed of infrastructure automation
Cloud Service Brokerage Case Study. Health Insurance Association Launches a Security and Integration Cloud Service Brokerage
Cloud Service Brokerage Case Study Health Insurance Association Launches a Security and Integration Cloud Service Brokerage Cloud Service Brokerage Case Study Health Insurance Association Launches a Security
Business Process Management with @enterprise
Business Process Management with @enterprise March 2014 Groiss Informatics GmbH 1 Introduction Process orientation enables modern organizations to focus on the valueadding core processes and increase
owncloud Architecture Overview
owncloud Architecture Overview owncloud, Inc. 57 Bedford Street, Suite 102 Lexington, MA 02420 United States phone: +1 (877) 394-2030 www.owncloud.com/contact owncloud GmbH Schloßäckerstraße 26a 90443
Proposal for a Vehicle Tracking System (VTS)
Proposal for a Vehicle Tracking System (VTS) 2 Executive Summary Intelligent Instructions is an IT product development and consulting company. At Intelligent Instructions, we focus on the needs of the
Homework: Visual Search and Interaction with NSF and NASA Polar Datasets Due: May 2nd, 2015, 12pm PT
Homework: Visual Search and Interaction with NSF and NASA Polar Datasets Due: May 2nd, 2015, 12pm PT 1. Overview In this assignment you will take your Apache Solr index constructed from Polar data that
Why NoSQL? Your database options in the new non- relational world. 2015 IBM Cloudant 1
Why NoSQL? Your database options in the new non- relational world 2015 IBM Cloudant 1 Table of Contents New types of apps are generating new types of data... 3 A brief history on NoSQL... 3 NoSQL s roots
Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge
Exploring and Understanding Adverse Drug Reactions by Integrative Mining of Clinical Records and Biomedical Knowledge http://euadr-project.org PEDRO LOPES [email protected] University of Manchester October
Medications Shortages Dashboard
Medications Shortages Dashboard Project Plan Spring 2014 Spectrum Health Contact Jeff McConnell Team Members Alex Lockwood Alex Seling Cameron Keif 1 Table of Contents 1. Project Overview 3 2. Functional
Actuate Business Intelligence and Reporting Tools (BIRT)
Product Datasheet Actuate Business Intelligence and Reporting Tools (BIRT) Eclipse s BIRT project is a flexible, open source, and 100% pure Java reporting tool for building and publishing reports against
Interactive Application Security Testing (IAST)
WHITEPAPER Interactive Application Security Testing (IAST) The World s Fastest Application Security Software Software affects virtually every aspect of an individual s finances, safety, government, communication,
Elastic Private Clouds
White Paper Elastic Private Clouds Agile, Efficient and Under Your Control 1 Introduction Most businesses want to spend less time and money building and managing IT infrastructure to focus resources on
Alfresco Enterprise on AWS: Reference Architecture
Alfresco Enterprise on AWS: Reference Architecture October 2013 (Please consult http://aws.amazon.com/whitepapers/ for the latest version of this paper) Page 1 of 13 Abstract Amazon Web Services (AWS)
TERMS OF REFERENCE. Revamping of GSS Website. GSS Information Technology Directorate Application and Database Section
TERMS OF REFERENCE Revamping of GSS Website GSS Information Technology Directorate Application and Database Section Tel: Accra 0302 682656 Cables: GHANASTATS In case of reply the number and date of this
Oracle Identity Analytics Architecture. An Oracle White Paper July 2010
Oracle Identity Analytics Architecture An Oracle White Paper July 2010 Disclaimer The following is intended to outline our general product direction. It is intended for information purposes only, and may
IMPLEMENTING HEALTHCARE DASHBOARDS FOR OPERATIONAL SUCCESS
idashboards for Healthcare IMPLEMENTING HEALTHCARE DASHBOARDS FOR OPERATIONAL SUCCESS idashboards gives me access to real-time actionable data from all areas of the hospital. Internally, the adoption rate
Tableau Online. Understanding Data Updates
Tableau Online Understanding Data Updates Author: Francois Ajenstat July 2013 p2 Whether your data is in an on-premise database, a database, a data warehouse, a cloud application or an Excel file, you
A Close Look at Drupal 7
smart. uncommon. ideas. A Close Look at Drupal 7 Is it good for your bottom line? {WEB} MEADIGITAL.COM {TWITTER} @MEADIGITAL {BLOG} MEADIGITAL.COM/CLICKOSITY {EMAIL} [email protected] Table of Contents
SOA and Cloud in practice - An Example Case Study
SOA and Cloud in practice - An Example Case Study 2 nd RECOCAPE Event "Emerging Software Technologies: Trends & Challenges Nov. 14 th 2012 ITIDA, Smart Village, Giza, Egypt Agenda What is SOA? What is
THE EIGHT ADVANTAGES OF BEST- OF-BREED APPLICATIONS
WHITE PAPER THE EIGHT ADVANTAGES OF BEST- OF-BREED APPLICATIONS INTRODUCTION Until recently, field service organizations seeking to take advantage of today s mobile environment often found they had to
Towards a common definition and taxonomy of the Internet of Things. Towards a common definition and taxonomy of the Internet of Things...
Towards a common definition and taxonomy of the Internet of Things Contents Towards a common definition and taxonomy of the Internet of Things... 1 Introduction... 2 Common characteristics of Internet
Kaseya Traverse. Kaseya Product Brief. Predictive SLA Management and Monitoring. Kaseya Traverse. Service Containers and Views
Kaseya Product Brief Kaseya Traverse Predictive SLA Management and Monitoring Kaseya Traverse Traverse is a breakthrough cloud and service-level monitoring solution that provides real time visibility into
How To Make Sense Of Data With Altilia
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
