Envirofacts API Cory Wagner, US EPA



Similar documents
Please contact Cyber and Technology Training at for registration and pricing information.

Microsoft SQL Server Review

Deploy. Friction-free self-service BI solutions for everyone Scalable analytics on a modern architecture

Using IBM dashdb With IBM Embeddable Reporting Service

Petroleum Web Applications to Support your Business. David Jacob & Vanessa Ramirez Esri Natural Resources Team

Chapter 24: Creating Reports and Extracting Data

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Cloud Cruiser and Microsoft Azure Usage API Integration

There are various ways to find data using the Hennepin County GIS Open Data site:

Cloud Powered Mobile Apps with Azure

How is it helping? PragmatiQa XOData : Overview with an Example. P a g e Doc Version : 1.3

ACEYUS REPORTING. Aceyus Intelligence Executive Summary

Developing Microsoft Azure Solutions

Sisense. Product Highlights.

Developer Tutorial Version 1. 0 February 2015

Developing Microsoft Azure Solutions 20532A; 5 days

Bring your data to life with Microsoft Power BI. Peter Myers Bitwise Solutions

Toxics Release Inventory Data and Tools

CRGroup Whitepaper: Digging through the Data. Reporting Options in Microsoft Dynamics GP

Course MS20467C Designing Self-Service Business Intelligence and Big Data Solutions

4/25/2016 C. M. Boyd, Practical Data Visualization with JavaScript Talk Handout

Designing Self-Service Business Intelligence and Big Data Solutions

Trial version of GADD Dashboards Builder

Hacettepe University Department Of Computer Engineering BBM 471 Database Management Systems Experiment

Developing Fleet and Asset Tracking Solutions with Web Maps

Database and Analytical Tool Development for the Management of Data Derived from US DOE (NETL) Funded Fine Particulate (PM 2.

MS 10978A Introduction to Azure for Developers

Qlik REST Connector Installation and User Guide

Open is as Open Does: Lessons from Running a Professional Open Source Company

Why NoSQL? Your database options in the new non- relational world IBM Cloudant 1

idigbio Technology, Cloud and Appliances

Quick Start Guide to. ArcGISSM. Online

Reporting Services. White Paper. Published: August 2007 Updated: July 2008

The full setup includes the server itself, the server control panel, Firebird Database Server, and three sample applications with source code.

Tagetik Extends Customer Value with SQL Server 2012

How To Extend An Enterprise Bio Solution

Course 10978A Introduction to Azure for Developers

SAAS BASED INVENTORY MANAGEMENT SYSTEM WHITE PAPER

Aspire Systems - Experience in Digital Marketing and Social Media

Introducing the Reimagined Power BI Platform. Jen Underwood, Microsoft

Analyze Your Data. Salesforce, Winter

Building HTML5 and hybrid mobile apps using cloud services. Andrei Glazunov

Using Microsoft Business Intelligence Dashboards and Reports in the Federal Government

Short notes on webpage programming languages

Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros

Developing ASP.NET MVC 4 Web Applications

ORACLE BUSINESS INTELLIGENCE WORKSHOP

Grow your Business with our advanced Call Tracking services

Microsoft Power BI. Nov 21, 2015

Application for Splunk Enterprise

What s new in Access 2013

Design and Functional Specification

GeneSys. Unit Six.Two: Administering a 360 Project. genesysonline.net. psytech.com

IT Exam Training online / Bootcamp

Collaborative Open Market to Place Objects at your Service

Microsoft Azure Data Technologies: An Overview

Wave Analytics Data Integration

Project Zip Code. Version CUNA s Powerful Grassroots Program. User Manual. Copyright 2012, All Rights Reserved

Create Reports Utilizing SQL Server Reporting Services and PI OLEDB. Tutorial

JAVASCRIPT CHARTING. Scaling for the Enterprise with Metric Insights Copyright Metric insights, Inc.

2015 Exchange Network National Meeting Supporting the Business of Environmental Protection

SQL Reporting Services: A Peek at the Power & Potential

70-243: Administering and Deploying System Center 2012 Configuration Manager : Monitoring and Operating a Private Cloud with System Center 2012

API Architecture. for the Data Interoperability at OSU initiative

Team Members: Christopher Copper Philip Eittreim Jeremiah Jekich Andrew Reisdorph. Client: Brian Krzys

Additional information >>> HERE <<<

Qualifying Microsoft Training for Software Assurance Training Vouchers (SATVs)

Developing Business Intelligence and Data Visualization Applications with Web Maps

Microsoft Introduction to Azure for Developers

GSA2013: The Great SharePoint Adventure 2013

Building an ASP.NET MVC Application Using Azure DocumentDB

XML Processing and Web Services. Chapter 17

Open Data, Open Innovation.

McAfee Web Reporter Turning volumes of data into actionable intelligence

Crystal Converter User Guide

Discover the best keywords for your online marketing campaign

Leveraging Cloud-Based Mapping Solutions

ORACLE APPLICATION EXPRESS 5.0

Visualize your World. Democratization i of Geographic Data

Creating a universe on Hive with Hortonworks HDP 2.0

PBI365: Data Analytics and Reporting with Power BI

STATGRAPHICS Online. Statistical Analysis and Data Visualization System. Revised 6/21/2012. Copyright 2012 by StatPoint Technologies, Inc.

Web-Based Spatial Data Infrastructure (SDI) to Implement and Monitor Nonpoint Source (NPS) Pollution Management in the Imperial Valley, California

Technical White Paper The Excel Reporting Solution for Java

WHITE PAPER. Domo Advanced Architecture

Perl & NoSQL Focus on MongoDB. Jean-Marie Gouarné jmgdoc@cpan.org

System Requirements for Microsoft Dynamics NAV 2016

Actian Vortex Express 3.0

OpenText Information Hub (ihub) 3.1 and 3.1.1

SQL Server 2005 Reporting Services (SSRS)

Developing ASP.NET MVC 4 Web Applications Course 20486A; 5 Days, Instructor-led

SUGI 29 Systems Architecture. Paper

Business Intelligence Using SharePoint 2013 and Office365

Step by Step: Big Data Technology. Assoc. Prof. Dr. Thanachart Numnonda Executive Director IMC Institute 25 August 2015

Cortex Release 8.4. Cortex Release Notes. Jocelyn Brittain

Microsoft Training and Certification Guide. Current as of March 16, 2015

Course 20467: Designing Self-Service Business Intelligence and Big Data Solutions

INTRODUCING AZURE SEARCH

Database Linker Tutorial

Transcription:

Envirofacts API Cory Wagner, US EPA 2015 Exchange Network National Meeting Supporting the Business of Environmental Protection September 29 October 1, 2015 Sheraton Philadelphia Society Hill Hotel Philadelphia, Pennsylvania http://www.exchangenetwork.net/en2015

ABSTRACT Envirofacts is the EPAs One-stop shop for environmental data. This presentation will discuss the current implementation of the Envirofacts API along with the future plans to modernize it 2

What is Envirofacts? Warehouse of Integrated Environmental Data Suite of Web Tools Enables users to find and map environmental information Helps analyze data to meet business and information needs Provides data and information for EPA and non-epa (Public) use 3

Envirofacts Facts EF Database Stats 3488 Tables (program office databases, user tables, and applicationspecific tables) 500 Gigs Usage Stats 12 Million Hits Annually against the EF Website 15 Million API Hits Annually against the EF REST API Services supporting external data utilization Users 12 GIS Applications 30+ EPA customers (18 relying on EF for public data access) 10 EF data streams with other (non-epa Agencies) 4 Mobile Applications supported by EF 3 EPA Registries supported by EF

Envirofacts API Envirofacts has a RESTful data service API to all of its internal data holdings This functionality provides a user of Envirofacts with the ability to query any table through the use of a URL The default output is in XML, which can be utilized in other applications, as well as tools such as Microsoft Excel or Access. Output options of JavaScript Object Notation (JSON), CSV, and Excel can be requested in the URL Services are documented in the Environmental Dataset Gateway (EDG) Services will be made available through the E-Enterprise Portal via the Shared Services Catalog and through Developer Central 5

Example of the Envirofacts API Point out that the current API is SQL based and that we 1. Table Name At least one table name is required. Up to three table names can be entered. 2. Column Name This is an optional entry. Enter a column name and value to limit the results. 3. Operator This is an optional entry. This parameter allows users to pass in an operator with the query. Default output is "=" and does not require an operator, but users can enter "<", " >", "!=", "BEGINNING", "CONTAINING", operators as well via the URL. 4. Column Value This is an optional entry. The column value is queried against the database without modification. 5. Rows This is an optional entry. Specify the rows to display by entering 'rows/ <first_row> : <last_row>' Results numbering starts at 0. So to get the first five hundred rows enter rows/0:499 6. Output Format This is an optional entry. The default output is in XML; however, output options of JSON, CSV or Excel can be requested in the URL. 6

EF API Strengths and Limitations Strengths: Simple and well-documented Access multiple data sources without the need of user database connections. Output can be easily used with desktop applications such as Excel or Access, or used in creating Web mash-ups. Limitations: Can only query up to three tables (out of hundreds) Output is limited to 10000 rows of data at a time Joins can be very complex 7

Proposed Improvements to the EF API The row and join restrictions on the EF API limit its utility As part of a modernization effort, OEI is looking at the services with an eye on how to improve them Proposal: Create an Advanced Query Builder interface for Envirofacts The AQB: Provides an easy web-based access to the latest and greatest Envirofacts data dumps. Gives users the ability to bulk download data via CSV format and an API endpoint to access the query results. Allows testing, running, editing, sharing, forking and permalinking to public queries against Envirofact s corpus of data with a simple, syntax-highlighted web user interface. Can be used as a permanently linkable tool for teaching general SQL and relational database concepts. Would be hosted in a cloud environment so that it can be speedy, scalable, and always available. 8

AQB Details The AQB is based on the Stack Exchange Data Explorer (http://data.stackexchange. com/) Open Source tool built on an ASP.NET MVC software Stack in a Windows Azure environment 9

AQB Features Ability to feature select queries and show popular queries Ability for anonymous users to like/star a query Ability to choose between various systems (e.g. BR, RCRAINFO, CERCLIS, ECHO [+ Core FRS and Location Information tables] etc.) Ability to assign plain English titles, descriptions and tags to queries Robust search functionality allowing users to search for titles, descriptions, tables, columns and tags Comprehensive help and tutorial sections incorporated into the UI. Ability to fork queries so that users can re-purpose existing queries. Ability to view results and download results in CSV format Ability to reference results in JSON format through a REST API endpoint Auto detection lat/long fields and displays the data on a map when applicable Incorporates existing business rules to prevent incorrect table joins when possible. 10

AQB Features Auto detection lat/long fields and displays the data on a map when applicable Utilize EPA s Geoplatform 11

Visual SQL Query Builder to allow users to: drag tables create an alias order/sort/aggregate support grouping allow for the addition of criteria allow for quick creation of table joins. AQB Features 12

API Management Tool Also looking at API Management Tools Likely incorporated into first release of the AQB May require registration Provides standardized documentation and more robust analytics This will provide us with a better assessment of user needs 13

AQB Debut? Plan to have a version of the AQB available in 2016 Along with other enhancements to Envirofacts Unstructured data inclusion Natural Language Querying allowing a refined Google bar interface for the basic interface Refined look and feel to incorporate a more Red Fin like facility view as well as data visualizations to replace some of the tabular results And maybe more! 14

15

Envirofacts Contacts General: Bill Muldrow muldrow.bill@epa.gov 202.566.0692 AQB/API: Andrew Yuen yuen.andrew@epa.gov 202.566.1308 Cory Wagner wagner.cory@epa.gov 202.566.1555 www.epa.gov/enviro 16