Semantic and Data Mining Technologies. Simon See, Ph.D.,

Size: px
Start display at page:

Download "Semantic and Data Mining Technologies. Simon See, Ph.D.,"

Transcription

1 Semantic and Data Mining Technologies Simon See, Ph.D.,

2 Introduction to Semantic Web and Business Use Cases 2

3 Lots of Scientific Resources NAR 2009 over 1170 databases

4 Reuse, Recycling, Repurposing Paul writes workflows for identifying biological pathways implicated in resistance to Trypanosomiasis in cattle Paul meets Jo. Jo is investigating Whipworm in mouse. Jo reuses one of Paul s workflow without change. Jo identifies the biological pathways involved in sex dependence in the mouse model, believed to be involved in the ability of mice to expel the parasite. Previously a manual two year study by Jo had failed to do this.

5 Workflows Access to distributed and local resources Automation of data flow Iteration over data sets Interactive Agile software development Experimental protocols Declarative mashups?

6 The Data Playground 1) The Playground Panel with A) Data Objects that feed into BioMoby objects, B) BioMoby Services which can be executed through a right-click option and C) the record button for recording user activity. 2) The Taverna tree of Web Services which can be dragged into the Playground Panel and connected to Data Objects. 3) Semantic Discovery panel that shows compatible services based on input or output. 5) The Data Viewer panel for viewing data that features the Taverna data rendering capabilities. Andrew Gibson 4) The Data Editor panel for inputting Data.

7 Semantic Technology Stack Basic Technologies URI Uniform Resource Identifier RDF Resource Description Framework RDFS RDF Schema OWL Web ontology language SPARQL Protocol and Query Language 7 layercake.svg

8 Extraction, Modeling, Reasoning & Discovery Workflow Transaction Systems Unstructured Content Transform & Edit Load, Query Applications & Tools & Inference Analysis Tools Entity Extraction & Transform RDF/OWL Data Management Ontology Engineering SQL & SPARQL Query RSS, Categorization Other Data Formats Custom Scripting Inferencing BI Analytics Graph Visualization Social Network Analysis Semantic Rules Metadata Registry Security Faceted Search Semantic Indexing SPARQL Endpoint Versioning Data Sources Partner Tools Partner/Oracle Tools

9 Why Organizations use Oracle RDF Database Key Capabilities: Oracle database is the leading commercial database with native RDF/ OWL data management Load / Storage Scalable & secure platform for widerange of semantic applications Readily scales to ultra-large repositories (10s billions of triples) Choice of SQL or SPARQL query Leverages Oracle Partitioning. RAC supported Growing ecosystem of 3 rd party tools partners Query Native RDF graph data store Manages billions of triples Fast batch, bulk and incremental load SQL: SEM_Match SPARQL: via Jena plug-in Ontology assisted query of RDBMS data Forward chaining model Reasoning OWLprime, OWL 2 RL, RDFS User defined rule base 9

10 Architectural Overview User-def. OWLsubsets Incr. DML RDF/S Versioning: Workspaces Rulebases: OWL, RDF/S, user-defined Inferred RDF/OWL data SQLdev. PL/SQL Semantic Indexes RDF/OWL Oracle DB Security: fine-grained QUERY (SQL-based SPARQL) Query RDF/ OntologyOWL data assisted and Query of ontologies Enterprise Data Enterprise (Relational) data 3rd-Party Callouts INFER Tools NLP Info. Extractor: Calais, GATE Bulk-Load LO AD JDBC SQL Interface SQLplus Java API support Java Programs Topbraid Composer Visualizer (cytoscope) Programming Core Interface functionality SPARQL: Jena / Sesame RDF/OWL data and ontologies 3rd Party Tools Joseki / Sesame Reasoners: Pellet SPARQL Endpoints

11 Core Entities in Oracle Database Semantic Store Sem. Network Dictionary and scott herman scott data tables. New entities: models, rule bases, and rules indexes (entailments). OWL and RDFS rule bases are preloaded. SDO_RDF_TRIPLE_S A new Application object type for RDF. Tables Application Table Contains col R A1 of object type sdo_rdf_triple_s to allow loading and accessing RDF R triples, and storing ancillary values. Model A model holds an RDF A2 graph (set of S-P-O triples) and is associated with an sdo_rdf_triple_s column in an application table. R Rulebase A rulebase is a set of rules used for inferencing. An Entailments An entailment stores triples derived via inferencing. Oracle Database Rulebases & Vocabularies RDF / RDFS OWL subset Rule base m Models M1 X1 X2 Entailments M2 Xp Values Mn Triples Semantic Network (MDSYS)

12 Load / Query / Inference Oracle Database Load Bulk load Incremental load Application Tables scott R A1 A2 Rule base m Models M1 X1 X2 Entailments M2 scott Inference using OWL 2 RL, RDFS, etc. using User-defined rules RDF / RDFS OWL subset R herman Query SPARQL (direct or SQL-based) simple data access Rulebases & Vocabularies Xp R Values An Mn Triples Semantic Network (MDSYS)

13 Data Integration Platform in Health Informatics Enterprise Information Consumers (EICs) Access Patient Care Workforce Management Business Intelligence Clinical Analytics Design-Time Metadata Run-Time Metadata Model Virtual Deploy Relate Integration Server Model Physical (Semantic Knowledge base) Access LIS CIS HTB HIS 13

14 Vocabulary Management BACK Intelligent Topic Manager (ITM) Manage Edit Maintain Search Control Import Export Audit ITM Features Web User Interfaces Multilingual Connectors to text mining Collaborative maintenance Import / export Scalability API & Web Services Java J2EE LDAP Ontology model (OWL) FRONT Oracle Text 11g For text based search Terminology Taxonomy (RDF SKOS) Knowledge representation Catalog - Yellow pages (RDF) Oracle RDF 11g For graph based search Oracle Database 11g Ontology repository Semantic Portal

15 National Intelligence: Text Mining 2. Model 3. Structured Data Entity Extraction Engine Feature/ term/relation Extraction, categorization (Insight, Lymba, Calais, Gate) organization founder person Oracle founder 1. Unstructured Data (Text) Larry Ellison Oracle s founder Larry Ellison wins 2010 America s Cup Race Blogs, open source, newsfeed XML/OWL/N3 Ontologies + Rules Knowledge Base (RDF Store) Signal Intelligence, message traffic 10 s of billions of triples SPARQL/SQL Analyst Reports (Content Mgmt) Triple Structure: Subj Pred - Obj 4. Mining & Discovery Explore Browsing, Presentation, Reporting, Visualization, Query Tools (e.g. i2, Centrifuge, Visual Analytics) Analyst

16 Oracle Data Mining Algorithms Problem Classification Regression Anomaly Detection Attribute Importance Association Rules Clustering Feature Extraction Algorithm Applicability Logistic Regression (GLM) Decision Trees Naïve Bayes Support Vector Machine Multiple Regression (GLM) Support Vector Machine Classical statistical technique Popular / Rules / transparency Embedded app Wide / narrow data / text One Class SVM Minimum Description Length (MDL) A1 A2 A3 A4 A5 A6 A7 Apriori Hierarchical K-Means Hierarchical O-Cluster NMF F1 F2 F3 F4 Copyright 2010 Oracle Corporation Classical statistical technique Wide / narrow data / text Lack examples Attribute reduction Identify useful data Reduce data noise Market basket analysis Link analysis Product grouping Text mining Gene and protein analysis Text analysis Feature reduction

17 In-Database Data Mining Oracle Data Mining Traditional Analytics Data Import Results Data Mining Model Scoring Data Preparation and Transformation Savings Data Mining Model Building Model Scoring Data remains in the Database Embedded data preparation Data Prep & Transformation Data Extraction Model Scoring Embedded Data Prep Model Building Data Preparation Secs, Mins or Hours Hours, Days or Weeks Source Data SAS Work Area S A S SAS Process ing S A S Process Output Faster time for Data to Insights Lower TCO Eliminates Data Movement Data Duplication Maintains Security Target S A S Copyright 2010 Oracle Corporation Cutting edge machine learning algorithms inside the SQL kernel of Database SQL Most powerful language for data preparation and Data remains in the Database

18 11g Statistics & SQL Analytics Ranking functions Descriptive Statistics rank, dense_rank, cume_dist, percent_rank, ntile Window Aggregate functions (moving and cumulative) Avg, sum, min, max, count, variance, stddev, first_value, last_value Correlation, linear regression family, covariance Fitting of an ordinary-least-squares regression line to a set of number pairs. Frequently combined with the COVAR_POP, COVAR_SAMP, and CORR functions Enhanced with % statistics: chi squared, phi coefficient, Cramer's V, contingency coefficient, Cohen's kappa Hypothesis Testing Linear regression Pearson s correlation coefficients, Spearman's and Kendall's (both nonparametric). Cross Tabs Sum, avg, min, max, variance, stddev, count, ratio_to_report Statistical Aggregates Direct inter-row reference using offsets DBMS_STAT_FUNCS: summarizes numerical columns of a table and returns count, min, max, range, mean, median, stats_mode, variance, standard deviation, quantile values, +/- n sigma values, top/bottom 5 values Correlations Reporting Aggregate functions LAG/LEAD functions Statistics Student t-test, F-test, Binomial test, Wilcoxon Signed Ranks test, Chi-square, Mann Whitney test, Kolmogorov-Smirnov test, One-way ANOVA Distribution Fitting Kolmogorov-Smirnov Test, Anderson-Darling Test, Chi-Squared Test, Normal, Uniform, Weibull, Exponential Note: Statistics and SQL Analytics are included in Oracle Database Standard Edition Copyright 2010 Oracle Corporation

19 Integration with Oracle BI EE Copyright 2010 Oracle Corporation ODM provides likelihood of expense reporting fraud.and other important questions.

20 Copyright 2010 Oracle Corporation

21 Oracle s Partners for Semantic Technologies Integrated Tools and Solution Providers: Ontology Engineering Query Tool Interfaces Reasoners Applications Standards Sesame Joseki NLP Entity Extractors SI / Consulting 21

22 Some Oracle Database Semantics Customers Life Sciences Defense/ Intelligence Education Telecomm & Networking Hutchinson 3G Austria Clinical Medicine & Research Publishing Thomson Reuters 22

23 Summary: Oracle Database 11g Release 2 Oracle database is the leading commercial database with native support for W3C standards compliant RDF/OWL data store w/ comprehensive capabilities for. Reasoning and Discovery supporting std. ontologies persistent, native & 3rd party inference, and user-defined rules Scalability to evolve schemas dynamically and grow to 10 s billions of triples, incremental & parallel inference Data Integration to link structured & unstructured content, Loosely couple business silos Security to protect data on a need to know basis Integrated querying & manageability SPARQL & SQL for RDF/OWL, relational, XML, text, location, & multimedia data 23

24 An Analytical Database Changes Everything! Less data movement = faster analytics, and faster analytics = better BI throughout enterprise OLAP Statistical Functions Data Mining Copyright 2010 Oracle Corporation Predictive Analytics?x Text Mining

25 For More Information search.oracle.com Semantic Technologies or oracle.com

26 Additional Information Preview of the new Oracle Data Miner 11g R2 work flow New GUI Oracle Data Mining 11gR2 presentation at Oracle Open World 2009 Oracle Data Mining Blog Funny YouTube video that features Oracle Data Mining Oracle Data Mining on the Amazon Cloud Oracle Data Mining 11gR2 data sheet Oracle Data Mining 11gR2 white paper New TechCast (audio and video recording): ODM overview and several demos Fraud and Anomaly Detection using Oracle Data Mining 11g presentation Algorithm technical summary with links to Documentation Getting Started w/ ODM page w/ instructions to download Oracle Data Miner graphical user interface (GUI), ODM Step-by-Step Tutorial Demo datasets ODM Discussion Forum on OTN (great for posting questions/answers) ODM 11g Sample Code (examples of ODM SQL and Java APIs applied in several use cases; great for developers) Oracle s 50+ SQL based statistical functions (t-test, ANOVA, Pearson s, etc.) Oracle Data Mining Copyright 2010 Oracle Corporation

27 Copyright 2010 Oracle Corporation

28 Copyright purposes 2010 Oracle Corporation This presentation is for informational only and may not be incorporated into a contract or agreement.

Exadata V2 + Oracle Data Mining 11g Release 2 Importing 3 rd Party (SAS) dm models

Exadata V2 + Oracle Data Mining 11g Release 2 Importing 3 rd Party (SAS) dm models Exadata V2 + Oracle Data Mining 11g Release 2 Importing 3 rd Party (SAS) dm models Charlie Berger Sr. Director Product Management, Data Mining Technologies Oracle Corporation [email protected]

More information

The Oracle Data Mining Machine Bundle: Zero to Predictive Analytics in Two Weeks Collaborate 15 IOUG

The Oracle Data Mining Machine Bundle: Zero to Predictive Analytics in Two Weeks Collaborate 15 IOUG The Oracle Data Mining Machine Bundle: Zero to Predictive Analytics in Two Weeks Collaborate 15 IOUG Presentation #730 Tim Vlamis and Dan Vlamis Vlamis Software Solutions 816-781-2880 www.vlamis.com Presentation

More information

Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2

Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2 Oracle 11g DB Data Warehousing ETL OLAP Statistics Anomaly and Fraud Detection with Oracle Data Mining 11g Release 2 Data Mining Charlie Berger Sr. Director Product Management, Data

More information

Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies

Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies Application of OASIS Integrated Collaboration Object Model (ICOM) with Oracle Database 11g Semantic Technologies Zhe Wu Ramesh Vasudevan Eric S. Chan Oracle Deirdre Lee, Laura Dragan DERI A Presentation

More information

Oracle Data Mining In-Database Data Mining Made Easy!

Oracle Data Mining In-Database Data Mining Made Easy! Oracle Data Mining In-Database Data Mining Made Easy! Charlie Berger Sr. Director Product Management, Data Mining and Advanced Analytics Oracle Corporation [email protected] www.twitter.com/charliedatamine

More information

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies

Semantic Data Management. Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies Semantic Data Management Xavier Lopez, Ph.D., Director, Spatial & Semantic Technologies 1 Enterprise Information Challenge Source: Oracle customer 2 Vision of Semantically Linked Data The Network of Collaborative

More information

Statistical Analysis of Gene Expression Data With Oracle & R (- data mining)

Statistical Analysis of Gene Expression Data With Oracle & R (- data mining) Statistical Analysis of Gene Expression Data With Oracle & R (- data mining) Patrick E. Hoffman Sc.D. Senior Principal Analytical Consultant [email protected] Agenda (Oracle & R Analysis) Tools Loading

More information

OLSUG Workshop Oracle Data Mining

OLSUG Workshop Oracle Data Mining OLSUG Workshop Oracle Data Mining Charlie Berger Sr. Director of Product Mgmt, Life Sciences and Data Mining Oracle Corporation [email protected] Dr. Lutz Hamel Asst. Professor, Computer Science

More information

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved.

1 Copyright 2011, Oracle and/or its affiliates. All rights reserved. 1 Copyright 2011, Oracle and/or its affiliates. FPO In-Database Analytics: Predictive Analytics, Data Mining, Exadata & Business Intelligence Charlie Berger Sr. Director Product Management, Data Mining

More information

Blazing BI: the Analytic Options to the Oracle Database. ODTUG Kscope 2013

Blazing BI: the Analytic Options to the Oracle Database. ODTUG Kscope 2013 Blazing BI: the Analytic Options to the Oracle Database ODTUG Kscope 2013 Dan Vlamis Tim Vlamis Vlamis Software Solutions 816-781-2880 http://www.vlamis.com Copyright 2013, Vlamis Software Solutions, Inc.

More information

SQL - the best analysis language for Big Data!

SQL - the best analysis language for Big Data! SQL - the best analysis language for Big Data! NoCOUG Winter Conference 2014 Hermann Bär, [email protected] Data Warehousing Product Management, Oracle 1 The On-Going Evolution of SQL Introduction

More information

Graph Database Performance: An Oracle Perspective

Graph Database Performance: An Oracle Perspective Graph Database Performance: An Oracle Perspective Xavier Lopez, Ph.D. Senior Director, Product Management 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Program Agenda Broad Perspective

More information

Seamless Access from Oracle Database to Your Big Data

Seamless Access from Oracle Database to Your Big Data Seamless Access from Oracle Database to Your Big Data Brian Macdonald Big Data and Analytics Specialist Oracle Enterprise Architect September 24, 2015 Agenda Hadoop and SQL access methods What is Oracle

More information

Predictive Analytics for Better Business Intelligence

Predictive Analytics for Better Business Intelligence Oracle 11g DB Data Warehousing ETL OLAP Statistics Predictive Analytics for Better Business Intelligence Data Mining Charlie Berger Sr. Director Product Management, Data Mining Technologies

More information

Big Data Analytics with Oracle Advanced Analytics In-Database Option

Big Data Analytics with Oracle Advanced Analytics In-Database Option Big Data Analytics with Oracle Advanced Analytics In-Database Option Charlie Berger Sr. Director Product Management, Data Mining and Advanced Analytics [email protected] www.twitter.com/charliedatamine

More information

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features Charlie Berger, MS Eng, MBA Sr. Director Product Management, Data Mining and Advanced Analytics [email protected] www.twitter.com/charliedatamine

More information

Put SPARQL in Your Code: Building Applications with Oracle Semantic Technologies. Xavier Lopez, Ph.D. Zhe Wu, Ph.D. Souripriya Das, Ph.D.

Put SPARQL in Your Code: Building Applications with Oracle Semantic Technologies. Xavier Lopez, Ph.D. Zhe Wu, Ph.D. Souripriya Das, Ph.D. Put SPARQL in Your Code: Building Applications with Oracle Semantic Technologies Xavier Lopez, Ph.D. Zhe Wu, Ph.D. Souripriya Das, Ph.D. Semantics at OOW 2009 - Sessions Date/Time Title Location Sunday,

More information

Mining Big Data with RDF Graph Technology:

Mining Big Data with RDF Graph Technology: Mining Big Data with RDF Graph Technology: Xavier Lopez, Ph.D. Director, Product Mgmt. Zhe Wu, Ph.D. Consulting Member Technical Staff 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved.

More information

Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining

Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining Oracle Advanced Analytics Oracle R Enterprise & Oracle Data Mining R The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated

More information

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat Information Builders enables agile information solutions with business intelligence (BI) and integration technologies. WebFOCUS the most widely utilized business intelligence platform connects to any enterprise

More information

Oracle Spatial and Graph

Oracle Spatial and Graph Oracle Spatial and Graph Overview of New Graph Features "THE FOLLOWING IS INTENDED TO OUTLINE OUR GENERAL PRODUCT DIRECTION. IT IS INTENDED FOR INFORMATION PURPOSES ONLY, AND MAY NOT BE INCORPORATED INTO

More information

How To Use An Orgode Database With A Graph Graph (Robert Kramer)

How To Use An Orgode Database With A Graph Graph (Robert Kramer) RDF Graph Database per Linked Data Next Generation Open Data, come sfruttare l innovazione tecnologica per creare nuovi scenari e nuove opportunità. [email protected] 1 Copyright 2011, Oracle

More information

Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics

Predictive Analytics Powered by SAP HANA. Cary Bourgeois Principal Solution Advisor Platform and Analytics Predictive Analytics Powered by SAP HANA Cary Bourgeois Principal Solution Advisor Platform and Analytics Agenda Introduction to Predictive Analytics Key capabilities of SAP HANA for in-memory predictive

More information

Oracle's In-Database Statistical Functions

Oracle's In-Database Statistical Functions Oracle 11g DB Data Warehousing Oracle's In-Database Statistical Functions OLAP Statistics Data Mining Charlie Berger Sr. Director Product Management, Data Mining Technologies

More information

IBM SPSS Modeler 15 In-Database Mining Guide

IBM SPSS Modeler 15 In-Database Mining Guide IBM SPSS Modeler 15 In-Database Mining Guide Note: Before using this information and the product it supports, read the general information under Notices on p. 217. This edition applies to IBM SPSS Modeler

More information

The Data Mining Process

The Data Mining Process Sequence for Determining Necessary Data. Wrong: Catalog everything you have, and decide what data is important. Right: Work backward from the solution, define the problem explicitly, and map out the data

More information

Starting Smart with Oracle Advanced Analytics

Starting Smart with Oracle Advanced Analytics Starting Smart with Oracle Advanced Analytics Great Lakes Oracle Conference Tim Vlamis Thursday, May 19, 2016 Vlamis Software Solutions Vlamis Software founded in 1992 in Kansas City, Missouri Developed

More information

Oracle Data Miner (Extension of SQL Developer 4.0)

Oracle Data Miner (Extension of SQL Developer 4.0) An Oracle White Paper September 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Integrate Oracle R Enterprise Mining Algorithms into a workflow using the SQL Query node Denny Wong Oracle Data Mining

More information

extreme Datamining mit Oracle R Enterprise

extreme Datamining mit Oracle R Enterprise extreme Datamining mit Oracle R Enterprise Oliver Bracht Managing Director eoda Matthias Fuchs Senior Consultant ISE Information Systems Engineering GmbH extreme Datamining with Oracle R Enterprise About

More information

Data Analysis with Various Oracle Business Intelligence and Analytic Tools

Data Analysis with Various Oracle Business Intelligence and Analytic Tools Data Analysis with Various Oracle Business Intelligence and Analytic Tools Session ID: 108680 Prepared by: Tim and Dan Vlamis Vlamis Software Solutions www.vlamis.com @TimVlamis Agenda What we will talk

More information

Oracle Data Miner (Extension of SQL Developer 4.0)

Oracle Data Miner (Extension of SQL Developer 4.0) An Oracle White Paper October 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Generate a PL/SQL script for workflow deployment Denny Wong Oracle Data Mining Technologies 10 Van de Graff Drive Burlington,

More information

Semantic Stored Procedures Programming Environment and performance analysis

Semantic Stored Procedures Programming Environment and performance analysis Semantic Stored Procedures Programming Environment and performance analysis Marjan Efremov 1, Vladimir Zdraveski 2, Petar Ristoski 2, Dimitar Trajanov 2 1 Open Mind Solutions Skopje, bul. Kliment Ohridski

More information

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors...

Smart Cities require Geospatial Data Providing services to citizens, enterprises, visitors... Cloud-based Spatial Data Infrastructures for Smart Cities Geospatial World Forum 2015 Hans Viehmann Product Manager EMEA ORACLE Corporation Smart Cities require Geospatial Data Providing services to citizens,

More information

K@ A collaborative platform for knowledge management

K@ A collaborative platform for knowledge management White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index

More information

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies Big Data: Global Digital Data Growth Growing leaps and bounds by 40+% Year over Year! 2009 =.8 Zetabytes =.08

More information

An In-Depth Look at In-Memory Predictive Analytics for Developers

An In-Depth Look at In-Memory Predictive Analytics for Developers September 9 11, 2013 Anaheim, California An In-Depth Look at In-Memory Predictive Analytics for Developers Philip Mugglestone SAP Learning Points Understand the SAP HANA Predictive Analysis library (PAL)

More information

Oracle Big Data SQL Architectural Deep Dive

Oracle Big Data SQL Architectural Deep Dive Oracle Big Data SQL Architectural Deep Dive Dan McClary, Ph.D. Big Data Product Management Oracle Safe Harbor Statement The following is intended to outline our general product direction. It is intended

More information

The Semantic Web for Application Developers. Oracle New England Development Center Zhe Wu, Ph.D. [email protected] 1

The Semantic Web for Application Developers. Oracle New England Development Center Zhe Wu, Ph.D. alan.wu@oracle.com 1 The Semantic Web for Application Developers Oracle New England Development Center Zhe Wu, Ph.D. [email protected] 1 Agenda Background 10gR2 RDF 11g RDF/OWL New 11g features Bulk loader Semantic operators

More information

Data Mining with Oracle Database 11g Release 2

Data Mining with Oracle Database 11g Release 2 An Oracle White Paper September 2009 Data Mining with Oracle Database 11g Release 2 Competing on In-Database Analytics Executive Overview... 1 In-Database Data Mining... 1 Key Benefits of Oracle Data Mining...

More information

Getting Started with Oracle Data Miner 11g R2. Brendan Tierney

Getting Started with Oracle Data Miner 11g R2. Brendan Tierney Getting Started with Oracle Data Miner 11g R2 Brendan Tierney Scene Setting This is not about DB log mining This is an introduction to ODM And how ODM can be included in OBIEE (next presentation) Domain

More information

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process

ORACLE OLAP. Oracle OLAP is embedded in the Oracle Database kernel and runs in the same database process ORACLE OLAP KEY FEATURES AND BENEFITS FAST ANSWERS TO TOUGH QUESTIONS EASILY KEY FEATURES & BENEFITS World class analytic engine Superior query performance Simple SQL access to advanced analytics Enhanced

More information

Fraud and Anomaly Detection Using Oracle Advanced Analytic Option 12c

Fraud and Anomaly Detection Using Oracle Advanced Analytic Option 12c Fraud and Anomaly Detection Using Oracle Advanced Analytic Option 12c Charlie Berger Sr. Director Product Management, Data Mining and Advanced Analytics [email protected] www.twitter.com/charliedatamine

More information

Safe Harbor Statement

Safe Harbor Statement Safe Harbor Statement The following is intended to outline our general product direction. It is intended for information purposes only, and may not be incorporated into any contract. It is not a commitment

More information

Data Integration Checklist

Data Integration Checklist The need for data integration tools exists in every company, small to large. Whether it is extracting data that exists in spreadsheets, packaged applications, databases, sensor networks or social media

More information

How to Build MicroStrategy Projects on Top of Big Data Sources in the Cloud

How to Build MicroStrategy Projects on Top of Big Data Sources in the Cloud How to Build MicroStrategy Projects on Top of Big Data Sources in the Cloud Jochen Demuth, Director, Partner Engineering Use Cases for Big Data in the Cloud Four broad categories and their value Traditional

More information

Oracle Data Mining Hands On Lab

Oracle Data Mining Hands On Lab Oracle Data Mining Hands On Lab Material provided by Oracle Corporation Vlamis Software Solutions is one of the most respected training organizations in the Oracle Business Intelligence community because

More information

Tax Fraud in Increasing

Tax Fraud in Increasing Preventing Fraud with Through Analytics Satya Bhamidipati Data Scientist Business Analytics Product Group Copyright 2014 Oracle and/or its affiliates. All rights reserved. 2 Tax Fraud in Increasing 27%

More information

The basic data mining algorithms introduced may be enhanced in a number of ways.

The basic data mining algorithms introduced may be enhanced in a number of ways. DATA MINING TECHNOLOGIES AND IMPLEMENTATIONS The basic data mining algorithms introduced may be enhanced in a number of ways. Data mining algorithms have traditionally assumed data is memory resident,

More information

Geospatial Platforms For Enabling Workflows

Geospatial Platforms For Enabling Workflows Geospatial Platforms For Enabling Workflows Steven Hagan Vice President Oracle Database Server Technologies November, 2015 Evolution of Enabling Workflows HENRY FORD 100 YEARS AGO Industrialized the Manufacturing

More information

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.

Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p. Introduction p. xvii Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p. 9 State of the Practice in Analytics p. 11 BI Versus

More information

Anomaly and Fraud Detection with Oracle Data Mining

Anomaly and Fraud Detection with Oracle Data Mining Oracle 11g DB Data Warehousing ETL OLAP Statistics Anomaly and Fraud Detection with Oracle Data Mining Data Mining Charlie Berger Sr. Director Product Management, Data Mining Technologies

More information

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES

KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within

More information

Oracle Data Mining 11g Release 2

Oracle Data Mining 11g Release 2 An Oracle White Paper February 2012 Oracle Data Mining 11g Release 2 Competing on In-Database Analytics Disclaimer The following is intended to outline our general product direction. It is intended for

More information

Make Better Decisions Through Predictive Intelligence

Make Better Decisions Through Predictive Intelligence IBM SPSS Modeler Professional Make Better Decisions Through Predictive Intelligence Highlights Easily access, prepare and model structured data with this intuitive, visual data mining workbench Rapidly

More information

IBM SPSS Modeler Premium

IBM SPSS Modeler Premium IBM SPSS Modeler Premium Improve model accuracy with structured and unstructured data, entity analytics and social network analysis Highlights Solve business problems faster with analytical techniques

More information

Big Data and Predictive Analytics: Fiserv Data Mining Case Study [CON8631] Data Warehouse and Big Data

Big Data and Predictive Analytics: Fiserv Data Mining Case Study [CON8631] Data Warehouse and Big Data Big Data and Predictive Analytics: Fiserv Data Mining Case Study [CON8631] Data Warehouse and Big Data Miguel Barrera - Director, Risk Analytics, Fiserv, Inc. Julia Minkowski - Risk Manager, Fiserv, Inc.

More information

Geospatial Technology Innovations and Convergence

Geospatial Technology Innovations and Convergence Geospatial Technology Innovations and Convergence Processing Big and Fast Data: Best with a Multi-Model Database Steven Hagan Vice President Oracle Database Server Technologies August, 2015 Data Volume

More information

Azure Machine Learning, SQL Data Mining and R

Azure Machine Learning, SQL Data Mining and R Azure Machine Learning, SQL Data Mining and R Day-by-day Agenda Prerequisites No formal prerequisites. Basic knowledge of SQL Server Data Tools, Excel and any analytical experience helps. Best of all:

More information

Ganzheitliches Datenmanagement

Ganzheitliches Datenmanagement Ganzheitliches Datenmanagement für Hadoop Michael Kohs, Senior Sales Consultant @mikchaos The Problem with Big Data Projects in 2016 Relational, Mainframe Documents and Emails Data Modeler Data Scientist

More information

What Are They Thinking? With Oracle Application Express and Oracle Data Miner

What Are They Thinking? With Oracle Application Express and Oracle Data Miner What Are They Thinking? With Oracle Application Express and Oracle Data Miner Roel Hartman Brendan Tierney Agenda Who are we The Scenario Graphs & Charts in APEX - Live Demo Oracle Data Miner & DBA tasks

More information

How To Make Sense Of Data With Altilia

How To Make Sense Of Data With Altilia HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to

More information

Geospatial Platforms For Enabling Workflows

Geospatial Platforms For Enabling Workflows Geospatial Platforms For Enabling Workflows Steven Hagan Vice President Oracle Database Server Technologies May, 2015 Evolution of Enabling Workflows HENRY FORD 100 YEARS AGO Industrialized the Manufacturing

More information

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Practical Data Science with Azure Machine Learning, SQL Data Mining, and R Overview This 4-day class is the first of the two data science courses taught by Rafal Lukawiecki. Some of the topics will be

More information

Quick start. A project with SpagoBI 3.x

Quick start. A project with SpagoBI 3.x Quick start. A project with SpagoBI 3.x Summary: 1 SPAGOBI...2 2 SOFTWARE DOWNLOAD...4 3 SOFTWARE INSTALLATION AND CONFIGURATION...5 3.1 Installing SpagoBI Server...5 3.2Installing SpagoBI Studio and Meta...6

More information

Semantic Interoperability

Semantic Interoperability Ivan Herman Semantic Interoperability Olle Olsson Swedish W3C Office Swedish Institute of Computer Science (SICS) Stockholm Apr 27 2011 (2) Background Stockholm Apr 27, 2011 (2) Trends: from

More information

Knowledge Discovery from patents using KMX Text Analytics

Knowledge Discovery from patents using KMX Text Analytics Knowledge Discovery from patents using KMX Text Analytics Dr. Anton Heijs [email protected] Treparel Abstract In this white paper we discuss how the KMX technology of Treparel can help searchers

More information

HOW TO DO A SMART DATA PROJECT

HOW TO DO A SMART DATA PROJECT April 2014 Smart Data Strategies HOW TO DO A SMART DATA PROJECT Guideline www.altiliagroup.com Summary ALTILIA s approach to Smart Data PROJECTS 3 1. BUSINESS USE CASE DEFINITION 4 2. PROJECT PLANNING

More information

Data Mining Analytics for Business Intelligence and Decision Support

Data Mining Analytics for Business Intelligence and Decision Support Data Mining Analytics for Business Intelligence and Decision Support Chid Apte, T.J. Watson Research Center, IBM Research Division Knowledge Discovery and Data Mining (KDD) techniques are used for analyzing

More information

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management

Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Design and Implementation of a Semantic Web Solution for Real-time Reservoir Management Ram Soma 2, Amol Bakshi 1, Kanwal Gupta 3, Will Da Sie 2, Viktor Prasanna 1 1 University of Southern California,

More information

KnowledgeSEEKER Marketing Edition

KnowledgeSEEKER Marketing Edition KnowledgeSEEKER Marketing Edition Predictive Analytics for Marketing The Easiest to Use Marketing Analytics Tool KnowledgeSEEKER Marketing Edition is a predictive analytics tool designed for marketers

More information

BUSINESS VALUE OF SEMANTIC TECHNOLOGY

BUSINESS VALUE OF SEMANTIC TECHNOLOGY BUSINESS VALUE OF SEMANTIC TECHNOLOGY Preliminary Findings Industry Advisory Council Emerging Technology (ET) SIG Information Sharing & Collaboration Committee July 15, 2005 Mills Davis Managing Director

More information

Introduction to Ontologies

Introduction to Ontologies Technological challenges Introduction to Ontologies Combining relational databases and ontologies Author : Marc Lieber Date : 21-Jan-2014 BASEL BERN LAUSANNE ZÜRICH DÜSSELDORF FRANKFURT A.M. FREIBURG I.BR.

More information

Improve Model Accuracy with Unstructured Data

Improve Model Accuracy with Unstructured Data IBM SPSS Modeler Premium Improve Model Accuracy with Unstructured Data Highlights Easily access, prepare and integrate structured data and text, Web and survey data Support the entire data mining process

More information

Forecasting, Prediction Models, and Times Series Analysis with Oracle Business Intelligence and Analytics. Rittman Mead BI Forum 2013

Forecasting, Prediction Models, and Times Series Analysis with Oracle Business Intelligence and Analytics. Rittman Mead BI Forum 2013 Forecasting, Prediction Models, and Times Series Analysis with Oracle Business Intelligence and Analytics Rittman Mead BI Forum 2013 Dan Vlamis and Tim Vlamis Vlamis Software Solutions 816-781-2880 http://www.vlamis.com

More information

R Tools Evaluation. A review by Analytics @ Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

R Tools Evaluation. A review by Analytics @ Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015 R Tools Evaluation A review by Analytics @ Global BI / Local & Regional Capabilities Telefónica CCDO May 2015 R Features What is? Most widely used data analysis software Used by 2M+ data scientists, statisticians

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

Oracle Advanced Analytics - Option to Oracle Database: Oracle R Enterprise and Oracle Data Mining. Data Warehouse Global Leaders Winter 2013

Oracle Advanced Analytics - Option to Oracle Database: Oracle R Enterprise and Oracle Data Mining. Data Warehouse Global Leaders Winter 2013 Oracle Advanced Analytics - Option to Oracle Database: Oracle R Enterprise and Oracle Data Mining Data Warehouse Global Leaders Winter 2013 Dan Vlamis, Vlamis Software Solutions Tim Vlamis, Vlamis Software

More information

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013

Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 Complexity and Scalability in Semantic Graph Analysis Semantic Days 2013 James Maltby, Ph.D 1 Outline of Presentation Semantic Graph Analytics Database Architectures In-memory Semantic Database Formulation

More information

Fusion Applications Overview of Business Intelligence and Reporting components

Fusion Applications Overview of Business Intelligence and Reporting components Fusion Applications Overview of Business Intelligence and Reporting components This document briefly lists the components, their common acronyms and the functionality that they bring to Fusion Applications.

More information

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.

Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved. Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,

More information

E6895 Advanced Big Data Analytics Lecture 4:! Data Store

E6895 Advanced Big Data Analytics Lecture 4:! Data Store E6895 Advanced Big Data Analytics Lecture 4:! Data Store Ching-Yung Lin, Ph.D. Adjunct Professor, Dept. of Electrical Engineering and Computer Science Mgr., Dept. of Network Science and Big Data Analytics,

More information

Oracle Data Mining. Concepts 11g Release 2 (11.2) E16808-07

Oracle Data Mining. Concepts 11g Release 2 (11.2) E16808-07 Oracle Data Mining Concepts 11g Release 2 (11.2) E16808-07 June 2013 Oracle Data Mining Concepts, 11g Release 2 (11.2) E16808-07 Copyright 2005, 2013, Oracle and/or its affiliates. All rights reserved.

More information

Pentaho Data Mining Last Modified on January 22, 2007

Pentaho Data Mining Last Modified on January 22, 2007 Pentaho Data Mining Copyright 2007 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org

More information

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance

IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Data Sheet IBM Cognos 8 Business Intelligence Analysis Discover the factors driving business performance Overview Multidimensional analysis is a powerful means of extracting maximum value from your corporate

More information

Big Data Analytics Platform @ Nokia

Big Data Analytics Platform @ Nokia Big Data Analytics Platform @ Nokia 1 Selecting the Right Tool for the Right Workload Yekesa Kosuru Nokia Location & Commerce Strata + Hadoop World NY - Oct 25, 2012 Agenda Big Data Analytics Platform

More information

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes Highly competitive enterprises are increasingly finding ways to maximize and accelerate

More information

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features 1 Oracle SQL Developer 3.0: Overview and New Features Sue Harper Senior Principal Product Manager The following is intended to outline our general product direction. It is intended

More information

Sunnie Chung. Cleveland State University

Sunnie Chung. Cleveland State University Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:

More information

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco

Decoding the Big Data Deluge a Virtual Approach. Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco Decoding the Big Data Deluge a Virtual Approach Dan Luongo, Global Lead, Field Solution Engineering Data Virtualization Business Unit, Cisco High-volume, velocity and variety information assets that demand

More information