DBpedia German: Extensions and Applications

Size: px
Start display at page:

Download "DBpedia German: Extensions and Applications"

Transcription

1 DBpedia German: Extensions and Applications Alexandru-Aurelian Todor FU-Berlin, Innovationsforum Semantic Media Web, 7. Oktober 2014

2 Overview Why DBpedia? New Developments in DBpedia German Problems in DBpedia 2

3 Why DBpedia? Knowledge Bases are the core of Intelligent Web Applications Gazetteerless NER Question Answering Engines Document Enrichment Relation Extraction Event Detection Large web companies are developing their own alternatives Google Knowledge Graph/Freebase Microsoft Satori KB Wikidata Yahoo Knowledge Graph IBM Watson API, Wolfram Alpha etc. 3

4 DBpedia-LOD Cloud 4

5 International DBpedia Chapters Goal: provide additional resources for extraction, access, services, language specific endpoints and services language specific extension German Chapter addresses German Language URL: 5

6 DBpedia German: What We Offer DBpedia German Data Dumps: DBpedia German SPARQL Endpoint: DBpedia German Spotlight: DBpedia German Live: DBpedia German Live Changesets: Improved interlinking data, e.g. Linked Hypernyms Dataset (hypernyms from the first sentences) 6

7 DBpedia German Statistics Property Value Triples 146 Million Classes 206 Entities 4.3 Million Distinct Subjects 7.6 Million Properties Distinct Objects 36.7 Million Category Improvement Mappings >30% Missing Labels >300% 7

8 DBpedia German Infrastructure Main Sever 1: CPU: 2x Hexacore Xeon Ivy Bridge => 24 vcores Memory: 256 GB Ram SDD ~ 1TB Raid 5 Array HDD ~ 10 TB Raid 5 Array Secondary Servers 2: CPU: Quadcore Xeon Sandy Bridge: =>16 vcores Memory: 32 GB x2 HDD: 1TB RPM x2 8

9 Problems In DBpedia Ontology & Missing Data Missing Labels Missing Types Editing Capabilities Administration 9

10 Missing Labels Why Missing Labels are a problem? People don t understand the ontology New classes and properties are created needlessly Not a true multilingual ontology Language Missing Class Labels Missing Property Labels German French Spanish Italian Polish

11 Missing Labels How Do We Address the Problem Automatically Translate Labels using Translation services Present translation suggestions to editors in a batch mode Allow editors to edit and commit multiple translations at the same time 11

12 Missing Labels: Missingbot Bot-framework for editing the MappingsWiki Rest service for communicating with the mappings wiki and other applications Plugins for the Mappings Wiki in order to review added information 12

13 MissingBot Label Translations 14

14 Missing Types Why are Missing Types a Problem? rdf:type statements are the main way we query a KB Without precise type information there is no easy way to say Which CDU politicians born in berlin List all capitals in Europe List all actors (schauschpieler) in Berlin Without precise type information NER annotations are imprecise You can t filter out or select specific entities Ex: annotate only politicians, or software companies in a text document 15

15 Missing Types Solution: Linked Hypernyms Dataset Cooperation with the Prague University of Economics Extract type information from Hypernyms Significant improvement over instance-types dataset DBpedia Instance Types LHD 1.0 LHD 2.0 Nr. of resources New resources N/A Improvement N/A 52.5 % 44.3% 16

16 LHD Examples from the German DBpedia Dbpedia Resource Dbpedia Types LHD 1.0 Type LHD 2.0 Types esource/brad_pitt esource/tom_hanks wikidata:q5 owl:thing schemaorg:person wikidata:q dul:agent dul:naturalperson dbo:agent dbo:person dbo:actor dbo:actor same dbo:actor dbo:actor esource/wladimir_wladi mirowitsch_putin same dbo:politician dbo:politician esource/barack_obama esource/berlin esource/leipzig same dbo:politician dbo:politician schemaorg:place odp:location dbo:place wikidata:q532 opengis:_feature Same + dbo:populatedplace dbo:settlement urce/capital_city age/großstadt dbo:place dbo:place 17

17 Missing Ontology Editing Capabilities Why are Missing Ontology Editing Capabilities a problem? No good overview of the Ontology No efficient Way to rename or reorganize classes No efficient way to align the ontology with other ontologies 18

18 Missing Ontology Editing Capabilities Web Protégé Integration How to solve the Ontology Editing Problem? Use an advanced Collaborative Ontology Editor Solve the compatibility problem by integrating the editor into the existing framework Solve authentication and synchronization problems Architecture: 19

19 Missing Editing Capabilities 20

20 Why Administration is a Problem Configuring the different DBpedia services is a very complex task DBpedia Static: configuring the abstract extraction, generating datasets and importing them into virtuoso DBpedia Live: creating a Syncwiki, configuring the live extraction and an endpoint for the streaming updates DBpedia Spotlight: configuring a Hadoop cluster for dataset generation and then configuring a rest service DBpedia Lookup: generating the index for the lookup service Debugging: Problems are very specific to a configuration, there is no way to inspect specific issues without replicating the envirtonment 21

21 Addressing the Administration Problem Container Virtualisation Package the different dbpedia services in docker containers Share Conainers together with the configuration Docker Build once run everywhere Filesystem-level versioning Small containers Easy deployment Docker HUB Share Containers Push and Pull ndru/ 22

22 Addressing the Administration Problem: DBpedia+Docker DBpedia Spotlight Static Endpoint Live Endpoint Static Extraction Dockerized DBpedia Live Extraction DBpedia SyncWiki 23

23 Thank You!

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia

Linked Open Data Infrastructure for Public Sector Information: Example from Serbia Proceedings of the I-SEMANTICS 2012 Posters & Demonstrations Track, pp. 26-30, 2012. Copyright 2012 for the individual papers by the papers' authors. Copying permitted only for private and academic purposes.

More information

LinkZoo: A linked data platform for collaborative management of heterogeneous resources

LinkZoo: A linked data platform for collaborative management of heterogeneous resources LinkZoo: A linked data platform for collaborative management of heterogeneous resources Marios Meimaris, George Alexiou, George Papastefanatos Institute for the Management of Information Systems, Research

More information

QAD Business Intelligence Release Notes

QAD Business Intelligence Release Notes QAD Business Intelligence Release Notes September 2008 These release notes include information about the latest QAD Business Intelligence (QAD BI) fixes and changes. These changes may affect the way you

More information

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company Semantic SharePoint Technical Briefing Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company What is Semantic SP? a joint venture between iquest and Semantic Web Company, initiated in

More information

Pcounter. Category Characteristics. Unified print room management Print policies and rules Product-based job processing Print queue management

Pcounter. Category Characteristics. Unified print room management Print policies and rules Product-based job processing Print queue management Pcounter Category Characteristics Output Management Unified print room management Print policies and rules Product-based job processing Print queue management TAKING CONTROL WITH PROFESSIONAL OUTPUT MANAGEMENT

More information

XProtect Corporate 2013

XProtect Corporate 2013 Release note Milestone XProtect Corporate 2013 May 21, 2013 It is with great pleasure that we announce the release of: XProtect Corporate 2013 High performance for high security XProtect Corporate is powerful

More information

D3.3.1: Sematic tagging and open data publication tools

D3.3.1: Sematic tagging and open data publication tools COMPETITIVINESS AND INNOVATION FRAMEWORK PROGRAMME CIP-ICT-PSP-2013-7 Pilot Type B WP3 Service platform integration and deployment in cloud infrastructure D3.3.1: Sematic tagging and open data publication

More information

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study Amar-Djalil Mezaour 1, Julien Law-To 1, Robert Isele 3, Thomas Schandl 2, and Gerd Zechmeister

More information

HADOOP AT NOKIA JOSH DEVINS, NOKIA HADOOP MEETUP, JANUARY 2011 BERLIN

HADOOP AT NOKIA JOSH DEVINS, NOKIA HADOOP MEETUP, JANUARY 2011 BERLIN HADOOP AT NOKIA JOSH DEVINS, NOKIA HADOOP MEETUP, JANUARY 2011 BERLIN Two parts: * technical setup * applications before starting Question: Hadoop experience levels from none to some to lots, and what

More information

N8103-149/150/151/160 RAID Controller. N8103-156 MegaRAID CacheCade. Feature Overview

N8103-149/150/151/160 RAID Controller. N8103-156 MegaRAID CacheCade. Feature Overview N8103-149/150/151/160 RAID Controller N8103-156 MegaRAID CacheCade Feature Overview April 2012 Rev.1.0 NEC Corporation Contents 1 Introduction... 3 2 Types of RAID Controllers... 3 3 New Features of RAID

More information

Low-cost Open Data As-a-Service in the Cloud

Low-cost Open Data As-a-Service in the Cloud Low-cost Open Data As-a-Service in the Cloud Marin Dimitrov, Alex Simov, Yavor Petkov Ontotext AD, Bulgaria {first.last}@ontotext.com Abstract. In this paper we present the architecture and prototype of

More information

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud)

Open Cloud System. (Integration of Eucalyptus, Hadoop and AppScale into deployment of University Private Cloud) Open Cloud System (Integration of Eucalyptus, Hadoop and into deployment of University Private Cloud) Thinn Thu Naing University of Computer Studies, Yangon 25 th October 2011 Open Cloud System University

More information

Very Large Enterprise Network Deployment, 25,000+ Users

Very Large Enterprise Network Deployment, 25,000+ Users Very Large Enterprise Network Deployment, 25,000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering

More information

CA Email Supervision Architecture Stack

CA Email Supervision Architecture Stack SOLUTION BRIEF SEPTEMBER 2014 CA Email Supervision Architecture Stack Leo Garcia CA Technologies, Sr Services Consultant 2 SOLUTION BRIEF: CA EMAIL SUPERVISION ARCHITECTURE STACK ca.com Executive Summary

More information

Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana Network

Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana Network 1/12 Task Force on Enrichment and Evaluation Appendix A: Inventory of enrichment efforts and tools initiated in the context of the Europeana 29/10/2015 Project Name Type of enrichments Tool for manual

More information

White Paper November 2015. Technical Comparison of Perspectium Replicator vs Traditional Enterprise Service Buses

White Paper November 2015. Technical Comparison of Perspectium Replicator vs Traditional Enterprise Service Buses White Paper November 2015 Technical Comparison of Perspectium Replicator vs Traditional Enterprise Service Buses Our Evolutionary Approach to Integration With the proliferation of SaaS adoption, a gap

More information

K@ A collaborative platform for knowledge management

K@ A collaborative platform for knowledge management White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index

More information

Microsoft Dynamics CRM 2011 Guide to features and requirements

Microsoft Dynamics CRM 2011 Guide to features and requirements Guide to features and requirements New or existing Dynamics CRM Users, here s what you need to know about CRM 2011! This guide explains what new features are available and what hardware and software requirements

More information

Storage Sync for Hyper-V. Installation Guide for Microsoft Hyper-V

Storage Sync for Hyper-V. Installation Guide for Microsoft Hyper-V Installation Guide for Microsoft Hyper-V Egnyte Inc. 1890 N. Shoreline Blvd. Mountain View, CA 94043, USA Phone: 877-7EGNYTE (877-734-6983) www.egnyte.com 2013 by Egnyte Inc. All rights reserved. Revised

More information

owncloud Enterprise Edition on IBM Infrastructure

owncloud Enterprise Edition on IBM Infrastructure owncloud Enterprise Edition on IBM Infrastructure A Performance and Sizing Study for Large User Number Scenarios Dr. Oliver Oberst IBM Frank Karlitschek owncloud Page 1 of 10 Introduction One aspect of

More information

Cost-Effective Business Intelligence with Red Hat and Open Source

Cost-Effective Business Intelligence with Red Hat and Open Source Cost-Effective Business Intelligence with Red Hat and Open Source Sherman Wood Director, Business Intelligence, Jaspersoft September 3, 2009 1 Agenda Introductions Quick survey What is BI?: reporting,

More information

Sage 200 On Premise. System Requirements and Prerequisites

Sage 200 On Premise. System Requirements and Prerequisites Sage 200 On Premise System Requirements and Prerequisites v2013 Copyright statement Sage (UK) Limited and Sage Hibernia Limited, 2013. All rights reserved. If this documentation includes advice or information

More information

Graph Database Performance: An Oracle Perspective

Graph Database Performance: An Oracle Perspective Graph Database Performance: An Oracle Perspective Xavier Lopez, Ph.D. Senior Director, Product Management 1 Copyright 2012, Oracle and/or its affiliates. All rights reserved. Program Agenda Broad Perspective

More information

BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe

BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE. Big Data Europe BIG DATA AGGREGATOR STASINOS KONSTANTOPOULOS NCSR DEMOKRITOS, GREECE Big Data Europe The Big Data Aggregator The Big Data Aggregator: o A general-purpose architecture for processing Big Data o An implementation

More information

LDIF - Linked Data Integration Framework

LDIF - Linked Data Integration Framework LDIF - Linked Data Integration Framework Andreas Schultz 1, Andrea Matteini 2, Robert Isele 1, Christian Bizer 1, and Christian Becker 2 1. Web-based Systems Group, Freie Universität Berlin, Germany a.schultz@fu-berlin.de,

More information

Big Graph Analytics on Neo4j with Apache Spark. Michael Hunger Original work by Kenny Bastani Berlin Buzzwords, Open Stage

Big Graph Analytics on Neo4j with Apache Spark. Michael Hunger Original work by Kenny Bastani Berlin Buzzwords, Open Stage Big Graph Analytics on Neo4j with Apache Spark Michael Hunger Original work by Kenny Bastani Berlin Buzzwords, Open Stage My background I only make it to the Open Stages :) Probably because Apache Neo4j

More information

Publishing Linked Data Requires More than Just Using a Tool

Publishing Linked Data Requires More than Just Using a Tool Publishing Linked Data Requires More than Just Using a Tool G. Atemezing 1, F. Gandon 2, G. Kepeklian 3, F. Scharffe 4, R. Troncy 1, B. Vatant 5, S. Villata 2 1 EURECOM, 2 Inria, 3 Atos Origin, 4 LIRMM,

More information

Microsoft SharePoint Server 2010

Microsoft SharePoint Server 2010 Microsoft SharePoint Server 2010 Small Farm Performance Study Dell SharePoint Solutions Ravikanth Chaganti and Quocdat Nguyen November 2010 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY, AND MAY

More information

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software

Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software WHITEPAPER Accelerating Enterprise Applications and Reducing TCO with SanDisk ZetaScale Software SanDisk ZetaScale software unlocks the full benefits of flash for In-Memory Compute and NoSQL applications

More information

Comparing BlackBerry Solutions

Comparing BlackBerry Solutions Comparing Solutions solution comparison chart Messaging features Enterprise Express Cloud s Supported messaging environments Microsoft Exchange, IBM Lotus Domino, and Novell GroupWise Microsoft Exchange

More information

Building a BI Solution in the Cloud

Building a BI Solution in the Cloud Building a BI Solution in the Cloud Stacia Varga, Principal Consultant Email: stacia@datainspirations.com Twitter: @_StaciaV_ 2 SQLSaturday #467 Sponsors Stacia (Misner) Varga Over 30 years of IT experience,

More information

Hardware and Software Requirements for Server Applications

Hardware and Software Requirements for Server Applications Hardware and Software Requirements for Server Applications IVS Enterprise Server Version 11.3+ RMG Networks Holding Corporation All Rights Reserved. Rev 9/14 TABLE OF CONTENTS INTRODUCTION...3 Minimum

More information

Very Large Enterprise Network, Deployment, 25000+ Users

Very Large Enterprise Network, Deployment, 25000+ Users Very Large Enterprise Network, Deployment, 25000+ Users Websense software can be deployed in different configurations, depending on the size and characteristics of the network, and the organization s filtering

More information

Cisco TelePresence Management Suite

Cisco TelePresence Management Suite Data Sheet Centralized orchestration and management across the telepresence network Product Overview Figure 1. Cisco TelePresence Management Suite (Cisco TMS) provides complete control and management of

More information

A Performance Analysis of Distributed Indexing using Terrier

A Performance Analysis of Distributed Indexing using Terrier A Performance Analysis of Distributed Indexing using Terrier Amaury Couste Jakub Kozłowski William Martin Indexing Indexing Used by search

More information

Cisco TelePresence Management Suite

Cisco TelePresence Management Suite Data Sheet Product Overview Figure 1. Cisco TelePresence Management Suite (Cisco TMS) provides complete control and management of telepresence conferencing and media services infrastructure and endpoints,

More information

Microsoft SharePoint Server 2010

Microsoft SharePoint Server 2010 Microsoft SharePoint Server 2010 Medium Farm Solution Performance Study Dell SharePoint Solutions Ravikanth Chaganti and Quocdat Nguyen August 2010 THIS WHITE PAPER IS FOR INFORMATIONAL PURPOSES ONLY,

More information

ELIS Multimedia Lab. Linked Open Data. Sam Coppens MMLab IBBT - UGent

ELIS Multimedia Lab. Linked Open Data. Sam Coppens MMLab IBBT - UGent Linked Open Data Sam Coppens MMLab IBBT - UGent Overview: Linked Open Data: Principles Interlinking Data LOD Server Tools Linked Open Data: Principles Term Linked Data was first coined by Tim Berners Lee

More information

Geospatial Data and the Semantic Web. The GeoKnow Project. Sebastian Hellmann AKSW/KILT research group, Leipzig University & DBpedia Association

Geospatial Data and the Semantic Web. The GeoKnow Project. Sebastian Hellmann AKSW/KILT research group, Leipzig University & DBpedia Association spatial Data and the Semantic Web The Geo Project Sebastian Hellmann AKSW/KILT research group, Leipzig University & DBpedia Association # About the project Collaborative Project 2012-2015 Information and

More information

D&B360. Installation and Administration Guide. for Microsoft Dynamics CRM. Version 3.0

D&B360. Installation and Administration Guide. for Microsoft Dynamics CRM. Version 3.0 D&B360 Installation and Administration Guide for Microsoft Dynamics CRM Version 3.0 D&B360 is a trademark or registered trademark of Dun and Bradstreet, Incorporated. Other trademarks used herein are the

More information

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015

Pulsar Realtime Analytics At Scale. Tony Ng April 14, 2015 Pulsar Realtime Analytics At Scale Tony Ng April 14, 2015 Big Data Trends Bigger data volumes More data sources DBs, logs, behavioral & business event streams, sensors Faster analysis Next day to hours

More information

Mediasite EX server deployment guide

Mediasite EX server deployment guide Mediasite EX server deployment guide 2008 Sonic Foundry, Inc. All rights reserved. No part of this document may be copied and/or redistributed without the consent of Sonic Foundry, Inc. Additional copies

More information

Sharp Remote Device Manager (SRDM) Server Software Setup Guide

Sharp Remote Device Manager (SRDM) Server Software Setup Guide Sharp Remote Device Manager (SRDM) Server Software Setup Guide This Guide explains how to install the software which is required in order to use Sharp Remote Device Manager (SRDM). SRDM is a web-based

More information

Soma: Linked Data Infrastructure

Soma: Linked Data Infrastructure Soma: Linked Data Infrastructure What is Soma? It s Big Data Candy for the Cloud. The Soma platform helps Data Scientist to collaborate together to discover and share new facts from large datasets hosted

More information

VIVO Dashboard A Drupal-based tool for harvesting and executing sophisticated queries against data from a VIVO instance

VIVO Dashboard A Drupal-based tool for harvesting and executing sophisticated queries against data from a VIVO instance VIVO Dashboard A Drupal-based tool for harvesting and executing sophisticated queries against data from a VIVO instance! Paul Albert, Miles Worthington and Don Carpenter Chapter I: The Problem Administrators

More information

Cisco TelePresence Management Suite Extension for Microsoft Exchange

Cisco TelePresence Management Suite Extension for Microsoft Exchange Data Sheet Cisco TelePresence Management Suite Extension for Microsoft Exchange Product Overview Cisco TelePresence Management Suite Extension for Microsoft Exchange (Cisco TMSXE) is an extension for Cisco

More information

When talking about hosting

When talking about hosting d o s Cloud Hosting - Amazon Web Services Thomas Floracks When talking about hosting for web applications most companies think about renting servers or buying their own servers. The servers and the network

More information

Microsoft Dynamics CRM 2011 New Features

Microsoft Dynamics CRM 2011 New Features Microsoft Dynamics CRM 2011 New Features Jim Wang Darren Liu Chapter No. 1 "Setting Up the Development Workspace" In this package, you will find: A Biography of the authors of the book A preview chapter

More information

Adonis Technical Requirements

Adonis Technical Requirements Information Sheet Adonis Technical Requirements CONTENTS Contents... 1 Adonis Project Implementation... 1 Host Installation / Onboard Installation Full replication (LARGER Vessels):... 1 Onboard installation

More information

Data-Flow Awareness in Parallel Data Processing

Data-Flow Awareness in Parallel Data Processing Data-Flow Awareness in Parallel Data Processing D. Bednárek, J. Dokulil *, J. Yaghob, F. Zavoral Charles University Prague, Czech Republic * University of Vienna, Austria 6 th International Symposium on

More information

Software and Hardware Requirements

Software and Hardware Requirements Software and Hardware Requirements Version: 5 Last Changed: 17/07/2012 Arena Group Limited, a a company registered registered in England in England Wales, and Wales, registered registered no. 02168309.

More information

COMP9321 Web Application Engineering

COMP9321 Web Application Engineering COMP9321 Web Application Engineering Semester 2, 2015 Dr. Amin Beheshti Service Oriented Computing Group, CSE, UNSW Australia Week 11 (Part II) http://webapps.cse.unsw.edu.au/webcms2/course/index.php?cid=2411

More information

GATE Mímir and cloud services. Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation

GATE Mímir and cloud services. Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation GATE Mímir and cloud services Multi-paradigm indexing and search tool Pay-as-you-go large-scale annotation GATE Mímir GATE Mímir is an indexing system for GATE documents. Mímir can index: Text: the original

More information

HEAT Endpoint Management and Security Suite 8.4. Server Install Guide

HEAT Endpoint Management and Security Suite 8.4. Server Install Guide 8.4 Server Install Guide - 2 - Notices Version Information Server Install Guide; - HEAT Endpoint Management and Security Suite Version 8.4 - Published: May 2016 Document Number: 02_202_8.4_161251152 Copyright

More information

Geo-coding and Smart Space Platforms Integration Agent Performance Testing and Analysis

Geo-coding and Smart Space Platforms Integration Agent Performance Testing and Analysis rusmart 14, Saint-Petersburg, 27-29 August 2014 Geo-coding and Smart Space Platforms Integration Agent Performance Testing and Analysis Kirill Yudenok, ETU, OSLL rusmart 14 28 August 2014 3 Geo-coded Smart

More information

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin Background About Macmillan

More information

Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints

Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints Benchmarking the Performance of Storage Systems that expose SPARQL Endpoints Christian Bizer 1 and Andreas Schultz 1 1 Freie Universität Berlin, Web-based Systems Group, Garystr. 21, 14195 Berlin, Germany

More information

Data-Gov Wiki: Towards Linked Government Data

Data-Gov Wiki: Towards Linked Government Data Data-Gov Wiki: Towards Linked Government Data Li Ding 1, Dominic DiFranzo 1, Sarah Magidson 2, Deborah L. McGuinness 1, and Jim Hendler 1 1 Tetherless World Constellation Rensselaer Polytechnic Institute

More information

McAfee Network Security Platform 8.2

McAfee Network Security Platform 8.2 8.2.7.71-8.2.3.84 Manager-Mxx30-series Release Notes McAfee Network Security Platform 8.2 Revision B Contents About this release New features Enhancements Resolved Issues Installation instructions Known

More information

Setting Up the Development Workspace

Setting Up the Development Workspace Setting Up the Development Workspace The first step in this project is to set up a workspace, so you can build, test, and verify all of the components that we are going to put together in Microsoft Dynamics

More information

White paper on. From. Hexaware Technologies Limited (HTL)

White paper on. From. Hexaware Technologies Limited (HTL) White paper on ewise Knowledge Management Framework TM From Hexaware Technologies Limited (HTL) Hexaware Technologies Limited. All rights reserved. Copying or Distributing without prior permission is prohibited

More information

Legal Notices... 2. Introduction... 3

Legal Notices... 2. Introduction... 3 HP Asset Manager Asset Manager 5.10 Sizing Guide Using the Oracle Database Server, or IBM DB2 Database Server, or Microsoft SQL Server Legal Notices... 2 Introduction... 3 Asset Manager Architecture...

More information

Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy)

Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy) Building the Multilingual Web of Data: A Hands-on tutorial (ISWC 2014, Riva del Garda - Italy) Multilingual Word Sense Disambiguation and Entity Linking on the Web based on BabelNet Roberto Navigli, Tiziano

More information

Client-aware Cloud Storage

Client-aware Cloud Storage Client-aware Cloud Storage Feng Chen Computer Science & Engineering Louisiana State University Michael Mesnier Circuits & Systems Research Intel Labs Scott Hahn Circuits & Systems Research Intel Labs Cloud

More information

Benchmarking Cassandra on Violin

Benchmarking Cassandra on Violin Technical White Paper Report Technical Report Benchmarking Cassandra on Violin Accelerating Cassandra Performance and Reducing Read Latency With Violin Memory Flash-based Storage Arrays Version 1.0 Abstract

More information

Industry 4.0 and Big Data

Industry 4.0 and Big Data Industry 4.0 and Big Data Marek Obitko, mobitko@ra.rockwell.com Senior Research Engineer 03/25/2015 PUBLIC PUBLIC - 5058-CO900H 2 Background Joint work with Czech Institute of Informatics, Robotics and

More information

Deliverable 2.1.4. 150 Billion Triple dataset hosted on the LOD2 Knowledge Store Cluster. LOD2 Creating Knowledge out of Interlinked Data

Deliverable 2.1.4. 150 Billion Triple dataset hosted on the LOD2 Knowledge Store Cluster. LOD2 Creating Knowledge out of Interlinked Data Collaborative Project LOD2 Creating Knowledge out of Interlinked Data Project Number: 257943 Start Date of Project: 01/09/2010 Duration: 48 months Deliverable 2.1.4 150 Billion Triple dataset hosted on

More information

MapReduce and Distributed Data Analysis. Sergei Vassilvitskii Google Research

MapReduce and Distributed Data Analysis. Sergei Vassilvitskii Google Research MapReduce and Distributed Data Analysis Google Research 1 Dealing With Massive Data 2 2 Dealing With Massive Data Polynomial Memory Sublinear RAM Sketches External Memory Property Testing 3 3 Dealing With

More information

Oracle Big Data SQL Technical Update

Oracle Big Data SQL Technical Update Oracle Big Data SQL Technical Update Jean-Pierre Dijcks Oracle Redwood City, CA, USA Keywords: Big Data, Hadoop, NoSQL Databases, Relational Databases, SQL, Security, Performance Introduction This technical

More information

Arrow ECS sp. z o.o. Oracle Partner Academy training environment with Oracle Virtualization. Oracle Partner HUB

Arrow ECS sp. z o.o. Oracle Partner Academy training environment with Oracle Virtualization. Oracle Partner HUB Oracle Partner Academy training environment with Oracle Virtualization Technology Oracle Partner HUB Overview Description of technology The idea of creating new training centre was to attain light and

More information

iway Roadmap: 2011 and Beyond Dave Watson SVP, iway Software

iway Roadmap: 2011 and Beyond Dave Watson SVP, iway Software iway Roadmap: 2011 and Beyond Dave Watson SVP, iway Software iway Software Products DataMigrator Core Integration Server iway Service Manager Information Management/Data Governance B2B Gateway Managed

More information

MS 8912A: Customization and Configuration in Microsoft Dynamics CRM 4.0

MS 8912A: Customization and Configuration in Microsoft Dynamics CRM 4.0 MS 8912A: Customization and Configuration in Microsoft Dynamics CRM 4.0 Description: This three-day instructor-led course provides students with the tools to configure and customize Microsoft Dynamics

More information

Toolbox 4.3. System Requirements

Toolbox 4.3. System Requirements Toolbox 4.3 February 2015 Contents Introduction... 2 Requirements for Toolbox 4.3... 3 Toolbox Applications... 3 Installing on Multiple Computers... 3 Concurrent Loading, Importing, Processing... 4 Client...

More information

Development of nosql data storage for the ATLAS PanDA Monitoring System

Development of nosql data storage for the ATLAS PanDA Monitoring System Development of nosql data storage for the ATLAS PanDA Monitoring System M.Potekhin Brookhaven National Laboratory, Upton, NY11973, USA E-mail: potekhin@bnl.gov Abstract. For several years the PanDA Workload

More information

XenDesktop 7 Database Sizing

XenDesktop 7 Database Sizing XenDesktop 7 Database Sizing Contents Disclaimer... 3 Overview... 3 High Level Considerations... 3 Site Database... 3 Impact of failure... 4 Monitoring Database... 4 Impact of failure... 4 Configuration

More information

Windows Compute Cluster Server 2003. Miron Krokhmal CTO

Windows Compute Cluster Server 2003. Miron Krokhmal CTO Windows Compute Cluster Server 2003 Miron Krokhmal CTO Agenda The Windows compute cluster architecture o Hardware and software requirements o Supported network topologies o Deployment strategies, including

More information

Big Data Technologies Compared June 2014

Big Data Technologies Compared June 2014 Big Data Technologies Compared June 2014 Agenda What is Big Data Big Data Technology Comparison Summary Other Big Data Technologies Questions 2 What is Big Data by Example The SKA Telescope is a new development

More information

Querying DBpedia Using HIVE-QL

Querying DBpedia Using HIVE-QL Querying DBpedia Using HIVE-QL AHMED SALAMA ISMAIL 1, HAYTHAM AL-FEEL 2, HODA M. O.MOKHTAR 3 Information Systems Department, Faculty of Computers and Information 1, 2 Fayoum University 3 Cairo University

More information

a division of Technical Overview Xenos Enterprise Server 2.0

a division of Technical Overview Xenos Enterprise Server 2.0 Technical Overview Enterprise Server 2.0 Enterprise Server Architecture The Enterprise Server (ES) platform addresses the HVTO business challenges facing today s enterprise. It provides robust, flexible

More information

Big Data and Analytics: Challenges and Opportunities

Big Data and Analytics: Challenges and Opportunities Big Data and Analytics: Challenges and Opportunities Dr. Amin Beheshti Lecturer and Senior Research Associate University of New South Wales, Australia (Service Oriented Computing Group, CSE) Talk: Sharif

More information

Business applications:

Business applications: Consorzio COMETA - Progetto PI2S2 UNIONE EUROPEA Business applications: the COMETA approach Prof. Antonio Puliafito University of Messina Open Grid Forum (OGF25) Catania, 2-6.03.2009 www.consorzio-cometa.it

More information

Deploying and administering Microsoft Dynamics CRM Online and Microsoft Dynamics CRM 2015

Deploying and administering Microsoft Dynamics CRM Online and Microsoft Dynamics CRM 2015 Deploying and administering Microsoft Dynamics CRM Online and Microsoft Dynamics CRM 2015 Version 7.1.1 This document contains the information that was titled the Microsoft Dynamics CRM Implementation

More information

Managing a local Galaxy Instance. Anushka Brownley / Adam Kraut BioTeam Inc.

Managing a local Galaxy Instance. Anushka Brownley / Adam Kraut BioTeam Inc. Managing a local Galaxy Instance Anushka Brownley / Adam Kraut BioTeam Inc. Agenda Who are we Why a local installation Local infrastructure Local installation Tips and Tricks SlipStream Appliance WHO ARE

More information

An Oracle White Paper June 2011. Oracle Database Firewall 5.0 Sizing Best Practices

An Oracle White Paper June 2011. Oracle Database Firewall 5.0 Sizing Best Practices An Oracle White Paper June 2011 Oracle Database Firewall 5.0 Sizing Best Practices Introduction... 1 Component Overview... 1 Database Firewall Deployment Modes... 2 Sizing Hardware Requirements... 2 Database

More information

SEAIP 2009 Presentation

SEAIP 2009 Presentation SEAIP 2009 Presentation By David Tan Chair of Yahoo! Hadoop SIG, 2008-2009,Singapore EXCO Member of SGF SIG Imperial College (UK), Institute of Fluid Science (Japan) & Chicago BOOTH GSB (USA) Alumni Email:

More information

Managing Enterprise Devices and Apps using System Center Configuration Manager

Managing Enterprise Devices and Apps using System Center Configuration Manager Course 20696B: Managing Enterprise Devices and Apps using System Center Configuration Manager Course Details Course Outline Module 1: Managing Desktops and Devices in the Enterprise This module explains

More information

MS 10978A Introduction to Azure for Developers

MS 10978A Introduction to Azure for Developers MS 10978A Introduction to Azure for Developers Description: Days: 5 Prerequisites: This course offers students the opportunity to learn about Microsoft Azure development by taking an existing ASP.NET MVC

More information

Yet Another Triple Store Benchmark? Practical Experiences with Real-World Data

Yet Another Triple Store Benchmark? Practical Experiences with Real-World Data Yet Another Triple Store Benchmark? Practical Experiences with Real-World Data Martin Voigt, Annett Mitschick, and Jonas Schulz Dresden University of Technology, Institute for Software and Multimedia Technology,

More information

IMPLEMENTING GREEN IT

IMPLEMENTING GREEN IT Saint Petersburg State University of Information Technologies, Mechanics and Optics Department of Telecommunication Systems IMPLEMENTING GREEN IT APPROACH FOR TRANSFERRING BIG DATA OVER PARALLEL DATA LINK

More information

Cisco Data Preparation

Cisco Data Preparation Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and

More information

Hadoop Architecture. Part 1

Hadoop Architecture. Part 1 Hadoop Architecture Part 1 Node, Rack and Cluster: A node is simply a computer, typically non-enterprise, commodity hardware for nodes that contain data. Consider we have Node 1.Then we can add more nodes,

More information

Accelerating Hadoop MapReduce Using an In-Memory Data Grid

Accelerating Hadoop MapReduce Using an In-Memory Data Grid Accelerating Hadoop MapReduce Using an In-Memory Data Grid By David L. Brinker and William L. Bain, ScaleOut Software, Inc. 2013 ScaleOut Software, Inc. 12/27/2012 H adoop has been widely embraced for

More information

Wikidata. Semantic Web in Libraries December 2014. A Free Collaborative Knowledge Base. Markus Krötzsch TU Dresden

Wikidata. Semantic Web in Libraries December 2014. A Free Collaborative Knowledge Base. Markus Krötzsch TU Dresden Technische Universität Dresden Fakultät Informatik Wikidata A Free Collaborative Knowledge Base Markus Krötzsch TU Dresden Semantic Web in Libraries December 2014 Where is Wikipedia Going? Wikipedia in

More information

BlackBerry Enterprise Server for Microsoft Exchange Version: 5.0 Service Pack: 2. Feature and Technical Overview

BlackBerry Enterprise Server for Microsoft Exchange Version: 5.0 Service Pack: 2. Feature and Technical Overview BlackBerry Enterprise Server for Microsoft Exchange Version: 5.0 Service Pack: 2 Feature and Technical Overview Published: 2010-06-16 SWDT305802-1108946-0615123042-001 Contents 1 Overview: BlackBerry Enterprise

More information

xpaaerns on Spark, Shark, Tachyon and Mesos

xpaaerns on Spark, Shark, Tachyon and Mesos xpaaerns on Spark, Shark, Tachyon and Mesos Spark Summit 2014 Claudiu Barbura Sr. Director of Engineering A>geo Agenda xpa&erns Architecture From Hadoop to BDAS & our contribu

More information

Enabling Technologies for Distributed Computing

Enabling Technologies for Distributed Computing Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing, UNF Multi-core CPUs and Multithreading Technologies

More information

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms

Distributed File System. MCSN N. Tonellotto Complements of Distributed Enabling Platforms Distributed File System 1 How do we get data to the workers? NAS Compute Nodes SAN 2 Distributed File System Don t move data to workers move workers to the data! Store data on the local disks of nodes

More information

COLINDA - Conference Linked Data

COLINDA - Conference Linked Data Undefined 1 (0) 1 5 1 IOS Press COLINDA - Conference Linked Data Editor(s): Name Surname, University, Country Solicited review(s): Name Surname, University, Country Open review(s): Name Surname, University,

More information

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW

HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW HADOOP ON ORACLE ZFS STORAGE A TECHNICAL OVERVIEW 757 Maleta Lane, Suite 201 Castle Rock, CO 80108 Brett Weninger, Managing Director brett.weninger@adurant.com Dave Smelker, Managing Principal dave.smelker@adurant.com

More information

White Paper. Recording Server Virtualization

White Paper. Recording Server Virtualization White Paper Recording Server Virtualization Prepared by: Mike Sherwood, Senior Solutions Engineer Milestone Systems 23 March 2011 Table of Contents Introduction... 3 Target audience and white paper purpose...

More information