Berlin-Brandenburg Academy of sciences and humanities (BBAW) resources / services

Size: px
Start display at page:

Download "Berlin-Brandenburg Academy of sciences and humanities (BBAW) resources / services"

Transcription

1 Berlin-Brandenburg Academy of sciences and humanities (BBAW) resources / services speakers: Kai Zimmer and Jörg Didakowski Clarin Workshop WP2 February 2009

2 BBAW/DWDS The BBAW and its 40 longterm projects offer many resources: digital dictionary of the german language (DWDS): corpora, dictionaries, `language information platform`, but is also developing natural language processing tools and search engine german text archive (DTA): texts from 14th to 19th century in an `active archive` TELOTA: technical service section for projects at the academy Clarin Workshop WP2 February 2009

3 BBAW/DWDS DWDS publicly available corpora (via webinterface): - german reference corpus (balanced over categories and decades) - newspapers: Die Zeit (daily updated, current) Berliner Zeitung Tagesspiegel Potsdamer Neueste Nachrichten (PNN) - spoken language corpus - historic corpora - jewish periodicals (by Compact memory) Clarin Workshop WP2 February 2009

4 DDC DWDS uses ddc-concordance (OSS, LGPL) as an online corpus search engine. Features are: - statistical queries, not approximations - regular expressions, phrase, distance, trunaction (l/r) search - sentence or document-based search - search for wordforms (for english, german and russian) - index metadata and annotations - document relevance ranking - it s fast - scaleable to huge corpora and load, due to clustering architecture - clients for python, perl, php, c++ (network protocol easy to implement in other programming languages) Clarin Workshop WP2 February 2009

5 DDC DDCs query language is completely available in the xmlrpc-service Clarin Workshop WP2 February 2009

6 DDC/C4 The clustering architecture of the search engine ddc is primarily used for performance and scaling purposes. But it also allows to connect separate corpora from different places - like in the C4-project (similar to Dieters DAM LR EU project): Clarin Workshop WP2 February 2009

7 DDC/C4 C4 project consist of four different participants: Austrian Academy corpus (AAC, Vienna) Swiss text corpus (Basel) Corpus Southtirol (Italy) Berlin corpus (DWDS/BBAW, Germany) Each participating country adds a balanced ~20 million token subcorpus to a `shared`corpus. Results of a search query are sorted and merged by ddc. Authentication is done by simple mysql databases. Clarin Workshop WP2 February 2009

8 DDC/C4 Clarin Workshop WP2 February 2009

9 DDC/C4 Clarin Workshop WP2 February 2009

10 On with Jörgs presentation about our xmlrpc services... Clarin Workshop WP2 February 2009

11 Web Services The web services are currently for internal use in our project network They allow an efficient and easy access to textual resources and language processing tools The web services for language processing tools are based on XML-RPC The web services for textual resources are based on DDC An XML-RPC based service repository manages the services

12 XML-RPC XML-RPC is a Remote Procedure Calling protocol that works over the Internet. An XML-RPC message is an HTTP-POST request. The body of the request is in XML. A procedure can be executed on the server and the value it returns is formatted in XML, too.

13 Service Repository ( Database User administration (based on a MySQL Authorization management Granular configuration Individual unlocking of services Time sensitive authorization ( Database Service administration (based on a MySQL Integration of services IP address and port Service name, version, description, ID, maintainer, etc. logging information

14 Language Processing Tools They are for German Most of them are based on finite-state techniques Most of them are rule-based They are implemented in C and C++

15 ToMaSoTaTh combines different tasks: Tokenizing Morphological analysis (TAGH Morphology) TextToSound (using SAMPA) Tagging (moot) Thesaurus (Lexikonet) The several components can be applied individually input: plain text output: one token per line with tabulator separated information

16 Meinten Sie (Did You mean?) This tool calculates corrections of typos (based on edit distance which is precompiled over a word list) Input: token Output: token list (proposals)

17 SynCoP grammar/specification-driven parser Implemented systems: (partial) dependency parsing named entity recognition and classification (person names, location names, organization names) Input: plain text/tokenized text in XML Output: TIGER-XML oriented format

18 Thank You for Your attention!

19

20

21

22 Architecture

23 Admin Panel

24 ( client Using Services (as a First, connecting to the server: (" server=xmlrpclib.serverproxy(" A session_id is given by the server to the client: session_id =server.dwds.login("jantenner","test123") The session_id runs out after 15 minutes Then a service can be used via a function call: (, server.dwds.processor.lts.tomata.analyse(session_id print (, server.dwds.resource.kerncorpus.query(session_id print

A prototype infrastructure for D Spin Services based on a flexible multilayer architecture

A prototype infrastructure for D Spin Services based on a flexible multilayer architecture A prototype infrastructure for D Spin Services based on a flexible multilayer architecture Volker Boehlke 1,, 1 NLP Group, Department of Computer Science, University of Leipzig, Johanisgasse 26, 04103

More information

Automation of metadata processing

Automation of metadata processing Automation of metadata processing CLARIN-Conference in Wroclaw, Poland, 15-17, Octobre Except where otherwise noted, content on this poster is licensed under a Creative Commons Attribution 4.0 International

More information

SCALABLE DATA SERVICES

SCALABLE DATA SERVICES 1 SCALABLE DATA SERVICES 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2 Overview MySQL Database Clustering GlusterFS Memcached 3 Overview Problems of Data Services 4 Data retrieval

More information

Interactive Dynamic Information Extraction

Interactive Dynamic Information Extraction Interactive Dynamic Information Extraction Kathrin Eichler, Holmer Hemsen, Markus Löckelt, Günter Neumann, and Norbert Reithinger Deutsches Forschungszentrum für Künstliche Intelligenz - DFKI, 66123 Saarbrücken

More information

WebLicht: Web-based LRT services for German

WebLicht: Web-based LRT services for German WebLicht: Web-based LRT services for German Erhard Hinrichs, Marie Hinrichs, Thomas Zastrow Seminar für Sprachwissenschaft, University of Tübingen firstname.lastname@uni-tuebingen.de Abstract This software

More information

Motivation. Korpus-Abfrage: Werkzeuge und Sprachen. Overview. Languages of Corpus Query. SARA Query Possibilities 1

Motivation. Korpus-Abfrage: Werkzeuge und Sprachen. Overview. Languages of Corpus Query. SARA Query Possibilities 1 Korpus-Abfrage: Werkzeuge und Sprachen Gastreferat zur Vorlesung Korpuslinguistik mit und für Computerlinguistik Charlotte Merz 3. Dezember 2002 Motivation Lizentiatsarbeit: A Corpus Query Tool for Automatically

More information

ifinder ENTERPRISE SEARCH

ifinder ENTERPRISE SEARCH DATA SHEET ifinder ENTERPRISE SEARCH ifinder - the Enterprise Search solution for company-wide information search, information logistics and text mining. CUSTOMER QUOTE IntraFind stands for high quality

More information

Enterprise Email Archive Managed Archiving & ediscovery Services User Manual

Enterprise Email Archive Managed Archiving & ediscovery Services User Manual Enterprise Email Archive Managed Archiving & ediscovery Services User Manual Copyright (C) 2012 MessageSolution Inc. All Rights Reserved Table of Contents Chapter 1: Introduction... 3 1.1 About MessageSolution

More information

Beyond The Web Drupal Meets The Desktop (And Mobile) Justin Miller Code Sorcery Workshop, LLC http://codesorcery.net/dcdc

Beyond The Web Drupal Meets The Desktop (And Mobile) Justin Miller Code Sorcery Workshop, LLC http://codesorcery.net/dcdc Beyond The Web Drupal Meets The Desktop (And Mobile) Justin Miller Code Sorcery Workshop, LLC http://codesorcery.net/dcdc Introduction Personal introduction Format & conventions for this talk Assume familiarity

More information

Automatic Text Analysis Using Drupal

Automatic Text Analysis Using Drupal Automatic Text Analysis Using Drupal By Herman Chai Computer Engineering California Polytechnic State University, San Luis Obispo Advised by Dr. Foaad Khosmood June 14, 2013 Abstract Natural language processing

More information

Care and Feeding of FileMaker Server. FMUG April 7, 2006

Care and Feeding of FileMaker Server. FMUG April 7, 2006 Care and Feeding of FileMaker Server FMUG April 7, 2006 Topics not covered These are all important, all huge, and therefore not do-able in a single meeting: External authentication Accounts and privilege

More information

Efficiency of Web Based SAX XML Distributed Processing

Efficiency of Web Based SAX XML Distributed Processing Efficiency of Web Based SAX XML Distributed Processing R. Eggen Computer and Information Sciences Department University of North Florida Jacksonville, FL, USA A. Basic Computer and Information Sciences

More information

WordPress Security Scan Configuration

WordPress Security Scan Configuration WordPress Security Scan Configuration To configure the - WordPress Security Scan - plugin in your WordPress driven Blog, login to WordPress as administrator, by simply entering the url_of_your_website/wp-admin

More information

CENG 734 Advanced Topics in Bioinformatics

CENG 734 Advanced Topics in Bioinformatics CENG 734 Advanced Topics in Bioinformatics Week 9 Text Mining for Bioinformatics: BioCreative II.5 Fall 2010-2011 Quiz #7 1. Draw the decompressed graph for the following graph summary 2. Describe the

More information

Mobile Storage and Search Engine of Information Oriented to Food Cloud

Mobile Storage and Search Engine of Information Oriented to Food Cloud Advance Journal of Food Science and Technology 5(10): 1331-1336, 2013 ISSN: 2042-4868; e-issn: 2042-4876 Maxwell Scientific Organization, 2013 Submitted: May 29, 2013 Accepted: July 04, 2013 Published:

More information

Inmagic Content Server Standard and Enterprise Configurations Technical Guidelines

Inmagic Content Server Standard and Enterprise Configurations Technical Guidelines Inmagic Content Server v1.3 Technical Guidelines 6/2005 Page 1 of 15 Inmagic Content Server Standard and Enterprise Configurations Technical Guidelines Last Updated: June, 2005 Inmagic, Inc. All rights

More information

Log Analysis: Overall Issues p. 1 Introduction p. 2 IT Budgets and Results: Leveraging OSS Solutions at Little Cost p. 2 Reporting Security

Log Analysis: Overall Issues p. 1 Introduction p. 2 IT Budgets and Results: Leveraging OSS Solutions at Little Cost p. 2 Reporting Security Foreword p. xvii Log Analysis: Overall Issues p. 1 Introduction p. 2 IT Budgets and Results: Leveraging OSS Solutions at Little Cost p. 2 Reporting Security Information to Management p. 5 Example of an

More information

Augmented Search for Software Testing

Augmented Search for Software Testing Augmented Search for Software Testing For Testers, Developers, and QA Managers New frontier in big log data analysis and application intelligence Business white paper May 2015 During software testing cycles,

More information

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com

Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Using MySQL for Big Data Advantage Integrate for Insight Sastry Vedantam sastry.vedantam@oracle.com Agenda The rise of Big Data & Hadoop MySQL in the Big Data Lifecycle MySQL Solutions for Big Data Q&A

More information

Network Activity D Developing and Maintaining Databases

Network Activity D Developing and Maintaining Databases Network Activity D - Developing and Maintaining Databases Report D3.2.2 User Interface implementation Patricia KELBERT MNHN Paris BGBM Berlin July 2006-1- Table of Contents 1 Introduction... 4 2 Material

More information

Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov

Search and Data Mining: Techniques. Text Mining Anya Yarygina Boris Novikov Search and Data Mining: Techniques Text Mining Anya Yarygina Boris Novikov Introduction Generally used to denote any system that analyzes large quantities of natural language text and detects lexical or

More information

the missing log collector Treasure Data, Inc. Muga Nishizawa

the missing log collector Treasure Data, Inc. Muga Nishizawa the missing log collector Treasure Data, Inc. Muga Nishizawa Muga Nishizawa (@muga_nishizawa) Chief Software Architect, Treasure Data Treasure Data Overview Founded to deliver big data analytics in days

More information

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts

MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts MIRACLE at VideoCLEF 2008: Classification of Multilingual Speech Transcripts Julio Villena-Román 1,3, Sara Lana-Serrano 2,3 1 Universidad Carlos III de Madrid 2 Universidad Politécnica de Madrid 3 DAEDALUS

More information

XpoLog Center Suite Log Management & Analysis platform

XpoLog Center Suite Log Management & Analysis platform XpoLog Center Suite Log Management & Analysis platform Summary: 1. End to End data management collects and indexes data in any format from any machine / device in the environment. 2. Logs Monitoring -

More information

RSA Security Analytics

RSA Security Analytics RSA Security Analytics Event Source Log Configuration Guide RSA Authentication Manager and User Credential Manager Last Modified: Friday, March 13, 2015 Event Source Product Information: Vendor: RSA, The

More information

Securing and Accelerating Databases In Minutes using GreenSQL

Securing and Accelerating Databases In Minutes using GreenSQL Securing and Accelerating Databases In Minutes using GreenSQL Unified Database Security All-in-one database security and acceleration solution Simplified management, maintenance, renewals and threat update

More information

Information Retrieval Elasticsearch

Information Retrieval Elasticsearch Information Retrieval Elasticsearch IR Information retrieval (IR) is the activity of obtaining information resources relevant to an information need from a collection of information resources. Searches

More information

Documentation of open source GIS/RS software projects

Documentation of open source GIS/RS software projects Contract no. Workpackage Delivery Delivery Date 030776 WP1 D1.6 2007-07-02 CASCADOSS Development of a trans-national cascade training programme on Open Source GIS&RS Software for environmental applications

More information

Die Vielfalt vereinen: Die CLARIN-Eingangsformate CMDI und TCF

Die Vielfalt vereinen: Die CLARIN-Eingangsformate CMDI und TCF Die Vielfalt vereinen: Die CLARIN-Eingangsformate CMDI und TCF Susanne Haaf & Bryan Jurish Deutsches Textarchiv 1. The Metadata Format CMDI Metadata? Metadata Format? and more Metadata? Metadata Format?

More information

EVALUATION ONLY. WA2088 WebSphere Application Server 8.5 Administration on Windows. Student Labs. Web Age Solutions Inc.

EVALUATION ONLY. WA2088 WebSphere Application Server 8.5 Administration on Windows. Student Labs. Web Age Solutions Inc. WA2088 WebSphere Application Server 8.5 Administration on Windows Student Labs Web Age Solutions Inc. Copyright 2013 Web Age Solutions Inc. 1 Table of Contents Directory Paths Used in Labs...3 Lab Notes...4

More information

Search and Information Retrieval

Search and Information Retrieval Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search

More information

EFFECTIVE STRATEGIES FOR SEARCHING ORACLE UCM. Alan Mackenthun Senior Software Consultant 4/23/2010. F i s h b o w l S o l u t I o n s

EFFECTIVE STRATEGIES FOR SEARCHING ORACLE UCM. Alan Mackenthun Senior Software Consultant 4/23/2010. F i s h b o w l S o l u t I o n s EFFECTIVE STRATEGIES FOR SEARCHING ORACLE UCM Alan Mackenthun Senior Software Consultant 4/23/2010 F i s h b o w l S o l u t I o n s EFFECTIVE STRATEGIES FOR SEARCHING ORACLE UCM Contents INTRODUCTION...

More information

Implementing SharePoint 2010 as a Compliant Information Management Platform

Implementing SharePoint 2010 as a Compliant Information Management Platform Implementing SharePoint 2010 as a Compliant Information Management Platform Changing the Paradigm with a Business Oriented Approach to Records Management Introduction This document sets out the results

More information

Schema documentation for types1.2.xsd

Schema documentation for types1.2.xsd Generated with oxygen XML Editor Take care of the environment, print only if necessary! 8 february 2011 Table of Contents : ""...........................................................................................................

More information

Inmagic Content Server Workgroup Configuration Technical Guidelines

Inmagic Content Server Workgroup Configuration Technical Guidelines Inmagic Content Server Workgroup Configuration Technical Guidelines 6/2005 Page 1 of 12 Inmagic Content Server Workgroup Configuration Technical Guidelines Last Updated: June, 2005 Inmagic, Inc. All rights

More information

Word Completion and Prediction in Hebrew

Word Completion and Prediction in Hebrew Experiments with Language Models for בס"ד Word Completion and Prediction in Hebrew 1 Yaakov HaCohen-Kerner, Asaf Applebaum, Jacob Bitterman Department of Computer Science Jerusalem College of Technology

More information

D-SPIN. Volker Boehlke, Gerhard Heyer University of Leipzig. - overview, prototype, roadmap - Institut für Informatik

D-SPIN. Volker Boehlke, Gerhard Heyer University of Leipzig. - overview, prototype, roadmap - Institut für Informatik D-SPIN - overview, prototype, roadmap - Volker Boehlke, Gerhard Heyer University of Leipzig Institut für Informatik LeipzigLinguisticServices - repository: - relational database - used fields are: ID,

More information

Distributed Computing and Big Data: Hadoop and MapReduce

Distributed Computing and Big Data: Hadoop and MapReduce Distributed Computing and Big Data: Hadoop and MapReduce Bill Keenan, Director Terry Heinze, Architect Thomson Reuters Research & Development Agenda R&D Overview Hadoop and MapReduce Overview Use Case:

More information

Configure VPN between ProSafe VPN Client Software and FVG318

Configure VPN between ProSafe VPN Client Software and FVG318 Configure VPN between ProSafe VPN Client Software and FVG318 The following configuration is tested with: NETGEAR FVG318 with firmware version 1.0.41 NETGEAR ProSafe VPN Client Software version 10.5.1 Configure

More information

Things Made Easy: One Click CMS Integration with Solr & Drupal

Things Made Easy: One Click CMS Integration with Solr & Drupal May 10, 2012 Things Made Easy: One Click CMS Integration with Solr & Drupal Peter M. Wolanin, Ph.D. Momentum Specialist (principal engineer), Acquia, Inc. Drupal contributor drupal.org/user/49851 co-maintainer

More information

Using Microsoft Windows Authentication for Microsoft SQL Server Connections in Data Archive

Using Microsoft Windows Authentication for Microsoft SQL Server Connections in Data Archive Using Microsoft Windows Authentication for Microsoft SQL Server Connections in Data Archive 2014 Informatica Corporation. No part of this document may be reproduced or transmitted in any form, by any means

More information

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS. PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software

More information

Document management and exchange system supporting education process

Document management and exchange system supporting education process Document management and exchange system supporting education process Emil Egredzija, Bozidar Kovacic Information system development department, Information Technology Institute City of Rijeka Korzo 16,

More information

Natural Language to Relational Query by Using Parsing Compiler

Natural Language to Relational Query by Using Parsing Compiler Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 4, Issue. 3, March 2015,

More information

MXSAVE XMLRPC Web Service Guide. Last Revision: 6/14/2012

MXSAVE XMLRPC Web Service Guide. Last Revision: 6/14/2012 MXSAVE XMLRPC Web Service Guide Last Revision: 6/14/2012 Table of Contents Introduction! 4 Web Service Minimum Requirements! 4 Developer Support! 5 Submitting Transactions! 6 Clients! 7 Adding Clients!

More information

ASULPUNTO Magento unicenta opos integration extension Version 1.0.0

ASULPUNTO Magento unicenta opos integration extension Version 1.0.0 ASULPUNTO Magento unicenta opos integration extension Version 1.0.0 Magento unicenta opos Integration Extension Version 1.0.0 Page 1 Table of Contents Requirements...3 A working copy of unicenta opos on

More information

Information Technology Services Classification Level Range C Reports to. Manager ITS Infrastructure Effective Date June 29 th, 2015 Position Summary

Information Technology Services Classification Level Range C Reports to. Manager ITS Infrastructure Effective Date June 29 th, 2015 Position Summary Athabasca University Professional Position Description Section I Position Update Only Information Position Title Senior System Administrator Position # 999716,999902 Department Information Technology Services

More information

www.coveo.com Unifying Search for the Desktop, the Enterprise and the Web

www.coveo.com Unifying Search for the Desktop, the Enterprise and the Web wwwcoveocom Unifying Search for the Desktop, the Enterprise and the Web wwwcoveocom Why you need Coveo Enterprise Search Quickly find documents scattered across your enterprise network Coveo is actually

More information

ObserveIT Ticketing Integration Guide

ObserveIT Ticketing Integration Guide ObserveIT Ticketing Integration Guide Contents 1 Purpose of this Document... 2 2 Overview and Architecture... 2 3 Web Services Integration... 3 4 Customizing a New Ticketing System... 4 5 Appendix: Web

More information

Oracle 11g New Features - OCP Upgrade Exam

Oracle 11g New Features - OCP Upgrade Exam Oracle 11g New Features - OCP Upgrade Exam This course gives you the opportunity to learn about and practice with the new change management features and other key enhancements in Oracle Database 11g Release

More information

Design Proposal for a Meta-Data-Driven Content Management System

Design Proposal for a Meta-Data-Driven Content Management System Design Proposal for a Meta-Data-Driven Content Management System Andreas Krennmair ak@synflood.at 15th August 2005 Contents 1 Basic Idea 1 2 Services 2 3 Programmability 2 4 Storage 3 5 Interface 4 5.1

More information

CloudCERT (Testbed framework to exercise critical infrastructure protection)

CloudCERT (Testbed framework to exercise critical infrastructure protection) WP2. CONCEPTUAL MODELLING AND ARCHITECTURE CloudCERT (Testbed framework to exercise critical infrastructure protection) With the financial support of the Prevention, Preparedness and Consequence Management

More information

Speech Processing Applications in Quaero

Speech Processing Applications in Quaero Speech Processing Applications in Quaero Sebastian Stüker www.kit.edu 04.08 Introduction! Quaero is an innovative, French program addressing multimedia content! Speech technologies are part of the Quaero

More information

owncloud Architecture Overview

owncloud Architecture Overview owncloud Architecture Overview Time to get control back Employees are using cloud-based services to share sensitive company data with vendors, customers, partners and each other. They are syncing data

More information

The data between TC Monitor and remote devices is exchanged using HTTP protocol. Monitored devices operate either as server or client mode.

The data between TC Monitor and remote devices is exchanged using HTTP protocol. Monitored devices operate either as server or client mode. 1. Introduction TC Monitor is easy to use Windows application for monitoring and control of some Teracom Ethernet (TCW) and GSM/GPRS (TCG) controllers. The supported devices are TCW122B-CM, TCW181B- CM,

More information

Federated Identity Management. Willem Elbers (MPI-TLA) EUDAT training

Federated Identity Management. Willem Elbers (MPI-TLA) EUDAT training Federated Identity Management Willem Elbers (MPI-TLA) EUDAT training Date: 26 June 2012 Outline FIM and introduction to components Federation and metadata National Identity federations and inter federations

More information

Passive Logging. Intrusion Detection System (IDS): Software that automates this process

Passive Logging. Intrusion Detection System (IDS): Software that automates this process Passive Logging Intrusion Detection: Monitor events, analyze for signs of incidents Look for violations or imminent violations of security policies accepted use policies standard security practices Intrusion

More information

Team Collaboration, Version Management, Audit Trails

Team Collaboration, Version Management, Audit Trails Team Collaboration, Version Management, Audit Trails Best Practices for Successful Project Delivery with VoiceObjects May 2008 www.voiceobjects.com 2 Team Collaboration, Version Management, Audit Trails

More information

Log Management with Open-Source Tools. Risto Vaarandi SEB Estonia

Log Management with Open-Source Tools. Risto Vaarandi SEB Estonia Log Management with Open-Source Tools Risto Vaarandi SEB Estonia Outline Why use open source tools for log management? Widely used logging protocols and recently introduced new standards Open-source syslog

More information

Technical Report. The KNIME Text Processing Feature:

Technical Report. The KNIME Text Processing Feature: Technical Report The KNIME Text Processing Feature: An Introduction Dr. Killian Thiel Dr. Michael Berthold Killian.Thiel@uni-konstanz.de Michael.Berthold@uni-konstanz.de Copyright 2012 by KNIME.com AG

More information

CERN Document Server

CERN Document Server CERN Document Server Document Management System for Grey Literature in Networked Environment Martin Vesely CERN Geneva, Switzerland GL5, December 4-5, 2003 Amsterdam, The Netherlands Overview Searching

More information

SpamPanel Email Level Manual Version 1 Last update: March 21, 2014 SpamPanel

SpamPanel Email Level Manual Version 1 Last update: March 21, 2014 SpamPanel SpamPanel Email Level Manual Version 1 Last update: March 21, 2014 SpamPanel Table of Contents Incoming... 1 Incoming Spam Quarantine... 2 Incoming Log Search... 4 Delivery Queue... 7 Report Non-Spam...

More information

Funambol Exchange Connector v6.5 Installation Guide

Funambol Exchange Connector v6.5 Installation Guide Funambol Exchange Connector v6.5 Installation Guide Last modified: May 7, 2008 Table of Contents 1.Introduction...3 1.1. Prerequisites...3 1.2. Related documents...3 2.Funambol Exchange Synchronization

More information

A Plan for the Continued Development of the DNS Statistics Collector

A Plan for the Continued Development of the DNS Statistics Collector A Plan for the Continued Development of the DNS Statistics Collector Background The DNS Statistics Collector ( DSC ) software was initially developed under the National Science Foundation grant "Improving

More information

Special Topics in Computer Science

Special Topics in Computer Science Special Topics in Computer Science NLP in a Nutshell CS492B Spring Semester 2009 Jong C. Park Computer Science Department Korea Advanced Institute of Science and Technology INTRODUCTION Jong C. Park, CS

More information

CSCI-UA:0060-02. Database Design & Web Implementation. Professor Evan Sandhaus sandhaus@cs.nyu.edu evan@nytimes.com

CSCI-UA:0060-02. Database Design & Web Implementation. Professor Evan Sandhaus sandhaus@cs.nyu.edu evan@nytimes.com CSCI-UA:0060-02 Database Design & Web Implementation Professor Evan Sandhaus sandhaus@cs.nyu.edu evan@nytimes.com Lecture #27: DB Administration and Modern Architecture:The last real lecture. Database

More information

PoS(ISGC 2013)021. SCALA: A Framework for Graphical Operations for irods. Wataru Takase KEK E-mail: wataru.takase@kek.jp

PoS(ISGC 2013)021. SCALA: A Framework for Graphical Operations for irods. Wataru Takase KEK E-mail: wataru.takase@kek.jp SCALA: A Framework for Graphical Operations for irods KEK E-mail: wataru.takase@kek.jp Adil Hasan University of Liverpool E-mail: adilhasan2@gmail.com Yoshimi Iida KEK E-mail: yoshimi.iida@kek.jp Francesca

More information

Architecting ColdFusion For Scalability And High Availability. Ryan Stewart Platform Evangelist

Architecting ColdFusion For Scalability And High Availability. Ryan Stewart Platform Evangelist Architecting ColdFusion For Scalability And High Availability Ryan Stewart Platform Evangelist Introduction Architecture & Clustering Options Design an architecture and develop applications that scale

More information

Markus Dickinson. Dept. of Linguistics, Indiana University Catapult Workshop Series; February 1, 2013

Markus Dickinson. Dept. of Linguistics, Indiana University Catapult Workshop Series; February 1, 2013 Markus Dickinson Dept. of Linguistics, Indiana University Catapult Workshop Series; February 1, 2013 1 / 34 Basic text analysis Before any sophisticated analysis, we want ways to get a sense of text data

More information

The Power of Classifying in SharePoint 2010

The Power of Classifying in SharePoint 2010 The Power of Classifying in SharePoint 2010 by Agnes Molnar, Microsoft SharePoint MVP October 2010 Phone: (610)-717-0413 Email: info@metavistech.com Website: www.metavistech.com Introduction As a Microsoft

More information

5 Mistakes to Avoid on Your Drupal Website

5 Mistakes to Avoid on Your Drupal Website 5 Mistakes to Avoid on Your Drupal Website Table of Contents Introduction.... 3 Architecture: Content.... 4 Architecture: Display... 5 Architecture: Site or Functionality.... 6 Security.... 8 Performance...

More information

EZcast technical documentation

EZcast technical documentation EZcast technical documentation Document written by > Michel JANSENS > Arnaud WIJNS from ULB PODCAST team http://podcast.ulb.ac.be http://ezcast.ulb.ac.be podcast@ulb.ac.be SOMMAIRE SOMMAIRE 2 1. INTRODUCTION

More information

HOB WebSecureProxy as an SSL Terminal for E-Mail Clients

HOB WebSecureProxy as an SSL Terminal for E-Mail Clients HOB GmbH & Co. KG Schwadermühlstr. 3 90556 Cadolzburg Tel: 09103 / 715-0 Fax: 09103 / 715-271 E-Mail: support@hob.de Internet: www.hob.de HOB WebSecureProxy as an SSL Terminal for E-Mail Clients November

More information

Search Engine Technology and Digital Libraries: Moving from Theory t...

Search Engine Technology and Digital Libraries: Moving from Theory t... 1 von 5 09.09.2010 15:09 Search Engine Technology and Digital Libraries Moving from Theory to Practice Friedrich Summann Bielefeld University Library, Germany Norbert Lossau

More information

Brauchen die Digital Humanities eine eigene Methodologie?

Brauchen die Digital Humanities eine eigene Methodologie? Deutsche DH, Passau 26.03.2014 Brauchen die Digital Humanities eine eigene Methodologie? 26. März 2014 Heyer / Niekler / Wiedemann 1 Übersicht Aspekte der Operationalisierung geistes- und sozialwissenschaftlicher

More information

Deploying Cisco Unified Contact Center Express Volume 1

Deploying Cisco Unified Contact Center Express Volume 1 Deploying Cisco Unified Contact Center Express Volume 1 Course Introduction Learner Skills and Knowledge Course Goal and Course Flow Additional References Your Training Curriculum General Administration

More information

The Challenge of Machine Translation of Patent Specifications and the Approach of the European Patent Office

The Challenge of Machine Translation of Patent Specifications and the Approach of the European Patent Office The Challenge of Machine Translation of Patent Specifications and the Approach of the European Patent Office Georg Artelsmair Head of Department European Affairs/Member States European Patent Office Ottawa,

More information

Help-desk and registry - v2

Help-desk and registry - v2 Help-desk and registry - v2 Milestone M6C-3.3 May 2011 Editors: Radu Simionescu, Dan Cristea www.clarin.eu The ultimate objective of CLARIN is to create a European federation of existing digital repositories

More information

Flattening Enterprise Knowledge

Flattening Enterprise Knowledge Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it

More information

NEDERBOOMS Treebank Mining for Data- based Linguistics. Liesbeth Augustinus Vincent Vandeghinste Ineke Schuurman Frank Van Eynde

NEDERBOOMS Treebank Mining for Data- based Linguistics. Liesbeth Augustinus Vincent Vandeghinste Ineke Schuurman Frank Van Eynde NEDERBOOMS Treebank Mining for Data- based Linguistics Liesbeth Augustinus Vincent Vandeghinste Ineke Schuurman Frank Van Eynde LOT Summer School - June, 2014 NEDERBOOMS Exploita)on of Dutch treebanks

More information

Alternatives to SNMP and Challenges in Management Protocols. Communication Systems Seminar Talk 10 Francesco Luminati

Alternatives to SNMP and Challenges in Management Protocols. Communication Systems Seminar Talk 10 Francesco Luminati Alternatives to SNMP and Challenges in Management Protocols Communication Systems Seminar Talk 10 Francesco Luminati Introduction Structure Network management Management approaches SNMP Alternatives: NetConf

More information

A Framework-based Online Question Answering System. Oliver Scheuer, Dan Shen, Dietrich Klakow

A Framework-based Online Question Answering System. Oliver Scheuer, Dan Shen, Dietrich Klakow A Framework-based Online Question Answering System Oliver Scheuer, Dan Shen, Dietrich Klakow Outline General Structure for Online QA System Problems in General Structure Framework-based Online QA system

More information

Electronic Document Workflow Platform for KBA Customers

Electronic Document Workflow Platform for KBA Customers Electronic Document Workflow Platform for KBA Customers Contents...1 I. Introduction...2 II. What is the point? How does it work?...3 A. Document Workflow System...3 B. Document Circulation Rules...4 C.

More information

Configuring Cisco Secure ACS v5.5 to use RADIUS for Orchestrator Authentication

Configuring Cisco Secure ACS v5.5 to use RADIUS for Orchestrator Authentication Configuring Cisco Secure ACS v5.5 to use RADIUS for Orchestrator Authentication This document outlines the procedure for configuring Cisco Secure Access Control System to provide RADIUS services for Orchestrator

More information

SERVICE ORIENTED EVENT ASSESSMENT CLOSING THE GAP OF COMPLIANCE MANAGEMENT

SERVICE ORIENTED EVENT ASSESSMENT CLOSING THE GAP OF COMPLIANCE MANAGEMENT IBM Software Group SERVICE ORIENTED EVENT ASSESSMENT CLOSING THE GAP OF COMPLIANCE MANAGEMENT Dieter Riexinger IT Architect 09.10.2009 2009 IBM Corporation Agenda Introduction Legal obligations and regulations

More information

Inmagic Content Server v9 Standard Configuration Technical Guidelines

Inmagic Content Server v9 Standard Configuration Technical Guidelines Inmagic Content Server v9.0 Standard Configuration Technical Guidelines 5/2006 Page 1 of 15 Inmagic Content Server v9 Standard Configuration Technical Guidelines Last Updated: May, 2006 Inmagic, Inc. All

More information

Log Analyzer for Dummies. GIAC GCIH Gold Certification Author: Emilio Valente evalente@sdsc.edu April 2008

Log Analyzer for Dummies. GIAC GCIH Gold Certification Author: Emilio Valente evalente@sdsc.edu April 2008 Log Analyzer for Dummies GIAC GCIH Gold Certification Author: Emilio Valente evalente@sdsc.edu April 2008 Road map Objective Introduction Brief description of a Syslogger What companies offer Components

More information

Design of a Universal Web Application Installer

Design of a Universal Web Application Installer Design of a Universal Web Application Installer Version 1.1 By Brad Touesnard CS4983 Senior Report Supervised by Prof. Andrew McAllister Fredericton, New Brunswick 2 April 2005 UNIVERSITY OF NEW BRUNSWICK

More information

Electronic Document Management Using Inverted Files System

Electronic Document Management Using Inverted Files System EPJ Web of Conferences 68, 0 00 04 (2014) DOI: 10.1051/ epjconf/ 20146800004 C Owned by the authors, published by EDP Sciences, 2014 Electronic Document Management Using Inverted Files System Derwin Suhartono,

More information

GT 6.0 GRAM5 Key Concepts

GT 6.0 GRAM5 Key Concepts GT 6.0 GRAM5 Key Concepts GT 6.0 GRAM5 Key Concepts Overview The Globus Toolkit provides GRAM5: a service to submit, monitor, and cancel jobs on Grid computing resources. In GRAM, a job consists of a computation

More information

Log Mining Based on Hadoop s Map and Reduce Technique

Log Mining Based on Hadoop s Map and Reduce Technique Log Mining Based on Hadoop s Map and Reduce Technique ABSTRACT: Anuja Pandit Department of Computer Science, anujapandit25@gmail.com Amruta Deshpande Department of Computer Science, amrutadeshpande1991@gmail.com

More information

Oracle Universal Content Management 10.1.3

Oracle Universal Content Management 10.1.3 Date: 2007/04/16-10.1.3 Oracle Universal Content Management 10.1.3 Document Management Quick Start Tutorial Oracle Universal Content Management 10.1.3 Document Management Quick Start Guide Page 1 Contents

More information

A sustainable archiving software solution for The Language Archive

A sustainable archiving software solution for The Language Archive A sustainable archiving software solution for The Language Archive Paul Trilsbeek, Daan Broeder, Willem Elbers, André Moreira The Language Archive Max Planck Institute for Psycholinguistics Outline History

More information

Managing DICOM Image Metadata with Desktop Operating Systems Native User Interface

Managing DICOM Image Metadata with Desktop Operating Systems Native User Interface Managing DICOM Image Metadata with Desktop Operating Systems Native User Interface Chia-Chi Teng, Member, IEEE Abstract Picture Archiving and Communication System (PACS) is commonly used in the hospital

More information

PoS(EGICF12-EMITC2)091

PoS(EGICF12-EMITC2)091 Performance testing of distributed computational resources in the software development phase, Eva Cernakova and Marek Kocan P. J. Safarik University in Kosice, Kosice, Slovak Republic E-mail: jcernak@upjs.sk

More information

Analysis of Web Archives. Vinay Goel Senior Data Engineer

Analysis of Web Archives. Vinay Goel Senior Data Engineer Analysis of Web Archives Vinay Goel Senior Data Engineer Internet Archive Established in 1996 501(c)(3) non profit organization 20+ PB (compressed) of publicly accessible archival material Technology partner

More information

IT services for analyses of various data samples

IT services for analyses of various data samples IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical

More information

Apache Lucene. Searching the Web and Everything Else. Daniel Naber Mindquarry GmbH ID 380

Apache Lucene. Searching the Web and Everything Else. Daniel Naber Mindquarry GmbH ID 380 Apache Lucene Searching the Web and Everything Else Daniel Naber Mindquarry GmbH ID 380 AGENDA 2 > What's a search engine > Lucene Java Features Code example > Solr Features Integration > Nutch Features

More information

Your Question. Net Report Answer

Your Question. Net Report Answer Your Question Article: 00120 Question: How to Configure External Authentication for Net Report Web Portal Net Report Answer Introduction Security devices can be used to control access to network resources.

More information