EPL660: Information Retrieval and Search Engines Lab 7

Size: px
Start display at page:

Download "EPL660: Information Retrieval and Search Engines Lab 7"

Transcription

1 EPL660: Information Retrieval and Search Engines Lab 7 Παύλος Αντωνίου Γραφείο: B109, ΘΕΕ01 University of Cyprus Department of Computer Science

2 Apache Tika What is Apache Tika? Content Analysis Toolkit The Apache Tika toolkit detects and extracts metadata and text content from over a thousand different file types Useful for search engine indexing, content analysis, translation, and much more

3 Supported Document Formats Microsoft Excel, Word, PowerPoint, Visio, Outlook GZIP, bzip2 compression MP3, MIDI, Wave audio XML HTML Java class files Images Java Archive Files Plain text OpenDocument PDF RTF TAR/ZIP You can also extend Tika with your own parsers!!

4 Getting Started with Apache Tika Download a source release from: Build Tika from sources Use Maven build system: $ sudo apt-get install maven Extract tika sources to a folder Use install command: $ mvn install Note: We need Java 7 or higher to build Tika.

5 Build Artifacts The Tika build consists of a number of components and produces the following main binaries: tika-core/target/tika-core-*.jar Tika core library. Contains the core interfaces and classes of Tika, but none of the parser implementations. Depends only on Java 6. tika-parsers/target/tika-parsers-*.jar Tika parsers. Collection of classes that implement the Tika Parser interface based on various external parser libraries. tika-app/target/tika-app-*.jar Tika application. Combines the above components and all the external parser libraries into a single runnable jar with a GUI and a command line interface. tika-server/target/tika-server-*.jar Tika JAX-RS REST application. This is a Jetty web server running Tika REST services as described in this page. tika-bundle/target/tika-bundle-*.jar Tika bundle. An OSGi bundle that combines tika-parsers with non-osgified parser libraries to make them easy to deploy in an OSGi environment.

6 Command Line Utility

7 Tika GUI (--gui)

8 The Parser Interface void parse(inputstream stream, ContentHandler handler, Metadata metadata, ParseContext context) throws IOException, SAXException, TikaException; Input: Document to be parsed Related metadata Output Results as XHTML SAX events Extra metadata

9 InputStream First argument of parse method For reading the document to be parsed Parser implementation will consume this stream but will not close it Closing the stream is the responsibility of the client application that opened it in the first place.

10 XHTML SAX events Parsed content of the document stream is about to be returned to the client application as a sequence of XHTML SAX events. XHTML used to express structured content of the document (not to render documents for browsing) and SAX events enable streamed processing.

11 ContentHandler Second argument of parse method Receives XHTML SAX events produced by parser Parser implementations typically use the XHTMLContentHandler utility class to generate the XHTML output SAX events may be complex to understand Tika provides utility classes to process and convert event stream to other representations e.g. BodyContentHandler class can be used to extract the body of XHTML output and feed it either as SAX events to another content handler or as characters to an output stream, a writer, or simply a string

12 Document Metadata Third argument of parse method Used to pass document metadata both in and out of the parser; expressed as Metadata object some of the more interesting metadata properties: Metadata.RESOURCE_NAME_KEY Metadata.CONTENT_TYPE Metadata.TITLE Metadata.AUTHOR

13 Parse Context Final argument of parse method Inject context-specific information to the parsing process Example of use: when dealing with locale-specific date and number formats in Microsoft Excel spreadsheets

14 Write your Tika application! Download the Java file implementing PDF parsing related tika classes must be imported Problem: number of jar files (and their dependencies) must be downloaded and added to classpath difficult to manually specify and discover all dependency libraries Solution: Apache tool for building and managing any Java-based project excellent dependency management mechanism easy build process

15 Tika application using Maven! Installation: sudo apt-get install maven Create Maven project mvn archetype:generate -DgroupId=com.csdeptucy.app -DartifactId=tikaParser -DarchetypeArtifactId=maven-archetype-quickstart - DinteractiveMode=false Get into project folder cd tikaparser see project structure here POM.xml file core of project s configuration

16 POM file example <project xmlns=" xmlns:xsi=" xsi:schemalocation=" xsd"> <modelversion>4.0.0</modelversion> <groupid>com.mycompany.app</groupid> <artifactid>my-app</artifactid> <version>1.0-snapshot</version> <packaging>jar</packaging> <name>maven Quick Start Archetype</name> <url> <dependencies> <dependency> <groupid>junit</groupid> <artifactid>junit</artifactid> <version>4.8.2</version> <scope>test</scope> </dependency> </dependencies> </project>

17 Maven phases Most common lifecycle phases: validate: validate the project is correct and all necessary information is available compile: compile the source code of the project test: test the compiled source code using a suitable unit testing framework. These tests should not require the code be packaged or deployed package: take the compiled code and package it in its distributable format, such as a JAR integration-test: process and deploy the package if necessary into an environment where integration tests can be run verify: run any checks to verify the package is valid and meets quality criteria install: install the package into the local repository, for use as a dependency in other projects locally deploy: done in an integration or release environment, copies the final package to the remote repository for sharing with other developers and projects clean: cleans up artifacts created by prior builds site: generates site documentation for this project Phases may be executed in sequence mvn clean package

18 Test initial application Test the newly compiled and packaged JAR with the following command: java -cp target/tikaparser-1.0- SNAPSHOT.jar com.csdeptucy.app.app Which will print: Hello World!

19 Unzip LAB07.zip Place java file into tikaparser/src/main/java/com/csdeptucy/app folder Replace old pom.xml file with the given one Clean artifacts form the previous build and regenerate a jar file mvn clean package In case of java.lang.outofmemoryerror: Java heap space error run in terminal: export MAVEN_OPTS=-Xmx1024m mvn clean package Run the application java -cp target/tikaparser-1.0-snapshot-jar-withdependencies.jar com.csdeptucy.app.epl660parser

20 PDF Parsing import java.io.file; import java.io.fileinputstream; import java.io.ioexception; import java.io.inputstream; import org.apache.tika.exception.tikaexception; import org.apache.tika.metadata.metadata; import org.apache.tika.parser.parsecontext; import org.apache.tika.parser.pdf.pdfparser; import org.apache.tika.sax.bodycontenthandler; import org.xml.sax.contenthandler; import org.xml.sax.saxexception; public class EPL660Parser { public static void main(string[] args) { try { } parsepdf(); } catch (IOException SAXException TikaException e) { e.printstacktrace(); }

21 PDF Parsing private static void parsepdf() throws IOException, SAXException, TikaException { InputStream input = new FileInputStream(new File("simple.pdf")); ContentHandler texthandler = new BodyContentHandler(System.out); Metadata metadata = new Metadata(); ParseContext context = new ParseContext(); // parsing the document using PDF parser PDFParser parser = new PDFParser(); parser.parse(input, texthandler, metadata, context); // getting the content of the document System.out.println("Contents of the PDF :" + texthandler.tostring()); // getting metadata of the document System.out.println("Metadata of the PDF:"); String[] metadatanames = metadata.names(); for (String name : metadatanames) { System.out.println(name + " : " + metadata.get(name)); } } input.close(); }

22 PDF screenshots Content Metadata

23 PDF using Tika Content Metadata

24 Parse all Types of Files Change your PDF parser Parser should: Use AutoDetectParser: AutoDetectParser parser = new AutoDetectParser(); Read all files from a folder Print all the metadata for each file

25 Useful Links utorials/os-apache-tika/index.html

Maven2. Configuration and Build Management. Robert Reiz

Maven2. Configuration and Build Management. Robert Reiz Maven2 Configuration and Build Management Robert Reiz A presentation is not a documentation! A presentation should just support the speaker! PLOIN Because it's your time Seite 2 1 What is Maven2 2 Short

More information

Hands on exercise for

Hands on exercise for Hands on exercise for João Miguel Pereira 2011 0 Prerequisites, assumptions and notes Have Maven 2 installed in your computer Have Eclipse installed in your computer (Recommended: Indigo Version) I m assuming

More information

Software project management. and. Maven

Software project management. and. Maven Software project management and Maven Problem area Large software projects usually contain tens or even hundreds of projects/modules Will become messy and incomprehensible ibl if the projects don t adhere

More information

Build management & Continuous integration. with Maven & Hudson

Build management & Continuous integration. with Maven & Hudson Build management & Continuous integration with Maven & Hudson About me Tim te Beek tim.te.beek@nbic.nl Computer science student Bioinformatics Research Support Overview Build automation with Maven Repository

More information

Maven or how to automate java builds, tests and version management with open source tools

Maven or how to automate java builds, tests and version management with open source tools Maven or how to automate java builds, tests and version management with open source tools Erik Putrycz Software Engineer, Apption Software erik.putrycz@gmail.com Outlook What is Maven Maven Concepts and

More information

by Charles Souillard CTO and co-founder, BonitaSoft

by Charles Souillard CTO and co-founder, BonitaSoft C ustom Application Development w i t h Bonita Execution Engine by Charles Souillard CTO and co-founder, BonitaSoft Contents 1. Introduction 2. Understanding object models 3. Using APIs 4. Configuring

More information

Continuous integration in OSGi projects using Maven (v:0.1) Sergio Blanco Diez

Continuous integration in OSGi projects using Maven (v:0.1) Sergio Blanco Diez Continuous integration in OSGi projects using Maven (v:0.1) Sergio Blanco Diez December 1, 2009 Contents 1 Introduction 2 2 Maven 4 2.1 What is Maven?..................................... 4 2.2 How does

More information

Presentation of Enterprise Service Bus(ESB) and. Apache ServiceMix. Håkon Sagehaug 03.04.2008

Presentation of Enterprise Service Bus(ESB) and. Apache ServiceMix. Håkon Sagehaug 03.04.2008 Presentation of Enterprise Service Bus(ESB) and Apache ServiceMix Håkon Sagehaug 03.04.2008 Outline Enterprise Service Bus, what is is Apache Service Mix Java Business Integration(JBI) Tutorial, creating

More information

Continuous Integration Multi-Stage Builds for Quality Assurance

Continuous Integration Multi-Stage Builds for Quality Assurance Continuous Integration Multi-Stage Builds for Quality Assurance Dr. Beat Fluri Comerge AG ABOUT MSc ETH in Computer Science Dr. Inform. UZH, s.e.a.l. group Over 8 years of experience in object-oriented

More information

Software project management. and. Maven

Software project management. and. Maven Software project management and Maven Problem area Large software projects usually contain tens or even hundreds of projects/modules Will become messy if the projects don t adhere to some common principles

More information

Developer s Guide. How to Develop a Communiqué Digital Asset Management Solution

Developer s Guide. How to Develop a Communiqué Digital Asset Management Solution Developer s Guide How to Develop a Communiqué Digital Asset Management Solution 1 PURPOSE 3 2 CQ DAM OVERVIEW 4 2.1 2.2 Key CQ DAM Features 4 2.2 How CQ DAM Works 6 2.2.1 Unified Architecture 7 2.2.2 Asset

More information

COMPUTACIÓN ORIENTADA A SERVICIOS (PRÁCTICA) Dr. Mauricio Arroqui EXA-UNICEN

COMPUTACIÓN ORIENTADA A SERVICIOS (PRÁCTICA) Dr. Mauricio Arroqui EXA-UNICEN COMPUTACIÓN ORIENTADA A SERVICIOS (PRÁCTICA) Dr. Mauricio Arroqui EXA-UNICEN Actividad Crear un servicio REST y un cliente para el mismo ejercicio realizado durante la práctica para SOAP. Se requiere la

More information

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer. AVRO Tutorial

About the Tutorial. Audience. Prerequisites. Copyright & Disclaimer. AVRO Tutorial i About the Tutorial Apache Avro is a language-neutral data serialization system, developed by Doug Cutting, the father of Hadoop. This is a brief tutorial that provides an overview of how to set up Avro

More information

Content. Development Tools 2(63)

Content. Development Tools 2(63) Development Tools Content Project management and build, Maven Version control, Git Code coverage, JaCoCo Profiling, NetBeans Static Analyzer, NetBeans Continuous integration, Hudson Development Tools 2(63)

More information

Sonatype CLM for Maven. Sonatype CLM for Maven

Sonatype CLM for Maven. Sonatype CLM for Maven Sonatype CLM for Maven i Sonatype CLM for Maven Sonatype CLM for Maven ii Contents 1 Introduction 1 2 Creating a Component Index 3 2.1 Excluding Module Information Files in Continuous Integration Tools...........

More information

Apache Tika for Enabling Metadata Interoperability

Apache Tika for Enabling Metadata Interoperability Apache Tika for Enabling Metadata Interoperability Apache: Big Data Europe September 28 30, 2015 Budapest, Hungary Presented by Michael Starch (NASA JPL) and Nick Burch (Quan,cate) Proposed by Giuseppe

More information

Kohsuke Kawaguchi Sun Microsystems, Inc. hk2.dev.java.net, glassfish.dev.java.net. Session ID

Kohsuke Kawaguchi Sun Microsystems, Inc. hk2.dev.java.net, glassfish.dev.java.net. Session ID 1 Kohsuke Kawaguchi Sun Microsystems, Inc. hk2.dev.java.net, glassfish.dev.java.net Session ID 2 What s GlassFish v3? JavaEE 6 API for REST (JAX-RS) Better web framework support (Servlet 3.0) WebBeans,

More information

Maven2 Reference. Invoking Maven General Syntax: Prints help debugging output, very useful to diagnose. Creating a new Project (jar) Example:

Maven2 Reference. Invoking Maven General Syntax: Prints help debugging output, very useful to diagnose. Creating a new Project (jar) Example: Maven2 Reference Invoking Maven General Syntax: mvn plugin:target [-Doption1 -Doption2 dots] mvn help mvn -X... Prints help debugging output, very useful to diagnose Creating a new Project (jar) mvn archetype:create

More information

Meister Going Beyond Maven

Meister Going Beyond Maven Meister Going Beyond Maven A technical whitepaper comparing OpenMake Meister and Apache Maven OpenMake Software 312.440.9545 800.359.8049 Winners of the 2009 Jolt Award Introduction There are many similarities

More information

How To Run Apa Tika On A Microsoft Macbook Or Ipa.Net (For Linux) Or Ipad (For Windows) (For Macbook) (Or Ipa) (On Linux) (Minor) (Large

How To Run Apa Tika On A Microsoft Macbook Or Ipa.Net (For Linux) Or Ipad (For Windows) (For Macbook) (Or Ipa) (On Linux) (Minor) (Large What's with all the 1s and 0s? Making sense of binary data at scale with Apache Tika Nick Burch CTO, Quanticate Those 1s and 0s Apache Tika the basics Detection Binary formats Text formats Extending Tika

More information

Sonatype CLM Enforcement Points - Continuous Integration (CI) Sonatype CLM Enforcement Points - Continuous Integration (CI)

Sonatype CLM Enforcement Points - Continuous Integration (CI) Sonatype CLM Enforcement Points - Continuous Integration (CI) Sonatype CLM Enforcement Points - Continuous Integration (CI) i Sonatype CLM Enforcement Points - Continuous Integration (CI) Sonatype CLM Enforcement Points - Continuous Integration (CI) ii Contents 1

More information

LAB 2 SPARK / D-STREAM PROGRAMMING SCIENTIFIC APPLICATIONS FOR IOT WORKSHOP

LAB 2 SPARK / D-STREAM PROGRAMMING SCIENTIFIC APPLICATIONS FOR IOT WORKSHOP LAB 2 SPARK / D-STREAM PROGRAMMING SCIENTIFIC APPLICATIONS FOR IOT WORKSHOP ICTP, Trieste, March 24th 2015 The objectives of this session are: Understand the Spark RDD programming model Familiarize with

More information

Drupal CMS for marketing sites

Drupal CMS for marketing sites Drupal CMS for marketing sites Intro Sample sites: End to End flow Folder Structure Project setup Content Folder Data Store (Drupal CMS) Importing/Exporting Content Database Migrations Backend Config Unit

More information

CI/CD Cheatsheet. Lars Fabian Tuchel Date: 18.March 2014 DOC:

CI/CD Cheatsheet. Lars Fabian Tuchel Date: 18.March 2014 DOC: CI/CD Cheatsheet Title: CI/CD Cheatsheet Author: Lars Fabian Tuchel Date: 18.March 2014 DOC: Table of Contents 1. Build Pipeline Chart 5 2. Build. 6 2.1. Xpert.ivy. 6 2.1.1. Maven Settings 6 2.1.2. Project

More information

IBM Tivoli Workload Scheduler Integration Workbench V8.6.: How to customize your automation environment by creating a custom Job Type plug-in

IBM Tivoli Workload Scheduler Integration Workbench V8.6.: How to customize your automation environment by creating a custom Job Type plug-in IBM Tivoli Workload Scheduler Integration Workbench V8.6.: How to customize your automation environment by creating a custom Job Type plug-in Author(s): Marco Ganci Abstract This document describes how

More information

Tutorial- Counting Words in File(s) using MapReduce

Tutorial- Counting Words in File(s) using MapReduce Tutorial- Counting Words in File(s) using MapReduce 1 Overview This document serves as a tutorial to setup and run a simple application in Hadoop MapReduce framework. A job in Hadoop MapReduce usually

More information

Tutorial Reference Manual. Java WireFusion 4.1

Tutorial Reference Manual. Java WireFusion 4.1 Tutorial Reference Manual Java WireFusion 4.1 Contents INTRODUCTION...1 About this Manual...2 REQUIREMENTS...3 User Requirements...3 System Requirements...3 SHORTCUTS...4 DEVELOPMENT ENVIRONMENT...5 Menu

More information

Mind The Gap! Setting Up A Code Structure Building Bridges

Mind The Gap! Setting Up A Code Structure Building Bridges Mind The Gap! Setting Up A Code Structure Building Bridges Representation Of Architectural Concepts In Code Structures Why do we need architecture? Complex business problems too many details to keep overview

More information

Continuous Integration Part 2

Continuous Integration Part 2 1 Continuous Integration Part 2 This blog post is a follow up to my blog post Continuous Integration (CI), in which I described how to execute test cases in Code Tester (CT) in a CI environment. What I

More information

IBM WebSphere Adapter for Email 7.0.0.0. Quick Start Tutorials

IBM WebSphere Adapter for Email 7.0.0.0. Quick Start Tutorials IBM WebSphere Adapter for Email 7.0.0.0 Quick Start Tutorials Note: Before using this information and the product it supports, read the information in "Notices" on page 182. This edition applies to version

More information

WIRIS quizzes web services Getting started with PHP and Java

WIRIS quizzes web services Getting started with PHP and Java WIRIS quizzes web services Getting started with PHP and Java Document Release: 1.3 2011 march, Maths for More www.wiris.com Summary This document provides client examples for PHP and Java. Contents WIRIS

More information

IKAN ALM Architecture. Closing the Gap Enterprise-wide Application Lifecycle Management

IKAN ALM Architecture. Closing the Gap Enterprise-wide Application Lifecycle Management IKAN ALM Architecture Closing the Gap Enterprise-wide Application Lifecycle Management Table of contents IKAN ALM SERVER Architecture...4 IKAN ALM AGENT Architecture...6 Interaction between the IKAN ALM

More information

Builder User Guide. Version 5.4. Visual Rules Suite - Builder. Bosch Software Innovations

Builder User Guide. Version 5.4. Visual Rules Suite - Builder. Bosch Software Innovations Visual Rules Suite - Builder Builder User Guide Version 5.4 Bosch Software Innovations Americas: Bosch Software Innovations Corp. 161 N. Clark Street Suite 3500 Chicago, Illinois 60601/USA Tel. +1 312

More information

Hello World RESTful web service tutorial

Hello World RESTful web service tutorial Hello World RESTful web service tutorial Balázs Simon (sbalazs@iit.bme.hu), BME IIT, 2015 1 Introduction This document describes how to create a Hello World RESTful web service in Eclipse using JAX-RS

More information

Java Forum Nord 2015. Dirk Mahler

Java Forum Nord 2015. Dirk Mahler by Java Forum Nord 2015 Dirk Mahler Black Boxes Called Artifacts Software As A Graph jqassistant Let s Explore Libraries! 2 Yes We Scan Software Analysis Using jqassistant 3 Artifact Result of a build/integration

More information

Developing Web Services with Apache CXF and Axis2

Developing Web Services with Apache CXF and Axis2 Developing Web Services with Apache CXF and Axis2 By Kent Ka Iok Tong Copyright 2005-2010 TipTec Development Publisher: TipTec Development Author's email: freemant2000@yahoo.com Book website: http://www.agileskills2.org

More information

Service Integration course. Cassandra

Service Integration course. Cassandra Budapest University of Technology and Economics Department of Measurement and Information Systems Fault Tolerant Systems Research Group Service Integration course Cassandra Oszkár Semeráth Gábor Szárnyas

More information

Overview of Web Services API

Overview of Web Services API 1 CHAPTER The Cisco IP Interoperability and Collaboration System (IPICS) 4.5(x) application programming interface (API) provides a web services-based API that enables the management and control of various

More information

Introduction to Programming Tools. Anjana & Shankar September,2010

Introduction to Programming Tools. Anjana & Shankar September,2010 Introduction to Programming Tools Anjana & Shankar September,2010 Contents Essentials tooling concepts in S/W development Build system Version Control System Testing Tools Continuous Integration Issue

More information

Creating a Simple, Multithreaded Chat System with Java

Creating a Simple, Multithreaded Chat System with Java Creating a Simple, Multithreaded Chat System with Java Introduction by George Crawford III In this edition of Objective Viewpoint, you will learn how to develop a simple chat system. The program will demonstrate

More information

EMC Documentum Composer

EMC Documentum Composer EMC Documentum Composer Version 6.5 User Guide P/N 300 007 217 A02 EMC Corporation Corporate Headquarters: Hopkinton, MA 01748 9103 1 508 435 1000 www.emc.com Copyright 2008 EMC Corporation. All rights

More information

Software Quality Exercise 2

Software Quality Exercise 2 Software Quality Exercise 2 Testing and Debugging 1 Information 1.1 Dates Release: 12.03.2012 12.15pm Deadline: 19.03.2012 12.15pm Discussion: 26.03.2012 1.2 Formalities Please submit your solution as

More information

Integration with Other Tools

Integration with Other Tools Integration with Other Tools In this chapter, we will cover: ff ff ff ff ff ff ff ff ff Configuring Eclipse and Maven for Selenium WebDriver test development Configuring IntelliJ IDEA and Maven for Selenium

More information

! E6893 Big Data Analytics:! Demo Session II: Mahout working with Eclipse and Maven for Collaborative Filtering

! E6893 Big Data Analytics:! Demo Session II: Mahout working with Eclipse and Maven for Collaborative Filtering E6893 Big Data Analytics: Demo Session II: Mahout working with Eclipse and Maven for Collaborative Filtering Aonan Zhang Dept. of Electrical Engineering 1 October 9th, 2014 Mahout Brief Review The Apache

More information

Creating Custom Web Pages for cagrid Services

Creating Custom Web Pages for cagrid Services Creating Custom Web Pages for cagrid Services Creating Custom Web Pages for cagrid Services Contents Overview Changing the Default Behavior Subclassing the AXIS Servlet Installing and Configuring the Custom

More information

Java 7 Recipes. Freddy Guime. vk» (,\['«** g!p#« Carl Dea. Josh Juneau. John O'Conner

Java 7 Recipes. Freddy Guime. vk» (,\['«** g!p#« Carl Dea. Josh Juneau. John O'Conner 1 vk» Java 7 Recipes (,\['«** - < g!p#«josh Juneau Carl Dea Freddy Guime John O'Conner Contents J Contents at a Glance About the Authors About the Technical Reviewers Acknowledgments Introduction iv xvi

More information

SETTING UP YOUR JAVA DEVELOPER ENVIRONMENT

SETTING UP YOUR JAVA DEVELOPER ENVIRONMENT SETTING UP YOUR JAVA DEVELOPER ENVIRONMENT Summary This tipsheet describes how to set up your local developer environment for integrating with Salesforce. This tipsheet describes how to set up your local

More information

Java Language Tools COPYRIGHTED MATERIAL. Part 1. In this part...

Java Language Tools COPYRIGHTED MATERIAL. Part 1. In this part... Part 1 Java Language Tools This beginning, ground-level part presents reference information for setting up the Java development environment and for compiling and running Java programs. This includes downloading

More information

XML nyelvek és alkalmazások

XML nyelvek és alkalmazások THE INTERNET,mapped on the opposite page, is a scalefree network in that XML nyelvek és alkalmazások XML kezelés Javaban dis.'~tj port,from THE INTERNET,mapped on the opposite page, is a scalefree network

More information

SOLoist Automation of Class IDs Assignment

SOLoist Automation of Class IDs Assignment SOL Software d.o.o. www.sol.rs Public SOLoist Automation of Class IDs Assignment Project: SOLoist V4 Document Type: Project Documentation (PD) Document Version:. Date:.6.25 SOLoist - Trademark of SOL Software

More information

Creating an application with the Virgo Web Server

Creating an application with the Virgo Web Server Creating an application with the Virgo Web Server GreenPages: a demonstration Christopher Frost Ben Hale Rob Harrop Glyn Normington Steve Powell Andy Wilkinson 2.1.0.M02-incubation Abstract Spring application

More information

Teaming Up for Software Development

Teaming Up for Software Development Departamento de Informática Universidade do Minho Engenharia de Aplicações Introduction Agenda In the end of the session the attendee should be able to: Identify several super-sets of tools used in software

More information

Integrating your Maven Build and Tomcat Deployment

Integrating your Maven Build and Tomcat Deployment Integrating your Maven Build and Tomcat Deployment Maven Publishing Plugin for Tcat Server MuleSource and the MuleSource logo are trademarks of MuleSource Inc. in the United States and/or other countries.

More information

NIST/ITL CSD Biometric Conformance Test Software on Apache Hadoop. September 2014. National Institute of Standards and Technology (NIST)

NIST/ITL CSD Biometric Conformance Test Software on Apache Hadoop. September 2014. National Institute of Standards and Technology (NIST) NIST/ITL CSD Biometric Conformance Test Software on Apache Hadoop September 2014 Dylan Yaga NIST/ITL CSD Lead Software Designer Fernando Podio NIST/ITL CSD Project Manager National Institute of Standards

More information

European Access Point for Truck Parking Data

European Access Point for Truck Parking Data Delegated Regulation (EU) N 885/2013 of 15 May 2013 with regard to the provision of information services for safe and secure parking places for trucks and commercial vehicles European Access Point for

More information

Repository Management with Nexus

Repository Management with Nexus Repository Management with Nexus i Repository Management with Nexus Ed. 4.0 Repository Management with Nexus ii Contents 1 Introducing Nexus Repository Manager 1 2 Concepts 7 3 Installing and Running Nexus

More information

NetBeans e lo sviluppo di applicazioni Java/JavaFX per Facebook. Corrado De Bari corrado.debari@sun.com Sun Microsystems Italia

NetBeans e lo sviluppo di applicazioni Java/JavaFX per Facebook. Corrado De Bari corrado.debari@sun.com Sun Microsystems Italia NetBeans e lo sviluppo di applicazioni Java/JavaFX per Facebook Agenda Goals Architecture Samples step by step References Why JavaFX for Facebook Facebook as a world wide content manager: Photos and Videos

More information

GOOGLE DOCS. 1. Creating an account

GOOGLE DOCS. 1. Creating an account GOOGLE DOCS Google Docs allows you to create and share your work online using free software that operates much like Microsoft Word, Excel, and PowerPoint. Here are some features: Create, edit and upload

More information

Word Count Code using MR2 Classes and API

Word Count Code using MR2 Classes and API EDUREKA Word Count Code using MR2 Classes and API A Guide to Understand the Execution of Word Count edureka! A guide to understand the execution and flow of word count WRITE YOU FIRST MRV2 PROGRAM AND

More information

Vaidya Guide. Table of contents

Vaidya Guide. Table of contents Table of contents 1 Purpose... 2 2 Prerequisites...2 3 Overview... 2 4 Terminology... 2 5 How to Execute the Hadoop Vaidya Tool...4 6 How to Write and Execute your own Tests... 4 1 Purpose This document

More information

Builder User Guide. Version 6.0.1. Visual Rules Suite - Builder. Bosch Software Innovations

Builder User Guide. Version 6.0.1. Visual Rules Suite - Builder. Bosch Software Innovations Visual Rules Suite - Builder Builder User Guide Version 6.0.1 Bosch Software Innovations Americas: Bosch Software Innovations Corp. 161 N. Clark Street Suite 3500 Chicago, Illinois 60601/USA Tel. +1 312

More information

Oracle Universal Content Management 10.1.3

Oracle Universal Content Management 10.1.3 Date: 2007/04/16-10.1.3 Oracle Universal Content Management 10.1.3 Document Management Quick Start Tutorial Oracle Universal Content Management 10.1.3 Document Management Quick Start Guide Page 1 Contents

More information

FUSE-ESB4 An open-source OSGi based platform for EAI and SOA

FUSE-ESB4 An open-source OSGi based platform for EAI and SOA FUSE-ESB4 An open-source OSGi based platform for EAI and SOA Introduction to FUSE-ESB4 It's a powerful OSGi based multi component container based on ServiceMix4 http://servicemix.apache.org/smx4/index.html

More information

1 Building, Deploying and Testing DPES application

1 Building, Deploying and Testing DPES application 1 Building, Deploying and Testing DPES application This chapter provides updated instructions for accessing the sources code, developing, building and deploying the DPES application in the user environment.

More information

LICENSE4J FLOATING LICENSE SERVER USER GUIDE

LICENSE4J FLOATING LICENSE SERVER USER GUIDE LICENSE4J FLOATING LICENSE SERVER USER GUIDE VERSION 4.5.5 LICENSE4J www.license4j.com Table of Contents Getting Started... 2 Floating License Usage... 2 Installation... 4 Windows Installation... 4 Linux

More information

Implementing SQI via SOAP Web-Services

Implementing SQI via SOAP Web-Services IST-2001-37264 Creating a Smart Space for Learning Implementing SQI via SOAP Web-Services Date: 10-02-2004 Version: 0.7 Editor(s): Stefan Brantner, Thomas Zillinger (BearingPoint) 1 1 Java Archive for

More information

Tutorial 5: Developing Java applications

Tutorial 5: Developing Java applications Tutorial 5: Developing Java applications p. 1 Tutorial 5: Developing Java applications Georgios Gousios gousiosg@aueb.gr Department of Management Science and Technology Athens University of Economics and

More information

Setting up Hadoop with MongoDB on Windows 7 64-bit

Setting up Hadoop with MongoDB on Windows 7 64-bit SGT WHITE PAPER Setting up Hadoop with MongoDB on Windows 7 64-bit HCCP Big Data Lab 2015 SGT, Inc. All Rights Reserved 7701 Greenbelt Road, Suite 400, Greenbelt, MD 20770 Tel: (301) 614-8600 Fax: (301)

More information

Contents. Apache Log4j. What is logging. Disadvantages 15/01/2013. What are the advantages of logging? Enterprise Systems Log4j and Maven

Contents. Apache Log4j. What is logging. Disadvantages 15/01/2013. What are the advantages of logging? Enterprise Systems Log4j and Maven Enterprise Systems Log4j and Maven Behzad Bordbar Lecture 4 Log4j and slf4j What is logging Advantages Architecture Maven What is maven Terminology Demo Contents 1 2 Apache Log4j This will be a brief lecture:

More information

Automated performance testing using Maven & JMeter. George Barnett, Atlassian Software Systems @georgebarnett

Automated performance testing using Maven & JMeter. George Barnett, Atlassian Software Systems @georgebarnett Automated performance testing using Maven & JMeter George Barnett, Atlassian Software Systems @georgebarnett Create controllable JMeter tests Configure Maven to create a repeatable cycle Run this build

More information

CSE 70: Software Development Pipeline Build Process, XML, Repositories

CSE 70: Software Development Pipeline Build Process, XML, Repositories CSE 70: Software Development Pipeline Build Process, XML, Repositories Ingolf Krueger Department of Computer Science & Engineering University of California, San Diego La Jolla, CA 92093-0114, USA California

More information

SDK Code Examples Version 2.4.2

SDK Code Examples Version 2.4.2 Version 2.4.2 This edition of SDK Code Examples refers to version 2.4.2 of. This document created or updated on February 27, 2014. Please send your comments and suggestions to: Black Duck Software, Incorporated

More information

FOCUS ON YOUR FEATURES

FOCUS ON YOUR FEATURES FOCUS ON YOUR FEATURES DROPWIZARD TAKES CARE OF THE REST Felix Braun @ JavaLand 2015 DROPWIZARD'S HIGHLIGHTS Develop & deploy a RESTful microservice in 5 minutes Application start-up time under 2 seconds

More information

AVRO - SERIALIZATION

AVRO - SERIALIZATION http://www.tutorialspoint.com/avro/avro_serialization.htm AVRO - SERIALIZATION Copyright tutorialspoint.com What is Serialization? Serialization is the process of translating data structures or objects

More information

B.Sc (Honours) - Software Development

B.Sc (Honours) - Software Development Galway-Mayo Institute of Technology B.Sc (Honours) - Software Development E-Commerce Development Technologies II Lab Session Using the Java URLConnection Class The purpose of this lab session is to: (i)

More information

Amazon Glacier. Developer Guide API Version 2012-06-01

Amazon Glacier. Developer Guide API Version 2012-06-01 Amazon Glacier Developer Guide Amazon Glacier: Developer Guide Copyright 2016 Amazon Web Services, Inc. and/or its affiliates. All rights reserved. Amazon's trademarks and trade dress may not be used in

More information

Enterprise Content Management with Microsoft SharePoint

Enterprise Content Management with Microsoft SharePoint Enterprise Content Management with Microsoft SharePoint Overview of ECM Services and Features in Microsoft Office SharePoint Server 2007 and Windows SharePoint Services 3.0. A KnowledgeLake, Inc. White

More information

Overview of DatadiagramML

Overview of DatadiagramML Overview of DatadiagramML Microsoft Corporation March 2004 Applies to: Microsoft Office Visio 2003 Summary: This document describes the elements in the DatadiagramML Schema that are important to document

More information

Repository Management with Nexus

Repository Management with Nexus Repository Management with Nexus i Repository Management with Nexus Ed. 4.0 Repository Management with Nexus ii Contents 1 Introducing Sonatype Nexus 1 2 Concepts 6 3 Installing and Running Nexus 16 4

More information

SparkLab May 2015 An Introduction to

SparkLab May 2015 An Introduction to SparkLab May 2015 An Introduction to & Apostolos N. Papadopoulos Assistant Professor Data Engineering Lab, Department of Informatics, Aristotle University of Thessaloniki Abstract Welcome to SparkLab!

More information

Multiple vulnerabilities in Apache Foundation Struts 2 framework. Csaba Barta and László Tóth

Multiple vulnerabilities in Apache Foundation Struts 2 framework. Csaba Barta and László Tóth Multiple vulnerabilities in Apache Foundation Struts 2 framework Csaba Barta and László Tóth 12. June 2008 Content Content... 2 Summary... 3 Directory traversal vulnerability in static content serving...

More information

Practice Fusion API Client Installation Guide for Windows

Practice Fusion API Client Installation Guide for Windows Practice Fusion API Client Installation Guide for Windows Quickly and easily connect your Results Information System with Practice Fusion s Electronic Health Record (EHR) System Table of Contents Introduction

More information

ORACLE GOLDENGATE BIG DATA ADAPTER FOR HIVE

ORACLE GOLDENGATE BIG DATA ADAPTER FOR HIVE ORACLE GOLDENGATE BIG DATA ADAPTER FOR HIVE Version 1.0 Oracle Corporation i Table of Contents TABLE OF CONTENTS... 2 1. INTRODUCTION... 3 1.1. FUNCTIONALITY... 3 1.2. SUPPORTED OPERATIONS... 4 1.3. UNSUPPORTED

More information

Developing Eclipse Plug-ins* Learning Objectives. Any Eclipse product is composed of plug-ins

Developing Eclipse Plug-ins* Learning Objectives. Any Eclipse product is composed of plug-ins Developing Eclipse Plug-ins* Wolfgang Emmerich Professor of Distributed Computing University College London http://sse.cs.ucl.ac.uk * Based on M. Pawlowski et al: Fundamentals of Eclipse Plug-in and RCP

More information

An Overview of Java. overview-1

An Overview of Java. overview-1 An Overview of Java overview-1 Contents What is Java Major Java features Java virtual machine Java programming language Java class libraries (API) GUI Support in Java Networking and Threads in Java overview-2

More information

Hadoop Streaming. Table of contents

Hadoop Streaming. Table of contents Table of contents 1 Hadoop Streaming...3 2 How Streaming Works... 3 3 Streaming Command Options...4 3.1 Specifying a Java Class as the Mapper/Reducer... 5 3.2 Packaging Files With Job Submissions... 5

More information

Using Impatica for Power Point

Using Impatica for Power Point Using Impatica for Power Point What is Impatica? Impatica is a tool that will help you to compress PowerPoint presentations and convert them into a more efficient format for web delivery. Impatica for

More information

MarkLogic Server. Java Application Developer s Guide. MarkLogic 8 February, 2015. Copyright 2015 MarkLogic Corporation. All rights reserved.

MarkLogic Server. Java Application Developer s Guide. MarkLogic 8 February, 2015. Copyright 2015 MarkLogic Corporation. All rights reserved. Java Application Developer s Guide 1 MarkLogic 8 February, 2015 Last Revised: 8.0-3, June, 2015 Copyright 2015 MarkLogic Corporation. All rights reserved. Table of Contents Table of Contents Java Application

More information

N CYCLES software solutions. XML White Paper. Where XML Fits in Enterprise Applications. May 2001

N CYCLES software solutions. XML White Paper. Where XML Fits in Enterprise Applications. May 2001 N CYCLES software solutions White Paper Where Fits in Enterprise Applications May 2001 65 Germantown Court 1616 West Gate Circle Suite 205 Nashville, TN 37027 Cordova, TN 38125 Phone 901-756-2705 Phone

More information

A Sample OFBiz application implementing remote access via RMI and SOAP Table of contents

A Sample OFBiz application implementing remote access via RMI and SOAP Table of contents A Sample OFBiz application implementing remote access via RMI and SOAP Table of contents 1 About this document... 2 2 Introduction... 2 3 Defining the data model... 2 4 Populating the database tables with

More information

Zebra and MapReduce. Table of contents. 1 Overview...2 2 Hadoop MapReduce APIs...2 3 Zebra MapReduce APIs...2 4 Zebra MapReduce Examples...

Zebra and MapReduce. Table of contents. 1 Overview...2 2 Hadoop MapReduce APIs...2 3 Zebra MapReduce APIs...2 4 Zebra MapReduce Examples... Table of contents 1 Overview...2 2 Hadoop MapReduce APIs...2 3 Zebra MapReduce APIs...2 4 Zebra MapReduce Examples... 2 1. Overview MapReduce allows you to take full advantage of Zebra's capabilities.

More information

Introduction to XML Applications

Introduction to XML Applications EMC White Paper Introduction to XML Applications Umair Nauman Abstract: This document provides an overview of XML Applications. This is not a comprehensive guide to XML Applications and is intended for

More information

D5.4.4 Integrated SemaGrow Stack API components

D5.4.4 Integrated SemaGrow Stack API components ICT Seventh Framework Programme (ICT FP7) Grant Agreement No: 318497 Data Intensive Techniques to Boost the Real Time Performance of Global Agricultural Data Infrastructures Deliverable Form Project Reference

More information

Enterprise Service Bus

Enterprise Service Bus We tested: Talend ESB 5.2.1 Enterprise Service Bus Dr. Götz Güttich Talend Enterprise Service Bus 5.2.1 is an open source, modular solution that allows enterprises to integrate existing or new applications

More information

Display Zipped Files within D2L Content Window

Display Zipped Files within D2L Content Window Display Zipped Files within D2L Content Window Some presentations (lectures, interactive exercises etc.) may need to be uploaded into Desire2Learn as zipped files, such as narrated Microsoft PowerPoint

More information

Installing Java. Table of contents

Installing Java. Table of contents Table of contents 1 Jargon...3 2 Introduction...4 3 How to install the JDK...4 3.1 Microsoft Windows 95... 4 3.1.1 Installing the JDK... 4 3.1.2 Setting the Path Variable...5 3.2 Microsoft Windows 98...

More information

Developer Guide: Smartphone Mobiliser Applications. Sybase Mobiliser Platform 5.1 SP03

Developer Guide: Smartphone Mobiliser Applications. Sybase Mobiliser Platform 5.1 SP03 Developer Guide: Smartphone Mobiliser Applications Sybase Mobiliser Platform 5.1 SP03 DOCUMENT ID: DC01866-01-0513-01 LAST REVISED: August 2013 Copyright 2013 by Sybase, Inc. All rights reserved. This

More information

Setting up an online Java Jmonitor. server using the. EXPERIMENTAL code from. John Melton GØORX/N6LYT

Setting up an online Java Jmonitor. server using the. EXPERIMENTAL code from. John Melton GØORX/N6LYT Setting up an online Java Jmonitor server using the EXPERIMENTAL code from John Melton GØORX/N6LYT This is NOT a commercial endeavor. John created jmonitor as an one EXPERIMENT along his path toward multiple

More information

Talend Component: tjasperreportexec

Talend Component: tjasperreportexec Talend Component: tjasperreportexec Purpose This component creates (compile + fill + export) reports based on Jasper Report designs (jrxml files). Making reports in the ETL system provides multiple advantages:

More information