Selecting a Taxonomy Management Tool. Wendi Pohs InfoClear Consulting #SLATaxo



Similar documents
Taxonomies in Practice Welcome to the second decade of online taxonomy construction

Semaphore Overview. A Smartlogic White Paper. Executive Summary

Flattening Enterprise Knowledge

Certified Information Professional (CIP) Certification Maintenance Form

Text Analytics Evaluation Case Study - Amdocs

Certified Information Professional 2016 Update Outline

A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks

Taxonomies for Auto-Tagging Unstructured Content. Heather Hedden Hedden Information Management Text Analytics World, Boston, MA October 1, 2013

Taxonomy Tools Requirements and Capabilities. Joseph A Busch, Project Performance Corporation Zachary R Wahl, Project Performance Corporation

ifinder ENTERPRISE SEARCH

ANSYS EKM Overview. What is EKM?

Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence

Managing Third Party Databases and Building Your Data Warehouse

Tools for Taxonomies. Heather Hedden Taxonomy Consultant Hedden Information Management Heather Hedden

Move Forward, Build Faster, Get Farther. Web Content Management Platform

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

SharePoint Term Store & Taxonomy Design Harold Brenneman Lighthouse Microsoft Technology Group

Analytics Software for a World of Smart Devices. Find What Matters in the Data from Equipment Systems and Smart Devices

Session 1: Business Taxonomy and Metadata Design

XpoLog Competitive Comparison Sheet

The Open Source CMS. Open Source Java & XML

JOURNAL OF OBJECT TECHNOLOGY

mframe Software Development Platform KEY FEATURES

Community Edition. Master Data Management 3.X. Administrator Guide

Noramsoft Inc. Noramsoft Inc. SPT2O1O - Course Description. Developing Solutions with SharePoint Server 2010 SPT2010. Noramsoft Inc. Noramsoft Inc.

HOW TO DO A SMART DATA PROJECT

Developing Microsoft SharePoint Server 2013 Advanced Solutions MOC 20489

MatchPoint Technical Features Tutorial Colygon AG Version 1.0

CONCEPTCLASSIFIER FOR SHAREPOINT

How to Develop Taxonomies to Support Navigation, Information Discovery, and Findability

FTA Technology 2009 IT Modernization and Business Rules Extraction

Databases in Organizations

ADAM Agency solutions

Auto-Classification for Document Archiving and Records Declaration

Big Data, Cloud Computing, Spatial Databases Steven Hagan Vice President Server Technologies

Introduction to Epinomy. Big Data Semantics

Developing Microsoft SharePoint Server 2013 Advanced Solutions

OpenText Output Transformation Server

A Java Tool for Creating ISO/FGDC Geographic Metadata

Cloud and Self-hosted Bitrix24 at a glance. Bitrix24

Why are Organizations Interested?

Session 2: Designing Information Architecture for SharePoint: Making Sense in a World of SharePoint Architecture

Inmagic Content Server Standard and Enterprise Configurations Technical Guidelines

Compliance in Office 365 What You Should Know. Don Miller Vice President of Sales Concept Searching

ORACLE ENTERPRISE DATA QUALITY PRODUCT FAMILY

The Ontological Approach for SIEM Data Repository

GSA2013: The Great SharePoint Adventure 2013

Terms and Definitions for CMS Administrators, Architects, and Developers

Best Practices for Architecting Taxonomy and Metadata in an Open Source Environment

USING TAXONOMIES AS MASTER REFERENCE METADATA FOR THE ENTERPRISE

Common Questions and Concerns About Documentum at NEF

Windchill Service Information Manager Curriculum Guide

Following a guiding STAR? Latest EH work with, and plans for, Semantic Technologies

SharePoint 2010 End User - Level II

Office SharePoint Server 2007

Content Management in the 21 st Century

Enterprise Information Integration (EII) A Technical Ally of EAI and ETL Author Bipin Chandra Joshi Integration Architect Infosys Technologies Ltd

Weblogs Content Classification Tools: performance evaluation

Software. PowerExplorer. Information Management and Platform DATA SHEET

Auto-Classification in SharePoint. How BA Insight AutoClassifier Integrates with the SharePoint Managed Metadata Service

Content Management Systems: Drupal Vs Jahia

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

DITA Adoption Process: Roles, Responsibilities, and Skills

Data Publishing Workflows with Dataverse

Model Driven Interoperability through Semantic Annotations using SoaML and ODM

A Content Hub strategy for newspaper and magazine publishers

CRITEO INTERNSHIP PROGRAM 2015/2016

Q: Which versions of Oracle BI does Primavera P6 Analytics support? A: Oracle Business Intelligence 10g

HP Systinet. Software Version: Windows and Linux Operating Systems. Concepts Guide

Test Data Management Concepts

A complete platform for proactive data management

A Software Tool for Thesauri Management, Browsing and Supporting Advanced Searches

Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions OVERVIEW

Authoring Within a Content Management System. The Content Management Story

Enterprise Enabler and the Microsoft Integration Stack

LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model

Drupal.

A guide to the lifeblood of DAM:

Microsoft SQL Business Intelligence Boot Camp

Studio. Rapid Single-Source Content Development. Author XYLEME STUDIO DATA SHEET

Semantic Method of Conflation. International Semantic Web Conference Terra Cognita Workshop Oct 26, Jim Ressler, Veleria Boaten, Eric Freese

Revealing Trends and Insights in Online Hiring Market Using Linking Open Data Cloud: Active Hiring a Use Case Study

Sample RFP Template: Content Management System

BLUESKIES. Microsoft SharePoint and Integration with Content Management Platforms. FileHold - Providing Advanced Content Management Functionality

<no narration for this slide>

<Insert Picture Here> Oracle SQL Developer 3.0: Overview and New Features

Developing Microsoft SharePoint Server 2013 Advanced Solutions

Delivering Smart Answers!

Talend Metadata Manager. Reduce Risk and Friction in your Information Supply Chain

Course Outline: Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions

Transcription:

Selecting a Taxonomy Management Tool Wendi Pohs InfoClear Consulting #SLATaxo

InfoClear Consulting What do we do? Content Analytics Strategy and Implementation, including: Taxonomy/Ontology development and maintenance Semantic technology and tool selection Metadata tagging strategy Search engine integration and relevance tuning Application design for unstructured data Graph database design 6/11/2013 2012 InfoClear Consulting

What we ll discuss today Overview of the basic, high-level considerations A deep dive into how taxonomy management software was selected at IBM Questions and discussion

A quick recap: The basic components Taxonomy(from the Greek: taxis (order) or a classification scheme) A dynamic list of words and phrases that are used to describe what content is about Taxonomies include: Facets A named group of similar words and phrases, used to provide input to metadata on the backend, and for different aspects of search Relationship Types Named links between words and phrases in facets Attributes Properties of terms Scope notes, dates Entities People, Places and Things, often proper nouns that are easily identifiable within content Metadata Named fields that are associated with content, often populated by the words or phrases in the taxonomy

The Semantic Spectrum: What do you need to manage? Simple to Complex Increasing complexity List Taxonomy Thesaurus Ontology Flat List Controlled Vocabulary Hierarchical Parent / Child Relationship between parent and a child can be relatively underspecified or ill defined. Equivalence Synonym Hierarchical-Broader than/narrower Than Related-Associated A thesaurus is typically used to associate the rough meaning of a term to the rough meaning of another term. Scope and History notes Classes (general things) many domains of interest Instances (specific things) Relationships between those things Properties (attributes) of those things Functions of and processes involving those things Constraints on and rules involving those things.

How are taxonomies used? Data Mgmnt.XML* Model Load Manipulate Manage Change P U B L I S H.XML* T R A N S F O R M I N T E G R A T E Content Mgmnt Tagging Systems Search Text analytics

As reference sources for other processes Allowed Values Lists for Metadata A collected list of metadata values, most often used in Content Management systems Tagging systems Input to manual and automated processes for applying metadata to content Auto-Categorization A set of techniques, primarily automated, used for grouping similar content together Entity extraction Identifying and extracting specific terms and phrases from content, either from an Authority List, or based on known patterns in text Transaction systems Support integration and mapping of similar metadata from different systems Web sites Support site navigation and faceted search

Why buy a taxonomy management tool? Your taxonomy spreadsheet has become too cumbersome No real tree view Hard to visualize related terms Hard when more than one person maintains the taxonomy You need better hooks for integration

How can you justify your purchase? Your purchase justification is most often based on consuming applications, and not on the taxonomy tool itself, so you need to understand how you will use it For auto-categorization? As part of a content management system? For Web site navigation? To support mapping between terms in legacy systems? For records or data management?

And you need to understand who will use the tool Are you working alone or will you be collaborating with others? Do you need to support other languages? Will you need IT support for installation and/or integration with other systems? Will this become an Enterprise tool?

What do the tools offer? 1. Data modeling Building your taxonomy Facets, mappings, relationship types, attributes Considerations Do you need to manage a standard thesaurus with BT/NT relationship types? Do you need other named relationships? How does the tool handle synonyms? How complex are your attributes? How does the tool handle polyhierarchy?

What do the tools offer? 2. Editing Creating, renaming, merging and deleting terms, promoting and demoting terms within hierarchies, mapping terms Managing relationships and attributes Considerations What is your most common editing task? Does the tool easily support it? Can you change many terms at once? How easy is it to find the terms you need to edit?

What do the tools offer? 3. Import/Export functionality To and from lists, spreadsheets, XML and other formats Considerations Are the taxonomy standards, like SKOS or ZTHES, supported by the tool? How robust is the spreadsheet import functionality? What formats do your consuming applications need?

What do the tools offer? 4. Workflow Draft => Approved/Published Change notification Email support Considerations These vary widely between tools, so think about what you really need, for example: Clear indication of term status if your taxonomy has many editors Clear indication of when a term has been approved or published

What do the tools offer? 5. Integration interfaces APIs XML outputs Considerations Understand the tools your developers use Open source, Java, or C# tools? Are there built-in connectors or integration components? Does the tool operate on your platform or server? Windows, Linux, Mac? Include IT in your purchase decisions

What do the tools offer? 6. Advanced features, for example: Visualization of terms and relationships Built-in auto-classification tools Built-in term mapping tools Robust versioning and archiving support Multi-lingual support Text analytics Considerations Visualization tools are important if you need to sell your taxonomy to management Auto-classification tools can require a slightly different design methodology. Understand the primary purpose of the tool Versioning and archiving are not givens Your environment and needs are unique

The intangibles Company viability and support You can often judge responsiveness by the technical support you receive as you evaluate the software Take the time to get customer references Considerations Taxonomy management is often an adjunct function when it is part of other tools. Smaller companies can be more agile, but new features are often driven by the loudest customer voice Consider tools that are easy to integrate with the related technologies that you use

General product considerations Commercial products Suites of tools Formal support Formal roadmaps and development schedules Customization is almost always necessary Taxonomy management modules are sometimes created to support other, primary applications Open source products Most flexibility for homegrown solutions Crowd-sourced, but large user community

Questions? Need more info about specific tools? wpohs@infoclearonline.com Or visit our Website at: http://www.infoclearonline.com