Auto-Classification in SharePoint. How BA Insight AutoClassifier Integrates with the SharePoint Managed Metadata Service



Similar documents
How To Use B Insight'S New Search Engine On Sharepoint

CONCEPTCLASSIFIER FOR SHAREPOINT

Delivering Smart Answers!

2007 to 2010 SharePoint Migration - Take Time to Reorganize

Developing Microsoft SharePoint Server 2013 Advanced Solutions MOC 20489

Taxonomies in Practice Welcome to the second decade of online taxonomy construction

Developing Microsoft SharePoint Server 2013 Advanced Solutions

UTILIZING COMPOUND TERM PROCESSING TO ADDRESS RECORDS MANAGEMENT CHALLENGES

Developing Microsoft SharePoint Server 2013 Advanced Solutions

Digital Marketplace - G-Cloud

Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions OVERVIEW

DE-20489B Developing Microsoft SharePoint Server 2013 Advanced Solutions

Layer2 Knowledge Management Suite for SharePoint V3

Course Outline: Course 20489B: Developing Microsoft SharePoint Server 2013 Advanced Solutions

Auto-Classification for Document Archiving and Records Declaration

Intelligent document management for the legal industry

Compliance in Office 365 What You Should Know. Don Miller Vice President of Sales Concept Searching

SharePoint Term Store & Taxonomy Design Harold Brenneman Lighthouse Microsoft Technology Group

IBM Enterprise Content Management Product Strategy

Pivotal CRM 6.0. Benefit for your organization : a solution that can support your business needs

Get results with modern, personalized digital experiences

Semantic SharePoint. Technical Briefing. Helmut Nagy, Semantic Web Company Andreas Blumauer, Semantic Web Company

IBM Software Group Thought Leadership Whitepaper. IBM Customer Experience Suite and Real-Time Web Analytics

CA IT Client Manager Asset Inventory and Discovery

Describe how to utilize the Publishing API to access publishing settings and content.

Microsoft FAST Search Server 2010 for SharePoint Evaluation Guide

IBM Content Navigator

Reduce Cost, Time, and Risk ediscovery and Records Management in SharePoint

Spend Enrichment: Making better decisions starts with accurate data

Collaboration. Michael McCabe Information Architect black and white solutions for a grey world

Semaphore Overview. A Smartlogic White Paper. Executive Summary

Session 3: Leveraging Taxonomy Term Store for SharePoint: Defining a Multi-Taxonomy Structure for Content Management

Governance in Digital Asset Management

The Smart Archive strategy from IBM

Configuring SharePoint 2013 Document Management and Search. Scott Jamison Chief Architect & CEO Jornata scott.jamison@jornata.com

Taxonomy Enterprise System Search Makes Finding Files Easy

Case Study - MetaVis Migrator

Introduction to Epinomy. Big Data Semantics

84% of Migration Projects Fail Getting it Right in SharePoint

Transform records management

Analance Data Integration Technical Whitepaper

Veritas ediscovery Platform

Symantec ediscovery Platform, powered by Clearwell

CA Repository for Distributed. Systems r2.3. Benefits. Overview. The CA Advantage

Things Made Easy: One Click CMS Integration with Solr & Drupal

Analance Data Integration Technical Whitepaper

The Value of Taxonomy Management Research Results

Microsoft SQL Server Business Intelligence and Teradata Database

Office Business Applications (OBA) for Healthcare Organizations. Make better decisions using the tools you already know

ZAP Business Intelligence Application for Microsoft Dynamics

Best Practices for Architecting Taxonomy and Metadata in an Open Source Environment

How To Make Sense Of Data With Altilia

harmon.ie Delivers the Business Value of Office 365 Migrations

Oracle Big Data SQL Technical Update

The Power of Classifying in SharePoint 2010

Enhancing Lotus Domino search

MatchPoint Technical Features Tutorial Colygon AG Version 1.0

Unlocking The Value of the Deep Web. Harvesting Big Data that Google Doesn t Reach

What do Big Data & HAVEn mean? Robert Lejnert HP Autonomy

Microsoft Dynamics AX 2012 A New Generation in ERP

SharePoint Comparison of Features

Infor10 Corporate Performance Management (PM10)

Top 5 reasons to choose HP Information Archiving

How To Write An Ecm For Mammoet

Expanding Uniformance. Driving Digital Intelligence through Unified Data, Analytics, and Visualization

Layer2 Term Set Glossary App for SharePoint 2013

KPMG Unlocks Hidden Value in Client Information with Smartlogic Semaphore

Sage X3 People. A powerful, simple and flexible human resource management software

ORACLE PROJECT MANAGEMENT

Microsoft Enterprise Search for IT Professionals Course 10802A; 3 Days, Instructor-led

SOLUTION BRIEF CA ERwin Modeling. How can I understand, manage and govern complex data assets and improve business agility?

Accelerate BI Initiatives With Self-Service Data Discovery And Integration

The Core Pillars of AN EFFECTIVE DOCUMENT MANAGEMENT SOLUTION

Lost in Space? Methodology for a Guided Drill-Through Analysis Out of the Wormhole

WHAT'S NEW IN SHAREPOINT 2013 WEB CONTENT MANAGEMENT

Top 5 reasons to choose HP Information Archiving

Microsoft in an Integrated. Update. decisions faster. Presented by Steve Studer for the AIIM

Integrated archiving: streamlining compliance and discovery through content and business process management

IBM Content Analytics with Enterprise Search, Version 3.0


Course Code NCS2013: SharePoint 2013 No-code Solutions for Office 365 and On-premises

Document Management. Document Management for the Agile Enterprise. AuraTech Pte Ltd

CENTRALIZED CONTROL CENTERS FOR THE OIL & GAS INDUSTRY A detailed analysis on Business challenges and Technical adoption.

Capitalize on Big Data for Competitive Advantage with Bedrock TM, an integrated Management Platform for Hadoop Data Lakes

Get More Value from Your Reference Data Make it Meaningful with TopBraid RDM

Session 2: Designing Information Architecture for SharePoint: Making Sense in a World of SharePoint Architecture

Big Data and Semantic Web in Manufacturing. Nitesh Khilwani, PhD Chief Engineer, Samsung Research Institute Noida, India

Transcription:

How BA Insight AutoClassifier Integrates with the SharePoint Managed Metadata Service BA Insight 2015

Table of Contents Abstract... 3 Findability and the Value of Metadata... 3 Finding Information is Hard... 3 Metadata is Essential for Findability... 3 Microsoft SharePoint Incorporates Metadata... 3 How the BA Insight AutoClassifier Improves SharePoint... 4 The Challenge: Creating and Managing Metadata... 4 Using the AutoClassifier... 5 Overview of the AutoClassifier... 5 Managing Taxonomies and Rules... 5 Benefits Summary... 6 How the AutoClassifier Integrates with MMS... 7 Summary... 9 About BA Insight... 9 Copyright 2015 BA Insight. All Rights Reserved. Page 2

Abstract Metadata is essential for findability, yet the vast majority of content within organizations has poor metadata. Creating metadata is challenging, and doing it by machine is the only practical way to get consistent, quality metadata on a broad basis. SharePoint provides a facility for referencing and managing metadata called the Managed Metadata Service (MMS). The BA Insight AutoClassifier leverages this to provide a rules-based engine fully integrated with SharePoint. This paper discusses the need for Metadata and demonstrates how to use the AutoClassifier and MMS to create metadata and improve findability. Findability and the Value of Metadata Finding Information is Hard With the constant growth of stored data, finding information not only becomes difficult, but it is also critical for business success. In a Findability survey conducted by FindWise in 2014, 45% of respondents reported that finding the information they are looking for within their organization is difficult or very difficult. Metadata is Essential for Findability The exponential growth of information requires new methods for managing content and search. If documents are not tagged with the appropriate metadata, then queries return every document that contains the search term, with no refinement of search results. To meet today s information challenges, metadata must be used to enrich the end user search experience. With good metadata on content, search results are more relevant as priority is given to documents in which the query matches the metadata (over documents in which the query matches only the text within the document). Also, faceted search, which is likewise called refinement or faceted navigation, can be used to quickly drill down into search results to get to the right information. Microsoft SharePoint Incorporates Metadata Microsoft introduced the Managed Metadata Service (MMS) as part of SharePoint 2010 and enhanced it in SharePoint 2013. MMS includes a term store that holds term sets (also known as taxonomies or ontologies). SharePoint Out of the Box also includes verbatim extractors (string matchers) to populate managed properties in search and lets users manually tag content. Copyright 2015 BA Insight. All Rights Reserved. Page 3

How the BA Insight AutoClassifier Improves SharePoint The BA Insight AutoClassifier, which is the core of BA Insight s Classification software, leverages MMS to precisely define the metadata tags and automate the process of classifying information to improve the user experience. Taxonomies that are stored in the term store are used to define the tags used as metadata on documents. The AutoClassifier automatically tags these documents and also assigns content types based on these taxonomies. The AutoClassifier rules that specify how documents are tagged are stored in the term store as well, unifying the management of terms and rules, making administration much simpler. The Challenge: Creating and Managing Metadata In order to meet today s information challenges, metadata must be: fully customizable in order to meet organizational requirements precise in order to eliminate false positives standardized and comprehensive across all subsets of information automatically and immediately applied with manual update capabilities utilized to enrich the end user search experience While it is well known that metadata is the key to effective search, in enterprises, good metadata is rare. On the web, pages are tagged and curated manually by people who have a strong incentive to do so and a large SEO industry behind them. In the enterprise, it is impossible to tag everything by hand. Requiring authors and content contributors to tag everything usually backfires as it is very time consuming and people often avoid tagging or circumvent systems set up to force them to tag. For example, one organization found that over 40% of their repository was tagged with the first term on the dropdown list. People simply picked the quickest choice, not the correct choice. Inconsistent tagging and incomplete or simply wrong metadata results in poor search and also degrades compliance. In this environment, automated tagging is the only practical solution. Manually generated or processgenerated metadata can be supplemented, or corrected, by machine-generated metadata very costeffectively. Machines may not be perfect at tagging based on human language, but they are much more consistent and often more accurate than humans. Copyright 2015 BA Insight. All Rights Reserved. Page 4

Using the AutoClassifier Overview of the AutoClassifier The BA Insight AutoClassifier, which is part of our Classification software, provides automatic tagging and classification of content based upon a taxonomy, using a robust rules engine. Instead of manually tagging documents, BA Insight AutoClassifier can be used to automatically classify crawled documents according to a defined taxonomy. AutoClassifier automatically adds the defined taxonomy node names, or terms, to the MMS term store and tags the matching documents after these files are crawled. Managing Taxonomies and Rules The AutoClassifier includes a taxonomy manager to develop both taxonomies and rules. These are stored in the term store and are usable throughout SharePoint since they are fully integrated with MMS. Each taxonomy node contains associated terms that are stored in the term store. A tag, or taxonomy node name, is applied to a crawled document if the document is a match based on the associated rule. Rules provide control over the application of metadata. They establish the conditions under which metadata should be applied to a document and may involve multiple criteria. For example, a rule might specify that a matching document must include the terms President and Bush and exclude the word plant in order to be classified with the metadata tag Presidents. Rules are generated automatically by default, and can then be specified within the taxonomy manager. Taxonomies can also be developed in the same environment. Copyright 2015 BA Insight. All Rights Reserved. Page 5

As the taxonomy is developed, the AutoClassifier test bench can be used to: Test taxonomy rules See the matching documents View the highlighted matching terms within the text of the classified document Edit rules as required The AutoClassifier automatically applies these rules to content in SharePoint or to content outside of SharePoint that is crawled for search. This results in consistent metadata on all content. When users search, their results are more relevant, and the refiners are fully populated resulting in better exploration. End users: Experience faster search Get a more accurate set of matching documents Enjoy a more intuitive search experience See pages that are generated based on the taxonomy hierarchy Benefits Summary Metadata is the backbone of successful information architecture. Microsoft s Managed Metadata Service is a powerful facility that centrally controls taxonomies, enterprise keywords, and content types. With the addition of the BA Insight AutoClassifier, an organization can realize the full potential of an organized information architecture that enriches the end user search experience. Copyright 2015 BA Insight. All Rights Reserved. Page 6

How the AutoClassifier Integrates with MMS BA Insight AutoClassifier integrates seamlessly with SharePoint MMS to provide a customized set of metadata where the terms are stored in the SharePoint term store and automatically applied to crawled documents as they are indexed by AutoClassifier. The terms and content types that are defined in the AutoClassifier taxonomies are automatically stored in the SharePoint term store: Copyright 2015 BA Insight. All Rights Reserved. Page 7

The BA Insight AutoClassifier automatically tags crawled documents in SharePoint. The AutoClassifier document tagging within SharePoint can be verified. Documents can be automatically or manually tagged. The BA Insight AutoClassifier is an intuitive, easy-to-install application. A custom taxonomy can be developed, or the sample taxonomy shipped with AutoClassifier can be tweaked. Copyright 2015 BA Insight. All Rights Reserved. Page 8

Summary BA Insight AutoClassifier enables organizations to meet the challenges of customization, precision, and auto-classification while fully integrating with Microsoft SharePoint by leveraging MMS. It provides a rulesbased engine, applying text analytics and semantic processing to add consistent, quality metadata to content surfaced within SharePoint. About BA Insight BA Insight s Software Portfolio accelerates and future-proofs the creation of cloud-based and on-premise SharePoint portals for enterprises, transforming how users find information. Our software dramatically reduces the time, cost, effort, and risk of implementing SharePoint portals while providing a greatly improved user experience. Customers use our Applications to improve productivity by finding the right information faster using Visual Refiners, Smart Previews, User-Generated InfoSites, and Matter Comparison; our Classification software to increase findability using auto-tagging, metadata generation, and text analytics; and our pre-built connectors and federation to provide secure connectivity to a wide variety of content systems. We serve organizations of all sizes around the globe including the Australian Government Department of Defence, CA Technologies, Chevron, Deloitte, Ford Motor Company, Keurig Green Mountain, Pfizer, and Travers Smith. BA Insight is a Microsoft Gold Certified Partner and member of the Microsoft Business- Critical SharePoint Program. Visit www.bainsight.com for more information and follow us at @BAinsight. Copyright 2015 BA Insight. All Rights Reserved. Page 9