Walk Before You Run. Prerequisites to Linked Data. Kenning Arlitsch Dean of the Library @kenning_msu

Similar documents

SEARCH ENGINE OPTIMIZATION FOR DIGITAL COLLECTIONS. Kenning Arlitsch Patrick OBrien Sandra McIntyre

Digital Training Search Engine Optimization. Presented by: Aris Tianto Head of Search at

OCLC CONTENTdm and the WorldCat Digital Collection Gateway Overview

DIGITAL MARKETING BASICS: SEO

A Basic Guide To Onsite SEO

Advanced & Actionable SEO for WordPress. Ryan Erwin Internet Marketing Chicago LLC. Internetmarekting.chicago.com

OCLC CONTENTdm. Geri Ingram Community Manager. Overview. Spring 2015 CONTENTdm User Conference Goucher College Baltimore MD May 27, 2015

SEO Audit Report For Ravenna Way Ravenna Way Boynton Beach FL 33437

Website Search Engine Optimization. Presented by: David Backes Senior Account Executive Anvil Media Inc.

Janet Bartoli Search Engine Optimization Consulting Small Business Packages

SEO BEST PRACTICES IPRODUCTION. SEO Best Practices. November 2013 V3. 1 iproduction

Learn how your CMS can have a HUGE impact on your SEO efforts

How to Drive More Traffic to Your Event Website

2015 SEO AND Beyond. Enter the Search Engines for Business.

Sixth International Conference on Webometrics, Informetrics and Scientometrics & Eleventh COLLNET Meeting, October 19 22, 2010, University of Mysore,

Authorship and SEO Search, Social and Tools to Promote You and Your Authors

1. SEO INFORMATION...2

SEO Tutorial PDF for Beginners

SEO Workshop Today s Coach Lynn Stevenson. SEO Analyst

Website Audit Reports

Mark E. Pruzansky MD. Local SEO Action Plan for. About your Local SEO Action Plan. Technical SEO. 301 Redirects. XML Sitemap. Robots.

Identifying Ethical SEO

An Approach to Give First Rank for Website and Webpage Through SEO

Content Marketing Templates

BUILDING YOUR SITE FOR SEO

Pizza SEO: Effective Web. Effective Web Audit. Effective Web Audit. Copyright Pizza SEO Ltd.

GOOGLE ANALYTICS TERMS

DrupalCamp Edinburgh Drupal 7 SEO. How to gain an advantage over your non-drupal competition. Diarmy Simpson

Campaign Goals, Objectives and Timeline SEO & Pay Per Click Process SEO Case Studies SEO & PPC Strategy On Page SEO Off Page SEO Pricing Plans Why Us

30 Website Audit Report. 6 Website Audit Report. 18 Website Audit Report. 12 Website Audit Report. Package Name 3

Sreekariyam P.O,Trivandrum - 17 Kerala Ph M info@acewaretechnology.com Web

How to catch Google s attention Search Engine Optimization and Marketing. Linda Schuster President/CEO (502) x115

Search Engine Optimisation Guide May 2009

New Features Relevant to Webmasters

The Ultimate Guide to Magento SEO Part 1: Basic website setup

SCIENCE TIME SEARCH MARKETING TRENDS 2014

Geo Targeting Server location, country-targeting, language declarations & hreflang

EARCM ENG NE. Did you know that SEO increases traffic, leads and sales? SEQ = More Website Visitors More Traffic = More Leads More Leads = More Sales

SEO Basics for Starters

[Ramit Solutions] SEO SMO- SEM - PPC. [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions

Search engine optimisation (SEO)

SEO CAMPAIGN REVIEW FOR PLANNED PARENTHOOD FEDERATION OF AMERICA, INC.

Search Engine Optimization Content is Key. Emerald Web Sites-SEO 1

SEO Hats. On-Page Optimisation. Internet Technical Terms. Website Architecture. How the Search Engine works. Search Engine Parameters

Contents: Executive Summary

Increasing Traffic to Your Website Through Search Engine Optimization (SEO) Techniques

Baidu: Webmaster Tools Overview and Guidelines

SEO = More Website Visitors More Traffic = More Leads More Leads= More Sales

SEARCH ENGINE OPTIMIZATION

Yandex: Webmaster Tools Overview and Guidelines

Seolize Short Guide on SEO and how to use Seolize C o p y r i g h t i a n n e t v2. 1

Your Blueprint websites Content Management System (CMS).

Chapter 6. Attracting Buyers with Search, Semantic, and Recommendation Technology

But isn t SEO complicated? AND expensive? What about the changing Google algorithms? How can I make my site SEOfriendly?

Get More Hits to Your Website

11 Core Elements Of A Successful Digital and Content Marketing Campaign. RODA marketing is a full service digital marketing and consulting agency.

Professional Diploma in Digital Marketing Module 2: Search Engine Optimisation Version 4.0 Location: Oslo/Norway Lecturer: Nina Furu

SEO AND CONTENT MANAGEMENT SYSTEM

Fraunhofer FOKUS. Fraunhofer Institute for Open Communication Systems Kaiserin-Augusta-Allee Berlin, Germany.

SEO SUITE ULTIMATE GUIDE 1

10 SEO Tricks for Prestashop

Search Marketing Training

The Almighty SEO Guide For Small Businesses

SEO Training SYLLABUS by SEOOFINDIA.COM

Search Engine Optimization - From Automatic Repetitive Steps To Subtle Site Development

Ranking Web of repositories: End users point of view?

Digital Asset Management Developing your Institutional Repository

UBER SEO. Affordable Online Marketing for Startups & Small Business. Provided By: EBWAY Crea2ve Solu2ons

Transcription:

Walk Before You Run Prerequisites to Linked Data Kenning Arlitsch Dean of the Library @kenning_msu

First, Take Care of Basics Linked Data applications will not matter if search engines can t first find library websites and repositories, crawl them, and understand the metadata provided.

Agenda Traditional SEO (Search Engine Optimization) Optimize hardware, software, websites, metadata Semantic Web Optimization Semantic Identity Schema.org Projects at MSU Using a vocabulary understood by search engines Improve machine comprehension

Funded Research 2011-2014 Getting Found: Search Engine Optimization for Digital Repositories 2014-2017 Measuring Up: Assessing Accuracy of Reported Use and Impact of Digital Repositories Partners OCLC Research Association of Research Libraries University of New Mexico

Part 1 of 3 SEARCH ENGINE OPTIMIZATION

SEO Building Blocks Priority 1 Increase Reach Get objects indexed by search engines Priority 2 Increase Visibility in SERP Provide robust descriptive content Priority 3 Get Relevant Increase click-through rates (CTR)

Why it Matters

Where College Students Begin Research 2005 2010 DeRosa, Cathy, et al. Perceptions of Libraries, 2010: Context and Community: A Report to the OCLC Membership, OCLC, 2010.

Need more reasons? Americans submit 18 billion search queries to search engines each month* 12 billion to Google sites (67%) 3.5 billion to Microsoft sites (19%) 1.8 billion to Yahoo! Sites (10%) 355 million to Ask Networks (.02%) 224 million to AOL, Inc. (.01%) How much of that traffic is directed to our libraries? * http://www.comscore.com/insights/market-rankings/comscore-releases-november- 2014-U.S.-Desktop-Search-Engine-Rankings

Our Research Inspiration Ten years building digital libraries Mountain West Digital Library Utah Digital Newspapers Western Waters Digital Library Western Soundscape Archive Were they being used?

Basic SEO improved Utah s collection accessibility in Google Google Index Ratio - All Collections* 12% Average 51% 79% 92% 0% 25% 50% 75% 100% 07/05/10 04/04/11 11/30/11 12/05/13 Google Index Ratio = URLs submitted / URLs Indexed by Google ~150 collections containing ~170,00 URLs (07/2010) and ~170 collections containing ~282,000 URLs (12/2013)

...Producing significant increases in the average number of page views per day Avg. Page Views / Day content.lib.utah.edu

resulting in more referrals and visitors 12 week comparison 2010 vs. 2012

Technical Barriers to SE Crawlers Website Design Graphics Confusing site hierarchies and paths Slow servers CMS often lack canonical links Metadata Schema not understood by SE Not unique Inconsistent/inaccurate

Challenge is presenting structured data SE s can identify, parse and digest Human Readable Wolfinger, N. H., & McKeever, M. (2006, July). Thanks for nothing: changes in income and labor force participation for never-married mothers since 1982. In 101st American Sociological Association (ASA) Annual Meeting; 2006 Aug 11-14; Montreal, Canada (No. 2006-07-04, pp. 1-42). Institute of Public & International Affairs (IPIA), University of Utah. Google Scholar Understandable

Google Scholar can read and understand! Google Scholar

Organizational/Cultural Themes Traditional SEO is an afterthought Librarians think too small re potential traffic Organizational communication is poor Analytics are usually poorly implemented Vendors are slow to catch on to SEO problems Because we don t demand it

How does a library justify SEO? Promote use of collections Accountability to funding organizations Improve institutional reputation Increase citation rates Attract research funding Attract students and faculty

Recommended SEO Process Institutionalize SEO Strategic Plan/driven by administration Accurate Measurement Tools Traditional SEO Get Indexed = Index Ratio Get Visible = Search Engine Results Page (SERP) Semantic Web Optimization Get Relevant = Click Through Rates (CTR)

Part 2 of 3 SEMANTIC WEB OPTIMIZATION

Semantic Identity Library organizations are poorly defined and represented on the Semantic Web

Google s Knowledge Graph A knowledge base to enhance search results with semanticsearch information gathered from a wide variety of sources Source: Wikipedia

Google s Perception of MSU Lib - 2012

Where does Google get its information?

Trusted Sources for Search Engines No Wikipedia presence? You don t exist as an important entity or thing You exist as a string of (confusing) text Other influences on the Knowledge Graph FreeBase (phasing out) Wikidata Google Places/Google My Business Google+

Wikipedia - 2012

DBPedia entry - 2012

2014 DBpedia entry

Remember, this was 2012

MSU Library - 2014

Part 3 of 3 TWO MSU SCHEMA.ORG PROJECTS

Schema.org Common vocabulary for describing things on the web Supported by Bing, Google, Yahoo and Yandex On-page markup helps search engines understand the information on webpages and provide richer results. https://support.google.com/webmasters/answer /1211158?hl=en 37

Hypothesis Implementing Schema.org will Improve machine understanding of content Improve rich snippets shown in SERP Increase click-through rates from SERP More traffic More users finding what they re looking for

Project 1: A Controlled Experiment by Jason Clark (with Michelle Gollehon) Two digital collections Similar size/content/date range Photos and historical documents 1 optimized with Schema.org (Schultz) 1 control (Brook)

A Revised Digital Library Architecture Collection Page (home page) arc.lib.montana.edu/schultz-0010/ About Pages (about page, topics page) arc.lib.montana.edu/schultz-0010/about.php arc.lib.montana.edu/schultz-0010/topics.php Item Pages (individual record page) arc.lib.montana.edu/schultz-0010/item/31 Sitemap and rel=canonical work arc.lib.montana.edu/schultz-0010/

Results

Project 2: Schema.org IR Data Model by Jeff Mixter and Patrick OBrien 1909 DC records from the Montana State University ScholarWorks IR scholarworks.montana.edu Started with Schema.org as the base Created an extension vocabulary using the same mechanics and conventions used in Schema.org It is published as RDFa http://purl.org/montana-state/library/ 44

schema: http://schema.org/ dcterms: http://purl.org/dc/terms/ mont: http://purl.org/montana-state/library/ 45

Syndication of RDF data Loaded all 1909 RDF descriptions back into the ScholarWorks repository Tweaked DSpace to pull and display JSON-ld data Newly created entities loaded into a Triple Store with a Pubby frontend Amazon Web Services 46

47 Google Webmaster Tools

CTR? Not so much, yet

Semantic Web Team Kenning Arlitsch, Dean @kenning_msu Patrick OBrien, Semantic Web Director @sempob Jeff Mixter, Research Associate, OCLC Research Jason Clark, Head of Lib Informatics and Computing @jaclark Scott Young, Digital Initiatives Librarian @hei_scott Doralyn Rossmann, Head of Coll Development @doralyn Jean Godby, Senior Research Associate, OCLC Research

Relevant Publications Arlitsch, Kenning, and Patrick S. OBrien. (2013) Improving the visibility and use of digital repositories through SEO. Chicago: ALA TechSource. ISBN-13: 978-1-55570-906-8 Mixter, Jeff, Patrick OBrien and Kenning Arlitsch. Describing Theses and Dissertations using Schema.org, Proceedings of the International Conference on Dublin Core and Metadata Applications 2014, Dublin Core Metadata Initiative: 138-146. Arlitsch, Kenning. Being Irrelevant: How Library Data Interchange Standards have kept us off the Internet, Journal of Library Administration, 54, no. 7 (2014): 609-619. Arlitsch, Kenning, Patrick OBrien, Jason A. Clark, Scott W.H. Young and Doralyn Rossmann. Demonstrating Library Value at Network Scale: Leveraging the Semantic Web with New Knowledge Work, Journal of Library Administration, 54, no. 5 (2014): 413-425. Arlitsch, Kenning, Patrick OBrien, and Brian Rossmann. "Managing Search Engine Optimization: An Introduction for Library Administrators." Journal of Library Administration 53, no. 2-3 (2013): 177-188. Arlitsch, Kenning, and Patrick S. O'Brien. "Invisible institutional repositories: Addressing the low indexing ratios of IRs in Google Scholar." Library Hi Tech 30, no. 1 (2012): 60-81.