A Taxonomy of Web Search by Andrei Broder
|
|
|
- Blake Moore
- 10 years ago
- Views:
Transcription
1 A Taxonomy of Web Search by Andrei Broder 2012
2 Outline Motivation 1 Motivation
3 Outline Motivation 1 Motivation
4 Aims of the Paper Point out the difference between classic IR and web search Introduce and analyze a taxonomy of web searches Show how search engines deal with web-specific needs
5 The Classical Model for IR
6 Web-spesific Needs
7 Outline Motivation 1 Motivation
8 Classification of Web Queries 1 Informational 2 Navigational 3 Transactional
9 Informational Queries Acquire some information assumed to be present on one or more web pages Information is in static form No further interaction is predicted Example: Where will WC 2018 be held WC 2018
10 Navigational Queries To reach a particular site User visited it in the past or assumes that it exists Only one right result Example: What is the official website of IBM? official website IBM
11 Transactional Queries Perform some web-mediated activity Further interaction is expected Main categories: shopping, finding servers, downloading various types of files Example: I need an accommodation in Rome. hotel Rome
12 Outline Motivation 1 Motivation
13 User Survey A survey of AltaVista users presented to random users users are self selected a pop-up window with the questions Questions to distinguish type of the query.
14 User Survey Questions
15 Log Analysis A random set of 1000 queries from the daily AltaVista log Only English queries Sexually oriented queries are removed Queries that are neither navigational, nor transactional are assumed to be informational
16 Results Motivation Table: Query Classification Type of query User Survey Query Log Analysis Navigational 24.5% 20% Informational 39% 48% Transactional 36% 30%
17 Outline Motivation 1 Motivation
18 First generation ( ) On-page data, close to classic IR, mostly informational queries AltaVista, Excite, WebCrawler, etc. Second generation ( ) Off-page, use of web-specific data such as link analysis, anchor-text, and click-through data, informational and navigational queries Google, DirectHit Third generation (2000-now) Attempt to ask "the need behind a query" Data from multiple sources (San Francisco : hotel reservation links, map server, weather server etc.) Support for informational, navigational, transactional queries
19 First generation ( ) On-page data, close to classic IR, mostly informational queries AltaVista, Excite, WebCrawler, etc. Second generation ( ) Off-page, use of web-specific data such as link analysis, anchor-text, and click-through data, informational and navigational queries Google, DirectHit Third generation (2000-now) Attempt to ask "the need behind a query" Data from multiple sources (San Francisco : hotel reservation links, map server, weather server etc.) Support for informational, navigational, transactional queries
20 First generation ( ) On-page data, close to classic IR, mostly informational queries AltaVista, Excite, WebCrawler, etc. Second generation ( ) Off-page, use of web-specific data such as link analysis, anchor-text, and click-through data, informational and navigational queries Google, DirectHit Third generation (2000-now) Attempt to ask "the need behind a query" Data from multiple sources (San Francisco : hotel reservation links, map server, weather server etc.) Support for informational, navigational, transactional queries
21 Outline Motivation 1 Motivation
22 Motivation Web search is task-driven. Search engines need to deal with different types of queries. The main aim of third generation search engines is to deal efficiently with transactional queries via semantic analyses (understanding what the query is about) and blending of various external databases.
23 Questions Motivation
A COMPREHENSIVE REVIEW ON SEARCH ENGINE OPTIMIZATION
Volume 4, No. 1, January 2013 Journal of Global Research in Computer Science REVIEW ARTICLE Available Online at www.jgrcs.info A COMPREHENSIVE REVIEW ON SEARCH ENGINE OPTIMIZATION 1 Er.Tanveer Singh, 2
SEARCH ENGINE OPTIMIZATION
SEARCH ENGINE OPTIMIZATION WEBSITE ANALYSIS REPORT FOR miaatravel.com Version 1.0 M AY 2 4, 2 0 1 3 Amendments History R E V I S I O N H I S T O R Y The following table contains the history of all amendments
Search Engine Optimization
Module Presenter s Manual Search Engine Optimization Effective from: April 2015 Ver. 1.0 Presenter s Manual Aptech Limited Page 1 Amendment Record Version No. Effective Date Change Replaced Pages 1.0 April
Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Retrieval models Older models» Boolean retrieval» Vector Space model Probabilistic Models» BM25» Language models Web search» Learning to Rank Search Taxonomy!
Digital Training Search Engine Optimization. Presented by: Aris Tianto Head of Search at InboundID [email protected] @atianto
Digital Training Search Engine Optimization Presented by: Aris Tianto Head of Search at InboundID [email protected] @atianto Why Is Search Important Why search is important? Total Internet users in Indonesia
Technical challenges in web advertising
Technical challenges in web advertising Andrei Broder Yahoo! Research 1 Disclaimer This talk presents the opinions of the author. It does not necessarily reflect the views of Yahoo! Inc. 2 Advertising
SEARCH ENGINE OPTIMIZATION(SEO) Basics of SEO
SEARCH ENGINE OPTIMIZATION(SEO) Basics of SEO What is SEO? SEO is an abbreviation for search engine optimization. SEO is the process of improving the volume or quality of traffic to a web site from search
Computational Advertising Andrei Broder Yahoo! Research. SCECR, May 30, 2009
Computational Advertising Andrei Broder Yahoo! Research SCECR, May 30, 2009 Disclaimers This talk presents the opinions of the author. It does not necessarily reflect the views of Yahoo! Inc or any other
LO C AL S E O. Cheat Sheet
LO C AL S E O Cheat Sheet What is local SEO? As a result of the rapid growth of mobile, local SEO has grown significantly in recent years with businesses aiming to take advantage of the improved connectivity
An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising. Anindya Ghose Sha Yang
An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising Anindya Ghose Sha Yang Stern School of Business New York University Outline Background Research Question and Summary of
Be found higher in organic search with Yellow Search Optimisation
Be found higher in organic search with Yellow Search Optimisation Yellow Search Optimisation (SEO) helps your website get found further up the organic search results, increasing your visibility to reach
Internet Explorer Security Settings. Help Sheet. Client Services. Version 4 Definitive 21 July 2009
Internet Explorer Security Settings Help Sheet Client Services Contents About this document 2 Audience... 2 Scope... 2 Related documentation... 2 Adding Præmium to your list of trusted sites 3 Pop up blocker
RSA Event Source Configuration Guide. Microsoft Internet Information Services
Configuration Guide Microsoft Internet Information Services Last Modified: Thursday, February 13, 2014 Event Source (Device) Product Information Vendor Microsoft Event Source (Device) Internet Information
Active Interest Media File Transfer Server Initial Client Install Documentation
Active Interest Media File Transfer Server Initial Client Install Documentation TABLE OF CONTENTS The Login Screen... Pg. 2 Firefox Java Enhanced WebClient. Pg. 3 Internet Explorer v7 Enhanced WebClient
How to Log in to LDRPS-Web v10 (L10) https://enterprise.strohlservices.com
How to Log in to LDRPS-Web v10 (L10) https://enterprise.strohlservices.com Contents First Time Login Instructions... 1 1) Use the Internet Explorer (IE) Web browser*... 1 2) Install the.net Framework...
The Core Pillars of AN EFFECTIVE DOCUMENT MANAGEMENT SOLUTION
The Core Pillars of AN EFFECTIVE DOCUMENT MANAGEMENT SOLUTION Amanda Perran 6 Time MVP Microsoft SharePoint Server Practice Lead, SharePoint - Plato vts Microsoft Co-Author of Beginning SharePoint 2007
UBER SEO. Affordable Online Marketing for Startups & Small Business. Provided By: EBWAY Crea2ve Solu2ons www.ebwaycrea2ve.com
UBER SEO Affordable Online Marketing for Startups & Small Business Provided By: EBWAY Crea2ve Solu2ons www.ebwaycrea2ve.com What is UBER SEO? EBWAY Creative provides SEO, SEM and SMO services, specifically
Outlook Data File navigate to the PST file that you want to open, select it and choose OK. The file will now appear as a folder in Outlook.
Migrate Archived Outlook Items Outlook includes archiving functionality that is used to free up space on the mail server by moving older items from the mail server to PST files stored on your computer
2011-2012 Search Engine Optimization (SEO)
2011-2012 Search Engine Optimization (SEO) Page 1 About TheMediaCrew TheMediaCrew -a leading search engine marketing and Optimization Company. We offer a complete range of search engine marketing services
Search Engine Optimization with Jahia
Search Engine Optimization with Jahia Thomas Messerli 12 Octobre 2009 Copyright 2009 by Graduate Institute Table of Contents 1. Executive Summary...3 2. About Search Engine Optimization...4 3. Optimizing
Director of Marketing, Cote Family Companies
Frank Soukup III Frank Soukup III Director of Marketing, Cote Family Companies Overview Understand what it is What it does How does it affect you How to use it for your advantage How to use blogs & social
ipad Set Up Guide: Staff! 1 of! 20
ipad Set Up Guide: Staff! 1 of! 20 Follow the step-by-step directions in this document to activate your ipad; set up Lotus Notes Traveler; install and configure Google Chrome and Google Drive; and set
SEO Workshop Keyword and Competitor Research and On Page Optimisation
SEO Workshop Keyword and Competitor Research and On Page Optimisation Marketing & Public Relations Department University of Newcastle April 2014 SEO Workshop Contents 2 What is SEO? STEP 1: Define Purpose
Data Warehousing in the Age of Big Data
Data Warehousing in the Age of Big Data Krish Krishnan AMSTERDAM BOSTON HEIDELBERG LONDON NEW YORK OXFORD * PARIS SAN DIEGO SAN FRANCISCO SINGAPORE SYDNEY TOKYO Morgan Kaufmann is an imprint of Elsevier
CASE STUDY. Online B2B Marketing and Lead Generation Tactics Increase Sales SEARCH - MARKETING - SOCIAL - MOBILE - ADVERTISING
CASE STUDY Online B2B Marketing and Lead Generation Tactics Increase Sales SEARCH - MARKETING - SOCIAL - MOBILE - ADVERTISING MES Hybrid Document Systems Depends on KEO Marketing s Online B2B Marketing
Improving Webpage Visibility in Search Engines by Enhancing Keyword Density Using Improved On-Page Optimization Technique
Improving Webpage Visibility in Search Engines by Enhancing Keyword Density Using Improved On-Page Optimization Technique Meenakshi Bansal Assistant Professor Department of Computer Engineering, YCOE,
EBOX Digital Content Management System (CMS) User Guide For Site Owners & Administrators
EBOX Digital Content Management System (CMS) User Guide For Site Owners & Administrators Version 1.0 Last Updated on 15 th October 2011 Table of Contents Introduction... 3 File Manager... 5 Site Log...
Internet Explorer Settings for use with Privia
Internet Explorer Settings for use with Privia The following document is intended for users who are running Privia and Internet Explorer who either cannot install the Privia client or the client is not
SEO Workshop Today s Coach Lynn Stevenson. SEO Analyst
SEO Workshop Today s Coach Lynn Stevenson SEO Analyst Overview Introduction to SEO Importance of Content SEO Content Best Practices Keyword Research Optimizing Content Common Pitfalls Social Media and
000-608. IBM WebSphere Process Server V7.0 Deployment Exam. http://www.examskey.com/000-608.html
IBM 000-608 IBM WebSphere Process Server V7.0 Deployment Exam TYPE: DEMO http://www.examskey.com/000-608.html Examskey IBM 000-608 exam demo product is here for you to test the quality of the product.
Managing Documents in the Citrix XenApp Remote Desktop
Introduction Managing Documents in the Citrix XenApp Remote Desktop What is a Citrix XenApp Remote Desktop? It is a virtualized instance of MS Windows with only enough software to run TAS in a controlled
Professional Diploma in Digital Marketing Module 2: Search Engine Optimisation Version 4.0 Location: Oslo/Norway Lecturer: Nina Furu
Professional Diploma in Digital Marketing Module 2: Search Engine Optimisation Version 4.0 Location: Oslo/Norway Lecturer: Nina Furu Programme Structure Search Engine Optimisation PROFESSIONAL DIPLOMA
Website Standards Association. Business Website Search Engine Optimization
Website Standards Association Business Website Search Engine Optimization Copyright 2008 Website Standards Association Page 1 1. FOREWORD...3 2. PURPOSE AND SCOPE...4 2.1. PURPOSE...4 2.2. SCOPE...4 2.3.
Edwin Analytics Getting Started Guide
Edwin Analytics Getting Started Guide This guide provides assistance for accessing and using Edwin Analytics, the Department of Elementary and Secondary Education s (ESE) online tool for expanding data
ClicktoFax Service Usage Manual
ClicktoFax Service Usage Manual 1. Log in to Fax Service 2. Configure your account 3. Send a fax 4. Receive a fax/search for Faxes/View Faxes 5. Logout 6. Additional Support 1. Log into fax service: a.
Blackboard Learning System: Student Instructional Guide
Blackboard Learning System: Student Instructional Guide This manual was prepared to assist students in the understanding, orientation, and usage of the Blackboard Learning System online course management
Dynamic Content for Executive Recruitment Firm
Dynamic Content for Executive Recruitment Firm Added dynamic functionality to existing static HTML site for a Philadelphia-area firm specializing in executive recruitment for the healthcare industry. This
SEARCH ENGINE OPTIMIZATION Jakub Zilincan 1. Introduction. Search Engine Optimization
SEARCH ENGINE OPTIMIZATION Jakub Zilincan 1 Abstract: Search engine optimization techniques, often shortened to SEO, should lead to first positions in organic search results. Some optimization techniques
F-Secure Messaging Security Gateway. Deployment Guide
F-Secure Messaging Security Gateway Deployment Guide TOC F-Secure Messaging Security Gateway Contents Chapter 1: Deploying F-Secure Messaging Security Gateway...3 1.1 The typical product deployment model...4
Google Trusted Stores Setup in Magento
Google Trusted Stores Setup in Magento Google Trusted Stores is a free badging program that can improve your conversion rate and average order size by reassuring potential customers you offer a great shopping
Get More Value from Your Reference Data Make it Meaningful with TopBraid RDM
Get More Value from Your Reference Data Make it Meaningful with TopBraid RDM Bob DuCharme Data Governance and Information Quality Conference June 9 TopQuadrant Company Focus: TopQuadrant was founded in
Horizontal IoT Application Development using Semantic Web Technologies
Horizontal IoT Application Development using Semantic Web Technologies Soumya Kanti Datta Research Engineer Communication Systems Department Email: [email protected] Roadmap Introduction Challenges
How We Did It. Unique data model abstraction layer to integrate, but de-couple EHR data from patient website design.
EHR Accessibility The Big Idea: Provide a standardized and improved user experience for ALL disabled and abled patients while interacting with their providers Electronic Health Records System (EHR). The
Configuring a Custom Load Evaluator Use the XenApp1 virtual machine, logged on as the XenApp\administrator user for this task.
Lab 8 User name: Administrator Password: Password1 Contents Exercise 8-1: Assigning a Custom Load Evaluator... 1 Scenario... 1 Configuring a Custom Load Evaluator... 1 Assigning a Load Evaluator to a Server...
How to configure and use a TraumaCad Plug-in with Philips isite
How to configure and use a TraumaCad Plug-in with Philips isite After isite plug-in is installed in the isite Enterprise / Radiology environment (and according to the location of the plug-in system preferences
Problem: Logging on to UT Southwestern Student Center
FAQ UT Southwestern Student Center Page 1 If you are experiencing problems logging into the site or accessing a data link, please try the remedies listed here first. Most problems can be quickly resolved
A taxonomy of mobile applications in tourism
University of Wollongong Research Online Faculty of Commerce - Papers (Archive) Faculty of Business 2012 A taxonomy of mobile applications in tourism Heather Kennedy-Eden University of Wollongong, [email protected]
MiraCosta College now offers two ways to access your student virtual desktop.
MiraCosta College now offers two ways to access your student virtual desktop. We now feature the new VMware Horizon View HTML access option available from https://view.miracosta.edu. MiraCosta recommends
Practical Web Analytics for User Experience
Practical Web Analytics for User Experience How Analytics Can Help You Understand Your Users Michael Beasley UX Designer, ITHAKA Ypsilanti, Michigan, USA üf IBs fmij ELSEVIER Amsterdam Boston Heidelberg
Topics in Website Testing. [Reading assignment: Chapter 14, pp. 211-227]
Topics in Website Testing [Reading assignment: Chapter 14, pp. 211-227] How to test a website Easiest way to start is by treating the web site as a black box. Look at a sample website such as www.apple.com
SEO Training SYLLABUS by SEOOFINDIA.COM
1 Foundation Course SEO Training SYLLABUS by SEOOFINDIA.COM Search Engine Optimization Training Course Internet and Search Engine Basics Internet Marketing Importance of Internet Marketing Types of Internet
Best Practice Search Engine Optimisation
Best Practice Search Engine Optimisation October 2007 Lead Hitwise Analyst: Australia Heather Hopkins, Hitwise UK Search Marketing Services Contents 1 Introduction 1 2 Search Engines 101 2 2.1 2.2 2.3
Augmented Search for Web Applications. New frontier in big log data analysis and application intelligence
Augmented Search for Web Applications New frontier in big log data analysis and application intelligence Business white paper May 2015 Web applications are the most common business applications today.
3CX IP PBX with Twilio Elastic SIP Trunking Interconnection Guide
3CX IP PBX with Twilio Elastic SIP Trunking Interconnection Guide Hello and welcome to our guide on how to set up a 3CX IP PBX for use with Twilio s Elastic SIP Trunking service. This guide covers the
Search Engine Optimization Proposal
Search Engine Optimization Proposal Focus on Search Engine Ranking Improvement & Website Optimization (+ suggestions on Social Media + Mobile Devices + Local Market + Website Integration & Development
Google Places Optimization (FAQ)
Google Places Optimization (FAQ) 1. What is local Search? Local search is any search aimed at finding something within a specific geographic area like hotel in Los Angles. Most of the time Google delivers
Web Beacons Guidelines for Notice and Choice
Web Beacons Guidelines for Notice and Choice The following statement was developed by a coalition of companies 1 in an effort to guide the appropriate use of Web Beacons. 2 The coalition is made up of
Understanding User Goals in Web Search
Understanding User Goals in Web Search Daniel E. Rose Yahoo! Inc. 701 First Avenue, MS B201 Sunnyvale, CA 94089 USA +1 408 349 7992 [email protected] Danny Levinson Yahoo! Inc. 144 Fourth Avenue SW,
Optimizing a large dynamically generated website for search engine crawling and ranking
Optimizing a large dynamically generated website for search engine crawling and ranking Research and implementation of a search engine optimization solution for a Fredhopper implementation Johan Köhne
SEO Training To Attract More Clients
SEO Training To Attract More Clients Welcome to the first of 3 webinars: SEO Basics Today s agenda: Inbound marketing concepts... Today s agenda: Inbound marketing concepts... What is SEO & why you should
How to Query, View & Print Documents in BDM. Banner Document Management (BDM)
(BDM) How to Query, View & Print Documents in BDM 1 Table of Contents 1. Overview 3 2. Creating & Saving Queries 2.1 Creating Queries within an Application 4 2.2 Creating a Cross Application Query 5 3.
CITY OF NAPLES VENDOR REGISTRATION TUTORIAL VENDOR SELF SERVICE (VSS) VENDOR REGISTATION TUTORIAL
CITY OF NAPLES VENDOR REGISTRATION TUTORIAL VENDOR SELF SERVICE (VSS) 3/5/2015 VENDOR REGISTATION TUTORIAL Start Vendor Registration Process: Please start by going to the City of Naples website located
LEARNING RESOURCE CENTRE GUIDE TO OFFICE 365
LEARNING RESOURCE CENTRE GUIDE TO OFFICE 365 LEARNING RESOURCE CENTRE OCTOBER 2014/2015 Table of Contents Explanation of One Drive and Microsoft Office Online... 3 How to create a document and folder...
Analyzing Chinese-English Mixed Language Queries in a Web Search Engine
Analyzing Chinese-English Mixed Language Queries in a Web Search Engine Hengyi Fu School of Information Florida State University 142 Collegiate Loop, FL 32306 [email protected] Shuheng Wu School of Information
Siri: A Virtual Personal Assistant An Ontology-driven Application for the Masses
Siri: A Virtual Personal Assistant An Ontology-driven Application for the Masses 2010 Siri, Inc. All rights reserved. Adam Cheyer and Tom Gruber cofounders, Siri It was imaginable 20 years ago. Apple's
2 Downloading Access Manager 3.1 SP4 IR1
Novell Access Manager 3.1 SP4 IR1 Readme May 2012 Novell This Readme describes the Novell Access Manager 3.1 SP4 IR1 release. Section 1, Documentation, on page 1 Section 2, Downloading Access Manager 3.1
Hadoop. MPDL-Frühstück 9. Dezember 2013 MPDL INTERN
Hadoop MPDL-Frühstück 9. Dezember 2013 MPDL INTERN Understanding Hadoop Understanding Hadoop What's Hadoop about? Apache Hadoop project (started 2008) downloadable open-source software library (current
CHAPTER 20 TESING WEB APPLICATIONS. Overview
CHAPTER 20 TESING WEB APPLICATIONS Overview The chapter describes the Web testing. Web testing is a collection of activities whose purpose is to uncover errors in WebApp content, function, usability, navigability,
SEO TRAINING COURSES GLASGOW & EDINBURGH. Online Marketing Workshops to fit all budgets
SEO TRAINING COURSES GLASGOW & EDINBURGH Online Marketing Workshops to fit all budgets SEO TRAINING COURSES GLASGOW & EDINBURGH BASIC INFORMATION SIGNUP CONSUMERS INTERACT WITH SEARCH ENGINES ON-PAGE OPTIMISATION
SWOT Assessment: CoreMedia, CoreMedia 7
SWOT Assessment: CoreMedia, CoreMedia 7 Analyzing the strengths, weaknesses, opportunities, and threats Reference Code: IT014-002848 Publication Date: 09 Dec 2013 Author: Sue Clarke SUMMARY Catalyst Web
