Reti di Calcolatori! Web Search

Size: px
Start display at page:

Download "Reti di Calcolatori! Web Search"

Transcription

1 Reti di Calcolatori! Web Search

2 Search use (iprospect Survey, 4/04,

3 Without search engines the web wouldn t scale 1. No incentive in creating content unless it can be easily found other finding methods haven t kept pace (taxonomies, bookmarks, etc) 2. The web is both a technology artifact and a social environment The Web has become the new normal in the American way of life; those who don t go online constitute an ever-shrinking minority. [Pew Foundation report, January 2005] 3. Search engines make aggregation of interest possible: Create incentives for very specialized niche players Economical specialized stores, providers, etc Social narrow interests, specialized communities, etc 4. The acceptance of search interaction makes unlimited selection stores possible: Amazon, Netflix, etc 5. Search turned out to be the best mechanism for advertising on the web, a $15+ B industry. Growing very fast but entire US advertising industry $250B huge room to grow Sponsored search marketing is about $10B

4 The coarse-level dynamics Advertisement Editorial Feeds Crawls Content creators Content aggregators Subscription Transaction Content consumers

5 Brief (non-technical) history Early keyword-based engines Altavista, Excite, Infoseek, Inktomi, ca Paid placement ranking: Goto.com (morphed into Overture.com Yahoo!) Your search ranking depended on how much you paid Auction for keywords: casino was expensive!

6 Brief (non-technical) history 1998+: Link-based ranking pioneered by Google Blew away all early engines save Inktomi Great user experience in search of a business model Meanwhile Goto/Overture s annual revenues were nearing $1 billion Result: Google added paid-placement ads to the side, independent of search results Yahoo follows suit, acquiring Overture (for paid placement) and Inktomi (for search)

7 Ads vs. search results Google has maintained that ads (based on vendors bidding for keywords) do not affect vendors rankings in search results Sponsored Links CG Appliance Express Discount Appliances (650) Same Day Certified Installation San Francis co-oakland-san Jose, CA Miele Vacuum Cleaners Miele Vacuums - Complete Sele ction Free Shipping! Miele Vacuum Cleaners Miele -Free Air shipping! All models. Helpful advice. -vacuum.com Search = miele Web Results 1-10 of about 7,310,000 for miele. (0.12 seconds) Miele, Inc -- Anything else is a compromise At the heart of your home, Appliances by Miele.... USA. to miele.com. Residential Appliances. Vacuum Cleaners. Dishwashers. Cooking Appliances. Steam Oven. Coffee System... www. miele.com/ - 20k - Cached - Similar pages Miele Welcome to Miele, the home of the very best appliances and kitchens in the world. www. miele.co.uk/ - 3k - Cached - Similar pages Miele - Deutscher Hersteller von Einbaugeräten, Hausgeräten... - [ Translate this page ] Das Portal zum Thema Essen & Geniessen online unter -tisch.d e. Miele weltweit...ein Leben lang.... Wählen Sie die Miele Vertretung Ihres Landes. www. miele.de/ - 10k - Cached - Similar pages Herzlich willkommen bei Miele Österreich - [ Translate this page ] Herzlich willkommen bei Miele Österreich Wenn Sie nicht automatisch weitergeleitet werden, klicken Sie bitte hier! HAUSHALTSGERÄTE... www. miele.at/ - 3k - Cached - Similar pages

8 Ads vs. search results Other vendors (Yahoo, MSN) have made similar statements from time to time Any of them can change anytime We will focus primarily on search results independent of paid placement ads Although the latter is a fascinating technical subject in itself

9 Web search basics Sponsored Links User CG Appliance Express Discount Appliances (650) Same Day Certified Installation San Francisco -Oakland-San Jose, CA Miele Vacuum Cleaners Miele Vacuums - Complete Selection Free Shipping! Miele Vacuum Cleaners Miele -Free Air shipping! All models. Helpful advice. est -vacuum.com Web Results 1-10 of about 7,310,000 for miele. (0.12 seconds) Web spider Miele, Inc -- Anything else is a compromise At the heart of your home, Appliances by Miele.... USA. to miele.com. Residential Appliances. Vacuum C leaners. Dishwashers. Cooking Appliances. Steam Oven. Coffee System... www. miele.com/ - 20k - Cached - Similar pages Miele Welcome to Miele, the home of the very best appliances and kitchens in the world. www. miele.co.uk/ - 3k - Cached - Similar pages Miele - Deutscher Hersteller von Einbaugeräten, Hausgeräten... - [ Translate this page ] Das Portal zum Thema Essen & Geniessen online unter -tisch.de. Miele weltweit...ein Leben lang.... Wählen Sie die Miele Vertretung Ihres Landes. www. miele.de/ - 10k - Cached - Similar pages Herzlich willkommen bei Miele Österreich - [ Translate this page ] Herzlich willkommen bei Miele Österreich Wenn Sie nicht automatisch weitergeleitet werden, klicken Sie bitte hier! HAUSHALTSGERÄTE... www. miele.at/ - 3k - Cached - Similar pages Search Indexer The Web Indexes Ad indexes

10 User Needs Need [Brod02, RL04] Informational want to learn about something (~40% / 65%) Low hemoglobin Navigational want to go to that page (~25% / 15%) United Airlines Transactional want to do something (web-mediated) (~35% / 20%) Access a service Downloads Shop Gray areas Find a good hub Exploratory search see what s there Seattle weather Mars surface images Canon S410 Car rental Brasil

11 Web search users Make ill defined queries Short AV 2001: 2.54 terms avg, 80% < 3 words) AV 1998: 2.35 terms avg, 88% < 3 words [Silv98] Imprecise terms Sub-optimal syntax (most queries without operator) Low effort Wide variance in Needs Expectations Knowledge Bandwidth Specific behavior 85% look over one result screen only (mostly above the fold) 78% of queries are not modified (one query/session) Follow links the scent of information...

12 Query Distribution Power law: few popular broad queries, many rare specific queries

13 How far do people look for results? (Source: iprospect.com WhitePaper_2006_SearchEngineUserBehavior.pdf)

14 The Web corpus The Web No design/co-ordination Distributed content creation, linking, democratization of publishing Content includes truth, lies, obsolete information, contradictions Unstructured (text, html, ), semistructured (XML, annotated photos), structured (Databases) Scale much larger than previous text corpora but corporate records are catching up. Growth slowed down from initial volume doubling every few months but still expanding Content can be dynamically generated

15 The Web: Dynamic content A page without a static html version E.g., current status of flight AA129 Current availability of rooms at a hotel Usually, assembled at the time of a request from a browser Typically, URL has a? character in it AA129 Application server Browser Back-end databases

16 Dynamic content Most dynamic content is ignored by web spiders Many reasons including malicious spider traps Some dynamic content (news stories from subscriptions) are sometimes delivered as static content Application-specific spidering Spiders commonly view web pages just as Lynx (a text browser) would Note: even static pages are typically assembled on the fly (e.g., headers are common)

17 The web: size What is being measured? Number of hosts Number of (static) html pages Volume of data Number of hosts netcraft survey Monthly report on how many web hosts & servers are out there Number of pages numerous estimates (will discuss later)

18 The Web as a Directed Graph Page A Anchor hyperlink Page B Assumption 1: A hyperlink between pages denotes author perceived relevance (quality signal) Assumption 2: The anchor of the hyperlink describes the target page (textual context)

19 Anchor Text WWW Worm - McBryan [Mcbr94] For ibm how to distinguish between: IBM s home page (mostly graphical) IBM s copyright page (high term freq. for ibm ) Rival s spam page (arbitrarily high term freq.) ibm ibm.com IBM home page A million pieces of anchor text with ibm send a strong signal

20 Indexing anchor text When indexing a document D, include anchor text from links pointing to D. Armonk, NY-based computer giant IBM announced today Joe s computer hardware links Compaq HP IBM Big Blue today announced record profits for the quarter

21 Indexing anchor text Can sometimes have unexpected side effects - e.g., evil empire. Can index anchor text with less weight.

22 Anchor Text Other applications Weighting/filtering links in the graph HITS [Chak98], Hilltop [Bhar01] Generating page descriptions from anchor text [Amit98, Amit00]

23 Citation Analysis Citation frequency Co-citation coupling frequency Cocitations with a given author measures impact Cocitation analysis [Mcca90] Convert frequencies to correlation coefficients, do multivariate analysis/clustering, validate conclusions E.g., cocitation in the Geography and GIS web shows communities [Lars96 ] Bibliographic coupling frequency Articles that co-cite the same articles are related Citation indexing Who is a given author cited by? (Garfield [Garf72]) E.g., Science Citation Index ( ) CiteSeer ( ) [Lawr99a] Pagerank preview: Pinsker and Narin 60s

24 Query-independent ordering First generation: using link counts as simple measures of popularity. Two basic suggestions: Undirected popularity: Each page gets a score = the number of in-links plus the number of out-links (3+2=5). Directed popularity: Score of a page = number of its in-links (3).

25 Query processing First retrieve all pages meeting the text query (say venture capital). Order these by their link popularity (either variant on the previous page).

26 Spamming simple popularity Exercise: How do you spam each of the following heuristics so your page gets a high score? Each page gets a score = the number of inlinks plus the number of out-links. Score of a page = number of its in-links.

27 Pagerank scoring Imagine a browser doing a random walk on web pages: Start at a random page 1/3 1/3 1/3 At each step, go out of the current page along one of the links on that page, equiprobably In the steady state each page has a longterm visit rate - use this as the page s score.

28 Not quite enough The web is full of dead-ends. Random walk can get stuck in dead-ends. Makes no sense to talk about long-term visit rates.??

29 Teleporting At a dead end, jump to a random web page. At any non-dead end, with probability 10%, jump to a random web page. With remaining probability (90%), go out on a random link. 10% - a parameter.

30 Result of teleporting Now cannot get stuck locally. There is a long-term rate at which any page is visited (not obvious, will show this). How do we compute this visit rate?

31 Markov chains A Markov chain consists of n states, plus an n n transition probability matrix P. At each step, we are in exactly one of the states. For 1 i,j n, the matrix entry P ij tells us the probability of j being the next state, given we are currently in state i. P ii >0 is OK. i P ij j

32 Markov chains Clearly, for all i, n j= 1 P ij = 1. Markov chains are abstractions of random walks. Exercise: represent the teleporting random walk from 3 slides ago as a Markov chain, for this case:

33 Ergodic Markov chains A Markov chain is ergodic if you have a path from any state to any other you can be in any state at every time step, with non-zero probability. Not ergodic (even/ odd).

34 Ergodic Markov chains For any ergodic Markov chain, there is a unique long-term visit rate for each state. Steady-state distribution. Over a long time-period, we visit each state in proportion to this rate. It doesn t matter where we start.

35 Probability vectors A probability (row) vector x = (x 1, x n ) tells us where the walk is at any point. E.g., ( ) means we re in state i. 1 i n More generally, the vector x = (x 1, x n ) means the walk is in state i with probability x i. n i= 1 x i = 1.

36 Change in probability vector If the probability vector is x = (x 1, x n ) at this step, what is it at the next step? Recall that row i of the transition prob. Matrix P tells us where we go next from state i. So from x, our next state is distributed as xp.

37 Steady state example The steady state looks like a vector of probabilities a = (a 1, a n ): a i is the probability that we are in state i. 1/4 3/ /4 3/4 For this example, a 1 =1/4 and a 2 =3/4.

38 How do we compute this vector? Let a = (a 1, a n ) denote the row vector of steady-state probabilities. If we our current position is described by a, then the next step is distributed as ap. But a is the steady state, so a=ap. Solving this matrix equation gives us a. So a is the (left) eigenvector for P. (Corresponds to the principal eigenvector of P with the largest eigenvalue.) Transition probability matrices always have largest eigenvalue 1.

39 One way of computing a Recall, regardless of where we start, we eventually reach the steady state a. Start with any distribution (say x=(10 0)). After one step, we re at xp; after two steps at xp 2, then xp 3 and so on. Eventually means for large k, xp k = a. Algorithm: multiply x by increasing powers of P until the product looks stable.

40 Pagerank summary Preprocessing: Given graph of links, build matrix P. From it compute a. The entry a i is a number between 0 and 1: the pagerank of page i. Query processing: Retrieve pages meeting query. Rank them by their pagerank. Order is query-independent.

41 Resources IIR Chapter 19

The changing face of web search. Prabhakar Raghavan Yahoo! Research

The changing face of web search. Prabhakar Raghavan Yahoo! Research The changing face of web search Prabhakar Raghavan 1 What is web search? Access to heterogeneous, distributed information Heterogeneous in creation Heterogeneous in accuracy Heterogeneous in motives Multi-billion

More information

Challenges in web search

Challenges in web search Challenges in web search Prabhakar Raghavan Special thanks to Andrei Broder, and to Marc Najork, Microsoft Research, for some of these slides. What is web search? Access to heterogeneous, distributed information

More information

Top Online Activities (Jupiter Communications, 2000) CS276A Text Information Retrieval, Mining, and Exploitation

Top Online Activities (Jupiter Communications, 2000) CS276A Text Information Retrieval, Mining, and Exploitation Top Online Activities (Jupiter Communications, 2000) CS276A Text Information Retrieval, Mining, and Exploitation Lecture 11 12 November, 2002 Email Web Search 88% 96% Special thanks to Andrei Broder, IBM

More information

Web Information Retrieval. Lecture 9 Information Retrieval in the Web

Web Information Retrieval. Lecture 9 Information Retrieval in the Web Web Information Retrieval Lecture 9 Information Retrieval in the Web Search use (iprospect Survey, 4/04) Without search engines the web wouldn t scale 1. No incentive in creating content unless it can

More information

What is web search? CS276B Text Retrieval and Mining Winter 2005. What is web search? Web search: guarantee. The driver. The coarse-level dynamics

What is web search? CS276B Text Retrieval and Mining Winter 2005. What is web search? Web search: guarantee. The driver. The coarse-level dynamics CS276B Text Retrieval and Mining Winter 2005 Lecture 1 What is web search? Access to heterogeneous, distributed information Heterogeneous in creation Heterogeneous in motives Heterogeneous in accuracy

More information

Administrative. Course feedback. Web basics! Schedule for the next two weeks. Thanks! If you ever have other feedback Assignments/homeworks

Administrative. Course feedback. Web basics! Schedule for the next two weeks. Thanks! If you ever have other feedback Assignments/homeworks Web basics! David Kauchak cs458 Fall 2012 adapted from: http://www.stanford.edu/class/cs276/handouts/lecture13-webchar.ppt http://www.flickr.com/photos/30686429@n07/3953914015/in/set-72157622330082619/

More information

Web IR. The Big Picture

Web IR. The Big Picture Web IR The Big Picture Brief (non-technical) History Early keyword-based engines: 1995-1997 Altavista, Excite, Infoseek, Inktomi, Lycos Paid search ranking: Goto (morphed into Overture.com Yahoo!) Your

More information

Part 1: Link Analysis & Page Rank

Part 1: Link Analysis & Page Rank Chapter 8: Graph Data Part 1: Link Analysis & Page Rank Based on Leskovec, Rajaraman, Ullman 214: Mining of Massive Datasets 1 Exam on the 5th of February, 216, 14. to 16. If you wish to attend, please

More information

Technical challenges in web advertising

Technical challenges in web advertising Technical challenges in web advertising Andrei Broder Yahoo! Research 1 Disclaimer This talk presents the opinions of the author. It does not necessarily reflect the views of Yahoo! Inc. 2 Advertising

More information

Search engines: ranking algorithms

Search engines: ranking algorithms Search engines: ranking algorithms Gianna M. Del Corso Dipartimento di Informatica, Università di Pisa, Italy ESP, 25 Marzo 2015 1 Statistics 2 Search Engines Ranking Algorithms HITS Web Analytics Estimated

More information

Introduction to Text Mining and Web Search. Gao Cong gaocong@cs.aau.dk

Introduction to Text Mining and Web Search. Gao Cong gaocong@cs.aau.dk Dat5 Introduction to Text Mining and Web Search Gao Cong gaocong@cs.aau.dk Some slides are borrowed from Prof. Marti Hearst, Christopher Manning, Louis Eisenberg, Bing Liu, and Prabhakar Raghavan Objectives

More information

Cloud and Big Data Summer School, Stockholm, Aug., 2015 Jeffrey D. Ullman

Cloud and Big Data Summer School, Stockholm, Aug., 2015 Jeffrey D. Ullman Cloud and Big Data Summer School, Stockholm, Aug., 2015 Jeffrey D. Ullman 2 Intuition: solve the recursive equation: a page is important if important pages link to it. Technically, importance = the principal

More information

Challenges in Running a Commercial Web Search Engine. Amit Singhal

Challenges in Running a Commercial Web Search Engine. Amit Singhal Challenges in Running a Commercial Web Search Engine Amit Singhal Overview Introduction/History Search Engine Spam Evaluation Challenge Google Introduction Crawling Follow links to find information Indexing

More information

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)

So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02) Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we

More information

Big Data Technology Motivating NoSQL Databases: Computing Page Importance Metrics at Crawl Time

Big Data Technology Motivating NoSQL Databases: Computing Page Importance Metrics at Crawl Time Big Data Technology Motivating NoSQL Databases: Computing Page Importance Metrics at Crawl Time Edward Bortnikov & Ronny Lempel Yahoo! Labs, Haifa Class Outline Link-based page importance measures Why

More information

Online edition (c)2009 Cambridge UP

Online edition (c)2009 Cambridge UP DRAFT! April 1, 2009 Cambridge University Press. Feedback welcome. 461 21 Link analysis The analysis of hyperlinks and the graph structure of the Web has been instrumental in the development of web search.

More information

WEB SEARCH BASICS, CRAWLING AND INDEXING. Slides by Manning, Raghavan, Schutze

WEB SEARCH BASICS, CRAWLING AND INDEXING. Slides by Manning, Raghavan, Schutze WEB SEARCH BASICS, CRAWLING AND INDEXING 1 Brief (non technical) history Early keyword based engines ca. 1995 1997 Altavista, Excite, Infoseek, Inktomi, Lycos Paid search ranking: Goto (morphed into Overture.com

More information

SEARCH ENGINE OPTIMIZATION

SEARCH ENGINE OPTIMIZATION SEARCH ENGINE OPTIMIZATION WEBSITE ANALYSIS REPORT FOR miaatravel.com Version 1.0 M AY 2 4, 2 0 1 3 Amendments History R E V I S I O N H I S T O R Y The following table contains the history of all amendments

More information

A Taxonomy of Web Search by Andrei Broder

A Taxonomy of Web Search by Andrei Broder A Taxonomy of Web Search by Andrei Broder 2012 Outline Motivation 1 Motivation 2 3 4 5 Outline Motivation 1 Motivation 2 3 4 5 Aims of the Paper Point out the difference between classic IR and web search

More information

Four Keys: Enhancing Search Engine Optimization

Four Keys: Enhancing Search Engine Optimization Four Keys: Enhancing Search Engine Optimization A Quick Guide for Vacation Rental Property Managers Driving Website Traffic Presented by LiveRez Introduction In the past few years, the utilization of search

More information

1 o Semestre 2007/2008

1 o Semestre 2007/2008 Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Outline 1 2 3 4 5 Outline 1 2 3 4 5 Exploiting Text How is text exploited? Two main directions Extraction Extraction

More information

Page 1 Basic Computer Skills Series: The Internet and the World Wide Web GOALS

Page 1 Basic Computer Skills Series: The Internet and the World Wide Web GOALS GOALS Understand the differences between the Internet and the World Wide Web Use a web browser to find and open websites Navigate using links, the back button, and the forward button Use bookmarks and

More information

Practical Graph Mining with R. 5. Link Analysis

Practical Graph Mining with R. 5. Link Analysis Practical Graph Mining with R 5. Link Analysis Outline Link Analysis Concepts Metrics for Analyzing Networks PageRank HITS Link Prediction 2 Link Analysis Concepts Link A relationship between two entities

More information

Search and Information Retrieval

Search and Information Retrieval Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search

More information

A COMPREHENSIVE REVIEW ON SEARCH ENGINE OPTIMIZATION

A COMPREHENSIVE REVIEW ON SEARCH ENGINE OPTIMIZATION Volume 4, No. 1, January 2013 Journal of Global Research in Computer Science REVIEW ARTICLE Available Online at www.jgrcs.info A COMPREHENSIVE REVIEW ON SEARCH ENGINE OPTIMIZATION 1 Er.Tanveer Singh, 2

More information

Online Traffic Generation

Online Traffic Generation Online Traffic Generation Executive Summary Build it and they will come. A great quote from a great movie, but not necessarily true in the World Wide Web. Build it and drive traffic to your Field of Dreams

More information

Search Engine Optimization for a WebSphere Commerce System

Search Engine Optimization for a WebSphere Commerce System IBM Software Group Search Engine Optimization for a WebSphere Commerce System Shash Anand (sanand@ca.ibm.com) Aileen Guan (aguan@ca.ibm.com) WebSphere Support Technical Exchange Agenda Overview General

More information

Removing Web Spam Links from Search Engine Results

Removing Web Spam Links from Search Engine Results Removing Web Spam Links from Search Engine Results Manuel EGELE pizzaman@iseclab.org, 1 Overview Search Engine Optimization and definition of web spam Motivation Approach Inferring importance of features

More information

Search Engine Optimization for Higher Education. An Ingeniux Whitepaper

Search Engine Optimization for Higher Education. An Ingeniux Whitepaper Search Engine Optimization for Higher Education An Ingeniux Whitepaper This whitepaper provides recommendations on how colleges and universities may improve search engine rankings by focusing on proper

More information

10 Things You Must Know Before Redesigning Your Website

10 Things You Must Know Before Redesigning Your Website 10 Things You Must Know Before Redesigning Your Website 201-33119 South Fraser Way Abbotsford, BC V2S 2B1 888.262.6687 contact@1stonthelist.ca www.1stonthelist.ca PG 1 ABOUT 1ST ON THE LIST Thanks for

More information

Well, it isn t if you have the right pre-built, totally unique website, which has all the income resources already built in.

Well, it isn t if you have the right pre-built, totally unique website, which has all the income resources already built in. Here s How You Can Quickly An Easily Own A Professionally Designed ecommerce Website Guaranteed To Rank High In The SERP s & Earn YOU Recurring Income each month without Having To Design, Implement or

More information

An Approach to Give First Rank for Website and Webpage Through SEO

An Approach to Give First Rank for Website and Webpage Through SEO International Journal of Computer Sciences and Engineering Open Access Research Paper Volume-2 Issue-6 E-ISSN: 2347-2693 An Approach to Give First Rank for Website and Webpage Through SEO Rajneesh Shrivastva

More information

Fig (1) (a) Server-side scripting with PHP. (b) Client-side scripting with JavaScript.

Fig (1) (a) Server-side scripting with PHP. (b) Client-side scripting with JavaScript. Client-Side Dynamic Web Page Generation CGI, PHP, JSP, and ASP scripts solve the problem of handling forms and interactions with databases on the server. They can all accept incoming information from forms,

More information

Online Marketing Optimization Essentials

Online Marketing Optimization Essentials Online Marketing Optimization Essentials Bilal Saleh Principal Partner E-Nor Inc. May 20, 2014 Agenda 2 E-Nor Overview Search Engine Optimization (SEO) Paid search Web Analytics Q&A Graphics by: http://www.iconarchive.com/show/seo-icons-by-designbolts.html

More information

Graph Algorithms and Graph Databases. Dr. Daisy Zhe Wang CISE Department University of Florida August 27th 2014

Graph Algorithms and Graph Databases. Dr. Daisy Zhe Wang CISE Department University of Florida August 27th 2014 Graph Algorithms and Graph Databases Dr. Daisy Zhe Wang CISE Department University of Florida August 27th 2014 1 Google Knowledge Graph -- Entities and Relationships 2 Graph Data! Facebook Social Network

More information

The 7 Deadly Sins of Web Marketing

The 7 Deadly Sins of Web Marketing The 7 Deadly Sins of Web Marketing Promoting Engineering Products & Services Shari L.S. Worthington, President Telesian Technology Inc. Marketing, Web, & e-business Services for the Technology & Manufacturing

More information

Tutorial, IEEE SERVICE 2014 Anchorage, Alaska

Tutorial, IEEE SERVICE 2014 Anchorage, Alaska Tutorial, IEEE SERVICE 2014 Anchorage, Alaska Big Data Science: Fundamental, Techniques, and Challenges (Data Mining on Big Data) 2014. 6. 27. By Neil Y. Yen Presented by Incheon Paik University of Aizu

More information

Trust and Reputation Management

Trust and Reputation Management Trust and Reputation Management Omer Rana School of Computer Science and Welsh escience Centre, Cardiff University, UK Omer Rana (CS, Cardiff, UK) CM0356/CMT606 1 / 28 Outline 1 Context Defining Trust

More information

Keywords the Most Important Item in SEO

Keywords the Most Important Item in SEO This is one of several guides and instructionals that we ll be sending you through the course of our Management Service. Please read these instructionals so that you can better understand what you can

More information

GOOGLE ANALYTICS TERMS

GOOGLE ANALYTICS TERMS GOOGLE ANALYTICS TERMS BOUNCE RATE The average percentage of people who visited your website and only viewed one page. In Google Analytics, you are able to see a site-wide bounce rate and bounce rates

More information

T: 0800 135 7186 cravenplan.com/search

T: 0800 135 7186 cravenplan.com/search Introduction Cravenplan Computers Ltd has been building and optimising websites for over 12 years and with a dedicated, experienced search engine marketing team we are in an excellent position to help

More information

How to Create a Campaign in AdWords Editor

How to Create a Campaign in AdWords Editor How to Create a Campaign in AdWords Editor Using AdWords Editor instead of the online interface for Google Adwords will speed up everything in your online business. AdWords Editor gives you the upper hand

More information

One credit, meets just once per week One textbook: Search Engine Marketing, Inc.

One credit, meets just once per week One textbook: Search Engine Marketing, Inc. Search Engine Strategies CSE/BIS 197 Fall 2006 Welcome! Profs. Brian Davison and Lin Lin Syllabus, schedule, etc. all online http://www.cse.lehigh.edu/~brian/course/sem One credit, meets just once per

More information

Link Analysis. Chapter 5. 5.1 PageRank

Link Analysis. Chapter 5. 5.1 PageRank Chapter 5 Link Analysis One of the biggest changes in our lives in the decade following the turn of the century was the availability of efficient and accurate Web search, through search engines such as

More information

An Overview of Computational Advertising

An Overview of Computational Advertising An Overview of Computational Advertising Evgeniy Gabrilovich in collaboration with many colleagues throughout the company 1 What is Computational Advertising? New scientific sub-discipline that provides

More information

INTERNET MARKETING. SEO Course Syllabus Modules includes: COURSE BROCHURE

INTERNET MARKETING. SEO Course Syllabus Modules includes: COURSE BROCHURE AWA offers a wide-ranging yet comprehensive overview into the world of Internet Marketing and Social Networking, examining the most effective methods for utilizing the power of the internet to conduct

More information

Web Advertising 1 2/26/2013 CS190: Web Science and Technology, 2010

Web Advertising 1 2/26/2013 CS190: Web Science and Technology, 2010 Web Advertising 12/26/2013 CS190: Web Science and Technology, 2010 Today's Plan Logistics Understanding searchers (Commercial Perspective) Search Advertising Next project: Google advertising challenge

More information

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content

More information

SEO AND CONTENT MANAGEMENT SYSTEM

SEO AND CONTENT MANAGEMENT SYSTEM International Journal of Electronics and Computer Science Engineering 953 Available Online at www.ijecse.org ISSN- 2277-1956 SEO AND CONTENT MANAGEMENT SYSTEM Savan K. Patel 1, Jigna B.Prajapati 2, Ravi.S.Patel

More information

Data Mining in Web Search Engine Optimization and User Assisted Rank Results

Data Mining in Web Search Engine Optimization and User Assisted Rank Results Data Mining in Web Search Engine Optimization and User Assisted Rank Results Minky Jindal Institute of Technology and Management Gurgaon 122017, Haryana, India Nisha kharb Institute of Technology and Management

More information

Understand how PPC can help you achieve your marketing objectives at every stage of the sales funnel.

Understand how PPC can help you achieve your marketing objectives at every stage of the sales funnel. 1 Understand how PPC can help you achieve your marketing objectives at every stage of the sales funnel. 2 WHO IS THIS GUIDE FOR? This guide is written for marketers who want to get more from their digital

More information

Search engine ranking

Search engine ranking Proceedings of the 7 th International Conference on Applied Informatics Eger, Hungary, January 28 31, 2007. Vol. 2. pp. 417 422. Search engine ranking Mária Princz Faculty of Technical Engineering, University

More information

DIGITAL MARKETING BASICS: SEO

DIGITAL MARKETING BASICS: SEO DIGITAL MARKETING BASICS: SEO Search engine optimization (SEO) refers to the process of increasing website visibility or ranking visibility in a search engine's "organic" or unpaid search results. As an

More information

Pavlo Baron. Big Data and CDN

Pavlo Baron. Big Data and CDN Pavlo Baron Big Data and CDN Pavlo Baron www.pbit.org pb@pbit.org @pavlobaron What is Big Data Big Data describes datasets that grow so large that they become awkward to work with using on-hand database

More information

Campaign Goals, Objectives and Timeline SEO & Pay Per Click Process SEO Case Studies SEO & PPC Strategy On Page SEO Off Page SEO Pricing Plans Why Us

Campaign Goals, Objectives and Timeline SEO & Pay Per Click Process SEO Case Studies SEO & PPC Strategy On Page SEO Off Page SEO Pricing Plans Why Us Campaign Goals, Objectives and Timeline SEO & Pay Per Click Process SEO Case Studies SEO & PPC Strategy On Page SEO Off Page SEO Pricing Plans Why Us & Contact Generate organic search engine traffic to

More information

Text and Web Mining A big challenge for Data Mining. Nguyen Hung Son Warsaw University

Text and Web Mining A big challenge for Data Mining. Nguyen Hung Son Warsaw University Text and Web Mining A big challenge for Data Mining Nguyen Hung Son Warsaw University Outline Text vs. Web mining Search Engine Inside: Why Search Engine so important Search Engine Architecture Crawling

More information

Web Analytics. Using emetrics to Guide Marketing Strategies on the Web

Web Analytics. Using emetrics to Guide Marketing Strategies on the Web Web Analytics Using emetrics to Guide Marketing Strategies on the Web Web analytics is the practice of measuring, collecting, analyzing and reporting on Internet data for the purposes of understanding

More information

FACEBOOK FOR SOCIAL SEO

FACEBOOK FOR SOCIAL SEO FACEBOOK FOR SOCIAL SEO BEST PRACTICES TO DRIVE ORGANIC SEARCH PERFORMANCE WITH FACEBOOK Facebook-BrightEdge White Paper INTRODUCTION Most companies today wouldn t even consider creating corporate or product

More information

How to make the most of search engine marketing (SEM)

How to make the most of search engine marketing (SEM) How to make the most of search engine marketing (SEM) If you build it, will they come? When it comes to your Web site, answering that question with a resounding yes has become a key requirement for success.

More information

How to get your Website listed with Search Engines and Directories

How to get your Website listed with Search Engines and Directories How to get your Website listed with Search Engines and Directories Presented by T. Quack Quack Internet Solutions www.quack.ch August 7, 2001 1/8 1. Introduction... 3 1.1 How do search engines work?...

More information

Spam & The Power of Social Networks

Spam & The Power of Social Networks Spam & The Power of Social Networks Ricardo Baeza-Yates Yahoo! Research Barcelona, Spain & Santiago, Chile Thanks to Tim Converse (Yahoo! Search) & R. Ravi (CMU) Models of Trust Workshop: May 22, 2006

More information

SEO Analysis Guide CreatorSEO easy to use SEO tools

SEO Analysis Guide CreatorSEO easy to use SEO tools CreatorSEO Analysis Guide Updated: July 2010 Introduction This guide has been developed by CreatorSEO to help our clients manage their SEO campaigns. This guide will be updated regularly as the Search

More information

Optimization Problems in Internet Advertising. Cliff Stein Columbia University Google Research

Optimization Problems in Internet Advertising. Cliff Stein Columbia University Google Research Optimization Problems in Internet Advertising Cliff Stein Columbia University Google Research Internet Advertising Multi-billion dollar business Google purchased DoubleClick for over 3 billion dollars,

More information

[Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC. [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions

[Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC. [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions [Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions [2014-2016] By Lathish Difference between Offline Marketing

More information

Search Engine Optimization. Software Engineering October 5, 2011 Frank Takes (ftakes@liacs.nl) LIACS, Leiden University

Search Engine Optimization. Software Engineering October 5, 2011 Frank Takes (ftakes@liacs.nl) LIACS, Leiden University Search Engine Optimization Software Engineering October 5, 2011 Frank Takes (ftakes@liacs.nl) LIACS, Leiden University Overview Search Engines Search Engine Optimization Google PageRank Social Media Search

More information

SEO for Profit. A Wordtracker Masterclass in search engine optimization. Mark Nunney

SEO for Profit. A Wordtracker Masterclass in search engine optimization. Mark Nunney SEO for Profit A Wordtracker Masterclass in search engine optimization Mark Nunney Contents Book Introduction Part One: Search engines and SEO 4 Introduction 5 Chapter 1: Search engines 7 Chapter 2: What

More information

5 Tips to Turn Your Website into a Marketing Machine

5 Tips to Turn Your Website into a Marketing Machine Internet Marketing Whitepaper: 5 Tips to Turn Your Website into a Marketing Machine Brought to you by www.hubspot.com Part 1: How the Internet has transformed businesses The Internet has profoundly transformed

More information

The mobile opportunity: How to capture upwards of 200% in lost traffic

The mobile opportunity: How to capture upwards of 200% in lost traffic June 2014 BrightEdge Mobile Share Report The mobile opportunity: How to capture upwards of 200% in lost traffic You ve likely heard that mobile website optimization is the next frontier, and you ve probably

More information

Top 12 Website Tips. How to work with the Search Engines

Top 12 Website Tips. How to work with the Search Engines Top 12 Website Tips 1. Put your website at the heart of your marketing strategy 2. Have a clear purpose for your website 3. Do extensive SEO keyword research 4. Understand what your online competitors

More information

Search Engine Optimization Glossary

Search Engine Optimization Glossary Search Engine Optimization Glossary A ALT Text/Tag or Attribute: A description of an image in your site's HTML. Unlike humans, search engines read only the ALT text of images, not the images themselves.

More information

Brief (non-technical) history

Brief (non-technical) history Sanda Harabagiu Lecture 10: Web search basics Brief (non-technical) history Early keyword-based engines ca. 1995-1997 Altavista, Excite, Infoseek, Inktomi, Lycos Paid searchranking: Goto (morphed into

More information

SAS BI Dashboard 3.1. User s Guide

SAS BI Dashboard 3.1. User s Guide SAS BI Dashboard 3.1 User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2007. SAS BI Dashboard 3.1: User s Guide. Cary, NC: SAS Institute Inc. SAS BI Dashboard

More information

Chapter-1 : Introduction 1 CHAPTER - 1. Introduction

Chapter-1 : Introduction 1 CHAPTER - 1. Introduction Chapter-1 : Introduction 1 CHAPTER - 1 Introduction This thesis presents design of a new Model of the Meta-Search Engine for getting optimized search results. The focus is on new dimension of internet

More information

The Ultimate Guide to Magento SEO Part 1: Basic website setup

The Ultimate Guide to Magento SEO Part 1: Basic website setup The Ultimate Guide to Magento SEO Part 1: Basic website setup Jason Millward http://www.jasonmillward.com jason@jasonmillward.com Published November 2014 All rights reserved. No part of this publication

More information

Introduction to Search Engine Marketing

Introduction to Search Engine Marketing Introduction to Search Engine Marketing What will you learn in this Presentation? Introduction to Search Engine Marketing How do we define Search Engines? What is the Search Engine Marketing share? Why

More information

Search Engine Optimisation Guide May 2009

Search Engine Optimisation Guide May 2009 Search Engine Optimisation Guide May 2009-1 - The Basics SEO is the active practice of optimising a web site by improving internal and external aspects in order to increase the traffic the site receives

More information

SEO REPORT. Prepared for searchoptions.com.au

SEO REPORT. Prepared for searchoptions.com.au REPORT Prepared for searchoptions.com.au March 24, 2016 searchoptions.com.au ISSUES FOUND ON YOUR SITE (MARCH 24, 2016) This report shows the issues that, when solved, will improve your site rankings and

More information

10. Search Engine Marketing

10. Search Engine Marketing 10. Search Engine Marketing What s inside: We look at the difference between paid and organic search results and look through the key terms and concepts that will help you understand this relationship.

More information

Computational advertising

Computational advertising Computational advertising Kira Radinsky Slides based on material from: Ronny Lempel Placement in Search Engines With the increased economic impact of the Web, it is crucial for businesses to have high

More information

Lecture 10: HBase! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl

Lecture 10: HBase! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl Big Data Processing, 2014/15 Lecture 10: HBase!! Claudia Hauff (Web Information Systems)! ti2736b-ewi@tudelft.nl 1 Course content Introduction Data streams 1 & 2 The MapReduce paradigm Looking behind the

More information

Small Business SEO Marketing an introduction

Small Business SEO Marketing an introduction Small Business SEO Marketing an introduction Optimax May 2012 www.optimaxads.com 1 CONTENTS Introduction 3 On Page Optimisation 3 How Google views your Web Site 5 How to check your web page code for SEO

More information

Proposal for Search Engine Optimization. Ref: Pro-SEO-0049/2009

Proposal for Search Engine Optimization. Ref: Pro-SEO-0049/2009 Proposal for Search Engine Optimization Ref: Pro-SEO-0049/2009 CONTENTS Contents... 2 Executive Summary... 3 Overview... 4 1.1 How Search Engines WORK?... 4 1.2 About us... 6 Methodology... 7 1.2.1 Phase

More information

Search engine marketing

Search engine marketing Search engine marketing 1 Online marketing planning PHASE 1 Current marketing situation analysis PHASE 2 Defining Strategy Setting Web Site Objective PHASE 3 Operational action programmes PHASE 4 Control

More information

Geo Targeting Server location, country-targeting, language declarations & hreflang

Geo Targeting Server location, country-targeting, language declarations & hreflang SEO Audit Checklist - TECHNICAL - Accessibility & Crawling Indexing DNS Make sure your domain name server is configured properly 404s Proper header responses and minimal reported errors Redirects Use 301s

More information

The RankBuilder Instant Ranking Formula

The RankBuilder Instant Ranking Formula The RankBuilder Instant Ranking Formula How to Build a Huge Network of Linking Power for Any Site... RankBuilder.com Introduction Get more training and SEO tools at the RankBuilder Blog Welcome to the

More information

8 Tips for Maximizing Survey Response Potential

8 Tips for Maximizing Survey Response Potential 8 Tips for Maximizing Survey Response Potential You ve built a survey and are ready to make it available to potential respondents. But how do you convert those potential respondents into completed responses?

More information

Search Engine Optimization and Search Engine Marketing: A Primer for Clients

Search Engine Optimization and Search Engine Marketing: A Primer for Clients Search Engine Optimization and Search Engine Marketing: A Primer for Clients Catamount Studios 1805 N Tejon St Colorado Springs, CO 80907 T 719.481.3201 F 719.302.4513 www.catamountstudios.com Introduction

More information

A Comparative Approach to Search Engine Ranking Strategies

A Comparative Approach to Search Engine Ranking Strategies 26 A Comparative Approach to Search Engine Ranking Strategies Dharminder Singh 1, Ashwani Sethi 2 Guru Gobind Singh Collage of Engineering & Technology Guru Kashi University Talwandi Sabo, Bathinda, Punjab

More information

DIGITAL MARKETING TRAINING

DIGITAL MARKETING TRAINING DIGITAL MARKETING TRAINING Digital Marketing Basics Keywords Research and Analysis Basics of advertising What is Digital Media? Digital Media Vs. Traditional Media Benefits of Digital marketing Latest

More information

An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising. Anindya Ghose Sha Yang

An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising. Anindya Ghose Sha Yang An Empirical Analysis of Sponsored Search Performance in Search Engine Advertising Anindya Ghose Sha Yang Stern School of Business New York University Outline Background Research Question and Summary of

More information

Internet Marketing for Local Businesses Online

Internet Marketing for Local Businesses Online Dear Business Owner, I know you get calls from all sorts of media outlets and organizations looking to get a piece of your advertising budget. Today I am not pitching you anything. I would just like to

More information

To learn more about Search Engine Optimization and its role in Internet marketing, view our short video series at: www.youtube.

To learn more about Search Engine Optimization and its role in Internet marketing, view our short video series at: www.youtube. Search Engine Optimization February 2013 Search Engine Optimization, or SEO, is the process of tuning your website so that it appears high in search engine results, increasing relevant traffic to your

More information

Search Engine Optimization

Search Engine Optimization Search Engine Optimization Search An Introductory Guide How to improve the effectiveness of your web site through better search engine results. As you ve probably learned, having a Web site is almost a

More information

Arya Progen Technologies & Engineering India Pvt. Ltd.

Arya Progen Technologies & Engineering India Pvt. Ltd. ARYA Group of Companies: ARYA Engineering & Consulting International Ltd. ARYA Engineering & Consulting Inc. ARYA Progen Technologies & Engineering India Pvt. Ltd. Head Office PO Box 68222, 28 Crowfoot

More information

SEARCH ENGINE OPTIMISATION

SEARCH ENGINE OPTIMISATION S E A R C H E N G I N E O P T I M I S AT I O N - PA G E 2 SEARCH ENGINE OPTIMISATION Search Engine Optimisation (SEO) is absolutely essential for small to medium sized business owners who are serious about

More information

An Alternative Web Search Strategy? Abstract

An Alternative Web Search Strategy? Abstract An Alternative Web Search Strategy? V.-H. Winterer, Rechenzentrum Universität Freiburg (Dated: November 2007) Abstract We propose an alternative Web search strategy taking advantage of the knowledge on

More information

SEO Loves Facebook A guide to increase your website rankings with Facebook

SEO Loves Facebook A guide to increase your website rankings with Facebook SEO Loves Facebook A guide to increase your website rankings with Facebook Introduction Most businesses today wouldn t even consider creating company or product marketing plans that don t include social

More information

What is a Search Engine?

What is a Search Engine? What is a Search Engine? Hi. It s me, Sarah and I m here to tell you about finding things on the Internet. The Internet brings together information from all over the world, which is a lot of information.

More information

enhanced landing page groups and meetings template guidelines

enhanced landing page groups and meetings template guidelines enhanced landing page groups and meetings template guidelines table of contents groups and meetings templates 03 groups and meetings template specifications 04 web best practices 05 writing for enhanced

More information

Website Search Engine Optimization (SEO) Evaluation XXXXXXX

Website Search Engine Optimization (SEO) Evaluation XXXXXXX Website Search Engine Optimization (SEO) Evaluation For XXXXXXX July 22, 2008 Introduction This report provides recommendations that can be implemented on XXXXX s website to improve acquisition from search

More information