Search engines: ranking algorithms
|
|
|
- Philomena Ellis
- 10 years ago
- Views:
Transcription
1 Search engines: ranking algorithms Gianna M. Del Corso Dipartimento di Informatica, Università di Pisa, Italy ESP, 25 Marzo 2015
2 1 Statistics 2 Search Engines Ranking Algorithms HITS
3 Web Analytics Estimated size: up to 45 billion pages Indexed pages: at least 2 billion pages Average size of a web page is 1.3 MB... Up 35% in last year Petabytes (10 15 bytes) for storing just the indexed pages!!
4 Web Analytics Web pages are dynamical objects Most of web pages content changes daily Average lifespan of a page is around 10 days. A huge quantity of data!! Find a document without search engines it s like looking for a needle in a haystack.
5 Statistics Market shares
6 Statistics Market shares Sheet1 Google 67 Microsoft 17.9 Yahoo! 11.3 Ask 2.7 AOL 1.3 Google Microsoft Yahoo! Ask AOL [comscore, July 2013]
7 Google s Numbers [NASDAQ:GOOG]
8 Google s Numbers Sheet1 year Ad Revenues Other Rev websites network other % 1.20% Revenue Sources % 1.06% % 1.09% % 3.06% % % 97% 99% 97% 3.22% 97% 96% 67 96% % % 3.70% % 3.62% % 5.11% Ad Revenues 0.6 Other Rev % 1.06% 1.09% 3.06% 3.22% 3.70% 3.62% 5.11% [investor.google.com/financial/tables.html]
9 Search Engines The dream of search engines: catalog everything published on the web Web content fruition by query Rank by relevance to the query
10 The Anatomy of a Search Engine Page Repository Spiders Indexer Collection Analysis Queries Query Engine Results Ranking Text Structure Utility Spider Control Indexes
11 Ranking Algorithms... At the very beginning (mid 90s) The order of pages returned by the engine for a given query was controlled by the owner of the page, who could increase the ranking score increasing the frequency of certain terms, font color, font size etc. SPAM
12 Ranking Algorithms... At the very beginning (mid 90s) The order of pages returned by the engine for a given query was controlled by the owner of the page, who could increase the ranking score increasing the frequency of certain terms, font color, font size etc. SPAM
13 Ranking Algorithms Modern engines In 1998 two similar ideas HITS (Jon Kleinberg) (S. Brin and L. Page) The importance of a page should not depend on the owner of the page!
14 Ranking Algorithms Basic intuition Look at the linkage structure p q Adding a link from page p to page q is a sort of vote from the author of p to q. Idea If the content of a page is interesting it will receive many votes, i.e. it will have many inlinks.
15 Ranking Algorithms Web Graph The Web is seen as a directed graph: Each page is a node Each hyperlink is an edge G =
16 Ranking Algorithms Web Graph The importance of a web page is determined by the structure of the web graph At first approximation: Contents of pages is not used! Aim: The owner of a page cannot boost the importance of its pages!
17 HITS An example query=the best automobile makers in the last 4 years Intention get back a list of top car brands and their official web sites Textual ranking pages returned might be very different Top companies might not even use the terms automobile makers.they might use the term car manufacturer instead
18 HITS HITS Each page has associated two scores a i authority score h i hub score A page is a good authority if it is linked by many good hubs A page is a good hub if it is linked by many good authorities To every page p we associate two scores: { ap = i I(p) h i h p = i O(p) a i { a (i) = G T h (i 1) h (i) = Ga (i) Hub pages advertise authoritative pages!
19 HITS HITS
20 HITS HITS G T G and GG T are non negative G T G and GG T are semipositive defined real nonnegative eigenvalues { h (i) = GG T h (i 1) a (i) = G T Ga (i 1) h is the dominant eigenvector of GG T a is the dominant eigenvector of G T G For Perron-Frobenius Theorem the eigenvector associated to the dominant eigenvalue has nonnegative entries!
21 HITS HITS At query time Find relevant pages Construct the graph starting with those bunch of pages Compute dominant eigenvector of G T G Sort the dominant eigenvalue and return the pages in accordance with the ordering induced
22 Google s Is a static ranking schema At query time relevant pages are retrieved neighbours The ranking of pages is based on the of pages which is precomputed
23 PR home doc3 doc1 doc2 doc3 home doc1 1. doc 3 2. doc 1 3. doc 2 doc2
24 A page is important if is voted by important pages The vote is expressed by a link
25 A page distribute its importance equally to its neighbors The importance of a page is the sum of the importances of pages which points to it π j = i I(j) π i outdeg(i) G = [ ] P = [ / /2 1/2 0 1/ /3 1/3 0 1/3 0 ]
26 It is called Random surfer model The web surfer jumps from page to page following hyperlinks. The probability of jumping to a page depends of the number of links in that page. Starting with a vector z (0), compute z (k) j = i I(j) z (k 1) p ij, p ij = 1 outdeg(i) Equivalent to compute the stationary distribution of the Markov chain with transition matrix P. Equivalent to compute the left eigenvector of P corresponding to eigenvalue 1.
27 It is called Random surfer model The web surfer jumps from page to page following hyperlinks. The probability of jumping to a page depends of the number of links in that page. Starting with a vector z (0), compute z (k) j = i I(j) z (k 1) p ij, p ij = 1 outdeg(i) Equivalent to compute the stationary distribution of the Markov chain with transition matrix P. Equivalent to compute the left eigenvector of P corresponding to eigenvalue 1.
28 It is called Random surfer model The web surfer jumps from page to page following hyperlinks. The probability of jumping to a page depends of the number of links in that page. Starting with a vector z (0), compute z (k) j = i I(j) z (k 1) p ij, p ij = 1 outdeg(i) Equivalent to compute the stationary distribution of the Markov chain with transition matrix P. Equivalent to compute the left eigenvector of P corresponding to eigenvalue 1.
29 Two problems: Presence of dangling nodes P cannot be stochastic P might not have the eigenvalue 1 Presence of cycles P can be reducible: the random surfer is trapped Uniqueness of eigenvalue 1 not guaranteed
30 Perron-Frobenius Theorem Let A 0 be an irreducible matrix there exists an eigenvector equal to the spectral radius of A, with algebraic multiplicity 1 there exists an eigenvector x > 0 such that Ax = ρ(a)x. if A > 0, then ρ(a) is the unique eigenvalue with maximum modulo.
31 Presence of dangling nodes P = P + dv T { 1 if page i is dangling d i = 0 otherwise v i = 1/n; P = / /2 1/2 0 1/ /3 1/3 0 1/3 0 P = /5 1/5 1/5 1/5 1/5 0 1/ /2 1/2 0 1/ /3 1/3 0 1/3 0
32 Presence of cycles Force irreducibility by adding artificial arcs chosen by the random surfer with small probability α. P = (1 α) P = (1 α) P + αev T, [ /5 1/5 1/5 1/5 1/5 0 1/ /2 1/2 0 1/ /3 1/3 0 1/3 0 Typical values of α is ] + α [ 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 1/5 ].
33 : a toy eample P = Computing the largest eigenvector of P we get z T [0.38, 0.52, 0.59, 0.27, 0.40], which corresponds to the following order of importance of pages [3, 2, 5, 1, 4].
34 P is sparse, P is full. The vector y = P T x, for x 0, such that x 1 = 1 can be computed as follows y = (1 α)p T x γ = x y, y = y + γv. The eigenvalues of P and P are interrelated: λ 1 ( P) = λ 1 ( P) = 1, λj ( P) = (1 α) λj ( P), j > 1. For the web graph λ 2 ( P) = (1 α).
Part 1: Link Analysis & Page Rank
Chapter 8: Graph Data Part 1: Link Analysis & Page Rank Based on Leskovec, Rajaraman, Ullman 214: Mining of Massive Datasets 1 Exam on the 5th of February, 216, 14. to 16. If you wish to attend, please
Big Data Technology Motivating NoSQL Databases: Computing Page Importance Metrics at Crawl Time
Big Data Technology Motivating NoSQL Databases: Computing Page Importance Metrics at Crawl Time Edward Bortnikov & Ronny Lempel Yahoo! Labs, Haifa Class Outline Link-based page importance measures Why
The PageRank Citation Ranking: Bring Order to the Web
The PageRank Citation Ranking: Bring Order to the Web presented by: Xiaoxi Pang 25.Nov 2010 1 / 20 Outline Introduction A ranking for every page on the Web Implementation Convergence Properties Personalized
DATA ANALYSIS II. Matrix Algorithms
DATA ANALYSIS II Matrix Algorithms Similarity Matrix Given a dataset D = {x i }, i=1,..,n consisting of n points in R d, let A denote the n n symmetric similarity matrix between the points, given as where
Perron vector Optimization applied to search engines
Perron vector Optimization applied to search engines Olivier Fercoq INRIA Saclay and CMAP Ecole Polytechnique May 18, 2011 Web page ranking The core of search engines Semantic rankings (keywords) Hyperlink
Enhancing the Ranking of a Web Page in the Ocean of Data
Database Systems Journal vol. IV, no. 3/2013 3 Enhancing the Ranking of a Web Page in the Ocean of Data Hitesh KUMAR SHARMA University of Petroleum and Energy Studies, India [email protected] In today
Graph Algorithms and Graph Databases. Dr. Daisy Zhe Wang CISE Department University of Florida August 27th 2014
Graph Algorithms and Graph Databases Dr. Daisy Zhe Wang CISE Department University of Florida August 27th 2014 1 Google Knowledge Graph -- Entities and Relationships 2 Graph Data! Facebook Social Network
Corso di Biblioteche Digitali
Corso di Biblioteche Digitali Vittore Casarosa [email protected] tel. 050-315 3115 cell. 348-397 2168 Ricevimento dopo la lezione o per appuntamento Valutazione finale 70-75% esame orale 25-30% progetto
Asking Hard Graph Questions. Paul Burkhardt. February 3, 2014
Beyond Watson: Predictive Analytics and Big Data U.S. National Security Agency Research Directorate - R6 Technical Report February 3, 2014 300 years before Watson there was Euler! The first (Jeopardy!)
Practical Graph Mining with R. 5. Link Analysis
Practical Graph Mining with R 5. Link Analysis Outline Link Analysis Concepts Metrics for Analyzing Networks PageRank HITS Link Prediction 2 Link Analysis Concepts Link A relationship between two entities
International Journal of Engineering Research-Online A Peer Reviewed International Journal Articles are freely available online:http://www.ijoer.
RESEARCH ARTICLE SURVEY ON PAGERANK ALGORITHMS USING WEB-LINK STRUCTURE SOWMYA.M 1, V.S.SREELAXMI 2, MUNESHWARA M.S 3, ANIL G.N 4 Department of CSE, BMS Institute of Technology, Avalahalli, Yelahanka,
1 o Semestre 2007/2008
Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Outline 1 2 3 4 5 Outline 1 2 3 4 5 Exploiting Text How is text exploited? Two main directions Extraction Extraction
Google s PageRank: The Math Behind the Search Engine
Google s PageRank: The Math Behind the Search Engine Rebecca S. Wills Department of Mathematics North Carolina State University Raleigh, NC 27695 [email protected] May, 2006 Introduction Approximately 9
Link Analysis. Chapter 5. 5.1 PageRank
Chapter 5 Link Analysis One of the biggest changes in our lives in the decade following the turn of the century was the availability of efficient and accurate Web search, through search engines such as
Walk-Based Centrality and Communicability Measures for Network Analysis
Walk-Based Centrality and Communicability Measures for Network Analysis Michele Benzi Department of Mathematics and Computer Science Emory University Atlanta, Georgia, USA Workshop on Innovative Clustering
Search engine ranking
Proceedings of the 7 th International Conference on Applied Informatics Eger, Hungary, January 28 31, 2007. Vol. 2. pp. 417 422. Search engine ranking Mária Princz Faculty of Technical Engineering, University
Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari [email protected]
Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari [email protected] Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content
Cloud and Big Data Summer School, Stockholm, Aug., 2015 Jeffrey D. Ullman
Cloud and Big Data Summer School, Stockholm, Aug., 2015 Jeffrey D. Ullman 2 Intuition: solve the recursive equation: a page is important if important pages link to it. Technically, importance = the principal
ELA http://math.technion.ac.il/iic/ela
SIGN PATTERNS THAT ALLOW EVENTUAL POSITIVITY ABRAHAM BERMAN, MINERVA CATRAL, LUZ M. DEALBA, ABED ELHASHASH, FRANK J. HALL, LESLIE HOGBEN, IN-JAE KIM, D. D. OLESKY, PABLO TARAZAGA, MICHAEL J. TSATSOMEROS,
The Mathematics Behind Google s PageRank
The Mathematics Behind Google s PageRank Ilse Ipsen Department of Mathematics North Carolina State University Raleigh, USA Joint work with Rebecca Wills Man p.1 Two Factors Determine where Google displays
THE $25,000,000,000 EIGENVECTOR THE LINEAR ALGEBRA BEHIND GOOGLE
THE $5,,, EIGENVECTOR THE LINEAR ALGEBRA BEHIND GOOGLE KURT BRYAN AND TANYA LEISE Abstract. Google s success derives in large part from its PageRank algorithm, which ranks the importance of webpages according
How To Understand The Network Of A Network
Roles in Networks Roles in Networks Motivation for work: Let topology define network roles. Work by Kleinberg on directed graphs, used topology to define two types of roles: authorities and hubs. (Each
Part 2: Community Detection
Chapter 8: Graph Data Part 2: Community Detection Based on Leskovec, Rajaraman, Ullman 2014: Mining of Massive Datasets Big Data Management and Analytics Outline Community Detection - Social networks -
arxiv:1201.3120v3 [math.na] 1 Oct 2012
RANKING HUBS AND AUTHORITIES USING MATRIX FUNCTIONS MICHELE BENZI, ERNESTO ESTRADA, AND CHRISTINE KLYMKO arxiv:1201.3120v3 [math.na] 1 Oct 2012 Abstract. The notions of subgraph centrality and communicability,
NEW VERSION OF DECISION SUPPORT SYSTEM FOR EVALUATING TAKEOVER BIDS IN PRIVATIZATION OF THE PUBLIC ENTERPRISES AND SERVICES
NEW VERSION OF DECISION SUPPORT SYSTEM FOR EVALUATING TAKEOVER BIDS IN PRIVATIZATION OF THE PUBLIC ENTERPRISES AND SERVICES Silvija Vlah Kristina Soric Visnja Vojvodic Rosenzweig Department of Mathematics
Web Search. 2 o Semestre 2012/2013
Dados na Dados na Departamento de Engenharia Informática Instituto Superior Técnico 2 o Semestre 2012/2013 Bibliography Dados na Bing Liu, Data Mining: Exploring Hyperlinks, Contents, and Usage Data, 2nd
RANKING WEB PAGES RELEVANT TO SEARCH KEYWORDS
ISBN: 978-972-8924-93-5 2009 IADIS RANKING WEB PAGES RELEVANT TO SEARCH KEYWORDS Ben Choi & Sumit Tyagi Computer Science, Louisiana Tech University, USA ABSTRACT In this paper we propose new methods for
NETZCOPE - a tool to analyze and display complex R&D collaboration networks
The Task Concepts from Spectral Graph Theory EU R&D Network Analysis Netzcope Screenshots NETZCOPE - a tool to analyze and display complex R&D collaboration networks L. Streit & O. Strogan BiBoS, Univ.
The world s largest matrix computation. (This chapter is out of date and needs a major overhaul.)
Chapter 7 Google PageRank The world s largest matrix computation. (This chapter is out of date and needs a major overhaul.) One of the reasons why Google TM is such an effective search engine is the PageRank
Structure Preserving Model Reduction for Logistic Networks
Structure Preserving Model Reduction for Logistic Networks Fabian Wirth Institute of Mathematics University of Würzburg Workshop on Stochastic Models of Manufacturing Systems Einhoven, June 24 25, 2010.
A Graph theoretical approach to Network Vulnerability Analysis and Countermeasures
A Graph theoretical approach to Network Vulnerability Analysis and Countermeasures Dr.Thaier Hamid University of Bedfordshire, UK Prof. Carsten Maple, University of Bedfordshire, UK ABSTRACT Computer networks
Master s Theory Exam Spring 2006
Spring 2006 This exam contains 7 questions. You should attempt them all. Each question is divided into parts to help lead you through the material. You should attempt to complete as much of each problem
Social Network Mining
Social Network Mining Data Mining November 11, 2013 Frank Takes ([email protected]) LIACS, Universiteit Leiden Overview Social Network Analysis Graph Mining Online Social Networks Friendship Graph Semantics
Soft Clustering with Projections: PCA, ICA, and Laplacian
1 Soft Clustering with Projections: PCA, ICA, and Laplacian David Gleich and Leonid Zhukov Abstract In this paper we present a comparison of three projection methods that use the eigenvectors of a matrix
Big Data Analytics. Lucas Rego Drumond
Big Data Analytics Lucas Rego Drumond Information Systems and Machine Learning Lab (ISMLL) Institute of Computer Science University of Hildesheim, Germany MapReduce II MapReduce II 1 / 33 Outline 1. Introduction
Algorithms for representing network centrality, groups and density and clustered graph representation
COSIN IST 2001 33555 COevolution and Self-organization In dynamical Networks Algorithms for representing network centrality, groups and density and clustered graph representation Deliverable Number: D06
USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS
USING SPECTRAL RADIUS RATIO FOR NODE DEGREE TO ANALYZE THE EVOLUTION OF SCALE- FREE NETWORKS AND SMALL-WORLD NETWORKS Natarajan Meghanathan Jackson State University, 1400 Lynch St, Jackson, MS, USA [email protected]
Power Rankings: Math for March Madness
Power Rankings: Math for March Madness James A. Swenson University of Wisconsin Platteville [email protected] March 5, 2011 Math Club Madison Area Technical College James A. Swenson (UWP) Power Rankings:
Search Engine Optimisation (SEO) Factsheet
Search Engine Optimisation (SEO) Factsheet SEO is a complex element of our industry and many clients do not fully understand what is involved in getting their site ranked on common search engines such
Removing Web Spam Links from Search Engine Results
Removing Web Spam Links from Search Engine Results Manuel EGELE [email protected], 1 Overview Search Engine Optimization and definition of web spam Motivation Approach Inferring importance of features
An Introduction to the Use of Bayesian Network to Analyze Gene Expression Data
n Introduction to the Use of ayesian Network to nalyze Gene Expression Data Cristina Manfredotti Dipartimento di Informatica, Sistemistica e Comunicazione (D.I.S.Co. Università degli Studi Milano-icocca
Monte Carlo methods in PageRank computation: When one iteration is sufficient
Monte Carlo methods in PageRank computation: When one iteration is sufficient K.Avrachenkov, N. Litvak, D. Nemirovsky, N. Osipova Abstract PageRank is one of the principle criteria according to which Google
Healthcare Analytics. Aryya Gangopadhyay UMBC
Healthcare Analytics Aryya Gangopadhyay UMBC Two of many projects Integrated network approach to personalized medicine Multidimensional and multimodal Dynamic Analyze interactions HealthMask Need for sharing
Social Network Analysis
Social Network Analysis Challenges in Computer Science April 1, 2014 Frank Takes ([email protected]) LIACS, Leiden University Overview Context Social Network Analysis Online Social Networks Friendship Graph
Dynamical Clustering of Personalized Web Search Results
Dynamical Clustering of Personalized Web Search Results Xuehua Shen CS Dept, UIUC [email protected] Hong Cheng CS Dept, UIUC [email protected] Abstract Most current search engines present the user a ranked
A COMPREHENSIVE REVIEW ON SEARCH ENGINE OPTIMIZATION
Volume 4, No. 1, January 2013 Journal of Global Research in Computer Science REVIEW ARTICLE Available Online at www.jgrcs.info A COMPREHENSIVE REVIEW ON SEARCH ENGINE OPTIMIZATION 1 Er.Tanveer Singh, 2
Anna Paananen. Comparative Analysis of Yandex and Google Search Engines
Anna Paananen Comparative Analysis of Yandex and Google Search Engines Helsinki Metropolia University of Applied Sciences Master s Degree Information Technology Master s Thesis 26 May 2012 PREFACE Working
BOOLEAN CONSENSUS FOR SOCIETIES OF ROBOTS
Workshop on New frontiers of Robotics - Interdep. Research Center E. Piaggio June 2-22, 22 - Pisa (Italy) BOOLEAN CONSENSUS FOR SOCIETIES OF ROBOTS Adriano Fagiolini DIEETCAM, College of Engineering, University
Rank one SVD: un algorithm pour la visualisation d une matrice non négative
Rank one SVD: un algorithm pour la visualisation d une matrice non négative L. Labiod and M. Nadif LIPADE - Universite ParisDescartes, France ECAIS 2013 November 7, 2013 Outline Outline 1 Data visualization
Dynamics of information spread on networks. Kristina Lerman USC Information Sciences Institute
Dynamics of information spread on networks Kristina Lerman USC Information Sciences Institute Information spread in online social networks Diffusion of activation on a graph, where each infected (activated)
[Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC. [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions
[Ramit Solutions] www.ramitsolutions.com SEO SMO- SEM - PPC [Internet / Online Marketing Concepts] SEO Training Concepts SEO TEAM Ramit Solutions [2014-2016] By Lathish Difference between Offline Marketing
Lecture 10: HBase! Claudia Hauff (Web Information Systems)! [email protected]
Big Data Processing, 2014/15 Lecture 10: HBase!! Claudia Hauff (Web Information Systems)! [email protected] 1 Course content Introduction Data streams 1 & 2 The MapReduce paradigm Looking behind the
Search Engines. Stephen Shaw <[email protected]> 18th of February, 2014. Netsoc
Search Engines Stephen Shaw Netsoc 18th of February, 2014 Me M.Sc. Artificial Intelligence, University of Edinburgh Would recommend B.A. (Mod.) Computer Science, Linguistics, French,
The Perron-Frobenius theorem.
Chapter 9 The Perron-Frobenius theorem. The theorem we will discuss in this chapter (to be stated below) about matrices with non-negative entries, was proved, for matrices with strictly positive entries,
Director of Marketing, Cote Family Companies
Frank Soukup III Frank Soukup III Director of Marketing, Cote Family Companies Overview Understand what it is What it does How does it affect you How to use it for your advantage How to use blogs & social
LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING. ----Changsheng Liu 10-30-2014
LABEL PROPAGATION ON GRAPHS. SEMI-SUPERVISED LEARNING ----Changsheng Liu 10-30-2014 Agenda Semi Supervised Learning Topics in Semi Supervised Learning Label Propagation Local and global consistency Graph
Social Media Mining. Graph Essentials
Graph Essentials Graph Basics Measures Graph and Essentials Metrics 2 2 Nodes and Edges A network is a graph nodes, actors, or vertices (plural of vertex) Connections, edges or ties Edge Node Measures
So today we shall continue our discussion on the search engines and web crawlers. (Refer Slide Time: 01:02)
Internet Technology Prof. Indranil Sengupta Department of Computer Science and Engineering Indian Institute of Technology, Kharagpur Lecture No #39 Search Engines and Web Crawler :: Part 2 So today we
Efficient Identification of Starters and Followers in Social Media
Efficient Identification of Starters and Followers in Social Media Michael Mathioudakis Department of Computer Science University of Toronto [email protected] Nick Koudas Department of Computer Science
Step 3: Go to Column C. Use the function AVERAGE to calculate the mean values of n = 5. Column C is the column of the means.
EXAMPLES - SAMPLING DISTRIBUTION EXCEL INSTRUCTIONS This exercise illustrates the process of the sampling distribution as stated in the Central Limit Theorem. Enter the actual data in Column A in MICROSOFT
KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS
ABSTRACT KEYWORD SEARCH OVER PROBABILISTIC RDF GRAPHS In many real applications, RDF (Resource Description Framework) has been widely used as a W3C standard to describe data in the Semantic Web. In practice,
LECTURE 4. Last time: Lecture outline
LECTURE 4 Last time: Types of convergence Weak Law of Large Numbers Strong Law of Large Numbers Asymptotic Equipartition Property Lecture outline Stochastic processes Markov chains Entropy rate Random
Content Delivery Networks. Shaxun Chen April 21, 2009
Content Delivery Networks Shaxun Chen April 21, 2009 Outline Introduction to CDN An Industry Example: Akamai A Research Example: CDN over Mobile Networks Conclusion Outline Introduction to CDN An Industry
Analysis of a Production/Inventory System with Multiple Retailers
Analysis of a Production/Inventory System with Multiple Retailers Ann M. Noblesse 1, Robert N. Boute 1,2, Marc R. Lambrecht 1, Benny Van Houdt 3 1 Research Center for Operations Management, University
Incorporating Participant Reputation in Community-driven Question Answering Systems
Incorporating Participant Reputation in Community-driven Question Answering Systems Liangjie Hong, Zaihan Yang and Brian D. Davison Department of Computer Science and Engineering Lehigh University, Bethlehem,
10. Search Engine Marketing
10. Search Engine Marketing What s inside: We look at the difference between paid and organic search results and look through the key terms and concepts that will help you understand this relationship.
Google Analytics and Google Analytics Premium: limits and quotas
Table Of Contents Data collection & Processing limits Accounts and Profiles Reports Admin Area Google Analytics data fields Lengths Google Analytics API Data collection & Processing limits 10 million hits
A Google-like Model of Road Network Dynamics and its Application to Regulation and Control
A Google-like Model of Road Network Dynamics and its Application to Regulation and Control Emanuele Crisostomi, Steve Kirkland, Robert Shorten August, 2010 Abstract Inspired by the ability of Markov chains
Mining Social-Network Graphs
342 Chapter 10 Mining Social-Network Graphs There is much information to be gained by analyzing the large-scale data that is derived from social networks. The best-known example of a social network is
graphical Systems for Website Design
2005 Linux Web Host. All rights reserved. The content of this manual is furnished under license and may be used or copied only in accordance with this license. No part of this publication may be reproduced,
Conductance, the Normalized Laplacian, and Cheeger s Inequality
Spectral Graph Theory Lecture 6 Conductance, the Normalized Laplacian, and Cheeger s Inequality Daniel A. Spielman September 21, 2015 Disclaimer These notes are not necessarily an accurate representation
Healthcare data analytics. Da-Wei Wang Institute of Information Science [email protected]
Healthcare data analytics Da-Wei Wang Institute of Information Science [email protected] Outline Data Science Enabling technologies Grand goals Issues Google flu trend Privacy Conclusion Analytics
Spidering and Filtering Web Pages for Vertical Search Engines
Spidering and Filtering Web Pages for Vertical Search Engines Michael Chau The University of Arizona [email protected] 1 Introduction The size of the Web is growing exponentially. The number of indexable
Social Media Mining. Network Measures
Klout Measures and Metrics 22 Why Do We Need Measures? Who are the central figures (influential individuals) in the network? What interaction patterns are common in friends? Who are the like-minded users
CS54100: Database Systems
CS54100: Database Systems Cloud Databases: The Next Post- Relational World 18 April 2012 Prof. Chris Clifton Beyond RDBMS The Relational Model is too limiting! Simple data model doesn t capture semantics
