Project Deliverables. CS 361, Lecture 28. Outline. Project Deliverables. Administrative. Project Comments



Similar documents
1 Computing the Standard Deviation of Sample Means

5 Boolean Decision Trees (February 11)

I. Chi-squared Distributions

CS103A Handout 23 Winter 2002 February 22, 2002 Solving Recurrence Relations

A Constant-Factor Approximation Algorithm for the Link Building Problem

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

Soving Recurrence Relations

Chapter 6: Variance, the law of large numbers and the Monte-Carlo method

Present Value Factor To bring one dollar in the future back to present, one uses the Present Value Factor (PVF): Concept 9: Present Value

CHAPTER 3 THE TIME VALUE OF MONEY

ODBC. Getting Started With Sage Timberline Office ODBC

Properties of MLE: consistency, asymptotic normality. Fisher information.

5: Introduction to Estimation

Incremental calculation of weighted mean and variance

CS103X: Discrete Structures Homework 4 Solutions

Sequences and Series

Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

1. C. The formula for the confidence interval for a population mean is: x t, which was

FIBONACCI NUMBERS: AN APPLICATION OF LINEAR ALGEBRA. 1. Powers of a matrix

Concept: Types of algorithms

In nite Sequences. Dr. Philippe B. Laval Kennesaw State University. October 9, 2008

5.4 Amortization. Question 1: How do you find the present value of an annuity? Question 2: How is a loan amortized?

Measures of Spread and Boxplots Discrete Math, Section 9.4

The Stable Marriage Problem

Annuities Under Random Rates of Interest II By Abraham Zaks. Technion I.I.T. Haifa ISRAEL and Haifa University Haifa ISRAEL.

WHEN IS THE (CO)SINE OF A RATIONAL ANGLE EQUAL TO A RATIONAL NUMBER?

Week 3 Conditional probabilities, Bayes formula, WEEK 3 page 1 Expected value of a random variable

Math C067 Sampling Distributions

.04. This means $1000 is multiplied by 1.02 five times, once for each of the remaining sixmonth

Chapter 14 Nonparametric Statistics

Professional Networking

Lecture 4: Cauchy sequences, Bolzano-Weierstrass, and the Squeeze theorem

Section 11.3: The Integral Test

where: T = number of years of cash flow in investment's life n = the year in which the cash flow X n i = IRR = the internal rate of return

Lecture 2: Karger s Min Cut Algorithm

Convexity, Inequalities, and Norms

Chair for Network Architectures and Services Institute of Informatics TU München Prof. Carle. Network Security. Chapter 2 Basics

Lesson 17 Pearson s Correlation Coefficient

Asymptotic Growth of Functions

Discrete Mathematics and Probability Theory Spring 2014 Anant Sahai Note 13

NATIONAL SENIOR CERTIFICATE GRADE 12

Lesson 15 ANOVA (analysis of variance)

Approximating Area under a curve with rectangles. To find the area under a curve we approximate the area using rectangles and then use limits to find

Non-life insurance mathematics. Nils F. Haavardsson, University of Oslo and DNB Skadeforsikring

Systems Design Project: Indoor Location of Wireless Devices

The Power of Free Branching in a General Model of Backtracking and Dynamic Programming Algorithms


I. Why is there a time value to money (TVM)?

Department of Computer Science, University of Otago

Definition. A variable X that takes on values X 1, X 2, X 3,...X k with respective frequencies f 1, f 2, f 3,...f k has mean

Repeating Decimals are decimal numbers that have number(s) after the decimal point that repeat in a pattern.

SEQUENCES AND SERIES

Lecture 3. denote the orthogonal complement of S k. Then. 1 x S k. n. 2 x T Ax = ( ) λ x. with x = 1, we have. i = λ k x 2 = λ k.

The following example will help us understand The Sampling Distribution of the Mean. C1 C2 C3 C4 C5 50 miles 84 miles 38 miles 120 miles 48 miles

Running Time ( 3.1) Analysis of Algorithms. Experimental Studies ( 3.1.1) Limitations of Experiments. Pseudocode ( 3.1.2) Theoretical Analysis

INVESTMENT PERFORMANCE COUNCIL (IPC)

LECTURE 13: Cross-validation

Vladimir N. Burkov, Dmitri A. Novikov MODELS AND METHODS OF MULTIPROJECTS MANAGEMENT

Handling. Collection Calls

Determining the sample size

1 Correlation and Regression Analysis

Time Value of Money, NPV and IRR equation solving with the TI-86

Hypergeometric Distributions

A Recursive Formula for Moments of a Binomial Distribution

S. Tanny MAT 344 Spring be the minimum number of moves required.

Chapter 5: Inner Product Spaces

FI A CIAL MATHEMATICS

Designing Incentives for Online Question and Answer Forums

AP Calculus BC 2003 Scoring Guidelines Form B

CS100: Introduction to Computer Science

AP Calculus AB 2006 Scoring Guidelines Form B

Chapter 7 Methods of Finding Estimators

Example 2 Find the square root of 0. The only square root of 0 is 0 (since 0 is not positive or negative, so those choices don t exist here).

Amendments to employer debt Regulations

How To Solve The Homewor Problem Beautifully

Elementary Theory of Russian Roulette

Confidence Intervals for One Mean

Baan Service Master Data Management

How to set up your GMC Online account

The analysis of the Cournot oligopoly model considering the subjective motive in the strategy selection

3 Basic Definitions of Probability Theory

Math 113 HW #11 Solutions


GCSE STATISTICS. 4) How to calculate the range: The difference between the biggest number and the smallest number.

INVESTMENT PERFORMANCE COUNCIL (IPC) Guidance Statement on Calculation Methodology

Trigonometric Form of a Complex Number. The Complex Plane. axis. ( 2, 1) or 2 i FIGURE The absolute value of the complex number z a bi is

Modified Line Search Method for Global Optimization

CDs Bought at a Bank verses CD s Bought from a Brokerage. Floyd Vest

Confidence Intervals

CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 8

Lecture 4: Cheeger s Inequality

THE REGRESSION MODEL IN MATRIX FORM. For simple linear regression, meaning one predictor, the model is. for i = 1, 2, 3,, n

Now here is the important step

Ranking Irregularities When Evaluating Alternatives by Using Some ELECTRE Methods

Groups of diverse problem solvers can outperform groups of high-ability problem solvers

Factoring x n 1: cyclotomic and Aurifeuillian polynomials Paul Garrett <garrett@math.umn.edu>

CHAPTER 7: Central Limit Theorem: CLT for Averages (Means)

Transcription:

Project Deliverables CS 361, Lecture 28 Jared Saia Uiversity of New Mexico Each Group should tur i oe group project cosistig of: About 6-12 pages of text (ca be loger with appedix) 6-12 figures (please put multiple figures o oe page where possible) Task list givig which tasks were performed by which group members. This should be siged by all members of the group. 3 Outlie Project Deliverables Each idividual should tur i the followig: Admiistrative Master Theorem Search Egies A evaluatio of the cotributio to the group project of every member of your group o a scale of 1(poor) to 10(excellet). To do this, write dow the ame of each member of your group (beside yourself), ad put a umber betwee 1 ad 10 ext to each ame. Please be hoest ad professioal i your evaluatio. Your evaluatio will be oe factor used to determie the project grade for each member of your group. 1 4 Admiistrative Project Commets The project is due this Thursday i class. This deadlie is strict - late projects will recieve o credit. (A partially completed project tured i o time will get some credit but a complete project tured i late will get o credit) To give you more time to work o the project, I m ot goig to have a hw due o Thursday Before you tur i the project: Reread The Top Project Mistakes sectio i the project descriptio sectio o the course web page You will loose poits if you make these same mistakes 2 5

Master Method Last I-Class Exercise The recurrece T () = at (/b) + f() ca be solved as follows: If a f(/b) f()/k for some costat K > 1, the T () = Θ(f()). If a f(/b) K f() for some costat K > 1, the T () = Θ( log b a ). If a f(/b) = f(), the T () = Θ(f() log b ). Cosider the recurrece: T () = 2T (/4) + log If we write this as T () = at (/b) + f(), the a = 2,b = 4,f() = log f() = log ad a f(/b) = /2 log(/4) a f(/b) is a costat factor smaller tha f(), so the root ode domiates. Thus the solutio is T () = Θ( log ) 6 9 Proof I-Class Exercise If f() is a costat factor larger tha a f(b/), the the sum is a descedig geometric series. The sum of ay geometric series is a costat times its largest term. I this case, the largest term is the first term f(). If f() is a costat factor smaller tha a f(b/), the the sum is a ascedig geometric series. The sum of ay geometric series is a costat times its largest term. I this case, this is the last term, which by our earlier argumet is Θ( log b a ). Fially, if a f(b/) = f(), the each of the L terms i the summatio is equal to f(). Cosider the recurrece: T () = 4T (/2) + 2 Q: What is f() ad a f(/b)? Q: Which of the three cases does the recurrece fall uder (whe is large)? Q: What is the solutio to this recurrece? 7 10 Example Limits of Master Theorem T () = T (/2) + log If we write this as T () = at (/b) + f(), the a = 1,b = 2,f() = log Here a f(/b) = /2 log /2 is smaller tha f() = log by a costat factor, so T () = Θ( log ) Cosider the recurrece: T () = 2T (/2) + / log We ca t apply the master thm here, because a f(/b) = /(log 1) is t equal to / lg, but the differece is t a costat factor We ca use the recursio tree method 8 11

Limits of Master Theorem Example Algorithmic Applicatio Cosider the recurrece: T () = 2T (/2) + / log Not hard to see that the sum of the i-th level is /(log i) Depth of tree is log T () = log log i i=0 = log j j=1 = log 1 j j=1 (1) (2) (3) = Θ( log log ) (4) Moder search egies are begiig to use algorithmic tools Google ad Clever are two good search egies that use algorithmic techiques Their rakig algorithms are based o liear algebra (they make use of the eigevectors of a certai type of matrix) These algorithms could ot have bee developed without rigorous mathematical aalysis 12 15 Take Away The Two Types of Search Queries The Master Theorem is a fast way to solve certai types of recurreces However it is ot as powerful as the recursio tree method Whe you see a appropriate recurrece, first try the Master Theorem, if this does t work, try recursio tree method or aihilators. Two types of search egie queries Specific Queries: E.g. Does Netscape support the JDK 1.1 codesigig API? Geeral Queries: E.g. Fid Iformatio about Java 13 16 The Deep Ed of the Sadbox The Abudace Problem That s the deep ed of the sadbox. I do t like to go there. A lephrechau lives there ad he tells me to bur thigs - Ralph from the Simpsos For Geeral Queries, we have Beyod 361: The Abudace Problem: The umber of relevat pages is far too large for a huma to digest. CS461: More advaced data structures ad algorithms, differet classes of algorithms (greedy, dyamic programmig), advaced mathematical aalysis of algorithms (amortizatio, proofs of correctess, graph theory) Approximatio Algorithms, Radomized Algorithms, Computatioal Geometry, Computatioal Biology, Graph Theoretic Algorithms, Cryptography, Olie Algorithms, Experimetal Algorithms, Smoothed Aalysis, Complexity Theory, etc, etc. 14 More specifically: There are ow hudreds of millios of pages o the web For most search queries, the umber of pages that cotai the query words is o the order of thousads A perso typically wats to look at less tha five pages 17

What we wat Crash Course i Graph Theory A Graph is: Oe way to hadle the abudace problem: Filter out from the huge set of relevat pages, those few pages which are most authoritative or defiitive To do this, we eed a otio of what makes a page authoritative A set of odes, V, ad A set of edges, E, where each edge is directed from some ode i V to some other ode i V Note: A tree is a just a graph without cycles There are may useful algorithms kow for graphs, so it s useful to frame problems i terms of Graph Theory where possible. 18 21 A Example Problem The Web as a Graph If we search for all pages cotaiig the word Microsoft we get back over a millio pages. There is othig about cotet of the page www.microsoft.com that makes it stad out. This page does t use the term microsoft most frequetly. This page does t use the term microsoft most promietly. There is othig completely uique i the textual cotet of the page that idetifies it as beig authoritative. What is it that makes the page www.microsoft.com most authoritative? Let each page i the web, be a ode of a graph Let there be a edge from ode x to ode y iff there is a lik from page x to page y This formulatio allows us to brig lots of great tools of graph theory to bear o the problem of lik aalysis. 19 22 Lik Aalysis Popularity ad Relevace Lik Structure is oe exteral measure of authority of a page If page x liks to page y, x is coferrig authority o page y There are some potetial pitfalls here Liks do t always cofer authority (e.g. they could be for avigatio or to ads) There is a balace betwee popularity ad relevace Cosider the simple heuristic for returig authoritative pages: of all pages cotaiig the search term, retur those which have the greatest umber of i-liks. This heuristic fails May authoritative pages for a search term do ot cotai that term. Examples: seach egies, automobile maufactures May pages which are ot authoritative for a give term would be icorrectly idetified. Examples: www.yahoo.com ad www.microsoft.com would be cosidered authoritative for ay search term these pages cotaied. 20 23

The Clever Approach Key observatio: Certai pages poit maily to sites that are authoritative. These pages are called good Hubs Examples: www.yahoo.com Jared s Big Page Of Liks To Cool Sites Questio: But the how do we defie what makes a page a good Hub? The Clever algorithm ad Google solve this problem 24 The Clever Approach We use a mutually-recursive defiitio of the otio of Authoritative ad Hub-like Good Hub pages poit to may good Authoritative pages Good Authoritative pages are poited to by may good Hub pages 25 The Clever Algorithm High Level Idea: Clever collects all pages cotaiig the search term, or pages that are eighbors of pages cotaiig the search term. It the iteratively propogates authority scores ad hub scores All pages start with equal authority ad hub scores. I each step: The pages with hub weight cofer authority weight to the pages they poit to. The pages with authority weight cofer hub weight to the pages that poit to them 26