Introduction to Information Retrieval

Size: px
Start display at page:

Download "Introduction to Information Retrieval"

Transcription

1 Introduction to Information Retrieval Christof Monz and Maarten de Rijke Spring 2002 Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 1

2 Today s Program Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 2

3 Today s Program What s Information Retrieval? Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 2

4 Today s Program What s Information Retrieval? Some administrative stuff Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 2

5 Today s Program What s Information Retrieval? Some administrative stuff Overview of the course Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 2

6 Today s Program What s Information Retrieval? Some administrative stuff Overview of the course Grading, homework etc. Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 2

7 Today s Program What s Information Retrieval? Some administrative stuff Overview of the course Grading, homework etc. How to represent information Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 2

8 Today s Program What s Information Retrieval? Some administrative stuff Overview of the course Grading, homework etc. How to represent information Our first retrieval model: boolean retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 2

9 What is Information Retrieval? Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

10 What is Information Retrieval? Finding relevant information in large collections of data Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

11 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

12 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

13 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

14 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys (text retrieval) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

15 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys (text retrieval) What does a brain tumor look like on a CT-scan Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

16 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys (text retrieval) What does a brain tumor look like on a CT-scan A picture of a brain tumor Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

17 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys (text retrieval) What does a brain tumor look like on a CT-scan A picture of a brain tumor (image retrieval) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

18 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys (text retrieval) What does a brain tumor look like on a CT-scan A picture of a brain tumor (image retrieval) It goes like this: hmm hmm hahmmm... Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

19 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys (text retrieval) What does a brain tumor look like on a CT-scan A picture of a brain tumor (image retrieval) It goes like this: hmm hmm hahmmm... A certain song Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

20 What is Information Retrieval? Finding relevant information in large collections of data In such a collection you may want to find: Give me information on the history of the Kennedys An article about the Kennedys (text retrieval) What does a brain tumor look like on a CT-scan A picture of a brain tumor (image retrieval) It goes like this: hmm hmm hahmmm... A certain song (music retrieval) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 3

21 Text Retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

22 Text Retrieval Online library catalogs (OPAC) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

23 Text Retrieval Online library catalogs (OPAC) Internet search engines, such as AltaVista, Google, Ilse Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

24 Text Retrieval Online library catalogs (OPAC) Internet search engines, such as AltaVista, Google, Ilse Specialized systems (aka vendors): Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

25 Text Retrieval Online library catalogs (OPAC) Internet search engines, such as AltaVista, Google, Ilse Specialized systems (aka vendors): MEDLINE (medical articles) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

26 Text Retrieval Online library catalogs (OPAC) Internet search engines, such as AltaVista, Google, Ilse Specialized systems (aka vendors): MEDLINE (medical articles) Lexis-Nexis (legal, business, academic,... ) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

27 Text Retrieval Online library catalogs (OPAC) Internet search engines, such as AltaVista, Google, Ilse Specialized systems (aka vendors): MEDLINE (medical articles) Lexis-Nexis (legal, business, academic,... ) Westlaw (legal articles) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

28 Text Retrieval Online library catalogs (OPAC) Internet search engines, such as AltaVista, Google, Ilse Specialized systems (aka vendors): MEDLINE (medical articles) Lexis-Nexis (legal, business, academic,... ) Westlaw (legal articles) Dialog (business information) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 4

29 Retrieval vs. Browsing Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 5

30 Retrieval vs. Browsing Popular Web Directories: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 5

31 Retrieval vs. Browsing Popular Web Directories: Yahoo!, Open Directory Project (dmoz) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 5

32 Retrieval vs. Browsing Popular Web Directories: Yahoo!, Open Directory Project (dmoz) The user has to guess the right directories to find the information Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 5

33 Retrieval vs. Browsing Popular Web Directories: Yahoo!, Open Directory Project (dmoz) The user has to guess the right directories to find the information The user has to adapt to the designers conceptualization of the directory Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 5

34 Retrieval vs. Browsing Popular Web Directories: Yahoo!, Open Directory Project (dmoz) The user has to guess the right directories to find the information The user has to adapt to the designers conceptualization of the directory The goal of information retrieval is to provide immediate random access to the data Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 5

35 Retrieval vs. Browsing Popular Web Directories: Yahoo!, Open Directory Project (dmoz) The user has to guess the right directories to find the information The user has to adapt to the designers conceptualization of the directory The goal of information retrieval is to provide immediate random access to the data The user can specifiy his information need Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 5

36 IR vs. Database Querying Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 6

37 IR vs. Database Querying IR is not the same thing as querying a database Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 6

38 IR vs. Database Querying IR is not the same thing as querying a database Database querying assumes that the data is in a standardized format Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 6

39 IR vs. Database Querying IR is not the same thing as querying a database Database querying assumes that the data is in a standardized format Transforming all information, news articles, web sites into a database format is difficult and impossible for large data collections Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 6

40 IR vs. Database Querying IR is not the same thing as querying a database Database querying assumes that the data is in a standardized format Transforming all information, news articles, web sites into a database format is difficult and impossible for large data collections Text retrieval can work with plain, unformatted data Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 6

41 Relevance as Similarity Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 7

42 Relevance as Similarity A fundamental idea within IR is: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 7

43 Relevance as Similarity A fundamental idea within IR is: A document is relevant to a query if they are similar Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 7

44 Relevance as Similarity A fundamental idea within IR is: A document is relevant to a query if they are similar Similarity can be defined as Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 7

45 Relevance as Similarity A fundamental idea within IR is: A document is relevant to a query if they are similar Similarity can be defined as string matching/comparison Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 7

46 Relevance as Similarity A fundamental idea within IR is: A document is relevant to a query if they are similar Similarity can be defined as string matching/comparison similar vocabulary Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 7

47 Relevance as Similarity A fundamental idea within IR is: A document is relevant to a query if they are similar Similarity can be defined as string matching/comparison similar vocabulary same meaning of text Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 7

48 The Ubiquity of IR Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

49 The Ubiquity of IR Information filtering Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

50 The Ubiquity of IR Information filtering routing Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

51 The Ubiquity of IR Information filtering routing Text categorization Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

52 The Ubiquity of IR Information filtering routing Text categorization Detecting information structure Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

53 The Ubiquity of IR Information filtering routing Text categorization Detecting information structure Hyperlink generation Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

54 The Ubiquity of IR Information filtering routing Text categorization Detecting information structure Hyperlink generation Topic/Information detection/screening Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

55 The Ubiquity of IR Information filtering routing Text categorization Detecting information structure Hyperlink generation Topic/Information detection/screening Portal development and maintenance Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

56 The Ubiquity of IR Information filtering routing Text categorization Detecting information structure Hyperlink generation Topic/Information detection/screening Portal development and maintenance Question Answering Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 8

57 Some Research Groups in IR Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 9

58 Some Research Groups in IR Industrial IR research: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 9

59 Some Research Groups in IR Industrial IR research: AT&T, NEC, Sun Microsystems, Microsoft, G&E Research, Sabir Research, NTT, AltaVista, Xerox, Q-Go, GO.com (Infoseek), Lexiquest, Answers.com, AnswerLogics, Google, Ask-Jeeves, Lucent Technologies, IBM... Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 9

60 Some Research Groups in IR Industrial IR research: AT&T, NEC, Sun Microsystems, Microsoft, G&E Research, Sabir Research, NTT, AltaVista, Xerox, Q-Go, GO.com (Infoseek), Lexiquest, Answers.com, AnswerLogics, Google, Ask-Jeeves, Lucent Technologies, IBM... Academic IR Groups: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 9

61 Some Research Groups in IR Industrial IR research: AT&T, NEC, Sun Microsystems, Microsoft, G&E Research, Sabir Research, NTT, AltaVista, Xerox, Q-Go, GO.com (Infoseek), Lexiquest, Answers.com, AnswerLogics, Google, Ask-Jeeves, Lucent Technologies, IBM... Academic IR Groups: Cornell, Massachusetts, Twente, Glasgow, Sheffield, Dortmund, Dublin, Stanford, Syracruse, Virginia Tech, Pisa... Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 9

62 Some Research Groups in IR Industrial IR research: AT&T, NEC, Sun Microsystems, Microsoft, G&E Research, Sabir Research, NTT, AltaVista, Xerox, Q-Go, GO.com (Infoseek), Lexiquest, Answers.com, AnswerLogics, Google, Ask-Jeeves, Lucent Technologies, IBM... Academic IR Groups: Cornell, Massachusetts, Twente, Glasgow, Sheffield, Dortmund, Dublin, Stanford, Syracruse, Virginia Tech, Pisa... Other: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 9

63 Some Research Groups in IR Industrial IR research: AT&T, NEC, Sun Microsystems, Microsoft, G&E Research, Sabir Research, NTT, AltaVista, Xerox, Q-Go, GO.com (Infoseek), Lexiquest, Answers.com, AnswerLogics, Google, Ask-Jeeves, Lucent Technologies, IBM... Academic IR Groups: Cornell, Massachusetts, Twente, Glasgow, Sheffield, Dortmund, Dublin, Stanford, Syracruse, Virginia Tech, Pisa... Other: CIA, DARPA, ERCIM, Mitre, NIST... Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 9

64 History of IR Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

65 History of IR 1950: Calvin N. Moors coins the term Information Retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

66 History of IR 1950: Calvin N. Moors coins the term Information Retrieval 1959: Luhn describes statistical retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

67 History of IR 1950: Calvin N. Moors coins the term Information Retrieval 1959: Luhn describes statistical retrieval 1960: Maron and Kuhns define a probabilistic model of IR Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

68 History of IR 1950: Calvin N. Moors coins the term Information Retrieval 1959: Luhn describes statistical retrieval 1960: Maron and Kuhns define a probabilistic model of IR 1966: Cranfield project defines evaluation measures Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

69 History of IR 1950: Calvin N. Moors coins the term Information Retrieval 1959: Luhn describes statistical retrieval 1960: Maron and Kuhns define a probabilistic model of IR 1966: Cranfield project defines evaluation measures 1968: Gerard Salton s first book about the SMART retrieval system Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

70 History of IR 1950: Calvin N. Moors coins the term Information Retrieval 1959: Luhn describes statistical retrieval 1960: Maron and Kuhns define a probabilistic model of IR 1966: Cranfield project defines evaluation measures 1968: Gerard Salton s first book about the SMART retrieval system 1972: Lockheed introduces DIALOG as commercial online service Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

71 History of IR 1950: Calvin N. Moors coins the term Information Retrieval 1959: Luhn describes statistical retrieval 1960: Maron and Kuhns define a probabilistic model of IR 1966: Cranfield project defines evaluation measures 1968: Gerard Salton s first book about the SMART retrieval system 1972: Lockheed introduces DIALOG as commercial online service Late 1980 s: First PC systems incorporate retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 10

72 History of IR Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 11

73 History of IR Early 1990 s: Cheap disks lead to the information storage revolution Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 11

74 History of IR Early 1990 s: Cheap disks lead to the information storage revolution 1992: Westlaw is the first large-scale information service using probabilistic retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 11

75 History of IR Early 1990 s: Cheap disks lead to the information storage revolution 1992: Westlaw is the first large-scale information service using probabilistic retrieval Mid 1990 s: Multi-media databases Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 11

76 History of IR Early 1990 s: Cheap disks lead to the information storage revolution 1992: Westlaw is the first large-scale information service using probabilistic retrieval Mid 1990 s: Multi-media databases 1994: The internet and web explosion Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 11

77 History of IR Early 1990 s: Cheap disks lead to the information storage revolution 1992: Westlaw is the first large-scale information service using probabilistic retrieval Mid 1990 s: Multi-media databases 1994: The internet and web explosion 1995: IR techniques are incorporated in all kinds of information management applications Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 11

78 Overview of the Course Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

79 Overview of the Course Basic IR models (week 1 & 2) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

80 Overview of the Course Basic IR models (week 1 & 2) Evaluating the quality of IR methods (week 3) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

81 Overview of the Course Basic IR models (week 1 & 2) Evaluating the quality of IR methods (week 3) Text representation (week 4) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

82 Overview of the Course Basic IR models (week 1 & 2) Evaluating the quality of IR methods (week 3) Text representation (week 4) Components of an IR system (week 5 & 6) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

83 Overview of the Course Basic IR models (week 1 & 2) Evaluating the quality of IR methods (week 3) Text representation (week 4) Components of an IR system (week 5 & 6) Improving effectiveness and efficiency (week 6 & 7) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

84 Overview of the Course Basic IR models (week 1 & 2) Evaluating the quality of IR methods (week 3) Text representation (week 4) Components of an IR system (week 5 & 6) Improving effectiveness and efficiency (week 6 & 7) Web-based IR (week 8 & 9) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

85 Overview of the Course Basic IR models (week 1 & 2) Evaluating the quality of IR methods (week 3) Text representation (week 4) Components of an IR system (week 5 & 6) Improving effectiveness and efficiency (week 6 & 7) Web-based IR (week 8 & 9) Current research themes (week 10) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 12

86 Objectives of the Course Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 13

87 Objectives of the Course At the end of the course you will be able to... Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 13

88 Objectives of the Course At the end of the course you will be able to... Exploit web specific information when searching Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 13

89 Objectives of the Course At the end of the course you will be able to... Exploit web specific information when searching Understand the core components of modern IR systems Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 13

90 Objectives of the Course At the end of the course you will be able to... Exploit web specific information when searching Understand the core components of modern IR systems Understand the potential of IR techniques for today s information society Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 13

91 Objectives of the Course At the end of the course you will be able to... Exploit web specific information when searching Understand the core components of modern IR systems Understand the potential of IR techniques for today s information society Build your own search engine (in principle) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 13

92 Objectives of the Course At the end of the course you will be able to... Exploit web specific information when searching Understand the core components of modern IR systems Understand the potential of IR techniques for today s information society Build your own search engine (in principle) Make some serious dough Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 13

93 Grading etc. Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

94 Grading etc. Prerequisites: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

95 Prerequisites: Grading etc. Computer literacy (including an account on gene plus the ability to use the unix command line interface) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

96 Prerequisites: Grading etc. Computer literacy (including an account on gene plus the ability to use the unix command line interface) Assessment: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

97 Prerequisites: Grading etc. Computer literacy (including an account on gene plus the ability to use the unix command line interface) Assessment: Weekly reading assignments (1 or 2 papers per week) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

98 Prerequisites: Grading etc. Computer literacy (including an account on gene plus the ability to use the unix command line interface) Assessment: Weekly reading assignments (1 or 2 papers per week) (3-5) assignments Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

99 Prerequisites: Grading etc. Computer literacy (including an account on gene plus the ability to use the unix command line interface) Assessment: Weekly reading assignments (1 or 2 papers per week) (3-5) assignments Final exam Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

100 Prerequisites: Grading etc. Computer literacy (including an account on gene plus the ability to use the unix command line interface) Assessment: Weekly reading assignments (1 or 2 papers per week) (3-5) assignments Final exam Final mark is obtained as the average of the final exam (60%), assignments (30%) and reading (10%) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 14

101 Web Site of the Course Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 15

102 Web Site of the Course URL: christof/courses/ir/ Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 15

103 Web Site of the Course URL: christof/courses/ir/ Features of the web site: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 15

104 Web Site of the Course URL: christof/courses/ir/ Features of the web site: Some of the reading material is available online Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 15

105 Web Site of the Course URL: christof/courses/ir/ Features of the web site: Some of the reading material is available online Links to universities, companies and people relevant to IR Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 15

106 Web Site of the Course URL: christof/courses/ir/ Features of the web site: Some of the reading material is available online Links to universities, companies and people relevant to IR Printer-friendly versions of the transparancies Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 15

107 Web Site of the Course URL: christof/courses/ir/ Features of the web site: Some of the reading material is available online Links to universities, companies and people relevant to IR Printer-friendly versions of the transparancies Fill out the online form to be added to the mailing list for this course (important!) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 15

108 Retrieval Models Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 16

109 Retrieval Models A retrieval model is an idealization or abstraction of an actual retrieval process Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 16

110 Retrieval Models A retrieval model is an idealization or abstraction of an actual retrieval process Conclusions derived from a model depend on whether the model is a good approximation of the retrieval situation Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 16

111 Retrieval Models A retrieval model is an idealization or abstraction of an actual retrieval process Conclusions derived from a model depend on whether the model is a good approximation of the retrieval situation Note that a retrieval model is not the same thing as a retrieval implementation Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 16

112 Retrieval Models Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 17

113 Retrieval Models document representations User identify relevant information query formulation display documents to the user Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 17

114 Components of a Retrieval Model Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

115 Components of a Retrieval Model The user: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

116 Components of a Retrieval Model The user: Search expert (e.g., librarian) vs. non-expert Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

117 Components of a Retrieval Model The user: Search expert (e.g., librarian) vs. non-expert Backgound of the user (knowledge of the topic) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

118 Components of a Retrieval Model The user: Search expert (e.g., librarian) vs. non-expert Backgound of the user (knowledge of the topic) In-depth searching vs. just-wanna-get-an-idea searching Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

119 Components of a Retrieval Model The user: Search expert (e.g., librarian) vs. non-expert Backgound of the user (knowledge of the topic) In-depth searching vs. just-wanna-get-an-idea searching The documents: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

120 Components of a Retrieval Model The user: Search expert (e.g., librarian) vs. non-expert Backgound of the user (knowledge of the topic) In-depth searching vs. just-wanna-get-an-idea searching The documents: Different languages Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

121 Components of a Retrieval Model The user: Search expert (e.g., librarian) vs. non-expert Backgound of the user (knowledge of the topic) In-depth searching vs. just-wanna-get-an-idea searching The documents: Different languages Semi-structured (e.g. HTML or XML) vs. plain Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 18

122 Document Representation Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 19

123 Meta-descriptions Document Representation Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 19

124 Document Representation Meta-descriptions Field information (author, title, date) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 19

125 Document Representation Meta-descriptions Field information (author, title, date) Key words Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 19

126 Document Representation Meta-descriptions Field information (author, title, date) Key words - Predefined Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 19

127 Document Representation Meta-descriptions Field information (author, title, date) Key words - Predefined - Manually extracted (by author/editor) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 19

128 Document Representation Meta-descriptions Field information (author, title, date) Key words - Predefined - Manually extracted (by author/editor) Content: automatically identifying what the document is about Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 19

129 Document Representation Controlled Vocabulary Free Text Manual Automatic Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 20

130 Document Representation Controlled Vocabulary Free Text Manual Current indexing practice Automatic Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 20

131 Document Representation Manual Automatic Controlled Current indexing Text categorization Vocabulary practice intelligent IR Free Text Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 20

132 Document Representation Manual Automatic Controlled Current indexing Text categorization Vocabulary practice intelligent IR Current indexing Free Text practice Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 20

133 Document Representation Manual Automatic Controlled Current indexing Text categorization Vocabulary practice intelligent IR Current indexing Text search engines Free Text practice statistical IR Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 20

134 Controlled Vocabularies Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

135 Examples are: Controlled Vocabularies Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

136 Examples are: Controlled Vocabularies ACM Computing Classification System Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

137 Examples are: Controlled Vocabularies ACM Computing Classification System An article on Web search engines would (probably) be classified as H.3.5 where: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

138 Examples are: Controlled Vocabularies ACM Computing Classification System An article on Web search engines would (probably) be classified as H.3.5 where: - H: Information Systems Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

139 Examples are: Controlled Vocabularies ACM Computing Classification System An article on Web search engines would (probably) be classified as H.3.5 where: - H: Information Systems - H.3: Information Storage and Retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

140 Examples are: Controlled Vocabularies ACM Computing Classification System An article on Web search engines would (probably) be classified as H.3.5 where: - H: Information Systems - H.3: Information Storage and Retrieval - H.3.5: Online Information Services Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

141 Examples are: Controlled Vocabularies ACM Computing Classification System An article on Web search engines would (probably) be classified as H.3.5 where: - H: Information Systems - H.3: Information Storage and Retrieval - H.3.5: Online Information Services NLM Medical Subject Headings (MeSH) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

142 Examples are: Controlled Vocabularies ACM Computing Classification System An article on Web search engines would (probably) be classified as H.3.5 where: - H: Information Systems - H.3: Information Storage and Retrieval - H.3.5: Online Information Services NLM Medical Subject Headings (MeSH) Yahoo! Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 21

143 Manual vs. Automatic Indexing Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

144 Manual vs. Automatic Indexing Pros of manual indexing: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

145 Manual vs. Automatic Indexing Pros of manual indexing: + Human judgements are most reliable Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

146 Manual vs. Automatic Indexing Pros of manual indexing: + Human judgements are most reliable + Searching controlled vocabularies is more efficient Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

147 Manual vs. Automatic Indexing Pros of manual indexing: + Human judgements are most reliable + Searching controlled vocabularies is more efficient Cons of manual indexing: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

148 Manual vs. Automatic Indexing Pros of manual indexing: + Human judgements are most reliable + Searching controlled vocabularies is more efficient Cons of manual indexing: Time consuming Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

149 Manual vs. Automatic Indexing Pros of manual indexing: + Human judgements are most reliable + Searching controlled vocabularies is more efficient Cons of manual indexing: Time consuming The person using the retrieval system has to be familiar with the classification system Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

150 Manual vs. Automatic Indexing Pros of manual indexing: + Human judgements are most reliable + Searching controlled vocabularies is more efficient Cons of manual indexing: Time consuming The person using the retrieval system has to be familiar with the classification system Classification systems are sometimes incoherent Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 22

151 Automatic Content Representation Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 23

152 Automatic Content Representation Using natural language understanding? Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 23

153 Automatic Content Representation Using natural language understanding? Computationally too expensive in real-world settings Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 23

154 Automatic Content Representation Using natural language understanding? Computationally too expensive in real-world settings Coverage Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 23

155 Automatic Content Representation Using natural language understanding? Computationally too expensive in real-world settings Coverage Language dependence Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 23

156 Automatic Content Representation Using natural language understanding? Computationally too expensive in real-world settings Coverage Language dependence The resulting representations may be too explicit to deal with the vagueness of a user s information need Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 23

157 Automatic Content Representation Using natural language understanding? Computationally too expensive in real-world settings Coverage Language dependence The resulting representations may be too explicit to deal with the vagueness of a user s information need Alternative: a document is simply an unstructured set of words appearing in it: bag of words Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 23

158 Bag-of-Words Approach Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

159 Bag-of-Words Approach A document is an unordered list of words Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

160 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

161 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

162 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Is White House one or two words? Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

163 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Is White House one or two words? Case folding Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

164 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Is White House one or two words? Case folding President Bush becomes president, bush Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

165 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Is White House one or two words? Case folding President Bush becomes president, bush Stemming or lemmatization Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

166 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Is White House one or two words? Case folding President Bush becomes president, bush Stemming or lemmatization Morphological information is thrown away Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

167 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Is White House one or two words? Case folding President Bush becomes president, bush Stemming or lemmatization Morphological information is thrown away agreements becomes agreement (lemmatization) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

168 Bag-of-Words Approach A document is an unordered list of words Grammatical information is lost Tokenization: What is a word? Is White House one or two words? Case folding President Bush becomes president, bush Stemming or lemmatization Morphological information is thrown away agreements becomes agreement (lemmatization) or even agree (stemming) Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 24

169 Example Bag of Words Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 25

170 Example Bag of Words Scientists have found compelling new evidence of possible ancient microscopic life on Mars, derived from magnetic crystals in a meteorite that fell to Earth from the red planet, NASA announced on Monday. Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 25

171 Example Bag of Words Scientists have found compelling new evidence of possible ancient microscopic life on Mars, derived from magnetic crystals in a meteorite that fell to Earth from the red planet, NASA announced on Monday. a, ancient, announced, compelling, crystals, derived, earth, evidence, fell, found, from (2 ), have, in, life, magnetic, mars, meteorite, microscopic, monday, nasa, new, of, on (2 ), planet, possible, red, scientists, that, the, to Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 25

172 What is this about? Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 26

173 What is this about?? added, al, an, and, ballots, been, completed, count, county (2 ), even, former, gore, ground, had, hand, have (2 ), he, if, in (2 ), independent, lost, many, miamidade, might, new, not, of, president, presidential, requested, shows, study, that, the, vice, votes, would Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 26

174 What is this about?? = added, al, an, and, ballots, been, completed, count, county (2 ), even, former, gore, ground, had, hand, have (2 ), he, if, in (2 ), independent, lost, many, miamidade, might, new, not, of, president, presidential, requested, shows, study, that, the, vice, votes, would An independent study shows former Vice President Al Gore would not have added many new votes in Miami-Dade County and might even have lost ground in that county, if the hand count of presidential ballots he requested had been completed. Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 26

175 Boolean Retrieval Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

176 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

177 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

178 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: t 1 AND t 2 = Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

179 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: t 1 AND t 2 = {d t 1 r(d)} {d t 2 r(d)} Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

180 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: t 1 AND t 2 = {d t 1 r(d)} {d t 2 r(d)} Documents whose representation contains t 1 and t 2 Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

181 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: t 1 AND t 2 = {d t 1 r(d)} {d t 2 r(d)} Documents whose representation contains t 1 and t 2 t 1 OR t 2 = Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

182 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: t 1 AND t 2 = {d t 1 r(d)} {d t 2 r(d)} Documents whose representation contains t 1 and t 2 t 1 OR t 2 = {d t 1 r(d)} {d t 2 r(d)} Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

183 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: t 1 AND t 2 = {d t 1 r(d)} {d t 2 r(d)} Documents whose representation contains t 1 and t 2 t 1 OR t 2 = {d t 1 r(d)} {d t 2 r(d)} Documents whose representation contains t 1 or t 2 Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

184 Boolean Retrieval Boolean operators are: AND (NEAR), OR, NOT The semantics of the Boolean operators: t 1 AND t 2 = {d t 1 r(d)} {d t 2 r(d)} Documents whose representation contains t 1 and t 2 t 1 OR t 2 = {d t 1 r(d)} {d t 2 r(d)} Documents whose representation contains t 1 or t 2 NOT t 1 = Introduction to Information Retrieval, Spring 2002, Week 1 Copyright c Christof Monz & Maarten de Rijke 27

Text Analytics. Introduction to Information Retrieval. Ulf Leser

Text Analytics. Introduction to Information Retrieval. Ulf Leser Text Analytics Introduction to Information Retrieval Ulf Leser Summary of last lecture Ulf Leser: Text Analytics, Winter Semester 2010/2011 2 Content of this Lecture Information Retrieval Introduction

More information

Search and Information Retrieval

Search and Information Retrieval Search and Information Retrieval Search on the Web 1 is a daily activity for many people throughout the world Search and communication are most popular uses of the computer Applications involving search

More information

Text Analytics. Introduction to Information Retrieval. Ulf Leser

Text Analytics. Introduction to Information Retrieval. Ulf Leser Text Analytics Introduction to Information Retrieval Ulf Leser Content of this Lecture Information Retrieval Introduction Documents Queries Related topics A first idea: Boolean queries and vector space

More information

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it

Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content

More information

Database Systems. Lecture 1: Introduction

Database Systems. Lecture 1: Introduction Database Systems Lecture 1: Introduction General Information Professor: Leonid Libkin Contact: libkin@ed.ac.uk Lectures: Tuesday, 11:10am 1 pm, AT LT4 Website: http://homepages.inf.ed.ac.uk/libkin/teach/dbs09/index.html

More information

Introduction to IR Systems: Supporting Boolean Text Search. Information Retrieval. IR vs. DBMS. Chapter 27, Part A

Introduction to IR Systems: Supporting Boolean Text Search. Information Retrieval. IR vs. DBMS. Chapter 27, Part A Introduction to IR Systems: Supporting Boolean Text Search Chapter 27, Part A Database Management Systems, R. Ramakrishnan 1 Information Retrieval A research field traditionally separate from Databases

More information

Precision and Relative Recall of Search Engines: A Comparative Study of Google and Yahoo

Precision and Relative Recall of Search Engines: A Comparative Study of Google and Yahoo and Relative Recall of Engines: A Comparative Study of Google and Yahoo B.T. Sampath Kumar J.N. Prakash Kuvempu University Abstract This paper compared the retrieval effectiveness of the Google and Yahoo.

More information

Technical challenges in web advertising

Technical challenges in web advertising Technical challenges in web advertising Andrei Broder Yahoo! Research 1 Disclaimer This talk presents the opinions of the author. It does not necessarily reflect the views of Yahoo! Inc. 2 Advertising

More information

Content Management Software Drupal : Open Source Software to create library website

Content Management Software Drupal : Open Source Software to create library website Content Management Software Drupal : Open Source Software to create library website S.Satish, Asst Library & Information Officer National Institute of Epidemiology (ICMR) R-127, Third Avenue, Tamil Nadu

More information

Information access through information technology

Information access through information technology Information access through information technology 1 Created to support an invited lecture at the International Conference MDGICT 2009 in Tamil Nadu, India, December 2009 by Paul.Nieuwenhuysen@vub.ac.be

More information

Mining Text Data: An Introduction

Mining Text Data: An Introduction Bölüm 10. Metin ve WEB Madenciliği http://ceng.gazi.edu.tr/~ozdemir Mining Text Data: An Introduction Data Mining / Knowledge Discovery Structured Data Multimedia Free Text Hypertext HomeLoan ( Frank Rizzo

More information

Combining RDF and Agent-Based Architectures for Semantic Interoperability in Digital Libraries

Combining RDF and Agent-Based Architectures for Semantic Interoperability in Digital Libraries Combining RDF and Agent-Based Architectures for Semantic Interoperability in Digital Libraries Norbert Fuhr, Claus-Peter Klas University of Dortmund, Germany {fuhr,klas}@ls6.cs.uni-dortmund.de 1 Introduction

More information

Introduction to Database Systems CS4320. Instructor: Christoph Koch koch@cs.cornell.edu CS 4320 1

Introduction to Database Systems CS4320. Instructor: Christoph Koch koch@cs.cornell.edu CS 4320 1 Introduction to Database Systems CS4320 Instructor: Christoph Koch koch@cs.cornell.edu CS 4320 1 CS4320/1: Introduction to Database Systems Underlying theme: How do I build a data management system? CS4320

More information

Module 5 The Internet as an Information Resource

Module 5 The Internet as an Information Resource Module 5 The Internet as an Information Resource Lesson 2 How to search for Information on the Internet. UNESCO EIPICT MODULE 5. LESSON 2 1 Scope What are the ways to find information on the Internet?

More information

College of Communication and Information. Library and Information Science

College of Communication and Information. Library and Information Science 510 CHILDREN S LITERATURE AND RELATED MATERIALS. (3) A survey of children s literature, traditional and modern. Reading and evaluation of books with multimedia materials with emphasis on the needs and

More information

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset.

Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset. White Paper Using LSI for Implementing Document Management Systems Turning unstructured data from a liability to an asset. Using LSI for Implementing Document Management Systems By Mike Harrison, Director,

More information

Medical Information-Retrieval Systems. Dong Peng Medical Informatics Group

Medical Information-Retrieval Systems. Dong Peng Medical Informatics Group Medical Information-Retrieval Systems Dong Peng Medical Informatics Group Outline Evolution of medical Information-Retrieval (IR). The information retrieval process. The trend of medical information retrieval

More information

Flattening Enterprise Knowledge

Flattening Enterprise Knowledge Flattening Enterprise Knowledge Do you Control Your Content or Does Your Content Control You? 1 Executive Summary: Enterprise Content Management (ECM) is a common buzz term and every IT manager knows it

More information

Organization of Information

Organization of Information 1 College of Information Studies University of Maryland Organization of Information Syllabus for LBSC 670 Online Course Fall 2011 Developed by Prof. T. Kanti Srikantaiah, Ph.D. Email: tsrikant@umd.edu

More information

Module Two - Searching Tools

Module Two - Searching Tools Introduction Module Two - Searching Tools There are various searching tools available to health professionals in both print and electronic formats. These include, among others: Online Public Access Catalogues,

More information

Challenges in Running a Commercial Web Search Engine. Amit Singhal

Challenges in Running a Commercial Web Search Engine. Amit Singhal Challenges in Running a Commercial Web Search Engine Amit Singhal Overview Introduction/History Search Engine Spam Evaluation Challenge Google Introduction Crawling Follow links to find information Indexing

More information

Welcome to echalk A Guide For Students. Introduction. Contents:

Welcome to echalk A Guide For Students. Introduction. Contents: Welcome to echalk A Guide For Students Introduction echalk is an online learning environment that connects students, teachers, parents and administrators within your school and district. echalk provides

More information

Emerging Career Trends for Information Professionals: A Snapshot of Job Titles in Summer 2013

Emerging Career Trends for Information Professionals: A Snapshot of Job Titles in Summer 2013 Emerging Career Trends for Information Professionals: A Snapshot of Job Titles in Summer 2013 Introduction This report provides an informal snapshot regarding some of the latest career trends for information

More information

Information Need Assessment in Information Retrieval

Information Need Assessment in Information Retrieval Information Need Assessment in Information Retrieval Beyond Lists and Queries Frank Wissbrock Department of Computer Science Paderborn University, Germany frankw@upb.de Abstract. The goal of every information

More information

DIABLO VALLEY COLLEGE CATALOG 2014-2015

DIABLO VALLEY COLLEGE CATALOG 2014-2015 COMPUTER SCIENCE COMSC The computer science department offers courses in three general areas, each targeted to serve students with specific needs: 1. General education students seeking a computer literacy

More information

College of Communication and Information. Library and Information Science

College of Communication and Information. Library and Information Science 510 CHILDREN S LITERATURE AND RELATED MATERIALS. (3) A survey of children s literature, traditional and modern. Reading and evaluation of books with multimedia materials with emphasis on the needs and

More information

Audit Management Reference

Audit Management Reference www.novell.com/documentation Audit Management Reference ZENworks 11 Support Pack 3 February 2014 Legal Notices Novell, Inc., makes no representations or warranties with respect to the contents or use of

More information

LDAP andUsers Profile - A Quick Comparison

LDAP andUsers Profile - A Quick Comparison Using LDAP in a Filtering Service for a Digital Library João Ferreira (**) José Luis Borbinha (*) INESC Instituto de Enghenharia de Sistemas e Computatores José Delgado (*) INESC Instituto de Enghenharia

More information

Online Assessment of Information Competence

Online Assessment of Information Competence Online Assessment of Information Competence Kathy Dabbour, Assessment Coordinator Oviatt Library May 15, 2009 1 Information Competence @ CSUN CSUN's General Education IC outcome: Students will progressively

More information

Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies

Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies Building a Spanish MMTx by using Automatic Translation and Biomedical Ontologies Francisco Carrero 1, José Carlos Cortizo 1,2, José María Gómez 3 1 Universidad Europea de Madrid, C/Tajo s/n, Villaviciosa

More information

A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace

A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace A.I. in health informatics lecture 1 introduction & stuff kevin small & byron wallace what is this class about? health informatics managing and making sense of biomedical information but mostly from an

More information

Chapter 5 Use the technological Tools for accessing Information 5.1 Overview of Databases are organized collections of information.

Chapter 5 Use the technological Tools for accessing Information 5.1 Overview of Databases are organized collections of information. Chapter 5 Use the technological Tools for accessing Information 5.1 Overview of Databases are organized collections of information. Ex: Online Catalogs. 1. Structure of a Database - Contains record and

More information

RARITAN VALLEY COMMUNITY COLLEGE COMPUTER SCIENCE (CS) DEPARTMENT. CISY 102 - Computer Literacy

RARITAN VALLEY COMMUNITY COLLEGE COMPUTER SCIENCE (CS) DEPARTMENT. CISY 102 - Computer Literacy I. Basic Course Information RARITAN VALLEY COMMUNITY COLLEGE COMPUTER SCIENCE (CS) DEPARTMENT CISY 102 - Computer Literacy A. Course Number and Title: CISY-102, Computer Literacy B. Date of Proposal or

More information

Digital Libraries and Content Management

Digital Libraries and Content Management Digital Libraries and Content Management Database Research Group, University of Rostock 4th European IBM Content Manager and Media Workshop, September 2002, Essen 0. Overview 1. Content Management Systems

More information

An Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System

An Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System An Overview of a Role of Natural Language Processing in An Intelligent Information Retrieval System Asanee Kawtrakul ABSTRACT In information-age society, advanced retrieval technique and the automatic

More information

Text Analytics Software Choosing the Right Fit

Text Analytics Software Choosing the Right Fit Text Analytics Software Choosing the Right Fit Tom Reamy Chief Knowledge Architect KAPS Group http://www.kapsgroup.com Text Analytics World San Francisco, 2013 Agenda Introduction Text Analytics Basics

More information

Content Analyst's Cerebrant Combines SaaS Discovery, Machine Learning, and Content to Perform Next-Generation Research

Content Analyst's Cerebrant Combines SaaS Discovery, Machine Learning, and Content to Perform Next-Generation Research INSIGHT Content Analyst's Cerebrant Combines SaaS Discovery, Machine Learning, and Content to Perform Next-Generation Research David Schubmehl IDC OPINION Organizations are looking for better ways to perform

More information

"An examination of the cultural impact of traditional and future information systems on diverse populations."

An examination of the cultural impact of traditional and future information systems on diverse populations. 1 of 6 7/21/2011 9:38 AM Home Courses Syllabi Fall 10 Updated Wed, 03/23/2011-16:59 COURSE NAME, NUMBER AND PREREQUISITES: [Prerequisite: IRLS 504 or consent of the instructor.] This course satisfies the

More information

Extract Archived Data from SAP ERP

Extract Archived Data from SAP ERP How-to Guide SAP NetWeaver 7.0 How To Extract Archived Data from SAP ERP Version 1.00 May 2006 Applicable Releases: SAP NetWeaver 7.0 (BI capability) Copyright 2008 SAP AG. All rights reserved. No part

More information

One of the main reasons for the Web s success

One of the main reasons for the Web s success Editor: Peiya Liu Siemens Corporate Research Metadata Standards for Web-Based Resources Achim Steinacker University of Technology, Darmstadt Amir Ghavam University of Ottawa Ralf Steinmetz German National

More information

1 o Semestre 2007/2008

1 o Semestre 2007/2008 Departamento de Engenharia Informática Instituto Superior Técnico 1 o Semestre 2007/2008 Outline 1 2 3 4 5 Outline 1 2 3 4 5 Exploiting Text How is text exploited? Two main directions Extraction Extraction

More information

College of Communications and Information Studies

College of Communications and Information Studies 510 CHILDREN S LITERATURE AND RELATED MATERIALS. (3) A survey of children s literature, traditional and modern. Reading and evaluation of books with multimedia materials with emphasis on the needs and

More information

Five Phases. The History of the Internet and World-Wide-Web. Long Distance LAN. internet. Internet. Tool Building

Five Phases. The History of the Internet and World-Wide-Web. Long Distance LAN. internet. Internet. Tool Building Five Phases The History of the Internet and World-Wide-Web Charles Severance Michigan State University Long Distance Networking 1966-1973 Network of Networks internet 1974-1985 internet becomes Internet

More information

Using SAP Logon Tickets for Single Sign on to Microsoft based web applications

Using SAP Logon Tickets for Single Sign on to Microsoft based web applications Collaboration Technology Support Center - Microsoft - Collaboration Brief March 2005 Using SAP Logon Tickets for Single Sign on to Microsoft based web applications André Fischer, Project Manager CTSC,

More information

CONCEPTCLASSIFIER FOR SHAREPOINT

CONCEPTCLASSIFIER FOR SHAREPOINT CONCEPTCLASSIFIER FOR SHAREPOINT PRODUCT OVERVIEW The only SharePoint 2007 and 2010 solution that delivers automatic conceptual metadata generation, auto-classification and powerful taxonomy tools running

More information

Installation & User Guide

Installation & User Guide SharePoint List Filter Plus Web Part Installation & User Guide Copyright 2005-2011 KWizCom Corporation. All rights reserved. Company Headquarters KWizCom 50 McIntosh Drive, Unit 109 Markham, Ontario ON

More information

ONLINE CONSUMER BEHAVIOR AND ADVERTISEMENT

ONLINE CONSUMER BEHAVIOR AND ADVERTISEMENT ONLINE CONSUMER BEHAVIOR AND ADVERTISEMENT Spring 2011 e-commerce Implementation Understanding customer decision making How will customer decide what to purchase and who to buy from? What factors will

More information

Get the most value from your surveys with text analysis

Get the most value from your surveys with text analysis PASW Text Analytics for Surveys 3.0 Specifications Get the most value from your surveys with text analysis The words people use to answer a question tell you a lot about what they think and feel. That

More information

Integration of Universal Worklist into Microsoft Office SharePoint

Integration of Universal Worklist into Microsoft Office SharePoint Integration of Universal Worklist into Microsoft Office SharePoint Applies to: SAP NetWeaver Portal 7.01 SP3 Microsoft Office SharePoint 2007 For more information, visit the Portal and Collaboration homepage.

More information

CDMG 3607 Digital Asset Management INSTRUCTIONAL OBJECTIVES

CDMG 3607 Digital Asset Management INSTRUCTIONAL OBJECTIVES New York City College of Technology The City University of New York Department of Communication Design CDMG 3607 Digital Asset Management Course Description This course focuses on the terminology, techniques,

More information

Please see current textbook prices at www.rcgc.bncollege.com

Please see current textbook prices at www.rcgc.bncollege.com ENG235: AMERICAN FILM CLASSICS SYLLABUS LECTURE HOURS/CREDITS: 3/3 CATALOG DESCRIPTION Prerequisite: ENG101 English Composition I This survey of the American film industry emphasizes its development as

More information

that differ from that of a basic online search:

that differ from that of a basic online search: Searching Online Databases: A Brief Tutorial Searching an online databaseutilizes methods that differ from that of a basic online search: Controlled vocabulary Indexed terms or Keywords Subject Headings

More information

PMML and UIMA Based Frameworks for Deploying Analytic Applications and Services

PMML and UIMA Based Frameworks for Deploying Analytic Applications and Services PMML and UIMA Based Frameworks for Deploying Analytic Applications and Services David Ferrucci 1, Robert L. Grossman 2 and Anthony Levas 1 1. Introduction - The Challenges of Deploying Analytic Applications

More information

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML?

CS2Bh: Current Technologies. Introduction to XML and Relational Databases. Introduction to Databases. Why databases? Why not use XML? CS2Bh: Current Technologies Introduction to XML and Relational Databases Spring 2005 Introduction to Databases CS2 Spring 2005 (LN5) 1 Why databases? Why not use XML? What is missing from XML: Consistency

More information

Best of the Solar System

Best of the Solar System Best of the Solar System Topic Area: Solar system, planets and moons Purpose: Introduce students to planetary research and familiarize them with the planets and their features. Key Questions: What are

More information

Lecture Overview. Web 2.0, Tagging, Multimedia, Folksonomies, Lecture, Important, Must Attend, Web 2.0 Definition. Web 2.

Lecture Overview. Web 2.0, Tagging, Multimedia, Folksonomies, Lecture, Important, Must Attend, Web 2.0 Definition. Web 2. Lecture Overview Web 2.0, Tagging, Multimedia, Folksonomies, Lecture, Important, Must Attend, Martin Halvey Introduction to Web 2.0 Overview of Tagging Systems Overview of tagging Design and attributes

More information

Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval

Search Taxonomy. Web Search. Search Engine Optimization. Information Retrieval Information Retrieval INFO 4300 / CS 4300! Retrieval models Older models» Boolean retrieval» Vector Space model Probabilistic Models» BM25» Language models Web search» Learning to Rank Search Taxonomy!

More information

HR Data Retrieval in a LDAP- Enabled Directory Service

HR Data Retrieval in a LDAP- Enabled Directory Service HR Data Retrieval in a LDAP- Enabled Directory Service HELP.PORTMANAGER Release 50A Copyright Copyright 2001 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted in

More information

Xcelsius Dashboards on SAP NetWaver BW Implementation Best Practices

Xcelsius Dashboards on SAP NetWaver BW Implementation Best Practices Xcelsius Dashboards on SAP NetWaver BW Implementation Best Practices Patrice Le Bihan, SAP Intelligence Platform & NetWeaver RIG, Americas Dr. Gerd Schöffl, SAP Intelligence Platform & NetWeaver RIG, EMEA

More information

How to Create Web Dynpro-Based iviews. Based on SAP NetWeaver 04 Stack 09. Jochen Guertler

How to Create Web Dynpro-Based iviews. Based on SAP NetWeaver 04 Stack 09. Jochen Guertler How to Create Web Dynpro-Based iviews Based on SAP NetWeaver 04 Stack 09 Jochen Guertler Copyright Copyright 2004 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted

More information

Websense Certified Engineer Web Security Professional Examination Specification

Websense Certified Engineer Web Security Professional Examination Specification Websense Certified Engineer Web Security Professional Examination Specification Introduction This is an exam specification for the Websense Certified Engineer - Web Security Professional examination. The

More information

Publish Acrolinx Terminology Changes via RSS

Publish Acrolinx Terminology Changes via RSS Publish Acrolinx Terminology Changes via RSS Only a limited number of people regularly access the Acrolinx Dashboard to monitor updates to terminology, but everybody uses an email program all the time.

More information

Digital Marketing Training Institute

Digital Marketing Training Institute Our USP Live Training Expert Faculty Personalized Training Post Training Support Trusted Institute 5+ Years Experience Flexible Batches Certified Trainers Digital Marketing Training Institute Mumbai Branch:

More information

CHAPTER 9: THE EVOLVING INTERNET

CHAPTER 9: THE EVOLVING INTERNET CHAPTER 9: THE EVOLVING INTERNET Multiple Choice: 1. What was the department of the U.S. government that developed the initial stages of the Internet? A. Department of Commerce B. Department of Defense

More information

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY

ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY ONTOLOGY-BASED APPROACH TO DEVELOPMENT OF ADJUSTABLE KNOWLEDGE INTERNET PORTAL FOR SUPPORT OF RESEARCH ACTIVITIY Yu. A. Zagorulko, O. I. Borovikova, S. V. Bulgakov, E. A. Sidorova 1 A.P.Ershov s Institute

More information

Taxonomy Enterprise System Search Makes Finding Files Easy

Taxonomy Enterprise System Search Makes Finding Files Easy Taxonomy Enterprise System Search Makes Finding Files Easy 1 Your Regular Enterprise Search System Can be Improved by Integrating it With the Taxonomy Enterprise Search System Regular Enterprise Search

More information

SQL Server 2005 Reporting Services (SSRS)

SQL Server 2005 Reporting Services (SSRS) SQL Server 2005 Reporting Services (SSRS) Author: Alex Payne and Brian Welcker Published: May 2005 Summary: SQL Server 2005 Reporting Services is a key component of SQL Server 2005. Reporting Services

More information

Automatic Document Categorization A Hummingbird White Paper

Automatic Document Categorization A Hummingbird White Paper Automatic Document Categorization A Hummingbird White Paper Automatic Document Categorization While every attempt has been made to ensure the accuracy and completeness of the information in this document,

More information

Why dread a bump on the head?

Why dread a bump on the head? Why dread a bump on the head? The neuroscience of traumatic brain injury Lesson 6: Exploring the data behind brain injury I. Overview This lesson exposes students to the role data access and analysis can

More information

WHITEPAPER MANAGE YOUR PAPER MATERIAL WITHIN YOUR CRM SYSTEM

WHITEPAPER MANAGE YOUR PAPER MATERIAL WITHIN YOUR CRM SYSTEM WHITEPAPER MANAGE YOUR PAPER MATERIAL WITHIN YOUR CRM SYSTEM WHITEPAPER MANAGE YOUR PAPER MATERIAL WITHIN YOUR CRM SYSTEM 2 ABOUT There are two main types of data within the enterprise: structured and

More information

Sanjeev Kumar. contribute

Sanjeev Kumar. contribute RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a

More information

US Patent and Trademark Office Department of Commerce

US Patent and Trademark Office Department of Commerce US Patent and Trademark Office Department of Commerce Request for Comments Regarding Prior Art Resources for Use in the Examination of Software-Related Patent Applications [Docket No.: PTO-P-2013-0064]

More information

Unit 5.1 The Database Concept

Unit 5.1 The Database Concept Unit 5.1 The Database Concept Candidates should be able to: What is a Database? A database is a persistent, organised store of related data. Persistent Data and structures are maintained when data handling

More information

Lecture 1: Introduction and the Boolean Model

Lecture 1: Introduction and the Boolean Model Lecture 1: Introduction and the Boolean Model Information Retrieval Computer Science Tripos Part II Simone Teufel Natural Language and Information Processing (NLIP) Group Simone.Teufel@cl.cam.ac.uk 1 Overview

More information

University of Massachusetts Boston Applied Linguistics Graduate Program. APLING 601 Introduction to Linguistics. Syllabus

University of Massachusetts Boston Applied Linguistics Graduate Program. APLING 601 Introduction to Linguistics. Syllabus University of Massachusetts Boston Applied Linguistics Graduate Program APLING 601 Introduction to Linguistics Syllabus Course Description: This course examines the nature and origin of language, the history

More information

Text Analytics Evaluation Case Study - Amdocs

Text Analytics Evaluation Case Study - Amdocs Text Analytics Evaluation Case Study - Amdocs Tom Reamy Chief Knowledge Architect KAPS Group http://www.kapsgroup.com Text Analytics World October 20 New York Agenda Introduction Text Analytics Basics

More information

Folksonomies versus Automatic Keyword Extraction: An Empirical Study

Folksonomies versus Automatic Keyword Extraction: An Empirical Study Folksonomies versus Automatic Keyword Extraction: An Empirical Study Hend S. Al-Khalifa and Hugh C. Davis Learning Technology Research Group, ECS, University of Southampton, Southampton, SO17 1BJ, UK {hsak04r/hcd}@ecs.soton.ac.uk

More information

IT Challenges for the Library and Information Studies Sector

IT Challenges for the Library and Information Studies Sector IT Challenges for the Library and Information Studies Sector This document is intended to facilitate and stimulate discussion at the e-science Scoping Study Expert Seminar for Library and Information Studies.

More information

The Value of Taxonomy Management Research Results

The Value of Taxonomy Management Research Results Taxonomy Strategies November 28, 2012 Copyright 2012 Taxonomy Strategies. All rights reserved. The Value of Taxonomy Management Research Results Joseph A Busch, Principal What does taxonomy do for search?

More information

Microsoft Windows SharePoint

Microsoft Windows SharePoint Microsoft Windows SharePoint SharePoint Basics Introduction What is Microsoft SharePoint? SharePoint is a tool to connect people and information. It provides a central site for sharing information with

More information

Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives

Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives Intelligent Search for Answering Clinical Questions Coronado Group, Ltd. Innovation Initiatives Search The Way You Think Copyright 2009 Coronado, Ltd. All rights reserved. All other product names and logos

More information

Audit TM. The Security Auditing Component of. Out-of-the-Box

Audit TM. The Security Auditing Component of. Out-of-the-Box Audit TM The Security Auditing Component of Out-of-the-Box This guide is intended to provide a quick reference and tutorial to the principal features of Audit. Please refer to the User Manual for more

More information

Richmond Systems. SupportDesk Web Interface User Guide

Richmond Systems. SupportDesk Web Interface User Guide Richmond Systems SupportDesk Web Interface User Guide 1 Contents SUPPORTDESK WEB INTERFACE...3 INTRODUCTION TO THE WEB INTERFACE...3 FEATURES OF THE WEB INTERFACE...3 HELPDESK SPECIALIST LOGIN...4 SEARCHING

More information

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.

PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS. PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software

More information

Malay A. Dalal Madhav Erraguntla Perakath Benjamin. Knowledge Based Systems, Inc. (KBSI) College Station, TX 77840, U.S.A.

Malay A. Dalal Madhav Erraguntla Perakath Benjamin. Knowledge Based Systems, Inc. (KBSI) College Station, TX 77840, U.S.A. AN INTRODUCTION TO USING PROSIM FOR BUSINESS PROCESS SIMULATION AND ANALYSIS Malay A. Dalal Madhav Erraguntla Perakath Benjamin Knowledge Based Systems, Inc. (KBSI) College Station, TX 77840, U.S.A. ABSTRACT

More information

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001

ICOM 6005 Database Management Systems Design. Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 ICOM 6005 Database Management Systems Design Dr. Manuel Rodríguez Martínez Electrical and Computer Engineering Department Lecture 2 August 23, 2001 Readings Read Chapter 1 of text book ICOM 6005 Dr. Manuel

More information

USE PATTERN OF ELECTRONIC INFORMATION RESOURCES IN THE COLLEGE LIBRARIES IN KERALA: AN ANALYTICAL STUDY

USE PATTERN OF ELECTRONIC INFORMATION RESOURCES IN THE COLLEGE LIBRARIES IN KERALA: AN ANALYTICAL STUDY USE PATTERN OF ELECTRONIC INFORMATION RESOURCES IN THE COLLEGE LIBRARIES IN KERALA: AN ANALYTICAL STUDY Thesis submitted to the University of Calicut for the award of the Degree of DOCTER OF PHILOSOPHY

More information

Software design (Cont.)

Software design (Cont.) Package diagrams Architectural styles Software design (Cont.) Design modelling technique: Package Diagrams Package: A module containing any number of classes Packages can be nested arbitrarily E.g.: Java

More information

If you see "Skip installation of the current version and test the currently installed version of Java" then select that hyperlink.

If you see Skip installation of the current version and test the currently installed version of Java then select that hyperlink. Workstation, Browser, Java, Connections, Proxy Servers, & Firewall Information March 2, 2015 Contents I. Workstation and Browser Configurations A. Internet Explorer B. Mozilla Firefox C. Google Chrome

More information

Definition of Information Need

Definition of Information Need Part I Definition of Information Need CHAPTER 1 The Importance of Information Need Information need is the motivation people think and feel to seek information, but it is a complex concept that divides

More information

No Stress Tech Guide To Crystal Reports XI: For Beginners. By Dr. Indera E. Murphy

No Stress Tech Guide To Crystal Reports XI: For Beginners. By Dr. Indera E. Murphy No Stress Tech Guide To Crystal Reports XI: For Beginners By Dr. Indera E. Murphy Published By: Tolana Publishing PO Box 719 Teaneck, NJ 07666 USA Find us online at www.tolana.com Inquiries may be sent

More information

Email: justinjia@ust.hk Office: LSK 5045 Begin subject: [ISOM3360]...

Email: justinjia@ust.hk Office: LSK 5045 Begin subject: [ISOM3360]... Business Intelligence and Data Mining ISOM 3360: Spring 2015 Instructor Contact Office Hours Course Schedule and Classroom Course Webpage Jia Jia, ISOM Email: justinjia@ust.hk Office: LSK 5045 Begin subject:

More information

A person or organization with a legitimate interest in a given situation, action or enterprise Stakeholders are any groups, individuals, agencies, or

A person or organization with a legitimate interest in a given situation, action or enterprise Stakeholders are any groups, individuals, agencies, or A person or organization with a legitimate interest in a given situation, action or enterprise Stakeholders are any groups, individuals, agencies, or organizations that have common direction, clients or

More information

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE CISY 233 INTRODUCTION TO PHP

RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE CISY 233 INTRODUCTION TO PHP RARITAN VALLEY COMMUNITY COLLEGE ACADEMIC COURSE OUTLINE CISY 233 INTRODUCTION TO PHP I. Basic Course Information A. Course Number and Title: CISY 233 Introduction to PHP B. New or Modified Course: Modified

More information

Responsive Web Design. vs. Mobile Web App: What s Best for Your Enterprise? A WhitePaper by RapidValue Solutions

Responsive Web Design. vs. Mobile Web App: What s Best for Your Enterprise? A WhitePaper by RapidValue Solutions Responsive Web Design vs. Mobile Web App: What s Best for Your Enterprise? A WhitePaper by RapidValue Solutions The New Design Trend: Build a Website; Enable Self-optimization Across All Mobile De vices

More information

Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project

Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project Bridging CAQDAS with text mining: Text analyst s toolbox for Big Data: Science in the Media Project Ahmet Suerdem Istanbul Bilgi University; LSE Methodology Dept. Science in the media project is funded

More information

Annotea and Semantic Web Supported Collaboration

Annotea and Semantic Web Supported Collaboration Annotea and Semantic Web Supported Collaboration Marja-Riitta Koivunen, Ph.D. Annotea project Abstract Like any other technology, the Semantic Web cannot succeed if the applications using it do not serve

More information

E-Business Technologies for the Future

E-Business Technologies for the Future E-Business Technologies for the Future Michael B. Spring Department of Information Science and Telecommunications University of Pittsburgh spring@imap.pitt.edu http://www.sis.pitt.edu/~spring Overview

More information

LASTLINE WHITEPAPER. Large-Scale Detection of Malicious Web Pages

LASTLINE WHITEPAPER. Large-Scale Detection of Malicious Web Pages LASTLINE WHITEPAPER Large-Scale Detection of Malicious Web Pages Abstract Malicious web pages that host drive-by-download exploits have become a popular means for compromising hosts on the Internet and,

More information

Integrated Library Systems (ILS) Glossary

Integrated Library Systems (ILS) Glossary Integrated Library Systems (ILS) Glossary Acquisitions Selecting, ordering and receiving new materials and maintaining accurate records. Authority files Lists of preferred headings in a library catalogue,

More information