QDquaderni. UP-DRES User Profiling for a Dynamic REcommendation System E. Messina, D. Toscani, F. Archetti. university of milano bicocca
|
|
- Violet Scott
- 8 years ago
- Views:
Transcription
1 A01 084/01
2
3 university of milano bicocca QDquaderni department of informatics, systems and communication UP-DRES User Profiling for a Dynamic REcommendation System E. Messina, D. Toscani, F. Archetti research report n. 1 march 2006
4 Copyright MMVI ARACNE editrice S.r.l. via Raffaele Garofalo, 133 A/B Roma (06) ISBN ISSN I diritti di traduzione, di memorizzazione elettronica, di riproduzione e di adattamento anche parziale, con qualsiasi mezzo, sono riservati per tutti i Paesi. Non sono assolutamente consentite le fotocopie senza il permesso scritto dell Editore. I edizione: aprile 2006
5 5 1. Introduction The Proposed System Content Extractor Taxonomy Builder Recommendation Manager Conclusions and Future Work...23 References...24
6
7 7 UP-DRES User Profiling for a Dynamic REcommendation System Enza Messina 1, Daniele Toscani 1,2, Francesco Archetti 1,2 1 DISCO, Università degli Studi di Milano Bicocca, Via Bicocca degli Arcimboldi, Milano, Italy messina@disco.unimib.it 2 Consorzio Milano Ricerche, Via Cicognara 7, Milano, Italy {archetti,toscani}@milanoricerche.it Abstract. The WWW is actually the most dynamic and attractive information exchange place. Finding useful information is hard due to huge data amount, varied topics and unstructured contents. In this paper we present a web browsing support system that proposes personalized contents. It is integrated in the content management system and it runs on the server hosting the site. It processes periodically site contents, extracting vectors of the most significant words. A topology tree is defined applying hierarchical clustering. During online browsing, viewed contents are processed and mapped in the vector space previously defined. The centroid of these vectors is compared with the topology tree nodes centroids to find the most similar; its contents are presented to the user as link suggestions or dynamically created pages. Personal profile is saved after every session and included in the analysis during same user s subsequent visits, avoiding the cold start problem. 1. Introduction Today s world is sometimes called the information society, to point out the growing importance that information is assuming. It is easy for everyone to consult knowledge sources and to publish them. Automatic systems help in this process, but they also generate a huge amount of monitoring and derived data. The practical effect is that, at a certain stage, people will be confronted with more information than they can effectively process: this situation is known as information overload [4] [17]. This means that part of that informa
8 8 Messina et al tion will be ignored, forgotten, distorted or otherwise lost. The web is the most evolving media and reflects these trends: finding information on it is becoming more and more difficult and time consuming. Users want to find useful and interesting contents during the navigation; on the other hand, portal administrators of e-commerce and services sites want to attract visitors. Every person perceives the definition of useful and interesting in a different way: this is the reason why systems that provide personalized suggestions based on user preferences, a.k.a. recommendation systems, are required. In order to derive models for representing web users and identifying their interests three different approaches may be found in the literature: collaborative filtering, content-based analysis, browsing behaviour modelling; this classification depends on the basis of the data source used. People interacting with collaborative filtering based systems have to actively express an interest, rating the contents they are viewing. This allows the system to give friendly suggestions (filter) based on the opinions of others users of the same service (from this the term collaborative ). In [12], for example, the authors proposed an filter which asks a small group of users to formulate queries in a special language, in order to determine the usefulness. Other collaborative filtering systems have been proposed in [18] and [25]. Even in these cases, an active and explicit participation from the user community is required: each user has to rate the content of Usenet news articles. A form of automation is introduced here by applying a k-nearest neighbour algorithm to find groups with similar interests. In [24] rating weights are defined to be proportional to the time spent viewing a page. In [31] the Usenet news posting are used to rate the liking of web sites, creating a list of the top endorsed sites. In a recent work, Sugiyama [30], user s profiles are derived from the choices made after a query submission to a search engine and from the contents of the pages selected from the query results. A modified collaborative filtering is then applied to a user-term matrix (instead of user-item matrix in classic collaborative filtering). Users term vectors are then clusterized to find homogeneous communities. Content based recommendation systems build a model of the web pages contents and compare it with the contents which are of interest for the user.
9 UP-DRES User Profiling for a Dynamic REcommendation System 9 Collaborative filtering is here implicit, in the sense that user s choices are helpful to state the relevance of similar items. The main techniques applied in this field can be grouped in clustering [3] [6], bayesian networks [6] and rulebased systems [27]. A content based approach to learn human interests automatically through a divisive hierarchical clustering algorithm has been proposed in [16]. Each page can be assigned to one or more nodes in the hierarchy, which is used for learning and predicting interests: the root is the user s general long-term interest and leaves represent short-term specific domains. In [28] information coming from multiple information resources is aggregated in order to create a recommendation list as reply to queries in which different query elements can be assigned by the user. An interesting application can be found in [13] where a system which presents links of interest in a box integrated into the Internet Explorer browser is presented. Here an ontology is built by clustering vectors of words extracted from web pages. In [23] the computer science ontology described in [22] is used for bootstrapping the current user s interests, in order to overcome the cold start problem arising when the user is unknown to the system. Documents viewed by the user are associated to a topic by using a variant of the nearest neighbour algorithm. Collaborative filtering is then performed on a user-topic matrix. In another system the content based approach is combined with collaborative filtering [1]. It ranks web pages through a topic filter and this information is reinforced by the user s feedbacks. Content personalized web pages present different information to different users and diverge from link personalization, which only adapts the link anchor structure and leaves unmodified the substantial information part. Early studies in [5] present the idea of a newspaper that allows for interactive personalization. In My Yahoo! [21] user s preferences are collected from explicit indication or semi-automated inference from navigation activity, asking the user to choose from general areas to more specific topics. The browsing behaviour modelling approach analyzes the interactions between the user and the web. Like in [35] web-server logs are used as data source to track user s browsing pattern into web sites. These logs, that are collected automatically from web server applications, provide information about activities performed by a user from the moment he/she enters a web site
10 10 Messina et al. to the moment he/she leaves it [8], including time spent viewing a page, and allow us to separate browsing sessions. Sessions clustering is useful to discover both groups of users, exhibiting similar browsing patterns, and groups of pages, with related contents (pages are clusterized on the basis of how often they appear together across navigation patterns). Algorithms for sessions clustering can be classified into two approaches: similarity-based and model-based (or probabilistic) [7]. Compared to similarity-based methods, which assign user to a cluster only on the basis of a given session similarity measure, model-based methods offer better interpretability: each model directly characterizes the corresponding cluster. Model-based clustering techniques have been widely used and have shown promising results in many applications involving web data [2] [33]. More specifically, in the model based approach the users sessions clusters are generated as follows: 1. A user arrives at the web site in a particular time and is assigned to a cluster with some probability. The number of clusters is determined by using several probabilistic methods, such as BIC (Bayesian Information Criterion), bayesian approximations, or bootstrap methods [11]. 2. The behaviour of each cluster is governed by a statistical model and the user s behaviour is generated from this model. Each cluster has a data-generating model with different components. Clusters are defined by learning the parameters of one or more (in the case of a mixture) probability distribution function, used to assign people to the various clusters, and the number of components. The number of components of the model can be determined by model selection techniques and parameters can be estimated using maximum likelihood algorithms, e.g. the EM (Expectation- Maximization) [9]. Other approaches that don t need user s active participation to the model creation are WebWatcher [14] and Letizia [19] [20], which extract information on users from their browsing behaviour. Some critics can be moved to the fact that they propose a persistent model and don t care about user s interest changes. For a complete review of the system based on implicit user participation see [15]. In this paper we propose a web profiling system particularly suitable for improving the services offered by dynamic web sites, whose contents are composed from a repository of documents related to different arguments. It
11 UP-DRES User Profiling for a Dynamic REcommendation System 11 combines the content based analysis with browsing behaviour modelling, in the sense that we follow the users during their visits and, on the basis of the contents that they are viewing, we identify their behaviour and consequently their interests. Sometimes people have to answer many questions about preferences or demographic data when they register to a web site. Profiles created in this way are generally static and have to be kept updated under the responsibility of the user. However, only few of them are willing to spend time doing seemingly useless operations, also if this will ensure a better personalization. The results are incomplete, unreliable profiles. The proposed approach does not require human interaction, because it extracts information about user preferences from the contents of the visited web pages. Another advantage of our system is that, being integrated in the content management application, it operates online, collecting the requests made by user without the need of web server logs data. In fact, log files ideally represent a good source of data to infer the browsing behaviour but practically, as stated in [2][33], they have to be cleaned and processed to reconstruct the users navigation sessions; this process can be very hard and sometimes impossible, due to technical reasons concerned mainly with privacy and security procedures that hide personal data. In addition, today the world wide web is migrating towards a dynamic structure, in which pages are not published in simple HTML format, but contains executable code and dynamic access to resources, and logs are losing the traditional function of lists of requested web pages, to become records of content management applications status, from which it is difficult to obtain useful information. The rest of the paper is organized as follows: the general architecture of the system is described in Section 2, where we introduce all of its modules: Content Extractor, responsible to manage documents and convert them in a machine-tractable form, Taxonomy Builder, that creates a document hierarchy based on topics, Recommendation Manager, which creates Sort and Long Term Profiles of users, on the basis of contents that they view. In sections 3 to 5 are given detailed descriptions of each of these modules. Finally, in Section 6 we present our conclusions and future work directions.
12 12 Messina et al. 2. The Proposed System In this section we present a synthesis of the architecture of the system that allows us to profile web users dynamically, in order to help them during the navigation process. In Fig. 1 we show the system s main modules: Content Extractor, Taxonomy Builder and Recommendation Manager. The activation of these modules and the data exchanges between them are governed by the super-module UP-DRES, which acts as a supervisor. Fig. 1. Overview of the system Some external elements take part in UP-DRES functioning. The Document Repository contains all the textual elements that can be used to compose the Web Site pages. The application that manages the Web Site is able to intercept the User s request and send them to the Document Repository, in order to select the documents to introduce to the UP-DRES system for the classification process.
13 UP-DRES User Profiling for a Dynamic REcommendation System 13 The system, through the Recommendation Manager module, combines the user s Short Term Profile (STP), obtained by analysing the user behaviour during the current session, with a Long Term Profile (LTP) built as a weighted sum of the previously constructed user s STPs. Typical web pages are composed of text, images, multimedia contents and applications stored in a file system area called Document Repository. Profiles are obtained by considering the contents of the pages visited by the user. They are used by the Recommendation Manager Module to decide, through a maximization matching procedure, which information to present next on the web site by choosing it from the currently available Document Repository. Contents shown on the web page should therefore automatically capture the visitor s preferences by using as indicator of interest the choices made by the user by clicking on a given page and the time spent visiting such page. The system runs on the server side, as a process integrated in the content management system which manages the web pages publication. In order to maximize the matching between the user s preferences expressed during the navigation pattern and the information currently available in the Document Repository, the Content Extractor browses periodically (offline) the Document Repository to take snapshots of the web site contents and it builds a matrix, which is its vector space representation, as described in Section 3. This matrix is then used as input by the Taxonomy Builder module to generate the Web Site Taxonomy, as explained in Section 4. As a visiting session starts, the sequence of pages viewed by the user are processed by the Content Extractor and a STP is dynamically updated at each click. The Recommendation Manager combines opportunely the STP with the Long Term Profile, as described in section 5. This profile combination produces as output a vector of terms which is classified according to the Web Site Taxonomy in order to find the taxonomy node whose contents best matches the user s browsing behaviour and his/her general interests. Recommendation is therefore made generating a selfadapting, personalized web site: contents of the matching class are presented to the user as link suggestion or composed dynamically in a web page. At the end of each session, the STP is integrated in the LTP, which synthesizes the user s browsing history which will be used in the next sessions to refine the recommendation process.
Understanding Web personalization with Web Usage Mining and its Application: Recommender System
Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,
More informationData Mining for Web Personalization
3 Data Mining for Web Personalization Bamshad Mobasher Center for Web Intelligence School of Computer Science, Telecommunication, and Information Systems DePaul University, Chicago, Illinois, USA mobasher@cs.depaul.edu
More informationIntroduction to Data Mining
Introduction to Data Mining 1 Why Data Mining? Explosive Growth of Data Data collection and data availability Automated data collection tools, Internet, smartphones, Major sources of abundant data Business:
More informationIntroduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
More informationWeb Mining using Artificial Ant Colonies : A Survey
Web Mining using Artificial Ant Colonies : A Survey Richa Gupta Department of Computer Science University of Delhi ABSTRACT : Web mining has been very crucial to any organization as it provides useful
More informationChallenges and Opportunities in Data Mining: Personalization
Challenges and Opportunities in Data Mining: Big Data, Predictive User Modeling, and Personalization Bamshad Mobasher School of Computing DePaul University, April 20, 2012 Google Trends: Data Mining vs.
More informationRecommendation Tool Using Collaborative Filtering
Recommendation Tool Using Collaborative Filtering Aditya Mandhare 1, Soniya Nemade 2, M.Kiruthika 3 Student, Computer Engineering Department, FCRIT, Vashi, India 1 Student, Computer Engineering Department,
More informationModel-Based Cluster Analysis for Web Users Sessions
Model-Based Cluster Analysis for Web Users Sessions George Pallis, Lefteris Angelis, and Athena Vakali Department of Informatics, Aristotle University of Thessaloniki, 54124, Thessaloniki, Greece gpallis@ccf.auth.gr
More informationA UPS Framework for Providing Privacy Protection in Personalized Web Search
A UPS Framework for Providing Privacy Protection in Personalized Web Search V. Sai kumar 1, P.N.V.S. Pavan Kumar 2 PG Scholar, Dept. of CSE, G Pulla Reddy Engineering College, Kurnool, Andhra Pradesh,
More informationQDquaderni. Designing Self Adaptive Service Oriented Applications G. Denaro, M. Pezzè, D. Tosi. university of milano bicocca
ARACNE university of milano bicocca QDquaderni department of informatics, systems and communication Designing Self Adaptive Service Oriented Applications G. Denaro, M. Pezzè, D. Tosi research report n.
More informationClustering Technique in Data Mining for Text Documents
Clustering Technique in Data Mining for Text Documents Ms.J.Sathya Priya Assistant Professor Dept Of Information Technology. Velammal Engineering College. Chennai. Ms.S.Priyadharshini Assistant Professor
More informationA Survey on Web Mining From Web Server Log
A Survey on Web Mining From Web Server Log Ripal Patel 1, Mr. Krunal Panchal 2, Mr. Dushyantsinh Rathod 3 1 M.E., 2,3 Assistant Professor, 1,2,3 computer Engineering Department, 1,2 L J Institute of Engineering
More informationSPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
More informationWeb Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it
Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari berardi@di.uniba.it Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content
More informationA Near Real-Time Personalization for ecommerce Platform Amit Rustagi arustagi@ebay.com
A Near Real-Time Personalization for ecommerce Platform Amit Rustagi arustagi@ebay.com Abstract. In today's competitive environment, you only have a few seconds to help site visitors understand that you
More informationK@ A collaborative platform for knowledge management
White Paper K@ A collaborative platform for knowledge management Quinary SpA www.quinary.com via Pietrasanta 14 20141 Milano Italia t +39 02 3090 1500 f +39 02 3090 1501 Copyright 2004 Quinary SpA Index
More informationairport urbanism airports, landscapes and cities
airport urbanism airports, landscapes and cities 01 Direttore Laura Cipriani Università degli Studi di Trento Comitato scientifico Bernardo Secchi Università Iuav di Venezia Giovanni Corbellini Università
More informationChapter 12: Web Usage Mining
Chapter 12: Web Usage Mining By Bamshad Mobasher With the continued growth and proliferation of e-commerce, Web services, and Web-based information systems, the volumes of clickstream and user data collected
More informationLDAP andUsers Profile - A Quick Comparison
Using LDAP in a Filtering Service for a Digital Library João Ferreira (**) José Luis Borbinha (*) INESC Instituto de Enghenharia de Sistemas e Computatores José Delgado (*) INESC Instituto de Enghenharia
More informationHow To Make Sense Of Data With Altilia
HOW TO MAKE SENSE OF BIG DATA TO BETTER DRIVE BUSINESS PROCESSES, IMPROVE DECISION-MAKING, AND SUCCESSFULLY COMPETE IN TODAY S MARKETS. ALTILIA turns Big Data into Smart Data and enables businesses to
More informationAdvances in Natural and Applied Sciences
AENSI Journals Advances in Natural and Applied Sciences ISSN:1995-0772 EISSN: 1998-1090 Journal home page: www.aensiweb.com/anas Privacy Protection in Personalized Web Search 1 M. Abinaya and 2 D. Vijay
More informationRecommender Systems: Content-based, Knowledge-based, Hybrid. Radek Pelánek
Recommender Systems: Content-based, Knowledge-based, Hybrid Radek Pelánek 2015 Today lecture, basic principles: content-based knowledge-based hybrid, choice of approach,... critiquing, explanations,...
More informationWeb analytics: Data Collected via the Internet
Database Marketing Fall 2016 Web analytics (incl real-time data) Collaborative filtering Facebook advertising Mobile marketing Slide set 8 1 Web analytics: Data Collected via the Internet Customers can
More informationLDA Based Security in Personalized Web Search
LDA Based Security in Personalized Web Search R. Dhivya 1 / PG Scholar, B. Vinodhini 2 /Assistant Professor, S. Karthik 3 /Prof & Dean Department of Computer Science & Engineering SNS College of Technology
More informationCLASSIFICATION AND CLUSTERING METHODS IN THE DECREASING OF THE INTERNET COGNITIVE LOAD
Acta Electrotechnica et Informatica No. 2, Vol. 6, 2006 1 CLASSIFICATION AND CLUSTERING METHODS IN THE DECREASING OF THE INTERNET COGNITIVE LOAD Kristína MACHOVÁ, Ivan KLIMKO Department of Cybernetics
More informationContent-Based Recommendation
Content-Based Recommendation Content-based? Item descriptions to identify items that are of particular interest to the user Example Example Comparing with Noncontent based Items User-based CF Searches
More informationAutomatic Timeline Construction For Computer Forensics Purposes
Automatic Timeline Construction For Computer Forensics Purposes Yoan Chabot, Aurélie Bertaux, Christophe Nicolle and Tahar Kechadi CheckSem Team, Laboratoire Le2i, UMR CNRS 6306 Faculté des sciences Mirande,
More informationKnowledge Pump: Community-centered Collaborative Filtering
Knowledge Pump: Community-centered Collaborative Filtering Natalie Glance, Damián Arregui and Manfred Dardenne Xerox Research Centre Europe, Grenoble Laboratory October 7, 1997 Abstract This article proposes
More informationDESIGNING AND MINING WEB APPLICATIONS: A CONCEPTUAL MODELING APPROACH
DESIGNING AND MINING WEB APPLICATIONS: A CONCEPTUAL MODELING APPROACH Rosa Meo Dipartimento di Informatica, Università di Torino Corso Svizzera, 185-10149 - Torino - Italy E-mail: meo@di.unito.it Tel.:
More informationOptimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2
Optimization of Search Results with Duplicate Page Elimination using Usage Data A. K. Sharma 1, Neelam Duhan 2 1, 2 Department of Computer Engineering, YMCA University of Science & Technology, Faridabad,
More informationA SURVEY ON WEB MINING TOOLS
IMPACT: International Journal of Research in Engineering & Technology (IMPACT: IJRET) ISSN(E): 2321-8843; ISSN(P): 2347-4599 Vol. 3, Issue 10, Oct 2015, 27-34 Impact Journals A SURVEY ON WEB MINING TOOLS
More informationNumerical Algorithms Group
Title: Summary: Using the Component Approach to Craft Customized Data Mining Solutions One definition of data mining is the non-trivial extraction of implicit, previously unknown and potentially useful
More informationSemantic Search in Portals using Ontologies
Semantic Search in Portals using Ontologies Wallace Anacleto Pinheiro Ana Maria de C. Moura Military Institute of Engineering - IME/RJ Department of Computer Engineering - Rio de Janeiro - Brazil [awallace,anamoura]@de9.ime.eb.br
More informationSome Research Challenges for Big Data Analytics of Intelligent Security
Some Research Challenges for Big Data Analytics of Intelligent Security Yuh-Jong Hu hu at cs.nccu.edu.tw Emerging Network Technology (ENT) Lab. Department of Computer Science National Chengchi University,
More informationInformation Retrieval and Web Search Engines
Information Retrieval and Web Search Engines Lecture 7: Document Clustering December 10 th, 2013 Wolf-Tilo Balke and Kinda El Maarry Institut für Informationssysteme Technische Universität Braunschweig
More informationCURRICULUM VITAE. Ilaria.giordani@disco.unimib.it. Phd in computer science
CURRICULUM VITAE PERSONAL INFORMATION Name Address Giordani Ilaria Via Volturno 13 22063 Cantù (Co) Mobile phone number (+ 39) 333.8725026 Phone number (+ 39) 031.712957 E-mail Ilaria.giordani@disco.unimib.it
More informationWeb Usage Mining. from Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer Chapter written by Bamshad Mobasher
Web Usage Mining from Bing Liu. Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data, Springer Chapter written by Bamshad Mobasher Many slides are from a tutorial given by B. Berendt, B. Mobasher,
More informationExperiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
More informationSpecialty Answering Service. All rights reserved.
0 Contents 1 Introduction... 2 1.1 Types of Dialog Systems... 2 2 Dialog Systems in Contact Centers... 4 2.1 Automated Call Centers... 4 3 History... 3 4 Designing Interactive Dialogs with Structured Data...
More informationPersonalization of Web Search With Protected Privacy
Personalization of Web Search With Protected Privacy S.S DIVYA, R.RUBINI,P.EZHIL Final year, Information Technology,KarpagaVinayaga College Engineering and Technology, Kanchipuram [D.t] Final year, Information
More informationSanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a
More informationOLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
More informationA Study of Web Log Analysis Using Clustering Techniques
A Study of Web Log Analysis Using Clustering Techniques Hemanshu Rana 1, Mayank Patel 2 Assistant Professor, Dept of CSE, M.G Institute of Technical Education, Gujarat India 1 Assistant Professor, Dept
More informationPSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS.
PSG College of Technology, Coimbatore-641 004 Department of Computer & Information Sciences BSc (CT) G1 & G2 Sixth Semester PROJECT DETAILS Project Project Title Area of Abstract No Specialization 1. Software
More informationEnterpriseLink Benefits
EnterpriseLink Benefits GGY AXIS 5001 Yonge Street Suite 1300 Toronto, ON M2N 6P6 Phone: 416-250-6777 Toll free: 1-877-GGY-AXIS Fax: 416-250-6776 Email: axis@ggy.com Web: www.ggy.com Table of Contents
More informationER/Studio Enterprise Portal 1.0.2 User Guide
ER/Studio Enterprise Portal 1.0.2 User Guide Copyright 1994-2008 Embarcadero Technologies, Inc. Embarcadero Technologies, Inc. 100 California Street, 12th Floor San Francisco, CA 94111 U.S.A. All rights
More information131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
More informationBayesian networks - Time-series models - Apache Spark & Scala
Bayesian networks - Time-series models - Apache Spark & Scala Dr John Sandiford, CTO Bayes Server Data Science London Meetup - November 2014 1 Contents Introduction Bayesian networks Latent variables Anomaly
More informationRecommending Web Pages using Item-based Collaborative Filtering Approaches
Recommending Web Pages using Item-based Collaborative Filtering Approaches Sara Cadegnani 1, Francesco Guerra 1, Sergio Ilarri 2, María del Carmen Rodríguez-Hernández 2, Raquel Trillo-Lado 2, and Yannis
More informationElsa C. Augustenborg Gary R. Danielson Andrew E. Beck
Elsa C. Augustenborg Gary R. Danielson Andrew E. Beck Pacific Northwest National Laboratory PNNL-SA-75867 Overview Technical challenges Institutional challenges Architectural approach Examples: Promising
More informationSearch Taxonomy. Web Search. Search Engine Optimization. Information Retrieval
Information Retrieval INFO 4300 / CS 4300! Retrieval models Older models» Boolean retrieval» Vector Space model Probabilistic Models» BM25» Language models Web search» Learning to Rank Search Taxonomy!
More informationitesla Project Innovative Tools for Electrical System Security within Large Areas
itesla Project Innovative Tools for Electrical System Security within Large Areas Samir ISSAD RTE France samir.issad@rte-france.com PSCC 2014 Panel Session 22/08/2014 Advanced data-driven modeling techniques
More informationWEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS
WEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS Biswajit Biswal Oracle Corporation biswajit.biswal@oracle.com ABSTRACT With the World Wide Web (www) s ubiquity increase and the rapid development
More informationHELP DESK SYSTEMS. Using CaseBased Reasoning
HELP DESK SYSTEMS Using CaseBased Reasoning Topics Covered Today What is Help-Desk? Components of HelpDesk Systems Types Of HelpDesk Systems Used Need for CBR in HelpDesk Systems GE Helpdesk using ReMind
More informationInternational Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
More informationData Mining for Fun and Profit
Data Mining for Fun and Profit Data mining is the extraction of implicit, previously unknown, and potentially useful information from data. - Ian H. Witten, Data Mining: Practical Machine Learning Tools
More informationUsing reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management
Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management Paper Jean-Louis Amat Abstract One of the main issues of operators
More informationKey words: web usage mining, clustering, e-marketing and e-business, business intelligence; hybrid soft computing.
Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue:
More informationANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION
ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION K.Vinodkumar 1, Kathiresan.V 2, Divya.K 3 1 MPhil scholar, RVS College of Arts and Science, Coimbatore, India. 2 HOD, Dr.SNS
More informationData Mining Cluster Analysis: Basic Concepts and Algorithms. Lecture Notes for Chapter 8. Introduction to Data Mining
Data Mining Cluster Analysis: Basic Concepts and Algorithms Lecture Notes for Chapter 8 by Tan, Steinbach, Kumar 1 What is Cluster Analysis? Finding groups of objects such that the objects in a group will
More informationClustering. Data Mining. Abraham Otero. Data Mining. Agenda
Clustering 1/46 Agenda Introduction Distance K-nearest neighbors Hierarchical clustering Quick reference 2/46 1 Introduction It seems logical that in a new situation we should act in a similar way as in
More informationBisecting K-Means for Clustering Web Log data
Bisecting K-Means for Clustering Web Log data Ruchika R. Patil Department of Computer Technology YCCE Nagpur, India Amreen Khan Department of Computer Technology YCCE Nagpur, India ABSTRACT Web usage mining
More informationDistributed Computing and Big Data: Hadoop and MapReduce
Distributed Computing and Big Data: Hadoop and MapReduce Bill Keenan, Director Terry Heinze, Architect Thomson Reuters Research & Development Agenda R&D Overview Hadoop and MapReduce Overview Use Case:
More informationReasoning Component Architecture
Architecture of a Spam Filter Application By Avi Pfeffer A spam filter consists of two components. In this article, based on my book Practical Probabilistic Programming, first describe the architecture
More informationAutomatic Document Categorization A Hummingbird White Paper
Automatic Document Categorization A Hummingbird White Paper Automatic Document Categorization While every attempt has been made to ensure the accuracy and completeness of the information in this document,
More informationAcquisition of User Profile for Domain Specific Personalized Access 1
Acquisition of User Profile for Domain Specific Personalized Access 1 Plaban Kumar Bhowmick, Samiran Sarkar, Sudeshna Sarkar, Anupam Basu Department of Computer Science & Engineering, Indian Institute
More informationUser Behavior Analysis Based On Predictive Recommendation System for E-Learning Portal
Abstract ISSN: 2348 9510 User Behavior Analysis Based On Predictive Recommendation System for E-Learning Portal Toshi Sharma Department of CSE Truba College of Engineering & Technology Indore, India toshishm.25@gmail.com
More informationA Generic business rules validation system for ORACLE Applications
A Generic business rules validation system for ORACLE Applications Olivier Francis MARTIN System analyst European Laboratory for Particle Physics - CERN / AS-DB Geneva - SWITZERLAND Jean Francois PERRIN
More informationOpen issues and research trends in Content-based Image Retrieval
Open issues and research trends in Content-based Image Retrieval Raimondo Schettini DISCo Universita di Milano Bicocca schettini@disco.unimib.it www.disco.unimib.it/schettini/ IEEE Signal Processing Society
More informationRequirements for Context-dependent Mobile Access to Information Services
Requirements for Context-dependent Mobile Access to Information Services Augusto Celentano Università Ca Foscari di Venezia Fabio Schreiber, Letizia Tanca Politecnico di Milano MIS 2004, College Park,
More informationSocial Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
More informationDatabase Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
More informationThe Need for Training in Big Data: Experiences and Case Studies
The Need for Training in Big Data: Experiences and Case Studies Guy Lebanon Amazon Background and Disclaimer All opinions are mine; other perspectives are legitimate. Based on my experience as a professor
More informationINTERNET MARKETING. SEO Course Syllabus Modules includes: COURSE BROCHURE
AWA offers a wide-ranging yet comprehensive overview into the world of Internet Marketing and Social Networking, examining the most effective methods for utilizing the power of the internet to conduct
More informationA Framework of Personalized Intelligent Document and Information Management System
A Framework of Personalized Intelligent and Information Management System Xien Fan Department of Computer Science, College of Staten Island, City University of New York, Staten Island, NY 10314, USA Fang
More informationChapter-1 : Introduction 1 CHAPTER - 1. Introduction
Chapter-1 : Introduction 1 CHAPTER - 1 Introduction This thesis presents design of a new Model of the Meta-Search Engine for getting optimized search results. The focus is on new dimension of internet
More informationA Business Process Services Portal
A Business Process Services Portal IBM Research Report RZ 3782 Cédric Favre 1, Zohar Feldman 3, Beat Gfeller 1, Thomas Gschwind 1, Jana Koehler 1, Jochen M. Küster 1, Oleksandr Maistrenko 1, Alexandru
More informationRemoving Web Spam Links from Search Engine Results
Removing Web Spam Links from Search Engine Results Manuel EGELE pizzaman@iseclab.org, 1 Overview Search Engine Optimization and definition of web spam Motivation Approach Inferring importance of features
More informationAn Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
More informationJournal of Global Research in Computer Science RESEARCH SUPPORT SYSTEMS AS AN EFFECTIVE WEB BASED INFORMATION SYSTEM
Volume 2, No. 5, May 2011 Journal of Global Research in Computer Science REVIEW ARTICLE Available Online at www.jgrcs.info RESEARCH SUPPORT SYSTEMS AS AN EFFECTIVE WEB BASED INFORMATION SYSTEM Sheilini
More informationOn Implicitly Discovered OLAP Schema-Specific Preferences in Reporting Tool
This work has been supported by ESF project No. 009/06/DP/...0/09/APIA/VIAA/0 On Implicitly Discovered OLAP Schema-Specific Preferences in Reporting Tool Natalija Kozmina and Darja Solodovnikova Faculty
More informationData Mining System, Functionalities and Applications: A Radical Review
Data Mining System, Functionalities and Applications: A Radical Review Dr. Poonam Chaudhary System Programmer, Kurukshetra University, Kurukshetra Abstract: Data Mining is the process of locating potentially
More informationSEMANTIC VIDEO ANNOTATION IN E-LEARNING FRAMEWORK
SEMANTIC VIDEO ANNOTATION IN E-LEARNING FRAMEWORK Antonella Carbonaro, Rodolfo Ferrini Department of Computer Science University of Bologna Mura Anteo Zamboni 7, I-40127 Bologna, Italy Tel.: +39 0547 338830
More informationWeb Advertising Personalization using Web Content Mining and Web Usage Mining Combination
8 Web Advertising Personalization using Web Content Mining and Web Usage Mining Combination Ketul B. Patel 1, Dr. A.R. Patel 2, Natvar S. Patel 3 1 Research Scholar, Hemchandracharya North Gujarat University,
More informationCo-Creation of Models and Metamodels for Enterprise. Architecture Projects.
Co-Creation of Models and Metamodels for Enterprise Architecture Projects Paola Gómez pa.gomez398@uniandes.edu.co Hector Florez ha.florez39@uniandes.edu.co ABSTRACT The linguistic conformance and the ontological
More informationVisualizing e-government Portal and Its Performance in WEBVS
Visualizing e-government Portal and Its Performance in WEBVS Ho Si Meng, Simon Fong Department of Computer and Information Science University of Macau, Macau SAR ccfong@umac.mo Abstract An e-government
More informationTowards Virtual Course Evaluation Using Web Intelligence
Towards Virtual Course Evaluation Using Web Intelligence M.E. Zorrilla 1, D. Marín 1, and E. Álvarez 2 1 Department of Mathematics, Statistics and Computation, University of Cantabria. Avda. de los Castros
More informationWeb Personalization Based on Static Information and Dynamic User Behavior
Web Personalization Based on Static Information and Dynamic User Behavior Massimiliano Albanese malbanes@unina.it Antonio Picariello picus@unina.it Dipartimento di Informatica e Sistemistica Università
More informationEmail Spam Detection Using Customized SimHash Function
International Journal of Research Studies in Computer Science and Engineering (IJRSCSE) Volume 1, Issue 8, December 2014, PP 35-40 ISSN 2349-4840 (Print) & ISSN 2349-4859 (Online) www.arcjournals.org Email
More informationMUSIC RECOMMENDER SYSTEM
POLITECNICO DI MILANO Facoltà di Ingegneria dell Informazione POLO REGIONALE DI COMO Master of Science in Computer Engineering MUSIC RECOMMENDER SYSTEM Supervisor: Prof. Sara Comai Master Graduation Thesis
More informationA Comparative Approach to Search Engine Ranking Strategies
26 A Comparative Approach to Search Engine Ranking Strategies Dharminder Singh 1, Ashwani Sethi 2 Guru Gobind Singh Collage of Engineering & Technology Guru Kashi University Talwandi Sabo, Bathinda, Punjab
More informationMonitoring Web Browsing Habits of User Using Web Log Analysis and Role-Based Web Accessing Control. Phudinan Singkhamfu, Parinya Suwanasrikham
Monitoring Web Browsing Habits of User Using Web Log Analysis and Role-Based Web Accessing Control Phudinan Singkhamfu, Parinya Suwanasrikham Chiang Mai University, Thailand 0659 The Asian Conference on
More informationAutomated Collaborative Filtering Applications for Online Recruitment Services
Automated Collaborative Filtering Applications for Online Recruitment Services Rachael Rafter, Keith Bradley, Barry Smyth Smart Media Institute, Department of Computer Science, University College Dublin,
More informationInner Classification of Clusters for Online News
Inner Classification of Clusters for Online News Harmandeep Kaur 1, Sheenam Malhotra 2 1 (Computer Science and Engineering Department, Shri Guru Granth Sahib World University Fatehgarh Sahib) 2 (Assistant
More informationA QoS-Aware Web Service Selection Based on Clustering
International Journal of Scientific and Research Publications, Volume 4, Issue 2, February 2014 1 A QoS-Aware Web Service Selection Based on Clustering R.Karthiban PG scholar, Computer Science and Engineering,
More informationDistributed Database for Environmental Data Integration
Distributed Database for Environmental Data Integration A. Amato', V. Di Lecce2, and V. Piuri 3 II Engineering Faculty of Politecnico di Bari - Italy 2 DIASS, Politecnico di Bari, Italy 3Dept Information
More informationK-means Clustering Technique on Search Engine Dataset using Data Mining Tool
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 505-510 International Research Publications House http://www. irphouse.com /ijict.htm K-means
More informationText Mining for Health Care and Medicine. Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk
Text Mining for Health Care and Medicine Sophia Ananiadou Director National Centre for Text Mining www.nactem.ac.uk The Need for Text Mining MEDLINE 2005: ~14M 2009: ~18M Overwhelming information in textual,
More informationComputational Advertising Andrei Broder Yahoo! Research. SCECR, May 30, 2009
Computational Advertising Andrei Broder Yahoo! Research SCECR, May 30, 2009 Disclaimers This talk presents the opinions of the author. It does not necessarily reflect the views of Yahoo! Inc or any other
More informationBig Data: Rethinking Text Visualization
Big Data: Rethinking Text Visualization Dr. Anton Heijs anton.heijs@treparel.com Treparel April 8, 2013 Abstract In this white paper we discuss text visualization approaches and how these are important
More information