Association rules for improving website effectiveness: case analysis
|
|
|
- Frederick Maxwell
- 10 years ago
- Views:
Transcription
1 Association rules for improving website effectiveness: case analysis Maja Dimitrijević, The Higher Technical School of Professional Studies, Novi Sad, Serbia, Tanja Krunić, The Higher Technical School of Professional Studies, Novi Sad, Serbia, Abstract Association rule mining of the web usage log files can be used to extract patterns of a website visitors behavior. This knowledge can then be utilized to enhance web marketing strategies or improve the web browsing experience. In this paper we apply association rule mining on the web usage log file of an educational institution. We use confidence and lift as the association rule interestingness measures and compare their values in two different time periods. We show how this comparison brings additional information about association rules discovered and helps a webmaster make more informed decisions about the website enhancements. Keywords: web usage mining, association rules, interestingness measures, e-business Introduction With the intense growth of e-commerce in recent years, the use of intelligent data management strategies has become essential in the e-business community. The knowledge about client behavior may potentially be utilized to maximize on-line application effectiveness, increase client satisfaction, and competitiveness. Association rule mining is a data mining method originally invented to extract patterns from transactional databases. Stated simply, an association rule is an implication in the form X Y, where X and Y are sets of items. Association rule mining identifies all such implications existing in a given data set, which satisfy certain constraints, such as minimal support and minimal confidence (Agrawal, Imielinski, & Swami, 1993). An interestingness measure is used to assign a value to each association rule in order to distinguish those rules that are potentially most interesting to a data analyst in a given domain. It usually measures how tightly the itemsets X and Y are correlated. In addition to confidence, which is basically the conditional probability of Y given X, many other statistical measures have been used to determine rule interestingness (Geng & Hamilton, 2006; Tan & Srivastava 2004). Association rule mining can be used to mine web usage log files and extract patterns about website visitor behavior. In this context a web resource is considered an item, while a website visitor session is considered a transaction of items (Anand et al., 2004; Kosala & Blockeel, 2000). 56
2 In this research we mine association rules from the web usage log files of a website of an educational institution. We propose to consider the changes of confidence levels of the association rules found in two different time periods. We discuss how useful the results are for a webmaster in order to make an action to increase the website effectiveness. The rest of the paper is organized as follows. The Related Work section discusses related issues and approaches to web usage association rule mining. The section Discovering Association Rules explains our research method applied to a real life dataset. The section Case discussion from the webmaster s perspective shows a sample of web usage association rules found in our data and the related comments from a webmaster s perspective. Our conclusions are given in the Conclusion section. Related Work Typically association rule algorithms generate a huge number of rules, not all of which are truly interesting to a data analyst. Various methods have been proposed to prune the set of generated rules and discard irrelevant rules during the rule generation phase (Sahaaya & Malarvizhi, 2010). Balcazar (2010) presented a formal approach to determining a rule set basis and thus eliminate redundant rules. Although various pruning methods can significantly reduce the rule set size, it typically still remains huge. An important issue in association rule mining is selecting the right interestingness measures (Huang & Xiangji, 2007). A recent study (Kannan & Bhaskaran, 2009) analyzes the distribution of rule clusters over various interestingness measures. A discussion of the limitations of association rule mining is given in a study by Webb (2011). These include the over-generation of rules, many of which are false discoveries, discovery of rules that are hard to understand, and limitations imposed by the minimal support constraint. The authors propose an approach called filtered top-k association discover that alleviates some of these problems. Association rule mining in the context of web usage data suffers from additional issues. Namely, web usage data is specific in the sense that it contains a large number of tightly correlated items (web resources or web pages) due to the link structure of a website. Web pages that are tightly linked together often co-occur in the same session,, is why the generated set of association rules contains a high number of so-called hard association rules that have very high confidence, but are not truly interesting to the user. To alleviate this problem, additional rule pruning methods need to be applied (Wang, 2005; Dimitrijevic & Bosnjak 2010). A line of research deals with finding additional web usage association rules that may be missed during the classical association rule mining. For example, an approach that discovers association rules between web pages that rarely occur together, but with other pages, where they occur frequently, is taken by Kazienko (2009). This method is used to add new, so called transitive association rules to the typical association rule set, which brings new potentially useful information to a webmaster. In this study, we propose to consider the changes of the association rule interestingness values in two different time periods. We discuss how this brings additional knowledge about the interestingness of web usage association rules to a webmaster. We apply a simple pruning 57
3 strategy in order to alleviate the problem of over-generation of not truly interesting rules. We conduct an empirical study on the website of an educational institution and present the results. Data set Discovering association rules For the purpose of this research, we have conducted an empirical study on the website of an educational institution. The web usage log file mined contains information about all web requests made in November The raw web usage log file contains 649,749 web requests. We also used the results of the web usage log mining of the same educational institution website that contained all web requests made to the same website in the same month of 2010 (Dimitrijevic & Bosnjak, 2011). The raw web usage log file contained 397,741 web requests. Research Method Our web usage log file mining process consists of four phases. Phase 1: Preprocessing The purpose of the first phase is to clean the web usage log file and eliminate irrelevant entries. The log file is preprocessed using WumPrep tool (Dettmar, 2004) to exclude irrelevant entries and to group all entries into visitor sessions. We refer to a session as a set of web resources requested during a website visit. When a website visitor browses through a website and then makes a pause and returns, her/his visit may be considered as one or two sessions. In this work we use the definition of a session as the set of web resources requested from the same IP address where the time between two consecutive requests does not exceed 5 minutes (Anand et al., 2004; Dettmar, 2004). The resulting preprocessed files contain 16,637 visitor sessions made in November 2010 and 36,354 visitor sessions made in November Phase 2: Association rule mining We use our software (Dimitrijevic & Bosnjak, 2011) to mine association rules from the web usage log file. For the purpose of this study, we focus on the so called short rules that contain one url on each side of the rule. Those rules are easier to understand by the webmaster and more likely to cause a webmaster to act and change a website structure. In order to select the most interesting association rules, we combine the confidence and lift interestingness measures. We sort the association rule set according to lift, while using a minimum confidence threshold to prune the non-interesting rules. We set the minimum support threshold value to 0.02 and the minimum confidence threshold value to 0.2. Phase 3: Pruning association rule set In order to minimize the number of hard association rules in our rule set, which result from the inter-connectedness of web pages through the website link structure and are not truly interesting to a webmaster, we apply a simple pruning methodology adopted from (Dimitrijevic & Bosnjak, 2011). All rules that contain directly linked pages are pruned out of the rule set. The term 58
4 directly linked is limited to the links in the body text of the page only and excludes the links through main menus. Phase 4: Joining the rule sets We join the rule sets in order to compare the rules found in November 2010 and in November 2012 and highlight confidence changes in the rules that appear in both sets. The file containing summarized information about rules from both rule sets and their confidence level changes is presented to the webmaster for further analysis and comments. Dynamic Confidence One of the main goals of this study was to investigate how the confidence of the discovered rules changes over time, and how that affects a webmaster when making decisions based on the knowledge contained in those rules. Therefore, we compared the rule sets generated in November 2010 and in November 2012 from the web usage log file of the educational institution. The website of the institution has undergone minimal changes within this period. Some pages have been added, but most of the old pages still exist, and their content had not changed. Hence most rules found in the old rule set were still there in the new set giving us the opportunity to compare the rules and their confidence levels. We joined the rule sets from both periods and highlighted the changes in the confidence levels in both sets. We selected 18 rules found in November 2010 that also appeared in the rule set generated in November We then presented the results to the webmaster in order to find out how useful the information about the changes in the confidence would be in deciding upon possible actions to improve the website structure. The changes in the confidence level of the association rules may reflect the changes in the behavior of the website visitors. The reasons for the confidence level change may be: Certain events in the institution may affect interests of the website visitors. For example, the exam periods or the time of the enrolment may affect the students interests and the way they browse the site. Changes in the website structure may affect the visitor browsing patterns. This is minimized in our study since the website had undergone only minor changes. Confidence change distribution We categorized the rules into three categories: static, intermediate and dynamic. Rules are considered static if the confidence changed by no more than 0.03, dynamic if it changed by more than 0.1 and intermediate otherwise. Chart 1 shows the numbers of association rules across the categories of their confidence level changes. In our sample there were 18 rules in total. Out of those, 4 rules were static, 3 were dynamic, and 11 were in the intermediate category. Rules found in November 2010 were also categorized according to how probable it would be that a webmaster makes an action (changes the website structure) based on the knowledge presented by the rule. The rules were classified into three categories: action expected, intermediate and action not-expected. 59
5 When the webmaster was presented with the changes of the rule confidence levels based on the rules found in November 2012, we asked him/her to re-evaluate the decisions about adding/removing links on the website based on the new information. The webmaster indicated that based on the changes in the confidence level in five out of 18 rules he/she would change his/her previous decision about adding/removing links, which is shown in Chart 2. Static Dynamic Intermediate Chart 1: Dynamic vs. Static rules The webmaster discussed that the new information about the confidence level changes was useful for all sample rules presented to him/her. It helped discriminate between the pages that are browsed in a rather consistent way and those that are browsed differently in the two time periods examined. The consistent rules confirmed the decision to create hard static links between the pages, while the changing rules pointed out pages that are more affected by the changes in the student browsing patterns. Decision change Decision confirmed Chart 2: Rules causing webmaster s decision to change 60
6 Case discussion from the webmaster s perspective In this paragraph, we present a sample discussion about selected rules from the webmaster s perspective. Table 1 shows an example of two rules where the confidence levels decrease. The Annual_exam_schedule page contains information about the annual exam schedule, and the Current_exam_schedule page contains the schedule of exams in the current period. The first rule fell into the dynamic category, and the second rule fell into the intermediate category. Page A Page B Confidence 2010 Confidence 2012 Annual_exam_schedule Current_exam_schedule Current_exam_schedule Annual_exam_schedule Table 1. Based on the knowledge that students who visited the Annual_exam_schedule page also visited the Current_exam_schedule page in November 2010 with the confidence of 0.61, the webmaster had made the decision to add a new link from the Annual_exam_schedule page to the Current_exam_schedule page. However, finding that in November 2012 the confidence decreased significantly, the webmaster decided that the link should not be added. Further, the webmaster decided to watch the association between the two pages in future. On the other hand, the opposite link from the Current_exam_schedule page to the Annual_exam_schedule page had not been added earlier, and the drop in the confidence level of the second rule confirmed the earlier decision of the webmaster. Table 2 contains two rules between the pages related to enrollment. The General_terms page contains information about general terms of enrollment, and the Undergrad_enrollment page contains some more specific information about enrolling to undergraduate studies. The first rule fell into the static category, and the second into the intermediate category. Page A Page B Confidence 2010 Confidence 2012 Undergrad_enrollment General_terms General_terms Undergrad_enrollment Table 2. The confidence for visiting the General_terms page if the Undergrad_enrollment page is visited is high and consistent in November 2010 and November The webmaster decided that a direct link from the page Undergrad_enrollment to the page General_terms should be added. In the opposite case the confidence for visiting the Undergrad_enrollment page if the General_terms page was visited was somewhat lower. However, since the content of two pages is related, the webmaster had considered adding a link from the Undergrad_enrollment page to the General_terms page in 2010, considering 0.37 a high enough confidence value. The drop in 61
7 the confidence in the year 2012 however, made the webmaster sure that the link would not be needed. Conclusion Association rule mining of web usage log files is a method that can bring new, previously unknown knowledge about the website visitor behavior. In this paper we presented how it can be used by a webmaster to improve a website structure. We have conducted an empirical study on a website of an educational institution. Our web usage mining process consisted of four phases: pre-processing, association rule discovery, pruning, and comparing confidence levels of the rules between two time periods. The interestingness measures assigned to the association rules combined with the pruning techniques proved efficient in highlighting the rules that are easy to understand by a webmaster and prompt them to an action. The levels of changes of the rule confidence over time brought additional information to the webmaster helping them decide upon actions that may further improve the website structure. We leave studies employing more sophisticated pruning techniques, which might exploit in more detail the interconnectedness of the web pages, as well as applying other association rule interestingness measures for future work. References Agrawal, R., Imielinski, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. Proceedings of the ACM SIGMOD International Conference on Management of Data, [Location?], pp Anand, S. S., Mulvenna, M., & Chavielier, K. (2004). On the deployment of web usage mining' in web mining: From web to semantic web, pp , Editors: Bettina Berendt, Andreas Hotho, Dunja Mladenic, Maarten van Someren, Myra Spiliopoulou, Gerd Stumme ( ), Berlin: Springer. Balcázar J. L. (2010). Redundancy, reduction schemes, and minimum-size bases for association rules, logical methods. Computer Science, 6(2:3), Dettmar G. (2004). Logfile preprocessing using WUMprep. Talk given at the Web Mining Seminar in Winter semester 2003/04, School of Business and Economics, Humboldt University Berlin, Berlin. Dimitrijevic M., & Bosnjak Z. (2010). Discovering interesting association rules in the web log usage data. Interdisciplinary Journal of Information, Knowledge, and Management, 5, Dimitrijevic M., & Bosnjak Z. (2011). Association rule mining system. Interdisciplinary Journal of Information, Knowledge, and Management, 6, Geng, L., & Hamilton, H. J. (2006). Interestingness measures for data mining: A survey. ACM Computing Surveys (CSUR), 38(3, Article 9). 62
8 Huang, X. (2007). Comparison of interestingness measures for web usage mining: An empirical study, International Journal of Information Technology & Decision Making (IJITDM), 6(1), Kannan S., & Bhaskaran R. (2009). Association rule pruning based on interestingness measures with clustering. International Journal of Computer Science Issues, 6(1), Kazienko, P. (2009). Mining indirect association rules for web recommendation. International Journal of Applied Mathematics and Computer Science, 19(1), Kosala, R., & Blockeel, H. (2000). Web mining research: A survey, SIGKDD Explorations, 2(1), Sahaaya, A. M., & Malarvizhi M. (2010). Improving web navigation technique using weighted order representation. International Journal of Research and Reviews in Computer Science, 1(2), Tan, P., Kumar, V., & Srivastava, J. (2004). Selecting the right interestingness measure for association patterns. Information Systems, 29(4), Webb, G. I. (2011). Filtered top k association discovery. WIREs Data Mining Knowl Discov 2011, 1, doi: /widm.28 Biographies Maja Dimitrijević is a lecturer at the Higher Technical School of Professional Studies in Novi Sad, Serbia. She teaches courses in database structures, object-oriented programming and software engineering. She is currently working on her PhD thesis in the area of data mining. Her current research interests include data mining, web usage mining, database structures and software engineering. She holds an MSC degree in Computer Science from the University of British Columbia, Vancouver, Canada. Tanja Krunić is a lecturer at the Higher Technical School of Professional Studies in Novi Sad, Serbia. She teaches courses in programming, web design and Internet languages and tools. She holds an MSC in mathematics and is currently working towards her PhD in Numerical Analysis from the Faculty of Mathematics and Natural Sciences, Novi Sad. Her research interests include important issues like responsive design, search engine optimization usability, accessibility, privacy, and security on the World Wide Web. 63
Web Usage Association Rule Mining System
Interdisciplinary Journal of Information, Knowledge, and Management Volume 6, 2011 Web Usage Association Rule Mining System Maja Dimitrijević The Advanced School of Technology, Novi Sad, Serbia [email protected]
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Ontology-Based Filtering Mechanisms for Web Usage Patterns Retrieval
Ontology-Based Filtering Mechanisms for Web Usage Patterns Retrieval Mariângela Vanzin, Karin Becker, and Duncan Dubugras Alcoba Ruiz Faculdade de Informática - Pontifícia Universidade Católica do Rio
Indirect Positive and Negative Association Rules in Web Usage Mining
Indirect Positive and Negative Association Rules in Web Usage Mining Dhaval Patel Department of Computer Engineering, Dharamsinh Desai University Nadiad, Gujarat, India Malay Bhatt Department of Computer
Understanding Web personalization with Web Usage Mining and its Application: Recommender System
Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,
Introduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm
International Journal of Engineering Inventions e-issn: 2278-7461, p-issn: 2319-6491 Volume 2, Issue 5 (March 2013) PP: 16-21 Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm
An Effective Analysis of Weblog Files to improve Website Performance
An Effective Analysis of Weblog Files to improve Website Performance 1 T.Revathi, 2 M.Praveen Kumar, 3 R.Ravindra Babu, 4 Md.Khaleelur Rahaman, 5 B.Aditya Reddy Department of Information Technology, KL
PREPROCESSING OF WEB LOGS
PREPROCESSING OF WEB LOGS Ms. Dipa Dixit Lecturer Fr.CRIT, Vashi Abstract-Today s real world databases are highly susceptible to noisy, missing and inconsistent data due to their typically huge size data
ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL
International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR
Importance of Domain Knowledge in Web Recommender Systems
Importance of Domain Knowledge in Web Recommender Systems Saloni Aggarwal Student UIET, Panjab University Chandigarh, India Veenu Mangat Assistant Professor UIET, Panjab University Chandigarh, India ABSTRACT
A Survey on Web Mining From Web Server Log
A Survey on Web Mining From Web Server Log Ripal Patel 1, Mr. Krunal Panchal 2, Mr. Dushyantsinh Rathod 3 1 M.E., 2,3 Assistant Professor, 1,2,3 computer Engineering Department, 1,2 L J Institute of Engineering
COURSE RECOMMENDER SYSTEM IN E-LEARNING
International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand
Selection of Optimal Discount of Retail Assortments with Data Mining Approach
Available online at www.interscience.in Selection of Optimal Discount of Retail Assortments with Data Mining Approach Padmalatha Eddla, Ravinder Reddy, Mamatha Computer Science Department,CBIT, Gandipet,Hyderabad,A.P,India.
Building A Smart Academic Advising System Using Association Rule Mining
Building A Smart Academic Advising System Using Association Rule Mining Raed Shatnawi +962795285056 [email protected] Qutaibah Althebyan +962796536277 [email protected] Baraq Ghalib & Mohammed
Visualizing e-government Portal and Its Performance in WEBVS
Visualizing e-government Portal and Its Performance in WEBVS Ho Si Meng, Simon Fong Department of Computer and Information Science University of Macau, Macau SAR [email protected] Abstract An e-government
DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.
DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,
KOINOTITES: A Web Usage Mining Tool for Personalization
KOINOTITES: A Web Usage Mining Tool for Personalization Dimitrios Pierrakos Inst. of Informatics and Telecommunications, [email protected] Georgios Paliouras Inst. of Informatics and Telecommunications,
Business Lead Generation for Online Real Estate Services: A Case Study
Business Lead Generation for Online Real Estate Services: A Case Study Md. Abdur Rahman, Xinghui Zhao, Maria Gabriella Mosquera, Qigang Gao and Vlado Keselj Faculty Of Computer Science Dalhousie University
II. OLAP(ONLINE ANALYTICAL PROCESSING)
Association Rule Mining Method On OLAP Cube Jigna J. Jadav*, Mahesh Panchal** *( PG-CSE Student, Department of Computer Engineering, Kalol Institute of Technology & Research Centre, Gujarat, India) **
Data Mining of Web Access Logs
Data Mining of Web Access Logs A minor thesis submitted in partial fulfilment of the requirements for the degree of Master of Applied Science in Information Technology Anand S. Lalani School of Computer
FRAMEWORK FOR WEB PERSONALIZATION USING WEB MINING
FRAMEWORK FOR WEB PERSONALIZATION USING WEB MINING Monika Soni 1, Rahul Sharma 2, Vishal Shrivastava 3 1 M. Tech. Scholar, Arya College of Engineering and IT, Rajasthan, India, [email protected] 2 M.
WebAdaptor: Designing Adaptive Web Sites Using Data Mining Techniques
From: FLAIRS-01 Proceedings. Copyright 2001, AAAI (www.aaai.org). All rights reserved. WebAdaptor: Designing Adaptive Web Sites Using Data Mining Techniques Howard J. Hamilton, Xuewei Wang, and Y.Y. Yao
Web Mining as a Tool for Understanding Online Learning
Web Mining as a Tool for Understanding Online Learning Jiye Ai University of Missouri Columbia Columbia, MO USA [email protected] James Laffey University of Missouri Columbia Columbia, MO USA [email protected]
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES
BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 123 CHAPTER 7 BEHAVIOR BASED CREDIT CARD FRAUD DETECTION USING SUPPORT VECTOR MACHINES 7.1 Introduction Even though using SVM presents
72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD
72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD Paulo Gottgtroy Auckland University of Technology [email protected] Abstract This paper is
Information Management course
Università degli Studi di Milano Master Degree in Computer Science Information Management course Teacher: Alberto Ceselli Lecture 01 : 06/10/2015 Practical informations: Teacher: Alberto Ceselli ([email protected])
How To Write A Summary Of A Review
PRODUCT REVIEW RANKING SUMMARIZATION N.P.Vadivukkarasi, Research Scholar, Department of Computer Science, Kongu Arts and Science College, Erode. Dr. B. Jayanthi M.C.A., M.Phil., Ph.D., Associate Professor,
A SEMANTICALLY ENRICHED WEB USAGE BASED RECOMMENDATION MODEL
A SEMANTICALLY ENRICHED WEB USAGE BASED RECOMMENDATION MODEL C.Ramesh 1, Dr. K. V. Chalapati Rao 1, Dr. A.Goverdhan 2 1 Department of Computer Science and Engineering, CVR College of Engineering, Ibrahimpatnam,
Arti Tyagi Sunita Choudhary
Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Web Usage Mining
Database Marketing, Business Intelligence and Knowledge Discovery
Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski
Enhance Preprocessing Technique Distinct User Identification using Web Log Usage data
Enhance Preprocessing Technique Distinct User Identification using Web Log Usage data Sheetal A. Raiyani 1, Shailendra Jain 2 Dept. of CSE(SS),TIT,Bhopal 1, Dept. of CSE,TIT,Bhopal 2 [email protected]
Data Preprocessing and Easy Access Retrieval of Data through Data Ware House
Data Preprocessing and Easy Access Retrieval of Data through Data Ware House Suneetha K.R, Dr. R. Krishnamoorthi Abstract-The World Wide Web (WWW) provides a simple yet effective media for users to search,
Semantically Enhanced Web Personalization Approaches and Techniques
Semantically Enhanced Web Personalization Approaches and Techniques Dario Vuljani, Lidia Rovan, Mirta Baranovi Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, HR-10000 Zagreb,
FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT
FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT ANURADHA.T Assoc.prof, [email protected] SRI SAI KRISHNA.A [email protected] SATYATEJ.K [email protected] NAGA ANIL KUMAR.G
Identifying the Number of Visitors to improve Website Usability from Educational Institution Web Log Data
Identifying the Number of to improve Website Usability from Educational Institution Web Log Data Arvind K. Sharma Dept. of CSE Jaipur National University, Jaipur, Rajasthan,India P.C. Gupta Dept. of CSI
An application for clickstream analysis
An application for clickstream analysis C. E. Dinucă Abstract In the Internet age there are stored enormous amounts of data daily. Nowadays, using data mining techniques to extract knowledge from web log
Web Advertising Personalization using Web Content Mining and Web Usage Mining Combination
8 Web Advertising Personalization using Web Content Mining and Web Usage Mining Combination Ketul B. Patel 1, Dr. A.R. Patel 2, Natvar S. Patel 3 1 Research Scholar, Hemchandracharya North Gujarat University,
Web Mining. Margherita Berardi LACAM. Dipartimento di Informatica Università degli Studi di Bari [email protected]
Web Mining Margherita Berardi LACAM Dipartimento di Informatica Università degli Studi di Bari [email protected] Bari, 24 Aprile 2003 Overview Introduction Knowledge discovery from text (Web Content
FACULTY STUDY PROGRAMME FOR POSTGRADUATE STUDIES
FACULTY OF CONTEMPORARY SCIENCES AND TECHNOLOGIES STUDY PROGRAMME FOR POSTGRADUATE STUDIES (Master of Science) NAME OF THE PROGRAMME: BUSINESS INFORMATICS STUDIES 262 PROGRAMME DESCRIPTION Business Informatics
Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.
Volume 4, Issue 11, November 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Analytics
ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION
ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION K.Vinodkumar 1, Kathiresan.V 2, Divya.K 3 1 MPhil scholar, RVS College of Arts and Science, Coimbatore, India. 2 HOD, Dr.SNS
Laboratory Module 8 Mining Frequent Itemsets Apriori Algorithm
Laboratory Module 8 Mining Frequent Itemsets Apriori Algorithm Purpose: key concepts in mining frequent itemsets understand the Apriori algorithm run Apriori in Weka GUI and in programatic way 1 Theoretical
DATA MINING CONCEPTS AND TECHNIQUES. Marek Maurizio E-commerce, winter 2011
DATA MINING CONCEPTS AND TECHNIQUES Marek Maurizio E-commerce, winter 2011 INTRODUCTION Overview of data mining Emphasis is placed on basic data mining concepts Techniques for uncovering interesting data
AN EFFICIENT APPROACH TO PERFORM PRE-PROCESSING
AN EFFIIENT APPROAH TO PERFORM PRE-PROESSING S. Prince Mary Research Scholar, Sathyabama University, hennai- 119 [email protected] E. Baburaj Department of omputer Science & Engineering, Sun Engineering
Automatic Recommendation for Online Users Using Web Usage Mining
Automatic Recommendation for Online Users Using Web Usage Mining Ms.Dipa Dixit 1 Mr Jayant Gadge 2 Lecturer 1 Asst.Professor 2 Fr CRIT, Vashi Navi Mumbai 1 Thadomal Shahani Engineering College,Bandra 2
Advanced Preprocessing using Distinct User Identification in web log usage data
Advanced Preprocessing using Distinct User Identification in web log usage data Sheetal A. Raiyani 1, Shailendra Jain 2, Ashwin G. Raiyani 3 Department of CSE (Software System), Technocrats Institute of
New Matrix Approach to Improve Apriori Algorithm
New Matrix Approach to Improve Apriori Algorithm A. Rehab H. Alwa, B. Anasuya V Patil Associate Prof., IT Faculty, Majan College-University College Muscat, Oman, [email protected] Associate
How To Use Data Mining For Knowledge Management In Technology Enhanced Learning
Proceedings of the 6th WSEAS International Conference on Applications of Electrical Engineering, Istanbul, Turkey, May 27-29, 2007 115 Data Mining for Knowledge Management in Technology Enhanced Learning
DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE
DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE SK MD OBAIDULLAH Department of Computer Science & Engineering, Aliah University, Saltlake, Sector-V, Kol-900091, West Bengal, India [email protected]
Implementation of a New Approach to Mine Web Log Data Using Mater Web Log Analyzer
Implementation of a New Approach to Mine Web Log Data Using Mater Web Log Analyzer Mahadev Yadav 1, Prof. Arvind Upadhyay 2 1,2 Computer Science and Engineering, IES IPS Academy, Indore India Abstract
Web Personalization based on Usage Mining
Web Personalization based on Usage Mining Sharhida Zawani Saad School of Computer Science and Electronic Engineering, University of Essex, Wivenhoe Park, Colchester, Essex, CO4 3SQ, UK [email protected]
On the design concepts for CRM system
Jeong Yong Ahn Department of Computer Science and Informatics, Seonan University, Namwon, Korea Seok Ki Kim Division of Mathematics and Statistical Informatics, Chonbuk National University, Chonju, Korea
SPATIAL DATA CLASSIFICATION AND DATA MINING
, pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal
A UPS Framework for Providing Privacy Protection in Personalized Web Search
A UPS Framework for Providing Privacy Protection in Personalized Web Search V. Sai kumar 1, P.N.V.S. Pavan Kumar 2 PG Scholar, Dept. of CSE, G Pulla Reddy Engineering College, Kurnool, Andhra Pradesh,
Web Mining Functions in an Academic Search Application
132 Informatica Economică vol. 13, no. 3/2009 Web Mining Functions in an Academic Search Application Jeyalatha SIVARAMAKRISHNAN, Vijayakumar BALAKRISHNAN Faculty of Computer Science and Engineering, BITS
A Cube Model for Web Access Sessions and Cluster Analysis
A Cube Model for Web Access Sessions and Cluster Analysis Zhexue Huang, Joe Ng, David W. Cheung E-Business Technology Institute The University of Hong Kong jhuang,kkng,[email protected] Michael K. Ng,
How To Solve The Kd Cup 2010 Challenge
A Lightweight Solution to the Educational Data Mining Challenge Kun Liu Yan Xing Faculty of Automation Guangdong University of Technology Guangzhou, 510090, China [email protected] [email protected]
Static Data Mining Algorithm with Progressive Approach for Mining Knowledge
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 85-93 Research India Publications http://www.ripublication.com Static Data Mining Algorithm with Progressive
Pre-Processing: Procedure on Web Log File for Web Usage Mining
Pre-Processing: Procedure on Web Log File for Web Usage Mining Shaily Langhnoja 1, Mehul Barot 2, Darshak Mehta 3 1 Student M.E.(C.E.), L.D.R.P. ITR, Gandhinagar, India 2 Asst.Professor, C.E. Dept., L.D.R.P.
E-CRM and Web Mining. Objectives, Application Fields and Process of Web Usage Mining for Online Customer Relationship Management.
University of Fribourg, Switzerland Department of Computer Science Information Systems Research Group Seminar Online CRM, 2005 Prof. Dr. Andreas Meier E-CRM and Web Mining. Objectives, Application Fields
Data Mining Applications in Higher Education
Executive report Data Mining Applications in Higher Education Jing Luan, PhD Chief Planning and Research Officer, Cabrillo College Founder, Knowledge Discovery Laboratories Table of contents Introduction..............................................................2
Investigating Clinical Care Pathways Correlated with Outcomes
Investigating Clinical Care Pathways Correlated with Outcomes Geetika T. Lakshmanan, Szabolcs Rozsnyai, Fei Wang IBM T. J. Watson Research Center, NY, USA August 2013 Outline Care Pathways Typical Challenges
Responsive web design Are we ready for the new age?
Responsive web design Are we ready for the new age? Nataša Subić, The Higher Education Technical School of Professional Studies in Novi Sad, Serbia, [email protected] Tanja Krunić, The Higher Education
Web Usage Mining: Identification of Trends Followed by the user through Neural Network
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 617-624 International Research Publications House http://www. irphouse.com /ijict.htm Web
A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING
A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING M.Gnanavel 1 & Dr.E.R.Naganathan 2 1. Research Scholar, SCSVMV University, Kanchipuram,Tamil Nadu,India. 2. Professor
Web Mining using Artificial Ant Colonies : A Survey
Web Mining using Artificial Ant Colonies : A Survey Richa Gupta Department of Computer Science University of Delhi ABSTRACT : Web mining has been very crucial to any organization as it provides useful
K-means Clustering Technique on Search Engine Dataset using Data Mining Tool
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 505-510 International Research Publications House http://www. irphouse.com /ijict.htm K-means
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
USING SEMANTIC WEB MINING TECHNOLOGIES FOR PERSONALIZED E-LEARNING EXPERIENCES
USING SEMANTIC WEB MINING TECHNOLOGIES FOR PERSONALIZED E-LEARNING EXPERIENCES Penelope Markellou 1,2, Ioanna Mousourouli 2, Sirmakessis Spiros 1,2,3, Athanasios Tsakalidis 1,2 1 Research Academic Computer
Mining an Online Auctions Data Warehouse
Proceedings of MASPLAS'02 The Mid-Atlantic Student Workshop on Programming Languages and Systems Pace University, April 19, 2002 Mining an Online Auctions Data Warehouse David Ulmer Under the guidance
Mining Association Rules: A Database Perspective
IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.12, December 2008 69 Mining Association Rules: A Database Perspective Dr. Abdallah Alashqur Faculty of Information Technology
Data Mining to Recognize Fail Parts in Manufacturing Process
122 ECTI TRANSACTIONS ON ELECTRICAL ENG., ELECTRONICS, AND COMMUNICATIONS VOL.7, NO.2 August 2009 Data Mining to Recognize Fail Parts in Manufacturing Process Wanida Kanarkard 1, Danaipong Chetchotsak
A Novel Approach for Network Traffic Summarization
A Novel Approach for Network Traffic Summarization Mohiuddin Ahmed, Abdun Naser Mahmood, Michael J. Maher School of Engineering and Information Technology, UNSW Canberra, ACT 2600, Australia, [email protected],[email protected],M.Maher@unsw.
How To Analyze Web Server Log Files, Log Files And Log Files Of A Website With A Web Mining Tool
International Journal of Advanced Computer and Mathematical Sciences ISSN 2230-9624. Vol 4, Issue 1, 2013, pp1-8 http://bipublication.com ANALYSIS OF WEB SERVER LOG FILES TO INCREASE THE EFFECTIVENESS
A Study of Web Log Analysis Using Clustering Techniques
A Study of Web Log Analysis Using Clustering Techniques Hemanshu Rana 1, Mayank Patel 2 Assistant Professor, Dept of CSE, M.G Institute of Technical Education, Gujarat India 1 Assistant Professor, Dept
Identifying User Behavior by Analyzing Web Server Access Log File
IJCSNS International Journal of Computer Science and Network Security, VOL.9 No.4, April 2009 327 Identifying User Behavior by Analyzing Web Server Access Log File K. R. Suneetha, Dr. R. Krishnamoorthi,
Research of Postal Data mining system based on big data
3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication
IT services for analyses of various data samples
IT services for analyses of various data samples Ján Paralič, František Babič, Martin Sarnovský, Peter Butka, Cecília Havrilová, Miroslava Muchová, Michal Puheim, Martin Mikula, Gabriel Tutoky Technical
Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management
Using reporting and data mining techniques to improve knowledge of subscribers; applications to customer profiling and fraud management Paper Jean-Louis Amat Abstract One of the main issues of operators
An Enhanced Framework For Performing Pre- Processing On Web Server Logs
An Enhanced Framework For Performing Pre- Processing On Web Server Logs T.Subha Mastan Rao #1, P.Siva Durga Bhavani #2, M.Revathi #3, N.Kiran Kumar #4,V.Sara #5 # Department of information science and
BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL
The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University
