Implementing Advanced Cleaning and End-User Interpretability Technologies in Web Log Mining
|
|
- Clinton Welch
- 8 years ago
- Views:
Transcription
1 109 mplementing Advanced Cleaning and End-User nterpretability Technologies in Web Log Mining Zidrina Pabarskaite School of Computing nformation Systems and Mathematics, South Bank University, 103 Borough Road, London SE OM, UK Abstract. Two new approaches to web log mining are presented. Novel advanced cleaning improves web log mining results. mproved filtering removes pages with no links from other pages. n the data visualisation phase, technical representations of web pages are replaced by user attractive text interpretations. Experiments with the real world problems showed that the proposed techniques significantly increase the quality and usefulness of web log mining results. Keywords. Web log mining, web mining, data cleaning, knowledge representation and visualization. 1. ntroduction The web mining analysis process has evolved over the past decade. A wide range of techniques of web log mining appeared during this time [l], [6], [9]. However, some problems remain and have received little or no attention [7]. Many methods only work well on web sites having simple designs and are of limited use. The complexity of most web sites, the type and appropriate cleaning of the data set, the manner and the sample of the data used, have profound effects on the performance of web log data mining. Typical web log cleaning methodologies removes images and pictures. The analysis concerns the investigation of medidmultimedia files but there are masses of other irrelevant files, which stay untouched during all the analysis process. These can be internal web administrator actions, special purpose files, etc. The complexity of many web sites can influence the data cleaning process and impact the final results as well. t is characteristic of human nature that people want to see simple and understandable structures. Web log analysis tools do not always produce results which are easily understood by investigators. Scientist in [3] pointed the need to develop tools to help better understand mined knowledge. Mining reports generated by software applications in the final stage should be least complicated and more accessible to the majority who are interested in this information. This is the problem of visualization of results. Most of the web data is used not by technicians or web site developers, but by the people making business decisions. n this context, it is important to present results understandable to business decision makers. The new approach presented here proposes to solve visualization problem. This is done by using text tags viewed on the web site and used to define links to other pages. Using this approach, firstly, the request to the web server is send asking for the HTML page code. At the next stage, text tags are extracted. These text tags replace the classical technical URL representation used by the majority of web analysis tools. By doing this, text interpretations can be easily traced and understood by any person who is doing analysis. 2. Complex Web site structures Before data can be analysed it should pass through several pre-processing stages such as data cleaning, feature selection, transformation and etc. [6]. The pre-processing can take up to 80% of the time spend analysing the data. The quality of the final results strongly depends on cleaning process. Existing web log techniques perform rather superficial data cleaning. Web sites are created using different tools and design methodologies, therefore different types of web logs are generated. A cleaning process suitable for one type of web log data, can be irrelevant for another type and produced inaccurate results. For instance, the frame is a structure of the web site (see Fig. 1). t represents certain types of pages and consists of a number of other pages. When a request is submitted, the web browser detects the frame and additionally requests frame 24th nt. Conf. lnformafion Technology nterfaces /T 2002, June 24-27, 2002, Cavtat, Croatia
2 110 related pages. At the same time, the web server logs a number of requests into the log file instead of simply logging just one transaction. This problem has not been seriously investigated by researchers even thought it commonly arises when data is collected from frame designed web sites. This issue prompts the need of removing irrelevant frame pages from web log files. Such cleaning is important because it influences the size of the data set, the speed and the accuracy of the final results. During cleaning process, the size of the data sample is reduced by a factor of 3 to 10 times. /iidex.cfa <frameset... > <frame name="topba" src="topba.cfa"...> <frameset... > <frame name="leitmenu" src="leftnenu.cfa"...> <frame name="m;lir" src="notrecognised.cfa"...> </frameset> </frameset> ii /index.cfa name="topbar" src="topbar.cfa" ( /topbar.cfa 1 <frameset... > <frame name="leftmenu" src="lehenu.cfa"...> <frame name="main" src="notrecognised.cfa"... > name="leftmenu" name="main" src="notrecognised.cfa" =igure 1. The architecture of frame pages The picture (see Fig. 1) presents a typical us frames while developing a web site. The picture shows the HTML code, where index.cfa presents a complex structure of the web page. t consists of a new frame <frameset...>. Every frame has its own name <frame name...>. The first frame contains a page with the name topbar.cfa, the second frame splits into two different web pages: 1eftmenu.cfa and notrecognised.cfa. When the page index.cfa is requested, all three other pages are downloaded as compound parts of this frame. At the same time web server logs request of index.cfa, /topbar.cfa, 1eftmenu.cfa and notrecognised.cfa into the log file. t is very likely that just one page, for example /notrecognised.cfa, presents the desired information. 3. An advanced cleaning technique Web log cleaning is not as easy as it may appear. Most web mining techniques removes image files with extensions GF and JPG (if analysis does not involve image examining) [8], [51. Files with these extensions do not refer to separate web pages. They are part of some web page and are usually downloaded together with the requested page file. However, web servers monitor and log all requested files, including imagdmedia files. Experimenting with web log data cleaning provides evidence that there is much work to be done in this area. Besides images, which can be removed by most web mining techniques, there are number of other irrelevant files. These pages have a negative impact on the analysis. For example, the administrator might log into a site to make some updates or modifications, the web server will log this action as well. The web server cannot distinguish internal from external user. Table 1 presents a list of file types referred to as irrelevant. This list was gathered from experimental studies analyzing web log data. Table 1. Discovered irrelevant web pages URLfile tlpe GF JPG C0 SWF JS XT css PFR admin Description Picturehmage file Picturehmage file An icon so that when visitors to your site boohmark or add it to their 'yavourites" a nifry little icon shows up instead of that drab default explorer icon Flash -animation files Java script (additional extensions) file Text file for search engines. t usually contains info about the site for robots HTML style sheet, style description, lots of different pages can have the same style description Picture Frames Administration actions The web logs from the advertising company site was used as a source of experimental studies (for details, see Section 5). n the beginning, standard web log mining tools were applied. n the later stage, it was noticed that the web log mining results were not meaningful to the advertising client. Exploring the web site design technology, it was discovered that the web site was created using frames (for details, see Section
3 111 Links Table soum Page Text fleftmermcfa content ciin neknucfa /members/mddex c6n ABOUT US flefhnenu cfa fleknu cfa lawardsmdex cfm hlshop/indexcfm AWARDS BOOKSHOP Aethnenu cfa icareerdmdex ciin CAREERS flehucfa i~scusslodlndex ciin DSCUSSON Aeitmnu cfd aboubxljoin html HOW TO JON Aeknu cfd lpmsonsimddex cfm PA PENSONS Aeknu cfd Aefhnenu cfa kmczslindci cfm /e&tdetulr cfm MEMBERS ONLY MY DETALS heknu cfa inewsiindev cfm NEWS 4 Record to Links Table Page Analvzer <a hrehcontent cfm></a> flethnenu cfa imntent cfin ltopbawfa mdex cfa /userdefinit~on cfa lresourcziindexciin /resowcz/menuciin hkshopmdexciin ltcokshop/menu.ch Page to be Analyed log file and retrieved from the web server. A page analyzer engine processes bodies of retrieved pages further. The analyzer parses HTML source and stores detected link in the links table (see Fig. 3). The procedure is repeated for all web pages in the web log. The links table in Fig. 3 consists of the source information (the first column) pointing from where the visitor of the web site got to the page located in the second contoured column. These are names of pages accessible Page page Request from the source pages. The third column is the text of user friendly web page names presented on the web site. A Web Server nternet similar page retrieving method was presented in [4]. The authors used special web agents to get links from the World Figure 2. Retrieving the source code from web Wide Web for indexing search engines. pages However, here, the links which have 2). As described earlier, frames do not refer to been retrieved are used for cleaning exact pages. t is difficult to detect frames purposes. Moreover, text tags retrieved the HTML code has their role in presentation results. FLTER F page found in link table it is preserved, ovmse -removed Add page as such link emt content cfm /eventsirecession-menu html leventsiconfaence html iservicesicpd-index c h lservicesltraimngicpdiloghome cfm iservicesitraimnglcpd~menu cfm ieventsirecession-menu html iservicesicpd-index cfm Figure 3. Retrieved just those web pages which are accessible from other pages because frame pages do not have characteristic file extension (e.g., like pictures extensions jpg, gif). The below proposed technique consist of two frame recognition stages Retrieving HTML code from web server The first stage of advanced web log filtering consists of retrieving information from the web server. Web log file serve as a source of web pages to be downloaded. n the developed framework, page names are taken from the web 3.2. Filtering Some pages can be found in both web log file and link table (see Fig. 3). However, the origin of the pages in the link table is different. Link table contains only pages the user can click on. However, there are some text tag problems. Sometimes, links can be the image or JavaScript file. No text tag can be retrieved using these links. n these cases, third column in JavaScript object-based scripting language used for visual enhancement.
4 112 Table 2. Top pages of the generated by Webtrends web mining tool, here irrelevant page is indicated as RP, and link page as LP Views Total no Type /ipaccrreers/ 19,760 RP L 17,988 RP /ronrenf. chn 13,453. LP /iaacareet~s/fnrtfile/lisfcute~ories.rfiir 3,492 LP /rmuform.rfm 3,347 RP /secr~~~neil;searchfmrrrcret.cfnr 2,607 RP /ipacareer.r/solash. cfm 2,496 RP /aboutmembers/ 2,312 RP /careers/ 2,296 RP /careers/menu. chn 2,181 RP /iuacureers/lzome mainkiver.cfin 2,091 LP /rad/ 1,886 lrp /ipacureers/~~elconre. chi 1,868 LP /tliunkvou.cfni 1,742 RP /aknree acrion.chn 1,725 RP Lserviceshaiii, crfm 1,440 LP /cod/lonin!. drli 1,434 RP /ser~~ices/~lvnninicmmu rhn 1,413 RP /ctd/add!. chi1 1,406 RP hem ber.~mau/searchouiions. cfin 1,370 LP /neu;s/ 1,353 RP /services/ 1,350 RP /nens/meizu.cfm 1,343 RP /services/trainir~.e/cr~~~~home.cf?z 1,313 LP the Links table may contain a gap. n some other cases, text tags can be completely meaningless, for instance click here, etc. From link pages it is possible to get to the other pages. n other words, user can click on a link page and get to the destination page. Link table s second column contains all possible pages user can get to (click on). Following this, the page is relevant if it exists in the links table. Pages that do not exist in the links table must be removed (see Fig. 2). Using the methodology developed in this research, cleaning covers not just frame pages but all irrelevant pages (for example, pictures, flash animation, etc.) and leaves only essential pages for further analysis and experimental studies. 4. Visualization web log patterns The essence of nformation Visualization is referred to the creation of an internal model or image in the mind of a user. Hence, information visualization is an activity that humankind is engaged in all the time [2]. The scheme of the process of knowledge discovery by Fayyad et al. [7] does not involves information visualization. However, visualization should give a clear view of the rules, classes or recommendations discovered in the final data analysis stage. For example, information presented for a web mining analyst might not be understandable by a business analyst. For example, a technical interpretation of the page index.htm1 is commonly understood by web site developers. The expression index.htm1 might be more easily understood by a business analyst as Homepage. The implemented engine (see Fig. 4) takes technical web page names from the cleaned Links table and replaces them with textual information. 5. Results The web site of the media advertisement company was used to show the effectiveness of the proposed methodology. The web site was created using frame pages. Standard cleaning technology was not effective. For the end-user, the results produced by Webtrends were poor and of limited use (see Table 2). Table shows top pages viewed by visitors to the web site. The first column of the Table 2 represents names of the web pages (views). The second column indicates percentage of a viewed page out of all views. The indication of relevancy of the page was added to the last column by authors (reminder, if it has not link, it is not suitable for mining process). The size of log file contained records gathered over the period from to After applying the proposed cleaning techniques, the size of the data was reduced to records (about 22% of the original data set was left). The results showed that in 72% of the cases used in the study the proposed frame cleaning technique outperformed standard cleaning methodology. After the data was cleaned from irrelevant records, visualization technology was applied and produced the results as can be seen in the last replacing engine stage, see Figure 4. For example, /content.cfm is replaced with text,. /events/recession-menu.html with RECESSON, /events/conference.html with Essential Advice for Budget Planning 2002, /services/cpd-index.cfm with CPD ZONE, /news/index.cfm with NEWS. This modification is pretty attractive and allows the analyst to trace the mined web site immediately. WebTrends is the web traffic analysis softwart: package for web administrators and marketing managers everywhere.
5 113 fieitmenu cfa /lehenu.cfa /leftmenu cfa /lefhnenu.cfa ileftmenu cfa /leftmenu cfa Links Table iawarddiudes cfm ibookshopiindex c h icareerdindex cfm /dscussionjlndex cfm iaboutudjom html ipensionsiindex.cfm /sewwedindex c h /editdetails.ch ABOUT US AWARDS BOOKSHOP CAREERS DSCUSSON HOW TO JON PA PENSONS MEMBERS ONLY MY DETALS NEWS contenwin Replacing technical vemon hy user fnendly interpretation interpretahon is ieventsconference hhnl ieventslconference.html /semces/cpd indexcfm /newsiindes.c& User friendly Web Log RECE~SON Essenhal Advlce fm Budget Planrung 2002" "Essentd Advlce for Budget Planrung 2002" "Essenhal Advice for Budget Planrung 2002" CPD ZONE NEWS Figure 4. User friendly interpretation of web pages 6. Concluding remarks Two new techniques for enhancing web log mining are presented. A novel framework for performing advanced web log data cleaning was presented. The advantages of this technique were enlarged. This new framework was implemented and tested on a real data set of web logs from the advertisement industry. The results were compared with the results produced using standard cleaning techniques. The last stage of data mining is visualization. The engine implemented in this research, automatically retrieves text from HTML sources and presents the results understandable to the domain user. Experiments with the real world problems and real world users showed that the proposed techniques significantly increase the attractiveness and usefulness of web mining results. 7. References Retrieval. n Database and expert systems applications Vienna: EEE Computer Society. [5] Faulstich, L., et al. A Data Mining Analyzing the Navigational Behaviour of Web Users. n Workshop on Machine Learning User Modelling of the ACA'99 nternational Conf Creta, Greece. [6] Faulstich, L.C., et al. WUM: A Tool for Web Utilization Analysis. n EDBT Workshop WebDB' , Spain. [7] Fayyad, U., et al. KDD for Science Data Analysis: ssues and Examples. n Knowledge discovery & data mining [8] Jain, N., et al., Web Mining: Pattern Discovery from World Wide Web Transactions. 1997, University of Minesota: Minneapolis. [9] Pitkow, J., et al. WEBVZ: a tool for World- Wide Web access log analysis. n First nternational World Wide Web Conference. 1994, Switzerland. [l] Balabanovic, M., et al., An Adaptive Agent for Automated Web Browsing. Visual Communication and mage Representation, Vol. 6(4). [2] Banissi, E., nformation Visualization. Encyclopedla of computer science and technology, Vol. 42(27). [3] Cooley, R., et al. Web Mining: nformation and Pattern Discovery on the World Wide Web. n Tools with artificial intelligence [4] Deogun, J.S., et al. Structural Abstractions of Hypertext Documents for Web-Based
Arti Tyagi Sunita Choudhary
Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Web Usage Mining
More informationResearch and Development of Data Preprocessing in Web Usage Mining
Research and Development of Data Preprocessing in Web Usage Mining Li Chaofeng School of Management, South-Central University for Nationalities,Wuhan 430074, P.R. China Abstract Web Usage Mining is the
More informationANALYSING SERVER LOG FILE USING WEB LOG EXPERT IN WEB DATA MINING
International Journal of Science, Environment and Technology, Vol. 2, No 5, 2013, 1008 1016 ISSN 2278-3687 (O) ANALYSING SERVER LOG FILE USING WEB LOG EXPERT IN WEB DATA MINING 1 V. Jayakumar and 2 Dr.
More informationEnhance Preprocessing Technique Distinct User Identification using Web Log Usage data
Enhance Preprocessing Technique Distinct User Identification using Web Log Usage data Sheetal A. Raiyani 1, Shailendra Jain 2 Dept. of CSE(SS),TIT,Bhopal 1, Dept. of CSE,TIT,Bhopal 2 sheetal.raiyani@gmail.com
More information131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
More informationWEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS
WEB SITE OPTIMIZATION THROUGH MINING USER NAVIGATIONAL PATTERNS Biswajit Biswal Oracle Corporation biswajit.biswal@oracle.com ABSTRACT With the World Wide Web (www) s ubiquity increase and the rapid development
More informationIdentifying the Number of Visitors to improve Website Usability from Educational Institution Web Log Data
Identifying the Number of to improve Website Usability from Educational Institution Web Log Data Arvind K. Sharma Dept. of CSE Jaipur National University, Jaipur, Rajasthan,India P.C. Gupta Dept. of CSI
More informationAdvanced Preprocessing using Distinct User Identification in web log usage data
Advanced Preprocessing using Distinct User Identification in web log usage data Sheetal A. Raiyani 1, Shailendra Jain 2, Ashwin G. Raiyani 3 Department of CSE (Software System), Technocrats Institute of
More informationAN EFFICIENT APPROACH TO PERFORM PRE-PROCESSING
AN EFFIIENT APPROAH TO PERFORM PRE-PROESSING S. Prince Mary Research Scholar, Sathyabama University, hennai- 119 princemary26@gmail.com E. Baburaj Department of omputer Science & Engineering, Sun Engineering
More informationUrchin Demo (12/14/05)
Urchin Demo (12/14/05) General Info / FAQs 1. What is Urchin? Regent has purchased a license for Urchin 5 Web Analytics Software. This software is used to analyze web traffic and produce reports on website
More informationGoogle Analytics for Robust Website Analytics. Deepika Verma, Depanwita Seal, Atul Pandey
1 Google Analytics for Robust Website Analytics Deepika Verma, Depanwita Seal, Atul Pandey 2 Table of Contents I. INTRODUCTION...3 II. Method for obtaining data for web analysis...3 III. Types of metrics
More information15 minutes is not much so I will try to give some crucial guidelines and basic knowledge.
1 Presentation. Good morning ladies and gentlemen, dear colleagues. First of all I would like to thank the committee for this invitation and letting me speak about one of my favourite topics: the internet.
More informationVisualizing e-government Portal and Its Performance in WEBVS
Visualizing e-government Portal and Its Performance in WEBVS Ho Si Meng, Simon Fong Department of Computer and Information Science University of Macau, Macau SAR ccfong@umac.mo Abstract An e-government
More informationWEB SITE DEVELOPMENT WORKSHEET
WEB SITE DEVELOPMENT WORKSHEET Thank you for considering Xymmetrix for your web development needs. The following materials will help us evaluate the size and scope of your project. We appreciate you taking
More informationCOURSE RECOMMENDER SYSTEM IN E-LEARNING
International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand
More informationAn Effective Analysis of Weblog Files to improve Website Performance
An Effective Analysis of Weblog Files to improve Website Performance 1 T.Revathi, 2 M.Praveen Kumar, 3 R.Ravindra Babu, 4 Md.Khaleelur Rahaman, 5 B.Aditya Reddy Department of Information Technology, KL
More informationInternet Advertising Glossary Internet Advertising Glossary
Internet Advertising Glossary Internet Advertising Glossary The Council Advertising Network bring the benefits of national web advertising to your local community. With more and more members joining the
More informationMaking Graphics Interactive
Making Graphics Interactive A Banner Ad or web banner is a form of advertising on the World Wide Web delivered by an ad server. This form of online advertising entails embedding an advertisement into
More informationInternational Journal of Engineering Research-Online A Peer Reviewed International Journal Articles available online http://www.ijoer.
REVIEW ARTICLE ISSN: 2321-7758 UPS EFFICIENT SEARCH ENGINE BASED ON WEB-SNIPPET HIERARCHICAL CLUSTERING MS.MANISHA DESHMUKH, PROF. UMESH KULKARNI Department of Computer Engineering, ARMIET, Department
More informationGuide to Analyzing Feedback from Web Trends
Guide to Analyzing Feedback from Web Trends Where to find the figures to include in the report How many times was the site visited? (General Statistics) What dates and times had peak amounts of traffic?
More informationDigital media glossary
A Ad banner A graphic message or other media used as an advertisement. Ad impression An ad which is served to a user s browser. Ad impression ratio Click-throughs divided by ad impressions. B Banner A
More informationLesson Overview. Getting Started. The Internet WWW
Lesson Overview Getting Started Learning Web Design: Chapter 1 and Chapter 2 What is the Internet? History of the Internet Anatomy of a Web Page What is the Web Made Of? Careers in Web Development Web-Related
More informationA Comparative Study of Different Log Analyzer Tools to Analyze User Behaviors
A Comparative Study of Different Log Analyzer Tools to Analyze User Behaviors S. Bhuvaneswari P.G Student, Department of CSE, A.V.C College of Engineering, Mayiladuthurai, TN, India. bhuvanacse8@gmail.com
More informationMicrosoft Expression Web
Microsoft Expression Web Microsoft Expression Web is the new program from Microsoft to replace Frontpage as a website editing program. While the layout has changed, it still functions much the same as
More informationWorking with RD Web Access in Windows Server 2012
Working with RD Web Access in Windows Server 2012 Introduction to RD Web Access So far in this series we have talked about how to successfully deploy and manage a Microsoft Windows Server 2012 VDI environment.
More informationChapter-1 : Introduction 1 CHAPTER - 1. Introduction
Chapter-1 : Introduction 1 CHAPTER - 1 Introduction This thesis presents design of a new Model of the Meta-Search Engine for getting optimized search results. The focus is on new dimension of internet
More informationANALYSIS OF WEB LOGS AND WEB USER IN WEB MINING
ANALYSIS OF WEB LOGS AND WEB USER IN WEB MINING L.K. Joshila Grace 1, V.Maheswari 2, Dhinaharan Nagamalai 3, 1 Research Scholar, Department of Computer Science and Engineering joshilagracejebin@gmail.com
More informationASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL
International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR
More informationW3Perl A free logfile analyzer
W3Perl A free logfile analyzer Features Works on Unix / Windows / Mac View last entries based on Perl scripts Web / FTP / Squid / Email servers Session tracking Others log format can be added easily Detailed
More informationWeb Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm
International Journal of Engineering Inventions e-issn: 2278-7461, p-issn: 2319-6491 Volume 2, Issue 5 (March 2013) PP: 16-21 Web Mining Patterns Discovery and Analysis Using Custom-Built Apriori Algorithm
More informationHow To Understand Programming Languages And Programming Languages
Objectives Differentiate between machine and and assembly languages Describe Describe various various ways ways to to develop develop Web Web pages pages including including HTML, HTML, scripting scripting
More informationWeb based training for field technicians can be arranged by calling 888-577-4919 These Documents are required for a successful install:
Software V NO. 1.7 Date 9/06 ROI Configuration Guide Before you begin: Note: It is important before beginning to review all installation documentation and to complete the ROI Network checklist for the
More informationKOINOTITES: A Web Usage Mining Tool for Personalization
KOINOTITES: A Web Usage Mining Tool for Personalization Dimitrios Pierrakos Inst. of Informatics and Telecommunications, dpie@iit.demokritos.gr Georgios Paliouras Inst. of Informatics and Telecommunications,
More informationShort notes on webpage programming languages
Short notes on webpage programming languages What is HTML? HTML is a language for describing web pages. HTML stands for Hyper Text Markup Language HTML is a markup language A markup language is a set of
More informationDreamweaver CS5. Module 2: Website Modification
Dreamweaver CS5 Module 2: Website Modification Dreamweaver CS5 Module 2: Website Modification Last revised: October 31, 2010 Copyrights and Trademarks 2010 Nishikai Consulting, Helen Nishikai Oakland,
More informationExploring Web Access Logs with Correspondence Analysis
Exploring Web Access Logs with Correspondence Analysis Nikos Koutsoupias 1 1 Department of Balkan Studies Aristotle University of Thessaloniki 3rd Km Florina-Niki / 53100 Florina nickk@auth.gr Abstract.
More informationHow Web Browsers Work
144 PART 4 HOW THE WORLD WIDE WEB WORKS CHAPTER 18 How Web Browsers Work 145 LIKE much of the Internet, the World Wide Web operates on a client/server model. You run a web client on your computer called
More informationWebTrends 101 Training
WebTrends 101 Training Mehgan O ConnorO 9/26/2007 Agenda Why Care about WebTrends? Logging in to WebTrends Navigating Your Agency s s Reports Key Terms Useful Reports & Statistics Additional Training 2
More informationA Survey on Preprocessing of Web Log File in Web Usage Mining to Improve the Quality of Data
A Survey on Preprocessing of Web Log File in Web Usage Mining to Improve the Quality of Data R. Lokeshkumar 1, R. Sindhuja 2, Dr. P. Sengottuvelan 3 1 Assistant Professor - (Sr.G), 2 PG Scholar, 3Associate
More informationHow To Analyze Web Server Log Files, Log Files And Log Files Of A Website With A Web Mining Tool
International Journal of Advanced Computer and Mathematical Sciences ISSN 2230-9624. Vol 4, Issue 1, 2013, pp1-8 http://bipublication.com ANALYSIS OF WEB SERVER LOG FILES TO INCREASE THE EFFECTIVENESS
More informationBisecting K-Means for Clustering Web Log data
Bisecting K-Means for Clustering Web Log data Ruchika R. Patil Department of Computer Technology YCCE Nagpur, India Amreen Khan Department of Computer Technology YCCE Nagpur, India ABSTRACT Web usage mining
More informationA SURVEY ON WEB MINING TOOLS
IMPACT: International Journal of Research in Engineering & Technology (IMPACT: IJRET) ISSN(E): 2321-8843; ISSN(P): 2347-4599 Vol. 3, Issue 10, Oct 2015, 27-34 Impact Journals A SURVEY ON WEB MINING TOOLS
More informationUnderstanding Web personalization with Web Usage Mining and its Application: Recommender System
Understanding Web personalization with Web Usage Mining and its Application: Recommender System Manoj Swami 1, Prof. Manasi Kulkarni 2 1 M.Tech (Computer-NIMS), VJTI, Mumbai. 2 Department of Computer Technology,
More informationAnalysis of Server Log by Web Usage Mining for Website Improvement
IJCSI International Journal of Computer Science Issues, Vol., Issue 4, 8, July 2010 1 Analysis of Server Log by Web Usage Mining for Website Improvement Navin Kumar Tyagi 1, A. K. Solanki 2 and Manoj Wadhwa
More informationSAP BusinessObjects Business Intelligence Platform Document Version: 4.1 Support Package 5-2014-11-06. Business Intelligence Launch Pad User Guide
SAP BusinessObjects Business Intelligence Platform Document Version: 4.1 Support Package 5-2014-11-06 Business Intelligence Launch Pad User Guide Table of Contents 1 Document history....7 2 Getting started
More informationPreprocessing and Content/Navigational Pages Identification as Premises for an Extended Web Usage Mining Model Development
Informatica Economică vol. 13, no. 4/2009 168 Preprocessing and Content/Navigational Pages Identification as Premises for an Extended Web Usage Mining Model Development Daniel MICAN, Dan-Andrei SITAR-TAUT
More informationOglethorpe University. CRS410 Internship in Communications. Debra Bryant, Web Content Intern. December 10, 2012
Website Development and Design: Real World Experience Debra Oglethorpe University CRS410 Internship in Communications Debra, Web Content Intern December 10, 2012 Experience Website Development and Design:
More informationVoluntary Product Accessibility Template Blackboard Learn Release 9.1 April 2014 (Published April 30, 2014)
Voluntary Product Accessibility Template Blackboard Learn Release 9.1 April 2014 (Published April 30, 2014) Contents: Introduction Key Improvements VPAT Section 1194.21: Software Applications and Operating
More informationContent Manager User Guide Information Technology Web Services
Content Manager User Guide Information Technology Web Services The login information in this guide is for training purposes only in a test environment. The login information will change and be redistributed
More informationWeb Log Analysis for Identifying the Number of Visitors and their Behavior to Enhance the Accessibility and Usability of Website
Web Log Analysis for Identifying the Number of and their Behavior to Enhance the Accessibility and Usability of Website Navjot Kaur Assistant Professor Department of CSE Punjabi University Patiala Himanshu
More informationWEB LOG EXPLORER CONTROL OF MULTIDIMENSIONAL DYNAMICS OF WEB PAGES
UDC:004.738.52 Original scientific paper WEB LOG EXPLORER CONTROL OF MULTIDIMENSIONAL DYNAMICS OF WEB PAGES Mislav Šimunić Faculty of Tourism and Hospitality Management, Opatija, Rijeka University, Croatia
More informationInternational Journal of Engineering Technology, Management and Applied Sciences. www.ijetmas.com November 2014, Volume 2 Issue 6, ISSN 2349-4476
ERP SYSYTEM Nitika Jain 1 Niriksha 2 1 Student, RKGITW 2 Student, RKGITW Uttar Pradesh Tech. University Uttar Pradesh Tech. University Ghaziabad, U.P., India Ghaziabad, U.P., India ABSTRACT Student ERP
More informationHow to Login Username Password:
How to Login After navigating to the SelecTrucks ATTS Call Tracking & Support Site: www.selectrucksatts.com Select Corporate Link to login for Corporate owned Centers/Locations. Username: Your Email Address
More informationPREPROCESSING OF WEB LOGS
PREPROCESSING OF WEB LOGS Ms. Dipa Dixit Lecturer Fr.CRIT, Vashi Abstract-Today s real world databases are highly susceptible to noisy, missing and inconsistent data due to their typically huge size data
More informationGetting Started with the new VWO
Getting Started with the new VWO TABLE OF CONTENTS What s new in the new VWO... 3 Where to locate features in new VWO... 5 Steps to create a new Campaign... 18 Step # 1: Enter Campaign URLs... 19 Step
More information1. Chat4Support Introduction
1. Chat4Support Introduction Chat4Support is a CodingBest product that helps businesses to improve their sales and customer service on the Internet. Website visitors just only need to click on the chat
More informationANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION
ANALYSIS OF WEBSITE USAGE WITH USER DETAILS USING DATA MINING PATTERN RECOGNITION K.Vinodkumar 1, Kathiresan.V 2, Divya.K 3 1 MPhil scholar, RVS College of Arts and Science, Coimbatore, India. 2 HOD, Dr.SNS
More informationFinancial Trading System using Combination of Textual and Numerical Data
Financial Trading System using Combination of Textual and Numerical Data Shital N. Dange Computer Science Department, Walchand Institute of Rajesh V. Argiddi Assistant Prof. Computer Science Department,
More informationAn Enhanced Framework For Performing Pre- Processing On Web Server Logs
An Enhanced Framework For Performing Pre- Processing On Web Server Logs T.Subha Mastan Rao #1, P.Siva Durga Bhavani #2, M.Revathi #3, N.Kiran Kumar #4,V.Sara #5 # Department of information science and
More informationPre-Processing: Procedure on Web Log File for Web Usage Mining
Pre-Processing: Procedure on Web Log File for Web Usage Mining Shaily Langhnoja 1, Mehul Barot 2, Darshak Mehta 3 1 Student M.E.(C.E.), L.D.R.P. ITR, Gandhinagar, India 2 Asst.Professor, C.E. Dept., L.D.R.P.
More informationIs Your Google Analytics Data Accurate?
Is Your Google Analytics Data Accurate? September 18, 2013 Presented By Amin Shawki Analytics Manager Andy Gibson Digital Marketing Analyst 1. 1 Billion+ pageviews/year in sites analyzed and supported
More informationAn Electronic Journal Management System
An Electronic Journal Management System Hrvoje Bogunović, Edgar Pek, Sven Lončarić and Vedran Mornar Faculty of Electrical Engineering and Computing, University of Zagreb Unska 3, 0000 Zagreb, Croatia
More informationExploitation of Server Log Files of User Behavior in Order to Inform Administrator
Exploitation of Server Log Files of User Behavior in Order to Inform Administrator Hamed Jelodar Computer Department, Islamic Azad University, Science and Research Branch, Bushehr, Iran ABSTRACT All requests
More informationA Survey on Web Mining Tools and Techniques
A Survey on Web Mining Tools and Techniques 1 Sujith Jayaprakash and 2 Balamurugan E. Sujith 1,2 Koforidua Polytechnic, Abstract The ineorable growth on internet in today s world has not only paved way
More informationSTATE OF VERMONT. Secretary of Administration ORIGINAL POLICY ADOPTED BY STC DATE: STATUTORY REFERENCE Policy for Web Look and Feel Requirements
STATE OF VERMONT Agency of Administration STANDARD STC State Technology Collaborative ORIGINAL POLICY ADOPTED BY STC DATE: EFFECTIVE DATE ORIGINAL POLICY NUMBER ASSOCIATED DOCUMENTS Standards for Usability
More informationResponsive web design Are we ready for the new age?
Responsive web design Are we ready for the new age? Nataša Subić, The Higher Education Technical School of Professional Studies in Novi Sad, Serbia, subic@vtsns.edu.rs Tanja Krunić, The Higher Education
More informationWeb Design Competition 2013. College of Computing Science, Department of Information Systems. New Jersey Institute of Technology
COMPETITION PURPOSE The Web is the most transformable invention of our time. This competition features the creation of high-quality, well-designed and original Websites, while seeking to identify and encourage
More informationORACLE APPLICATION EXPRESS 5.0
ORACLE APPLICATION EXPRESS 5.0 Key Features Fully supported nocost feature of the Oracle Database Simple 2-Tier Architecture Develop desktop and mobile applications 100% Browserbased Development and Runtime
More informationAN OVERVIEW OF PREPROCESSING OF WEB LOG FILES FOR WEB USAGE MINING
AN OVERVIEW OF PREPROCESSING OF WEB LOG FILES FOR WEB USAGE MINING N. M. Abo El-Yazeed Demonstrator at High Institute for Management and Computer, Port Said University, Egypt no3man_mohamed@himc.psu.edu.eg
More informationOmniUpdate Training (Advanced OU users level 7+)
(Advanced OU users level 7+) University Web Team The Web Team is a part of the Office of University Communication and Marketing s Creative Services Department. Our responsibility is to build, migrate,
More informationVoluntary Product Accessibility Report
Voluntary Product Accessibility Report Compliance and Remediation Statement for Section 508 of the US Rehabilitation Act for OpenText Content Server 10.5 October 23, 2013 TOGETHER, WE ARE THE CONTENT EXPERTS
More informationMonitoring Pramati Web Server
Monitoring Pramati Web Server 15 Overview This section describes how to monitor Pramati Web Server from the Console. You can monitor information regarding the running Default Server and Virtual Hosts,
More informationCMS Training. Prepared for the Nature Conservancy. March 2012
CMS Training Prepared for the Nature Conservancy March 2012 Session Objectives... 3 Structure and General Functionality... 4 Section Objectives... 4 Six Advantages of using CMS... 4 Basic navigation...
More informationRequirements for Developing WebWorks Help
WebWorks Help 5.0 Originally introduced in 1998, WebWorks Help is an output format that allows online Help to be delivered on multiple platforms and browsers, which makes it easy to publish information
More informationANALYZING OF THE EVOLUTION OF WEB PAGES BY USING A DOMAIN BASED WEB CRAWLER
- 151 - Journal of the Technical University Sofia, branch Plovdiv Fundamental Sciences and Applications, Vol. 16, 2011 International Conference Engineering, Technologies and Systems TechSys 2011 BULGARIA
More informationEvents Forensic Tools for Microsoft Windows
Events Forensic Tools for Microsoft Windows Professional forensic tools Events Forensic Tools for Windows Easy Events Log Management Events Forensic Tools (EFT) is a fast, easy to use and very effective
More informationCommon Online Advertising Terms Provided by ZEDO, Inc.
3rd Party Ad Tag 3rd Party Redirect Action Action Tracking Tag Activity Ad Dimension Ad Hoc Report Ad Network Ad Tag Advanced Report Advertiser Advertiser Summary Report Advertiser Type Allocate per Ad
More informationDevelopment of Framework System for Managing the Big Data from Scientific and Technological Text Archives
Development of Framework System for Managing the Big Data from Scientific and Technological Text Archives Mi-Nyeong Hwang 1, Myunggwon Hwang 1, Ha-Neul Yeom 1,4, Kwang-Young Kim 2, Su-Mi Shin 3, Taehong
More informationInner Classification of Clusters for Online News
Inner Classification of Clusters for Online News Harmandeep Kaur 1, Sheenam Malhotra 2 1 (Computer Science and Engineering Department, Shri Guru Granth Sahib World University Fatehgarh Sahib) 2 (Assistant
More informationMining for Web Engineering
Mining for Engineering A. Venkata Krishna Prasad 1, Prof. S.Ramakrishna 2 1 Associate Professor, Department of Computer Science, MIPGS, Hyderabad 2 Professor, Department of Computer Science, Sri Venkateswara
More informationOffice 888-707-3030. Fax 888-300-3002
Office 888-707-3030 Fax 888-300-3002 1 Reseller Quick-Start Guide Table of Contents Reseller Account Setup Checklist 2 Domain & Name Servers 3 Payment Gateway(s) 4 Dedicated Email Servers 5 Landing Page
More informationOptum Patient Portal. 70 Royal Little Drive. Providence, RI 02904. Copyright 2002-2013 Optum. All rights reserved. Updated: 3/7/13
Optum Patient Portal 70 Royal Little Drive Providence, RI 02904 Copyright 2002-2013 Optum. All rights reserved. Updated: 3/7/13 Table of Contents 1 Patient Portal Activation...1 1.1 Pre-register a Patient...1
More informationMOOCviz 2.0: A Collaborative MOOC Analytics Visualization Platform
MOOCviz 2.0: A Collaborative MOOC Analytics Visualization Platform Preston Thompson Kalyan Veeramachaneni Any Scale Learning for All Computer Science and Artificial Intelligence Laboratory Massachusetts
More informationEnriched Links: A Framework For Improving Web Navigation Using Pop-Up Views
Enriched Links: A Framework For Improving Web Navigation Using Pop-Up Views Gary Geisler Interaction Design Laboratory School of Information and Library Science University of North Carolina at Chapel Hill
More informationMONITORING YOUR WEBSITE WITH GOOGLE ANALYTICS
MONITORING YOUR WEBSITE WITH GOOGLE ANALYTICS How to use Google Analytics to track activity on your website and help get the most out of your website 2 April 2012 Version 1.0 Contents Contents 2 Introduction
More informationDevelopment of strategies to build a Web page
TITLE OF THE SCENARIO Keywords A chi voglio insegnare? Age range and grade of the learners Special characteristics of learners The learning emphasis? Learning subject /field / skills or dimension Specific
More informationWebsite analytics / statistics Monitoring and analysing the impact of web marketing
Website analytics / statistics Monitoring and analysing the impact of web marketing What are website analytics / statistics? Web analytics is the measurement, collection, analysis and reporting of website
More informationWeb Usage mining framework for Data Cleaning and IP address Identification
Web Usage mining framework for Data Cleaning and IP address Identification Priyanka Verma The IIS University, Jaipur Dr. Nishtha Kesswani Central University of Rajasthan, Bandra Sindri, Kishangarh Abstract
More informationHow To Change Your Site On Drupal Cloud On A Pcode On A Microsoft Powerstone On A Macbook Or Ipad (For Free) On A Freebie (For A Free Download) On An Ipad Or Ipa (For
How-to Guide: MIT DLC Drupal Cloud Theme This guide will show you how to take your initial Drupal Cloud site... and turn it into something more like this, using the MIT DLC Drupal Cloud theme. See this
More informationHS Web Design Business and Technology
Scope And Sequence Timeframe Unit Instructional Topics Course Description Web Design students will learn how to create and maintain web pages using the prevailing techniques and software. Students will
More informationContent Manager User Guide Information Technology Web Services
Content Manager User Guide Information Technology Web Services The login information in this guide is for training purposes only in a test environment. The login information will change and be redistributed
More informationWebAdaptor: Designing Adaptive Web Sites Using Data Mining Techniques
From: FLAIRS-01 Proceedings. Copyright 2001, AAAI (www.aaai.org). All rights reserved. WebAdaptor: Designing Adaptive Web Sites Using Data Mining Techniques Howard J. Hamilton, Xuewei Wang, and Y.Y. Yao
More informationHow People Read Books Online: Mining and Visualizing Web Logs for Use Information
How People Read Books Online: Mining and Visualizing Web Logs for Use Information Rong Chen 1, Anne Rose 2, Benjamin B. Bederson 2 1 Department of Computer Science and Technique College of Computer Science,
More informationNovell ZENworks Asset Management 7.5
Novell ZENworks Asset Management 7.5 w w w. n o v e l l. c o m October 2006 USING THE WEB CONSOLE Table Of Contents Getting Started with ZENworks Asset Management Web Console... 1 How to Get Started...
More informationINTERNET MARKETING. SEO Course Syllabus Modules includes: COURSE BROCHURE
AWA offers a wide-ranging yet comprehensive overview into the world of Internet Marketing and Social Networking, examining the most effective methods for utilizing the power of the internet to conduct
More informationKentico Content Management System (CMS
Kentico Content Management System (CMS Table of Contents I. Introduction... 1 II. Log into a Kentico CMS Desk to Edit GC Website... 1 A. Select a Browser (Internet Explorer or Firefox only)... 1 B. Login
More informationSearch Engine Optimisation (SEO) Guide
Search Engine Optimisation (SEO) Guide Search Engine Optimisation (SEO) has two very distinct areas; on site SEO and off site SEO. The first relates to all the tasks that you can carry out on your website
More informationDeveloping Web Browser Recording Tools. Using Server-Side Programming Technology
Developing Web Browser Recording Tools Using Server-Side Programming Technology Chris J. Lu Ph.D. National Library of Medicine NLM, NIH, Bldg. 38A, Rm. 7N-716, 8600 Rockville Pike Bethesda, MD 20894, USA
More informationHow to set up a campaign with Admedo Premium Programmatic Advertising. Log in to the platform with your email address & password at app.admedo.
How to set up a campaign with Admedo Premium Programmatic Advertising Log in to the platform with your email address & password at app..com For further support please email: hi@.com Campaign Delivery Settings
More information