Web Traffic Mining Using a Concurrent Neuro-Fuzzy Approach

Size: px
Start display at page:

Download "Web Traffic Mining Using a Concurrent Neuro-Fuzzy Approach"

Transcription

1 Web Traffic Mining Using a Concurrent Neuro-Fuzzy Approach Xiaozhe Wang, Ajith Abraham and Kate A. Smith School of Business Systems, Faculty of Information Technology, Monash University, Clayton, Victoria 3800, Australia {catherine.wang,ajith.abraham,kate.smith}@infotech.monash.edu.au Abstract: Web servers play a crucial role to convey knowledge and information to the end users. With the popularity of the WWW, discovering the hidden information about the users and usage pattern is critical to determine effective marketing strategies and to optimise the server usage and accommodate future growth. Many of the currently available server analysis tools could provide only statistical data without much useful information. Mining useful information becomes a challenging task when the user traffic volume is enormous and keeps on growing. In this paper, we propose a concurrent neuro-fuzzy model to analyse useful information from the available statistical/text data from the Web log analyser. We made use of the cluster information generated by Self Organising Map (SOM) for data analysis and a Fuzzy Inference System (FIS) to forecast the daily and hourly traffic volume. Empirical results clearly demonstrate that the proposed hybrid technique is efficient and could be extended to other Web environments. Keywords: Web mining, clustering, self organising map, hybrid, neuro-fuzzy 1. Introduction and Motivation for Research The World Wide Web (WWW) is continuously growing with rapid increase of the information transaction volume and number of requests from Web users around the world. For Web administrators and managers, discovering the hidden information about the users access or usage patterns has become a necessity to improve the quality of the Web information service performances. From the business point of view, knowledge obtained from the usage or access patterns of Web users could be applied directly for marketing and management of E-business, E-services, E-searching, E- education and so on. However, the statistical data available from the normal Web log data files or even the information provided by Web trackers could only provide the information explicitly because of the nature and limitations of the methodology itself. Generally, one could say that the analysis relies on three general sets of information given a current focus of attention: (1) past usage patterns (2) degree of shared content and (3) inter-memory associative link structures [7]. Computational Web Intelligence (CWI) [1], a recently coined paradigm, is aimed to improve the quality of intelligence in the Web technology [8]. The pattern discovery of Web usage mining consists of several steps including statistical analysis, clustering, classification and so on [6][9]. Most of the existing research is focused on finding the patterns; with little efforts on the detailed pattern analysis. We propose SOM [12] to cluster and discover patterns from the data. These clustered data were further used for different statistical analysis. In order to make the analysis more intelligent we also used the clustered data to forecast the daily traffic volume and the hourly page requests. Using a Takagi Sugeno Fuzzy Inference System (TSFIS) [], we explored the prediction of average daily traffic 1

2 volume (1 to days ahead) and the hourly page requests traffic in a day (1,12 and 24 hours ahead). As a case study, we explored the Web user access patterns of Monash University s Web site located at We made use of the statistical data provided by Analog [16] Web access log file analyser, which is embedded at the University s Web server. Analog generated data and text information covers different aspects of the users access log records, weekly-based reports include traffic, types of files accessed, domain summary, operating system used, navigation summary and so on. We illustrate the typical Web traffic patterns of Monash University in Figures 1 and 2 showing the daily and hourly volume of traffic (number of requests from different domains and the number of page requests) for the week starting 14-Jul- 2002, 00:13 to 20-Jul-2002, 12:22. For more user access data logs please refer [10] Figure 1. Daily Web traffic of Monash University Figure 2. Hourly Web traffic of Monash University In a week, over 7 million visitors access data from the University s Web site [11] and since the data cover different aspects (domains, files accessed, no of daily and hourly access, page requests etc.), it is a real challenge to find some hidden information or to extract usage patterns. The mere complexity of the data volume paves way for the requirement of hybrid intelligent systems for information analysis and trend forecast. 2

3 In the subsequent section, we present the structure of proposed concurrent neurofuzzy model for mining Web access patterns. In section 3, we present the analysis of the clustered Web data using SOM followed by modeling the TSFIS to forecast the usage pattern trends in Section 4. Finally, some conclusions and future works are given. 2. Hybrid Neuro-Fuzzy Approach for Web Traffic Mining The hybrid framework combines SOM and a FIS operating in a concurrent environment as illustrated in Figure 3. In a concurrent model, neural network assists the fuzzy system continuously to determine the required parameters especially when certain input variables cannot be measured directly. Such combinations do not optimise the fuzzy system but only aids to improve the performance of the overall system [18]. Learning takes place only in the neural network and the fuzzy system remains unchanged during this phase. The pre-processed data (after cleaning and scaling) is fed to the SOM to identify the data clusters. The clustering phase is based on SOM; an unsupervised learning algorithm [12], which can accept input objects described by their features and place them on a two dimensional (2D) map in such a way that similar objects are placed close together. Referring to Figure 4, data X, Y and Z may be segregated into three clusters according to the SOM algorithm. The clustered data is then used by the Web Usage Data Analyser (WUDA) for discovering different patterns and providing useful information to the Web analyst. Web Log File Data Data preprocessing Fuzzy Inference System Web Usage Pattern Forecast Self Organising Map Web Usage Data Clusters Web Usage Data Analyser Figure 3. Architecture of the concurrent neuro-fuzzy Model for Web pattern analysis FIS is used to forecast the Web traffic patterns on an hourly and daily basis. FIS is a popular computing framework based on the concepts of fuzzy set theory, fuzzy if-then rules, and fuzzy reasoning. The basic structure of the FIS consists of three conceptual components: a rule base, which contains a selection of fuzzy rules; a database, which 3

4 defines the membership functions used in the fuzzy rule and a reasoning mechanism, which performs the inference procedure upon the rules and given facts to derive a reasonable output or conclusion. As shown in Figure 4, data X is associated with Cluster 3 strongly but data Y and Z have weak associations with other clusters. Example: Data Z is associated with cluster 1 but can be also considered to have some weak association with Clusters 2 and 3. The degree of association of the data with a particular cluster is modelled as an additional fuzzy variable. We used the Takagi- Sugeno FIS to derive the rule conclusions []. The FIS could forecast the hourly and daily Web traffic. An important advantage of the FIS is the interpretability of the developed model in the form of simple if-then rules. Cluster 3 x Z Y Cluster 1 Cluster 2 Figure 4. Data association with clusters 3. Data Clustering and Experimental Analysis Using SOM The data source selected for our approach is the single-site [11] Web traffic data generated by the Analog Web access log file analyser [16]. It is a usual practice to embed some Web trackers to analyse the user access logs. After browsing through some of the features of the best trackers available [14][16][3][2] it is easy to conclude that rather than generating statistical data and texts they really don t help to find much meaningful information. Data Pre-processing In this research, we used the data from 17 February 2002 to 07 July Selecting the useful data is an important task in the data pre-processing block. After some preliminary analysis, we selected the statistical data comprising of domain byte requests, hourly page requests and daily page requests to generate the cluster models for finding Web users usage patterns. It is also important to remove irrelevant noisy data in order to build a precise model. Further, the datasets were scaled to 0-1. Besides the two inputs, volume of requests and volume of pages (bytes), we also included an additional input index number to distinguish the time sequence of the data. The most recently accessed data were indexed higher while the least recently accessed data were placed at the bottom [17]. Data Clustering Using SOM During working days, there are millions of user requests with different interests from different countries all around the world. The huge volume and the dynamic nature are two important features of the Web data, which invokes technology challenge to create appropriate Web mining models. SOM allows different data to be grouped together based on some similar characteristics. 4

5 The SOM algorithm for forming the Web usage or access patterns map is given below: Initialise the weight w, neighbourhood size N m (0) and parameter functions ij α(t) and σ 2 ( t). Select the training term vector x i at random for the input layer and calculate the similarity (distance) d of this input to the weight w of each node j. d j = x w j = n i= 1 2 ( x w ) (1) i ij Select the node with the minimum distance as the winner vector m Update the weights connecting the input layer to the winning node and its neighbouring nodes by the learning rule w ( t + 1) = w ( t) + c x w ( t) (2) ij ij [ ] i ij where c = α( t)exp( r r / σ 2 ( t)) for all nodes j in N m (t) i m Repeat steps 2-4 by increasing t by 1 at a time and decreasing the neighbourhood size, α(t) and σ 2 ( t) until the weights are stabilised. Map each term to a node on the SOM. Label each winning nodes with an associated text term. A 2D map of Web usage patterns with different clusters are formed after the training process. The related transaction entries are grouped into the same cluster and the relationship between different clusters is explicitly shown on the map. We used the Viscovery SOMine [4] to simulate the SOM. All the records were processed using SOM and the clustering results were obtained after the unsupervised learning process. We adopted a trial and error approach by comparing the normalised distortion and quantization error to decide the various parameter settings of the SOM algorithm. We finally decided the parameter setting, which could minimise both normalised distortion and quantization errors. The obtained 2D cluster map showing five different clusters according to the country of origin (domain) is shown in Figure. Cluster Cluster 4 Cluster 3 Cluster 1 Cluster 2 Figure. Clustering results of daily number of requests according to the domains

6 3.1 WUDA to Discover Domain Patterns Figure 6 depicts the details of the number of requests allocated to each cluster (total clusters) according to country (domain) of origin. For clarity purpose, we have used logarithmic scale in the Y-axis. Figure 6. WUDA: Daily number of requests according to the domain of origin As evident from Figure 6, clusters 4 and separated out from the rest of the others (maximum number of requests) with a very few nodes compared to others and spread along all the time. For clusters 1, 2 and 3, the patterns are very similar with a bit of difficulty to identify each other. The domain analysis reveals that cluster contains only Australian domains and cluster 4 accounts only *.com and *.net users. This is because majority of the requests originated from Australian domains (60%) followed by *.com and *.net users (8%)users. The remaining 3 clusters were shared by users from different domains depending on the volume of requests and volume of pages (bytes). As depicted in Figure 7, more useful information is available from clusters 1, 2 and 3 with reference to the time of accessing the Web site. Figure 7. WUDA: Daily number of requests with reference to the time of access Even though clusters 1, 2 and 3 have very similar patterns for the requests, the time of access is separated very clearly as shown in Figure 7. Cluster 2 accounts for the most recent visitors and cluster 3 represents the least recent visitors. Cluster 1 accounts for the users, which were not covered by Clusters 2 and 3. 6

7 3.2 WUDA to Discover Request Patterns The training process generated 4 clusters for the hourly number of requests. The developed clusters are illustrated in Figure 8. The hour of day the request was made is indicated in each cluster Cluster Cluster 10 2 Cluste 21 r Cluster Cluster Figure 8. Hourly requests clusters using SOM From the developed clusters as depicted in Figure 8, it is very difficult to tell the difference between each cluster, as the requests (according to the hour) are scattered. From Figures 9 (a) and (b) it may be concluded that clusters 2 and 3 have much higher requests and pages (nearly double) than clusters 1 and 4. (a) Number of hourly page requests (b) Number of hourly requests Value Figure 9. WUDA: Comparison of hourly page traffic volume and page requests Figure 10 depicts the clustering results of the data focusing on the hourly trends of the traffic. Clusters 2 and 3 are mainly responsible for the traffic during the office hours (00:09 18:00) and clusters 1 and 4 account for the traffic during the University off peak hours. It is interesting to note that the access patterns for each hour could be analysed from the cluster results with reasonable classification features. 7

8 Figure 10. WUDA: Hourly traffic patterns 3.3 WUDA to Discover Daily Requests Clusters Due to the dynamic nature of the WWW it is difficult to understand the daily traffic pattern using conventional Web log analysers. We attempted to cluster the data depending on the total activity for each day of the week using volume of daily requests, pages and index value as input features. The training process using SOM generated 7 clusters and the developed 2D map is shown in Figure 11. Wed Thu Fri Mon Tue Wed Mon Sat Sat Sun Sun Sun Sun Tue on Wed Wed Sat Su Sun Sat Wed Thu Tue Tue Thu Fri Thu ue Thu Sat Sa Cluster 1 Tue Cluster 2 Mon Wed Tue Thu Fri Mon Fri Sat on Thu Cluster 6 Mon Sun Tue Wed Wed Mon Su Mon Tue Fri Thu Sun Wed Fri Sat Fri Su Fri Thu Mon Wed Thu Sat Sat Mon Fri Thu Sun Tue Tue Fri Wed Fri Wed Sun Sa Thu Mon Tue Sat Cluster 4 Fri Mon Wed Thu ri Sun Su Thu Tue Wed Thu Sat Mon Mon Fri Tue Tue Wed Sun Cluster 3 F Mon Tue Fri Sun Cluster Thu Sat Mon hu Wed Fri Tue Fri Sun Sun F Cluster 7 Mon Sat Tue Wed Thu Sat Wed Wed on MonTue Thu Thu Fri Tue Wed Sat Sun Sat Sun Sat Sa Figure 11. Developed daily request clusters showing the days WUDA reveals that the clusters are separated according to the time of access. Each cluster accounted the requests / access during a certain period as shown in Figure 12. The ranking of the clusters are ordered as 2, 6, 1, 4, 3, 7 and according to the descending order of the access time. Further analysis of the daily records in each cluster, also reveals some interesting patterns as illustrated in Figure 13. Clusters 3 and 6 accounts for access records, which happened during Saturday and Sunday. Cluster 1 was separated as it only covered the first few weekdays (mostly from Monday to Thursday). While, the biggest group of clusters consist of 2, 4, and 7 accounted for the transactions during Monday to Friday. 8

9 Figure 12. WUDA: Cluster ranking depending on the time of access Figure 13. WUDA: Cluster ranking depending on the day of the week 4. Fuzzy Inference Systems The world of information is surrounded by uncertainty and imprecision. The human reasoning process can handle inexact, uncertain and vague concepts in an appropriate manner. Usually, the human thinking, reasoning and perception process cannot be expressed precisely. These types of experiences can rarely be expressed or measured using statistical or probability theory. Fuzzy logic provides a framework to model uncertainty, human way of thinking, reasoning and the perception process. Fuzzy ifthen rules and fuzzy reasoning are the backbone of fuzzy inference systems, which are the most important modelling tools based on fuzzy set theory. Fuzzy modelling can be pursued using the following steps. Select relevant input and output variables. Determine the number of linguistic terms associated with each input/output variables. Also choose the appropriate family of parameterized membership functions, fuzzy operators, reasoning mechanism etc. Choose a specific type of fuzzy inference system Design a collection of fuzzy if-then rules (knowledge base) 9

10 Figure 14. Takagi Sugeno fuzzy inference system using a min or product as T-norm operator We made use of the Takagi Sugeno fuzzy inference scheme in which the conclusion of a fuzzy rule is constituted by a weighted linear combination of the crisp inputs rather than a fuzzy set []. A basic Takagi-Sugeno fuzzy inference system is illustrated in Figure 14 and the if-then rules has the following structure if xis A1 and y is B1, then z1 = p1x+ q1y + r1 (3) where p 1, q 1 and r 1 are linear parameters. A conventional FIS makes use of a model of the expert who is in a position to specify the most important properties of the process. Expert knowledge is often the main source to design FIS. According to the performance measure of the problem environment, the membership functions, rule bases and the inference mechanism are to be adapted. Evolutionary computation [19] and neural network learning techniques are used to adapt the various fuzzy parameters. Recently, a combination of evolutionary computation and neural network learning has also been investigated [18]. In this research, we used the Adaptive Neuro-Fuzzy Inference System (ANFIS) [13] framework based on neural network learning to fine tune the rule antecedent parameters and a least mean squares estimation to adapt the rule consequent parameters of the TSFIS. A step in the learning procedure has two parts: In the first part the input patterns are propagated, and the optimal conclusion parameters are estimated by an iterative least mean square procedure, while the antecedent parameters (membership functions) are assumed to be fixed for the current cycle through the training set. In the second part the patterns are propagated again, and in this epoch, back propagation is used to modify the antecedent parameters, while the conclusion parameters remain fixed. Please refer to [13] for more details. Design and Experimentation Results We used the popular grid partitioning method (clustering) to generate the initial rule base. This partition strategy requires only a small number of membership functions for each input. The technique is illustrated in Figure 1. 10

11 Figure 1. Example showing how the 2 dimensional spaces are partitioned using 3 trapezoidal membership functions per input dimension. A simple if-then rule will appear as If input-1 is medium and input 2 is large then rule R 8 is fired. Besides the inputs, volume of requests and volume of pages (bytes) and index number, we also used the cluster location information provided by the SOM output. The data was re-indexed based on the cluster information. We attempted to develop fuzzy inference models to predict (few time steps ahead) the Web traffic volume on a hourly and daily basis. We used the data from 17 February 2002 to 30 June 2002 for training and the data from 01 July 2002 to 06 July 2002 for testing and validation purposes. Daily Traffic Prediction We used the MATLAB environment to simulate the various experiments. Given the daily traffic volume of a particular day the developed model could predict the traffic volume up to five days ahead. Three 3 membership functions were assigned to each input variable. 81 fuzzy if-then rules were generated using the grid based partitioning method and the rule antecedent/consequent parameters were learned after 0 epochs. We also investigated the daily web traffic prediction performance without the cluster information input variable. Table 1 summarizes the performance of the fuzzy inference system for training and test data. Table 1. Training and test performance for Web traffic volume forecast Root Mean Squared Error (RMSE) Forecast period Fuzzy Inference System (with cluster information) Fuzzy Inference System (without cluster information) Training Test Training Test One day Two days Three days Four days Five days Figures 16 (a), (b), (c), (d) and (e) depicts the test results for one day, two days, three days, four days and five days ahead forecast of daily Web traffic volume. 11

12 (a) (b) (c) (d) (e) Figure 16 (a) (e). Test results of daily forecast of Web traffic volume Hourly Page Request Forecast Three membership functions were assigned to each input variable. 81 fuzzy if-then rules were generated using the grid based partitioning method and the rule antecedent/consequent parameters were learned after 40 epochs. We also investigated the volume of hourly page requests prediction performance without the cluster information input variable. Table 2 summarizes the performance of the FIS for training and test data. Figures 17 (a), (b) and (c) illustrates the test results for 1 hour, 12 hours and 24 hours ahead forecast of the volume of hourly page requests. 12

13 Table 2. Training and test performance for volume of hourly page requests forecast Root Mean Squared Error (RMSE) Forecast period Fuzzy Inference System (with cluster information) Fuzzy Inference System (without cluster information) Training Test Training Test 1 hour hours hours (a) (b) (c) Figure 17 (a) (c). Test results of hourly forecast of volume of page requests. Conclusions and Future Work The discovery of useful knowledge, user information and access patterns allows Web based organisations to predict user access patterns and helps in future developments, maintenance planning and also to target more rigorous advertising campaigns aimed at groups of users [1]. This case study on Monash University s Web access patterns reveals the necessity to incorporate computational intelligence techniques for mining useful information. WUDA of the SOM data clusters provided several useful information related to the user access patterns. The developed FIS could predict the daily Web traffic and hourly page requests within reasonable error limits. Our experiment results also reveal the importance of the cluster information to improve the forecast accuracy of the FIS. These techniques might be useful to the website tracker software vendors to provide more useful information to the users. We relied on the statistical/text data provided by the Analog [16] software embedded at the University s Web server [10]. Analog generates the statistical data 13

14 by analysing the access data logs. Due to incomplete details, we had to analyse the usage patterns for different aspects separately, preventing us to link some common information between the different aspects, trends, patterns etc. For example, the domain requests and the daily or hourly requests are all stand-alone information and are not interconnected. Therefore, a direct analysis of the Web access logs might be more helpful. We believe that if the detailed access information could cover different interlinked features, then the usage patterns would be more comprehensive and useful. Even the access or usage patterns for particular domain within a particular time period can be analysed and predicted for marketing segment analysis and so on. In this research, we considered only the Web traffic data during the University s peak working time. Our future research will also incorporate off-peak months (summer semesters) and so on. We also plan to incorporate more data mining techniques and improve the functional aspects of the concurrent neuro-fuzzy approach. References [1] Zhang Y.Q. and Lin T.Y., Computational Web Intelligence (CWI): Synergy of Computational Intelligence and Web Technology, 2002 World Congress in Computational Intelligence, IEEE Press, pp.72-7, [2] WebSTAT Web Traffic Analyser, < (accessed on 27 July 2002) [3] Website Tracker, < (accessed on 27 July 2002) [4] Viscovery SOMine, (accessed on 27 July 2002) [] Sugeno M., Industrial Applications of Fuzzy Control, Elsevier Science Pub Co., 198. [6] Srivastava, J., Cooley R., Deshpande M.and Tan P.N., Web Usage Mining: Discovery and Applications of Usage Patterns from Web Data, SIGKDD Explorations, 1(2): pp , [7] Pirolli P, Pitkow J, and Rao R., Silk From a Sow s Ear: Extracting Usable Structures from the Web, In Proceeding on Human Factors in Computing Systems (CHI-96), [8] Pal S.K., Talwar V., and Mitra P., Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Directions. IEEE Transactions on Neural Networks, [9] Ng A. and Smith K. A., Web usage mining by a self-organizing map, C. Dagli et al. (Eds.), Smart Engineering System Design: Neural Networks, Fuzzy Logic, Evolutionary Programming, Data Mining and Complex Systems, ASME Press, vol. 10, pp , [10] Monash University Weekly Website User Access Statistics < (accessed on 27 July 2002) [11] Monash University Server Usage Statistics. < (accessed on 27 July 2002) [12] Kohenen T., Self-Organizing Maps, Springer Verlag Germany, 199. [13] Jang R., Neuro-Fuzzy Modeling: Architectures, Analyses and Applications, PhD Thesis, University of California, Berkeley, [14] Hitbox Central Web Traffic Analysis, < (accessed on 27 July 2002) [1] Cooley R., Srivastava J., and Mobasher B., Web Mining: Information and Pattern Discovery on the World Wide Web. In Proceedings of the 9th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'97) [16] Analog Website Tracker, < (accessed on 27 July 2002) [17] Aggarwal C., Wolf J.L., and Yu P.S., Caching on the World Wide Web, IEEE Trans. On Knowledge and Data Eng., vol. 11, no. 1, pp , Feb [18] Abraham A., Neuro-Fuzzy Systems: State-of-the-Art Modeling Techniques, Connectionist Models of Neurons, Learning Processes, and Artificial Intelligence, Springer-Verlag Germany, Jose Mira and Alberto Prieto (Eds.), Granada, Spain, pp , [19] Abraham A. and Nath B., Evolutionary Design of Fuzzy Control Systems - An Hybrid Approach, In Proceedings of The Sixth International Conference on Control, Automation, Robotics and Vision, (CD ROM Proceeding), Wang J.L. (Ed.), ISBN , Singapore,

Soft Computing Paradigms for Web Access Pattern Analysis

Soft Computing Paradigms for Web Access Pattern Analysis 15 Soft Computing Paradigms for Web Access Pattern Analysis Xiaozhe Wang 1, Ajith Abraham 2 and Kate A. Smith 1 1 School of Business Systems, Faculty of Information Technology, Monash University, Clayton,

More information

Intelligent web traffic mining and analysis

Intelligent web traffic mining and analysis Journal of Network and Computer Applications 28 (2005) 147 165 www.elsevier.com/locate/jnca Intelligent web traffic mining and analysis Xiaozhe Wang a, *, Ajith Abraham b, Kate A. Smith a a School of Business

More information

A Study of Web Log Analysis Using Clustering Techniques

A Study of Web Log Analysis Using Clustering Techniques A Study of Web Log Analysis Using Clustering Techniques Hemanshu Rana 1, Mayank Patel 2 Assistant Professor, Dept of CSE, M.G Institute of Technical Education, Gujarat India 1 Assistant Professor, Dept

More information

Key words: web usage mining, clustering, e-marketing and e-business, business intelligence; hybrid soft computing.

Key words: web usage mining, clustering, e-marketing and e-business, business intelligence; hybrid soft computing. Volume 5, Issue 3, March 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue:

More information

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

Mobile Phone APP Software Browsing Behavior using Clustering Analysis Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis

More information

Real Stock Trading Using Soft Computing Models

Real Stock Trading Using Soft Computing Models Real Stock Trading Using Soft Computing Models Brent Doeksen 1, Ajith Abraham 2, Johnson Thomas 1 and Marcin Paprzycki 1 1 Computer Science Department, Oklahoma State University, OK 74106, USA, 2 School

More information

A Study of Web Traffic Analysis

A Study of Web Traffic Analysis Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology ISSN 2320 088X IJCSMC, Vol. 3, Issue.

More information

Visualization of Breast Cancer Data by SOM Component Planes

Visualization of Breast Cancer Data by SOM Component Planes International Journal of Science and Technology Volume 3 No. 2, February, 2014 Visualization of Breast Cancer Data by SOM Component Planes P.Venkatesan. 1, M.Mullai 2 1 Department of Statistics,NIRT(Indian

More information

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling 1 Forecasting Women s Apparel Sales Using Mathematical Modeling Celia Frank* 1, Balaji Vemulapalli 1, Les M. Sztandera 2, Amar Raheja 3 1 School of Textiles and Materials Technology 2 Computer Information

More information

Knowledge Based Descriptive Neural Networks

Knowledge Based Descriptive Neural Networks Knowledge Based Descriptive Neural Networks J. T. Yao Department of Computer Science, University or Regina Regina, Saskachewan, CANADA S4S 0A2 Email: jtyao@cs.uregina.ca Abstract This paper presents a

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Research and Development of Data Preprocessing in Web Usage Mining

Research and Development of Data Preprocessing in Web Usage Mining Research and Development of Data Preprocessing in Web Usage Mining Li Chaofeng School of Management, South-Central University for Nationalities,Wuhan 430074, P.R. China Abstract Web Usage Mining is the

More information

Quality Assessment in Spatial Clustering of Data Mining

Quality Assessment in Spatial Clustering of Data Mining Quality Assessment in Spatial Clustering of Data Mining Azimi, A. and M.R. Delavar Centre of Excellence in Geomatics Engineering and Disaster Management, Dept. of Surveying and Geomatics Engineering, Engineering

More information

ABSTRACT The World MINING 1.2.1 1.2.2. R. Vasudevan. Trichy. Page 9. usage mining. basic. processing. Web usage mining. Web. useful information

ABSTRACT The World MINING 1.2.1 1.2.2. R. Vasudevan. Trichy. Page 9. usage mining. basic. processing. Web usage mining. Web. useful information SSRG International Journal of Electronics and Communication Engineering (SSRG IJECE) volume 1 Issue 1 Feb Neural Networks and Web Mining R. Vasudevan Dept of ECE, M. A.M Engineering College Trichy. ABSTRACT

More information

FUZZY AND NEURO-FUZZY MODELS FOR SHORT-TERM WATER DEMAND FORECASTING IN TEHRAN *

FUZZY AND NEURO-FUZZY MODELS FOR SHORT-TERM WATER DEMAND FORECASTING IN TEHRAN * Iranian Journal of Science & Technology, Transaction B, Engineering, Vol. 33, No. B, pp 6-77 Printed in The Islamic Republic of Iran, 009 Shiraz University FUZZY AND NEURO-FUZZY MODELS FOR SHORT-TERM WATER

More information

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10 1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom

More information

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations Volume 3, No. 8, August 2012 Journal of Global Research in Computer Science REVIEW ARTICLE Available Online at www.jgrcs.info Comparison of Supervised and Unsupervised Learning Classifiers for Travel Recommendations

More information

Credit Card Fraud Detection Using Self Organised Map

Credit Card Fraud Detection Using Self Organised Map International Journal of Information & Computation Technology. ISSN 0974-2239 Volume 4, Number 13 (2014), pp. 1343-1348 International Research Publications House http://www. irphouse.com Credit Card Fraud

More information

Self Organizing Maps: Fundamentals

Self Organizing Maps: Fundamentals Self Organizing Maps: Fundamentals Introduction to Neural Networks : Lecture 16 John A. Bullinaria, 2004 1. What is a Self Organizing Map? 2. Topographic Maps 3. Setting up a Self Organizing Map 4. Kohonen

More information

Programming Risk Assessment Models for Online Security Evaluation Systems

Programming Risk Assessment Models for Online Security Evaluation Systems Programming Risk Assessment Models for Online Security Evaluation Systems Ajith Abraham 1, Crina Grosan 12, Vaclav Snasel 13 1 Machine Intelligence Research Labs, MIR Labs, http://www.mirlabs.org 2 Babes-Bolyai

More information

Advanced Web Usage Mining Algorithm using Neural Network and Principal Component Analysis

Advanced Web Usage Mining Algorithm using Neural Network and Principal Component Analysis Advanced Web Usage Mining Algorithm using Neural Network and Principal Component Analysis Arumugam, P. and Christy, V Department of Statistics, Manonmaniam Sundaranar University, Tirunelveli, Tamilnadu,

More information

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College

More information

AN EFFICIENT APPROACH TO PERFORM PRE-PROCESSING

AN EFFICIENT APPROACH TO PERFORM PRE-PROCESSING AN EFFIIENT APPROAH TO PERFORM PRE-PROESSING S. Prince Mary Research Scholar, Sathyabama University, hennai- 119 princemary26@gmail.com E. Baburaj Department of omputer Science & Engineering, Sun Engineering

More information

Analysis of Server Log by Web Usage Mining for Website Improvement

Analysis of Server Log by Web Usage Mining for Website Improvement IJCSI International Journal of Computer Science Issues, Vol., Issue 4, 8, July 2010 1 Analysis of Server Log by Web Usage Mining for Website Improvement Navin Kumar Tyagi 1, A. K. Solanki 2 and Manoj Wadhwa

More information

An Effective Analysis of Weblog Files to improve Website Performance

An Effective Analysis of Weblog Files to improve Website Performance An Effective Analysis of Weblog Files to improve Website Performance 1 T.Revathi, 2 M.Praveen Kumar, 3 R.Ravindra Babu, 4 Md.Khaleelur Rahaman, 5 B.Aditya Reddy Department of Information Technology, KL

More information

A HYBRID FUZZY-ANN APPROACH FOR SOFTWARE EFFORT ESTIMATION

A HYBRID FUZZY-ANN APPROACH FOR SOFTWARE EFFORT ESTIMATION A HYBRID FUZZY-ANN APPROACH FOR SOFTWARE EFFORT ESTIMATION Sheenu Rizvi 1, Dr. S.Q. Abbas 2 and Dr. Rizwan Beg 3 1 Department of Computer Science, Amity University, Lucknow, India 2 A.I.M.T., Lucknow,

More information

Data Mining and Neural Networks in Stata

Data Mining and Neural Networks in Stata Data Mining and Neural Networks in Stata 2 nd Italian Stata Users Group Meeting Milano, 10 October 2005 Mario Lucchini e Maurizo Pisati Università di Milano-Bicocca mario.lucchini@unimib.it maurizio.pisati@unimib.it

More information

USING SELF-ORGANISING MAPS FOR ANOMALOUS BEHAVIOUR DETECTION IN A COMPUTER FORENSIC INVESTIGATION

USING SELF-ORGANISING MAPS FOR ANOMALOUS BEHAVIOUR DETECTION IN A COMPUTER FORENSIC INVESTIGATION USING SELF-ORGANISING MAPS FOR ANOMALOUS BEHAVIOUR DETECTION IN A COMPUTER FORENSIC INVESTIGATION B.K.L. Fei, J.H.P. Eloff, M.S. Olivier, H.M. Tillwick and H.S. Venter Information and Computer Security

More information

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant

More information

A New Approach For Estimating Software Effort Using RBFN Network

A New Approach For Estimating Software Effort Using RBFN Network IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.7, July 008 37 A New Approach For Estimating Software Using RBFN Network Ch. Satyananda Reddy, P. Sankara Rao, KVSVN Raju,

More information

INTEROPERABLE FEATURES CLASSIFICATION TECHNIQUE FOR CLOUD BASED APPLICATION USING FUZZY SYSTEMS

INTEROPERABLE FEATURES CLASSIFICATION TECHNIQUE FOR CLOUD BASED APPLICATION USING FUZZY SYSTEMS INTEROPERABLE FEATURES CLASSIFICATION TECHNIQUE FOR CLOUD BASED APPLICATION USING FUZZY SYSTEMS * C. Saravanakumar 1 and C. Arun 2 1 Department of Computer Science and Engineering, Sathyabama University,

More information

Visualizing e-government Portal and Its Performance in WEBVS

Visualizing e-government Portal and Its Performance in WEBVS Visualizing e-government Portal and Its Performance in WEBVS Ho Si Meng, Simon Fong Department of Computer and Information Science University of Macau, Macau SAR ccfong@umac.mo Abstract An e-government

More information

D A T A M I N I N G C L A S S I F I C A T I O N

D A T A M I N I N G C L A S S I F I C A T I O N D A T A M I N I N G C L A S S I F I C A T I O N FABRICIO VOZNIKA LEO NARDO VIA NA INTRODUCTION Nowadays there is huge amount of data being collected and stored in databases everywhere across the globe.

More information

Towards Rule-based System for the Assembly of 3D Bricks

Towards Rule-based System for the Assembly of 3D Bricks Universal Journal of Communications and Network 3(4): 77-81, 2015 DOI: 10.13189/ujcn.2015.030401 http://www.hrpub.org Towards Rule-based System for the Assembly of 3D Bricks Sanguk Noh School of Computer

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

High Frequency Trading using Fuzzy Momentum Analysis

High Frequency Trading using Fuzzy Momentum Analysis Proceedings of the World Congress on Engineering 2 Vol I WCE 2, June 3 - July 2, 2, London, U.K. High Frequency Trading using Fuzzy Momentum Analysis A. Kablan Member, IAENG, and W. L. Ng. Abstract High

More information

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,

More information

NEURAL NETWORKS IN DATA MINING

NEURAL NETWORKS IN DATA MINING NEURAL NETWORKS IN DATA MINING 1 DR. YASHPAL SINGH, 2 ALOK SINGH CHAUHAN 1 Reader, Bundelkhand Institute of Engineering & Technology, Jhansi, India 2 Lecturer, United Institute of Management, Allahabad,

More information

How To Use Neural Networks In Data Mining

How To Use Neural Networks In Data Mining International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and

More information

How To Predict Web Site Visits

How To Predict Web Site Visits Web Site Visit Forecasting Using Data Mining Techniques Chandana Napagoda Abstract: Data mining is a technique which is used for identifying relationships between various large amounts of data in many

More information

Web Usage Mining: Identification of Trends Followed by the user through Neural Network

Web Usage Mining: Identification of Trends Followed by the user through Neural Network International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 7 (2013), pp. 617-624 International Research Publications House http://www. irphouse.com /ijict.htm Web

More information

Modeling and Design of Intelligent Agent System

Modeling and Design of Intelligent Agent System International Journal of Control, Automation, and Systems Vol. 1, No. 2, June 2003 257 Modeling and Design of Intelligent Agent System Dae Su Kim, Chang Suk Kim, and Kee Wook Rim Abstract: In this study,

More information

Web Advertising Personalization using Web Content Mining and Web Usage Mining Combination

Web Advertising Personalization using Web Content Mining and Web Usage Mining Combination 8 Web Advertising Personalization using Web Content Mining and Web Usage Mining Combination Ketul B. Patel 1, Dr. A.R. Patel 2, Natvar S. Patel 3 1 Research Scholar, Hemchandracharya North Gujarat University,

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

SINGULAR SPECTRUM ANALYSIS HYBRID FORECASTING METHODS WITH APPLICATION TO AIR TRANSPORT DEMAND

SINGULAR SPECTRUM ANALYSIS HYBRID FORECASTING METHODS WITH APPLICATION TO AIR TRANSPORT DEMAND SINGULAR SPECTRUM ANALYSIS HYBRID FORECASTING METHODS WITH APPLICATION TO AIR TRANSPORT DEMAND K. Adjenughwure, Delft University of Technology, Transport Institute, Ph.D. candidate V. Balopoulos, Democritus

More information

Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model

Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model Iman Attarzadeh and Siew Hock Ow Department of Software Engineering Faculty of Computer Science &

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

ANALYZING OF SYSTEM ERRORS FOR INCREASING A WEB SERVER PERFORMANCE BY USING WEB USAGE MINING

ANALYZING OF SYSTEM ERRORS FOR INCREASING A WEB SERVER PERFORMANCE BY USING WEB USAGE MINING ISTANBUL UNIVERSITY JOURNAL OF ELECTRICAL & ELECTRONICS ENGINEERING YEAR VOLUME NUMBER : 2007 : 7 : 2 (379-386) ANALYZING OF SYSTEM ERRORS FOR INCREASING A WEB SERVER PERFORMANCE BY USING WEB USAGE MINING

More information

A Survey on Web Mining From Web Server Log

A Survey on Web Mining From Web Server Log A Survey on Web Mining From Web Server Log Ripal Patel 1, Mr. Krunal Panchal 2, Mr. Dushyantsinh Rathod 3 1 M.E., 2,3 Assistant Professor, 1,2,3 computer Engineering Department, 1,2 L J Institute of Engineering

More information

An Analysis on Density Based Clustering of Multi Dimensional Spatial Data

An Analysis on Density Based Clustering of Multi Dimensional Spatial Data An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,

More information

Short Term Electricity Price Forecasting Using ANN and Fuzzy Logic under Deregulated Environment

Short Term Electricity Price Forecasting Using ANN and Fuzzy Logic under Deregulated Environment Short Term Electricity Price Forecasting Using ANN and Fuzzy Logic under Deregulated Environment Aarti Gupta 1, Pankaj Chawla 2, Sparsh Chawla 3 Assistant Professor, Dept. of EE, Hindu College of Engineering,

More information

Prediction Model for Crude Oil Price Using Artificial Neural Networks

Prediction Model for Crude Oil Price Using Artificial Neural Networks Applied Mathematical Sciences, Vol. 8, 2014, no. 80, 3953-3965 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.43193 Prediction Model for Crude Oil Price Using Artificial Neural Networks

More information

Neuro-Rough Trading Rules for Mining Kuala Lumpur Composite Index

Neuro-Rough Trading Rules for Mining Kuala Lumpur Composite Index European Journal of Scientific Research ISSN 1450-216X Vol.28 No.2 (2009), pp.278-286 EuroJournals Publishing, Inc. 2009 http://www.eurojournals.com/ejsr.htm Neuro-Rough Trading Rules for Mining Kuala

More information

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network Dušan Marček 1 Abstract Most models for the time series of stock prices have centered on autoregressive (AR)

More information

A Neuro-Fuzzy Classifier for Customer Churn Prediction

A Neuro-Fuzzy Classifier for Customer Churn Prediction A Neuro-Fuzzy Classifier for Customer Churn Prediction Hossein Abbasimehr K. N. Toosi University of Tech Tehran, Iran Mostafa Setak K. N. Toosi University of Tech Tehran, Iran M. J. Tarokh K. N. Toosi

More information

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL

MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL MANAGING QUEUE STABILITY USING ART2 IN ACTIVE QUEUE MANAGEMENT FOR CONGESTION CONTROL G. Maria Priscilla 1 and C. P. Sumathi 2 1 S.N.R. Sons College (Autonomous), Coimbatore, India 2 SDNB Vaishnav College

More information

Association rules for improving website effectiveness: case analysis

Association rules for improving website effectiveness: case analysis Association rules for improving website effectiveness: case analysis Maja Dimitrijević, The Higher Technical School of Professional Studies, Novi Sad, Serbia, dimitrijevic@vtsns.edu.rs Tanja Krunić, The

More information

Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing

Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing www.ijcsi.org 198 Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing Lilian Sing oei 1 and Jiayang Wang 2 1 School of Information Science and Engineering, Central South University

More information

Comparison of K-means and Backpropagation Data Mining Algorithms

Comparison of K-means and Backpropagation Data Mining Algorithms Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and

More information

Fuzzy Active Queue Management for Assured Forwarding Traffic in Differentiated Services Network

Fuzzy Active Queue Management for Assured Forwarding Traffic in Differentiated Services Network Fuzzy Active Management for Assured Forwarding Traffic in Differentiated Services Network E.S. Ng, K.K. Phang, T.C. Ling, L.Y. Por Department of Computer Systems & Technology Faculty of Computer Science

More information

Data Mining and Soft Computing. Francisco Herrera

Data Mining and Soft Computing. Francisco Herrera Francisco Herrera Research Group on Soft Computing and Information Intelligent Systems (SCI 2 S) Dept. of Computer Science and A.I. University of Granada, Spain Email: herrera@decsai.ugr.es http://sci2s.ugr.es

More information

American International Journal of Research in Science, Technology, Engineering & Mathematics

American International Journal of Research in Science, Technology, Engineering & Mathematics American International Journal of Research in Science, Technology, Engineering & Mathematics Available online at http://www.iasir.net ISSN (Print): 2328-349, ISSN (Online): 2328-3580, ISSN (CD-ROM): 2328-3629

More information

Price Prediction of Share Market using Artificial Neural Network (ANN)

Price Prediction of Share Market using Artificial Neural Network (ANN) Prediction of Share Market using Artificial Neural Network (ANN) Zabir Haider Khan Department of CSE, SUST, Sylhet, Bangladesh Tasnim Sharmin Alin Department of CSE, SUST, Sylhet, Bangladesh Md. Akter

More information

6.2.8 Neural networks for data mining

6.2.8 Neural networks for data mining 6.2.8 Neural networks for data mining Walter Kosters 1 In many application areas neural networks are known to be valuable tools. This also holds for data mining. In this chapter we discuss the use of neural

More information

Foundations of Business Intelligence: Databases and Information Management

Foundations of Business Intelligence: Databases and Information Management Foundations of Business Intelligence: Databases and Information Management Problem: HP s numerous systems unable to deliver the information needed for a complete picture of business operations, lack of

More information

Towards applying Data Mining Techniques for Talent Mangement

Towards applying Data Mining Techniques for Talent Mangement 2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Towards applying Data Mining Techniques for Talent Mangement Hamidah Jantan 1,

More information

Web Mining using Artificial Ant Colonies : A Survey

Web Mining using Artificial Ant Colonies : A Survey Web Mining using Artificial Ant Colonies : A Survey Richa Gupta Department of Computer Science University of Delhi ABSTRACT : Web mining has been very crucial to any organization as it provides useful

More information

Self Organizing Maps for Visualization of Categories

Self Organizing Maps for Visualization of Categories Self Organizing Maps for Visualization of Categories Julian Szymański 1 and Włodzisław Duch 2,3 1 Department of Computer Systems Architecture, Gdańsk University of Technology, Poland, julian.szymanski@eti.pg.gda.pl

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

Design of Prediction System for Key Performance Indicators in Balanced Scorecard

Design of Prediction System for Key Performance Indicators in Balanced Scorecard Design of Prediction System for Key Performance Indicators in Balanced Scorecard Ahmed Mohamed Abd El-Mongy. Faculty of Systems and Computers Engineering, Al-Azhar University Cairo, Egypt. Alaa el-deen

More information

Database Marketing, Business Intelligence and Knowledge Discovery

Database Marketing, Business Intelligence and Knowledge Discovery Database Marketing, Business Intelligence and Knowledge Discovery Note: Using material from Tan / Steinbach / Kumar (2005) Introduction to Data Mining,, Addison Wesley; and Cios / Pedrycz / Swiniarski

More information

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH 205 A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH ABSTRACT MR. HEMANT KUMAR*; DR. SARMISTHA SARMA** *Assistant Professor, Department of Information Technology (IT), Institute of Innovation in Technology

More information

Grid Density Clustering Algorithm

Grid Density Clustering Algorithm Grid Density Clustering Algorithm Amandeep Kaur Mann 1, Navneet Kaur 2, Scholar, M.Tech (CSE), RIMT, Mandi Gobindgarh, Punjab, India 1 Assistant Professor (CSE), RIMT, Mandi Gobindgarh, Punjab, India 2

More information

ISSUES IN RULE BASED KNOWLEDGE DISCOVERING PROCESS

ISSUES IN RULE BASED KNOWLEDGE DISCOVERING PROCESS Advances and Applications in Statistical Sciences Proceedings of The IV Meeting on Dynamics of Social and Economic Systems Volume 2, Issue 2, 2010, Pages 303-314 2010 Mili Publications ISSUES IN RULE BASED

More information

The Research of Data Mining Based on Neural Networks

The Research of Data Mining Based on Neural Networks 2011 International Conference on Computer Science and Information Technology (ICCSIT 2011) IPCSIT vol. 51 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V51.09 The Research of Data Mining

More information

Extracting Fuzzy Rules from Data for Function Approximation and Pattern Classification

Extracting Fuzzy Rules from Data for Function Approximation and Pattern Classification Extracting Fuzzy Rules from Data for Function Approximation and Pattern Classification Chapter 9 in Fuzzy Information Engineering: A Guided Tour of Applications, ed. D. Dubois, H. Prade, and R. Yager,

More information

White Paper. Data Mining for Business

White Paper. Data Mining for Business White Paper Data Mining for Business January 2010 Contents 1. INTRODUCTION... 3 2. WHY IS DATA MINING IMPORTANT?... 3 FUNDAMENTALS... 3 Example 1...3 Example 2...3 3. OPERATIONAL CONSIDERATIONS... 4 ORGANISATIONAL

More information

Business Intelligence Integration. Joel Da Costa, Takudzwa Mabande, Richard Migwalla Antoine Bagula, Joseph Balikuddembe

Business Intelligence Integration. Joel Da Costa, Takudzwa Mabande, Richard Migwalla Antoine Bagula, Joseph Balikuddembe Business Intelligence Integration Joel Da Costa, Takudzwa Mabande, Richard Migwalla Antoine Bagula, Joseph Balikuddembe Project Description Business Intelligence (BI) is the practice of using computer

More information

dm106 TEXT MINING FOR CUSTOMER RELATIONSHIP MANAGEMENT: AN APPROACH BASED ON LATENT SEMANTIC ANALYSIS AND FUZZY CLUSTERING

dm106 TEXT MINING FOR CUSTOMER RELATIONSHIP MANAGEMENT: AN APPROACH BASED ON LATENT SEMANTIC ANALYSIS AND FUZZY CLUSTERING dm106 TEXT MINING FOR CUSTOMER RELATIONSHIP MANAGEMENT: AN APPROACH BASED ON LATENT SEMANTIC ANALYSIS AND FUZZY CLUSTERING ABSTRACT In most CRM (Customer Relationship Management) systems, information on

More information

2. IMPLEMENTATION. International Journal of Computer Applications (0975 8887) Volume 70 No.18, May 2013

2. IMPLEMENTATION. International Journal of Computer Applications (0975 8887) Volume 70 No.18, May 2013 Prediction of Market Capital for Trading Firms through Data Mining Techniques Aditya Nawani Department of Computer Science, Bharati Vidyapeeth s College of Engineering, New Delhi, India Himanshu Gupta

More information

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling 1 Forecasting Women s Apparel Sales Using Mathematical Modeling Celia Frank* 1, Balaji Vemulapalli 1, Les M. Sztandera 2, Amar Raheja 3 1 School of Textiles and Materials Technology 2 Computer Information

More information

Data Mining and Analysis of Online Social Networks

Data Mining and Analysis of Online Social Networks Data Mining and Analysis of Online Social Networks R.Sathya 1, A.Aruna devi 2, S.Divya 2 Assistant Professor 1, M.Tech I Year 2, Department of Information Technology, Ganadipathy Tulsi s Jain Engineering

More information

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results , pp.33-40 http://dx.doi.org/10.14257/ijgdc.2014.7.4.04 Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results Muzammil Khan, Fida Hussain and Imran Khan Department

More information

An Intelligent Approach to Software Cost Prediction

An Intelligent Approach to Software Cost Prediction An Intelligent Approach to Software Cost Prediction Xishi Huang, Danny HO', Luiz F. Capretz, Jing Ren Dept. of ECE, University of Western Ontario, London, Ontario, N6G 1 H1, Canada 1 Toronto Design Center,

More information

Part-time Diploma in InfoComm and Digital Media (Information Systems) Certificate in Information Systems Course Schedule & Timetable

Part-time Diploma in InfoComm and Digital Media (Information Systems) Certificate in Information Systems Course Schedule & Timetable Certificate in Information Systems Course Schedule & Timetable Module Code Module Title Start Date End Date Coursework Final Exam PTDIS010101 Management Information Tue, April 16, 2013 Tue, 2 April 2013

More information

DATA MINING TECHNIQUES AND APPLICATIONS

DATA MINING TECHNIQUES AND APPLICATIONS DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,

More information

Exploitation of Server Log Files of User Behavior in Order to Inform Administrator

Exploitation of Server Log Files of User Behavior in Order to Inform Administrator Exploitation of Server Log Files of User Behavior in Order to Inform Administrator Hamed Jelodar Computer Department, Islamic Azad University, Science and Research Branch, Bushehr, Iran ABSTRACT All requests

More information

A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan

A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan , pp.217-222 http://dx.doi.org/10.14257/ijbsbt.2015.7.3.23 A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan Muhammad Arif 1,2, Asad Khatak

More information

Using Data Mining for Mobile Communication Clustering and Characterization

Using Data Mining for Mobile Communication Clustering and Characterization Using Data Mining for Mobile Communication Clustering and Characterization A. Bascacov *, C. Cernazanu ** and M. Marcu ** * Lasting Software, Timisoara, Romania ** Politehnica University of Timisoara/Computer

More information

Research of Postal Data mining system based on big data

Research of Postal Data mining system based on big data 3rd International Conference on Mechatronics, Robotics and Automation (ICMRA 2015) Research of Postal Data mining system based on big data Xia Hu 1, Yanfeng Jin 1, Fan Wang 1 1 Shi Jiazhuang Post & Telecommunication

More information

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR

More information

Web Log Analysis for Identifying the Number of Visitors and their Behavior to Enhance the Accessibility and Usability of Website

Web Log Analysis for Identifying the Number of Visitors and their Behavior to Enhance the Accessibility and Usability of Website Web Log Analysis for Identifying the Number of and their Behavior to Enhance the Accessibility and Usability of Website Navjot Kaur Assistant Professor Department of CSE Punjabi University Patiala Himanshu

More information

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin Data Mining for Customer Service Support Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin Traditional Hotline Services Problem Traditional Customer Service Support (manufacturing)

More information

A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING

A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING A COGNITIVE APPROACH IN PATTERN ANALYSIS TOOLS AND TECHNIQUES USING WEB USAGE MINING M.Gnanavel 1 & Dr.E.R.Naganathan 2 1. Research Scholar, SCSVMV University, Kanchipuram,Tamil Nadu,India. 2. Professor

More information

A New Approach for Evaluation of Data Mining Techniques

A New Approach for Evaluation of Data Mining Techniques 181 A New Approach for Evaluation of Data Mining s Moawia Elfaki Yahia 1, Murtada El-mukashfi El-taher 2 1 College of Computer Science and IT King Faisal University Saudi Arabia, Alhasa 31982 2 Faculty

More information

Intuitionistic fuzzy load balancing in cloud computing

Intuitionistic fuzzy load balancing in cloud computing 8 th Int. Workshop on IFSs, Banská Bystrica, 9 Oct. 2012 Notes on Intuitionistic Fuzzy Sets Vol. 18, 2012, No. 4, 19 25 Intuitionistic fuzzy load balancing in cloud computing Marin Marinov European Polytechnical

More information

Momentum Analysis based Stock Market Prediction using Adaptive Neuro-Fuzzy Inference System (ANFIS)

Momentum Analysis based Stock Market Prediction using Adaptive Neuro-Fuzzy Inference System (ANFIS) Momentum Analysis based Stock Market Prediction using Adaptive Neuro-Fuzzy Inference System (ANFIS) Samarth Agrawal, Manoj Jindal, G. N. Pillai Abstract This paper presents an innovative approach for indicating

More information

INTELLIGENT ENERGY MANAGEMENT OF ELECTRICAL POWER SYSTEMS WITH DISTRIBUTED FEEDING ON THE BASIS OF FORECASTS OF DEMAND AND GENERATION Chr.

INTELLIGENT ENERGY MANAGEMENT OF ELECTRICAL POWER SYSTEMS WITH DISTRIBUTED FEEDING ON THE BASIS OF FORECASTS OF DEMAND AND GENERATION Chr. INTELLIGENT ENERGY MANAGEMENT OF ELECTRICAL POWER SYSTEMS WITH DISTRIBUTED FEEDING ON THE BASIS OF FORECASTS OF DEMAND AND GENERATION Chr. Meisenbach M. Hable G. Winkler P. Meier Technology, Laboratory

More information

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015 An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content

More information

Soft-Computing Models for Building Applications - A Feasibility Study (EPSRC Ref: GR/L84513)

Soft-Computing Models for Building Applications - A Feasibility Study (EPSRC Ref: GR/L84513) Soft-Computing Models for Building Applications - A Feasibility Study (EPSRC Ref: GR/L84513) G S Virk, D Azzi, K I Alkadhimi and B P Haynes Department of Electrical and Electronic Engineering, University

More information