FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT



Similar documents
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

A Survey on Association Rule Mining in Market Basket Analysis

Mining an Online Auctions Data Warehouse

Static Data Mining Algorithm with Progressive Approach for Mining Knowledge

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

New Matrix Approach to Improve Apriori Algorithm

Data Mining in Telecommunication

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

Healthcare Measurement Analysis Using Data mining Techniques

II. OLAP(ONLINE ANALYTICAL PROCESSING)

Inner Classification of Clusters for Online News

NEW TECHNIQUE TO DEAL WITH DYNAMIC DATA MINING IN THE DATABASE

How To Use Neural Networks In Data Mining

Data Mining System, Functionalities and Applications: A Radical Review

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

Application of Data Mining Techniques in Intrusion Detection

International Journal of World Research, Vol: I Issue XIII, December 2008, Print ISSN: X DATA MINING TECHNIQUES AND STOCK MARKET

AN INTELLIGENT ANALYSIS OF CRIME DATA USING DATA MINING & AUTO CORRELATION MODELS

Selection of Optimal Discount of Retail Assortments with Data Mining Approach

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

Association rules for improving website effectiveness: case analysis

Laboratory Module 8 Mining Frequent Itemsets Apriori Algorithm

Standardization of Components, Products and Processes with Data Mining

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results

An Introduction to Data Mining

International Journal of Advanced Engineering Research and Applications (IJAERA) ISSN: Vol. 1, Issue 6, October Big Data and Hadoop

Data Mining Solutions for the Business Environment

SPATIAL DATA CLASSIFICATION AND DATA MINING

Mining Pharmacy Data Helps to Make Profits

Text Mining: The state of the art and the challenges

Data Mining: Concepts and Techniques

72. Ontology Driven Knowledge Discovery Process: a proposal to integrate Ontology Engineering and KDD

Machine Learning, Data Mining, and Knowledge Discovery: An Introduction

Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

EFFECTIVE USE OF THE KDD PROCESS AND DATA MINING FOR COMPUTER PERFORMANCE PROFESSIONALS

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

Mining Association Rules: A Database Perspective

Mining Online GIS for Crime Rate and Models based on Frequent Pattern Analysis

Mobile Phone APP Software Browsing Behavior using Clustering Analysis

A Comprehensive Study of CRM through Data Mining Techniques

Data Mining to Recognize Fail Parts in Manufacturing Process

Knowledge Discovery and Data. Data Mining vs. OLAP

Use of Data Mining in the field of Library and Information Science : An Overview

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis

COURSE RECOMMENDER SYSTEM IN E-LEARNING

ISSN: A Review: Image Retrieval Using Web Multimedia Mining

Information Security in Big Data using Encryption and Decryption

DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Extend Table Lens for High-Dimensional Data Visualization and Classification Mining

Dr. U. Devi Prasad Associate Professor Hyderabad Business School GITAM University, Hyderabad

Cost Drivers of a Parametric Cost Estimation Model for Data Mining Projects (DMCOMO)

An application for clickstream analysis

A Spatial Decision Support System for Property Valuation

Database Marketing, Business Intelligence and Knowledge Discovery

2.1. Data Mining for Biomedical and DNA data analysis

College information system research based on data mining

Homomorphic Encryption Schema for Privacy Preserving Mining of Association Rules

Introduction to Data Mining

Enhancing Quality of Data using Data Mining Method

Semantic Analysis of Business Process Executions

Analyzing Huge Data Sets in Forensic Investigations

Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI

TIM 50 - Business Information Systems

An Agile Methodology Based Model for Change- Oriented Software Engineering

Building A Smart Academic Advising System Using Association Rule Mining

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

NEURAL NETWORKS IN DATA MINING

20 A Visualization Framework For Discovering Prepaid Mobile Subscriber Usage Patterns

Data Mining Application in Advertisement Management of Higher Educational Institutes

Decision Support System For A Customer Relationship Management Case Study

Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms

Introduction. A. Bellaachia Page: 1

How To Analyze Web Server Log Files, Log Files And Log Files Of A Website With A Web Mining Tool

Profile Based Personalized Web Search and Download Blocker

Process Mining in Big Data Scenario

Data Mining: Introduction. Lecture Notes for Chapter 1. Slides by Tan, Steinbach, Kumar adapted by Michael Hahsler

A Survey on Intrusion Detection System with Data Mining Techniques

Meeting Scheduling with Multi Agent Systems: Design and Implementation

Financial Trading System using Combination of Textual and Numerical Data

Ethical Issues in Data Mining

Journal of Global Research in Computer Science RESEARCH SUPPORT SYSTEMS AS AN EFFECTIVE WEB BASED INFORMATION SYSTEM

Mining Signatures in Healthcare Data Based on Event Sequences and its Applications

International Journal of Advance Research in Computer Science and Management Studies

Computational Intelligence in Data Mining and Prospects in Telecommunication Industry

DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE

Transcription:

FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT ANURADHA.T Assoc.prof, atadiparty@yahoo.co.in SRI SAI KRISHNA.A saikrishna.gjc@gmail.com SATYATEJ.K satyatej.koganti@gmail.com NAGA ANIL KUMAR.G anil.naidu58@gmail.com Abstract Data mining is used to extract meaningful information and to develop significant relationships among variables stored in large data sets involving methods from statistics and artificial intelligence but also management. In this paper data mining technique named association technique is applied to analyze the patterns which depicts the relationship between more frequently taken books by the students. The student s record of library data was studied based on many factors such as their subject requirements, number of books issued, and duration for each book. It is recommended that all these correlated information should be conveyed to the library department such that it helps them to improve the issue performance by avoiding the delay, maintain more number of highly required books according to the current subjects of the different streams of students. Keywords frequent pattern; association technique; minimal support. I. Introduction The actual data mining task is the automatic or semi-automatic analysis of large quantities of data in order to extract previously unknown interesting patterns such as groups of data records (cluster analysis), unusual records (anomaly detection) and dependencies (association rule mining) [1].Library management concerns with the management of resources which basically includes books, manuscripts, journals etc. and providing effective and efficient services to its users. Since the manual management of books and other resources in a library and keeping track of every books of the library accessed by the user is tedious job and hence often technological support is expected [7]. Further, to keep track of the books issued and returned across the counters, journals, periodicals and manuscripts consulted by the users and so on needs additional book keeping on additional parameters and are generally not done in a typical library which operates manually. The situation becomes acute when the library does procurement of resources. II. Data Mining Data mining techniques are used to operate on large volumes of data to discover hidden patterns and relationships helpful in decision making [7].Data mining software allow the users to analyze data from different dimensions categorize it and a summarized the relationships, identified during the mining process [1].Different data mining techniques are used in various fields of life such as medicine, statistical analysis, engineering, education, banking, marketing, sale etc., A. Frequent Patterns ISSN : 0975-3397 Vol. 3 No. 11 November 2011 3582

Frequent item sets play an essential role in many data mining tasks that try to find interesting patterns from databases, such as association rules, correlations, sequences, episodes, classifiers, clusters and many more of which the mining of association rules is one of the most popular problems [6].The original motivation for searching association rules came from the need to analyze so called supermarket transaction data, that is, to examine customer behavior in terms of the purchased products [5]. Association rules describe how often items are purchased together. B. Associations in Data-Mining In data mining, association rule learning is a popular and well researched method for discovering interesting relations between variables in large databases [5]. Piatetsky-Shapiro describes analyzing and presenting strong rules discovered in databases using different measures of interestingness [3]. Based on the concept of strong rules, many introduced association rules for discovering regularities between products in large scale transaction data recorded by point-of-sale (POS) systems in supermarkets. For example, the rule {Onions, Potatoes} => {Burger} found in the sales data of a supermarket would indicate that if a customer buys onions and potatoes together, he or she is likely to also buy hamburger meat [3]. Such information can be used as the basis for decisions about marketing activities such as, e.g., promotional pricing or product placements. In addition to the above example from market basket analysis [2] association rules are employed today in many application areas including Web usage mining, intrusion detection and bioinformatics. C. Data mining in Library Management System Library provides an essential element for the betterment and progress of the student career. It enables the students of a college/university to get updated in the academic subjects. Mining in educational environment is called Educational Data Mining, concern with developing new methods to discover knowledge from educational databases like library, sports, health and management etc., in order to analyze student s trends and behaviors towards particular subjects when in-close with examinations [1]. With the information deep and enough on management in library maintenance system will provide the student s to achieve quality objectives in their academic curriculum, data mining methodology which was used in Library Management can help bridging this knowledge gaps in higher education system. Student. No CRYPTOGRAPHY AND NETWORK SECURITY TABLE:1 ANSCII- C DBMS DATAMINING TECHNIQUES 1 1 1 0 1 2 0 1 1 0 3 0 0 1 1 4 1 0 0 1 Sample table showing some of the books issued to students In the above table, when the book is taken by the student it is considered as 1 and if not it is 0.Similarly about 2000 different student records for seven different books are considered. III. Proposed Model In a university library overall books issued to a student is determined by the automation records of the university library. The proposed model mainly deals with finding relationship between the most frequent books that are taken by the students for that semester based on the conditions of stipulated time period and maximum books issued. In general the books were issued for a period of 15 days and providing an option for students to renewal those books for later period of time. While at the same time issued books of a student can get returned ISSN : 0975-3397 Vol. 3 No. 11 November 2011 3583

in any of these 15 days. A student can take a maximum of four different books within that period. The proposed model makes prediction about most chosen books by students based on class automation as well as system information from the department. We have considered about 2000 records of students, and the books considered were the important books required by the students for that particular semester. Initially, we calculated the count of the various seven books which are taken individually. Then the books which were less than the minimum support key were eliminated from further proceedings. Count of the books were as follows: 1. Computer Networks : 1098 2. Compiler Design : 965 3. Data Mining : 1065 4. Operating Systems : 688 5. Web designing : 257 6. Java : 219 7. Software Engineering : 896 From the above data we can identify that the count of both Web designing and Java is less than the minimum support count of 300.So they are eliminated from further proceedings of frequent patterns. Fig.1. Representation of individual books issued to students Now for the frequent patterns we have to consider the rest of books over which we are applying the data-mining techniques. On applying the frequent patterns we calculated what the count among 2000 students who have taken both the books together at the same time. After calculating the count of each frequent patterns, the later process was similar to the above process where the frequent 2 patterns are eliminated whose count was less than the minimal support key. Following data shows the count of various frequent 2 patterns: 1. COMPUTER NETWORKS and COMPILER DESIGN = 760 2. COMPUTER NETWORKS and DATA MINING = 870 3. COMPUTER NETWORKS and OPERATING SYSTEMS = 543 4. COMPUTER NETWORKS and SOFTWARE ENGINEERING = 396 5. COMPILER DESIGN and DATA MINING = 783 6. COMPILER DESIGN and OPERATING SYSTEMS = 287 7. COMPILER DESIGN and SOFTWARE ENGINEERING = 246 8. DATA MINING and OPERATING SYSTEMS = 340 9. DATA MINING and SOFTWARE ENGINEERING = 233 10. OPERATING SYSTEMS and SOFTWARE ENGINEERING = 269 Now the combinations COMPILER DESIGN and OPERATING SYSTEMS, COMPILER DESIGN and SOFTWARE ENGINEERING, DATA MINING and SOFTWARE ENGINEERING and OPERATING SYSTEMS and SOFTWARE ENGINEERING are not considered for further proceedings for less than minimum support. ISSN : 0975-3397 Vol. 3 No. 11 November 2011 3584

Fig.2. Representation of Frequent 2 pattern books Finally, for the frequent 3 patterns we have only 2 different sets obtained (COMPUTER NETWORKS,COMPILER DESIGN,DATA MINING) and (COMPUTER NETWORKS,DATA MINING,OPERATING SYSTEMS) after considering the minimum support and minimum confidence values. Fig.3. Representation of Frequent 3 pattern books IV.Results On applying the data mining techniques on different 2000 records available we came with frequent patterns between (CN,CD,DataMining) and (CN,DataMining,SE).We can conclude that among 2000 students 356 students take all the three books namely (CN,CD,DataMining) and 320 students take all (CN,Data Mining,SE). Further, frequent 4 pattern book set is not obtained as the count of the combinations is less than the required minimum support key value. V.Conclusion By this way of applying the association techniques over a library data, it will help the library management to buy and maintain those books that are much required and hold them in large number than those that are not much preferred by the students.so, this helps for maximum number of students to get those frequent pattern related books and also for efficient retrieval of books. User can retrieve the books which he needed in a faster way which can save a lot of time for him rather than just searching for various books. ISSN : 0975-3397 Vol. 3 No. 11 November 2011 3585

VI.Future Work We have applied the data-mining technique for proper functioning of Library management. This can be extended from 7 available books to many books and over many thousands of students information so that library management of much required books and efficient retrieval can be done successfully. Similarly indeed having more books we can divide the books as per departments and can apply these techniques so that it would be beneficial for all the students of different departments. References [1] Anitha, V; Sunitha Jeevan, C; Kala, S. (2011): Library Management System Using Association Rule Mining. International Journal of Biotech Trends and Technology may to june issue. [2] Data on Market Basket analysis from wikipedia. http://en.wikipedia.org/wiki/market_basket [3] Fayyad, U.M., Piatetsky-Shapiro, G.;Smyth, P. From Data Mining To Knowledge Discovery: An Overview. In Advances In Knowledge Discovery And Data Mining, eds. U.M. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, AAAI Press/The MIT Press, Menlo Park, CA, 1996, pp. 1-34. [4] Frawley, W.J; Piatetsky-Shapiro,G;Matheus,C.J. Knowledge Discovery in Databases: AAAI/MIT Press, 1991. [5] Khattak,A.M;Khan,A.M;Sungyoung,L.(2010) : Analyzing Association Rule Mining and Clustering on Sales Day Data with XLMiner and Weka. International Journal of Database Theory and Application Vol. 3, No. 1, March, 2010. [6] R. Agrawal, T. Imielinski, and A. Swami, Mining Associations between set of items in Massive Databases. In proc. Of the ACM- SIGMOD 1993 int l conf. on Management of Data, pages 207-216, Washington D.C. USA, 1993. [7] Shaeela,A;Tasleem,M;Ahsan,R.S;Inayat Khan,M. (2010): Data Mining Model for Higher Education System. European Journal of Scientific Research Vol.43 No.1 (2010), pp.24-29. Authors Profile [1] T.ANURADHA Assoc.Prof and doing research in DATA BASE MANAGEMENT STUDIES in KL UNIVERSITY, GUNTUR [2] A.SRI SAI KRISHNA Student at, studying IVth year B.tech in [3] K.SATYATEJ Student at, studying IVth year B.tech in [4] G.NAGA ANIL KUMAR Student at, studying IVth year B.tech in ISSN : 0975-3397 Vol. 3 No. 11 November 2011 3586