Data Processing on Database Management Systems with Fuzzy Query



Similar documents
Linguistic Preference Modeling: Foundation Models and New Trends. Extended Abstract

A FUZZY LOGIC APPROACH FOR SALES FORECASTING

Knowledge Base and Inference Motor for an Automated Management System for developing Expert Systems and Fuzzy Classifiers

Classification of Fuzzy Data in Database Management System

Project Management Efficiency A Fuzzy Logic Approach

Product Selection in Internet Business, A Fuzzy Approach

Artificial Neural Networks are bio-inspired mechanisms for intelligent decision support. Artificial Neural Networks. Research Article 2014

A FUZZY MATHEMATICAL MODEL FOR PEFORMANCE TESTING IN CLOUD COMPUTING USING USER DEFINED PARAMETERS

Fuzzy regression model with fuzzy input and output data for manpower forecasting

Optimization of Fuzzy Inventory Models under Fuzzy Demand and Fuzzy Lead Time

Fuzzy Logic Based Revised Defect Rating for Software Lifecycle Performance. Prediction Using GMR

Fuzzy Candlestick Approach to Trade S&P CNX NIFTY 50 Index using Engulfing Patterns

Forecasting of Economic Quantities using Fuzzy Autoregressive Model and Fuzzy Neural Network

Introduction to Fuzzy Control

Bubble Code Review for Magento

EMPLOYEE PERFORMANCE APPRAISAL SYSTEM USING FUZZY LOGIC

A Fuzzy Logic Based Approach for Selecting the Software Development Methodologies Based on Factors Affecting the Development Strategies

Analysis and Usage of Fuzzy Logic for Optimized Evaluation of Database Queries

ROUGH SETS AND DATA MINING. Zdzisław Pawlak

Intuitionistic fuzzy load balancing in cloud computing

NTC Project: S01-PH10 (formerly I01-P10) 1 Forecasting Women s Apparel Sales Using Mathematical Modeling

FLBVFT: A Fuzzy Load Balancing Technique for Virtualization and Fault Tolerance in Cloud

A Method for Solving Linear Programming Problems with Fuzzy Parameters Based on Multiobjective Linear Programming Technique

An Approach to Fuzzy Database Querying, Analysis and Realisation

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

INTELLIGENT ANALYSIS OF THE EFFECT OF INTERNET SYSTEM IN SOCIETY

Problems often have a certain amount of uncertainty, possibly due to: Incompleteness of information about the environment,

Fuzzy Numbers in the Credit Rating of Enterprise Financial Condition

New Architecture of Fuzzy Database Management Systems

Fuzzy Logic Approach for Threat Prioritization in Agile Security Framework using DREAD Model

Multiple Fuzzy Regression Model on Two Wheelers Mileage with Several independent Factors

Development of Virtual Lab System through Application of Fuzzy Analytic Hierarchy Process

Meeting Scheduling with Multi Agent Systems: Design and Implementation

Applications of Fuzzy Logic in Control Design

Fuzzy sets in Data mining- A Review

Threat Modeling Using Fuzzy Logic Paradigm

Approvals Management Engine R12 (AME) Demystified

1. Give the 16 bit signed (twos complement) representation of the following decimal numbers, and convert to hexadecimal:

Design and Implementation of Supermarket Management System Yongchang Rena, Mengyao Chenb

Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies

A methodology for Data Migration between Different Database Management Systems

Computational Intelligence Introduction

A HYBRID RULE BASED FUZZY-NEURAL EXPERT SYSTEM FOR PASSIVE NETWORK MONITORING

On Development of Fuzzy Relational Database Applications

JAVA FUZZY LOGIC TOOLBOX FOR INDUSTRIAL PROCESS CONTROL

Fuzzy Methods in Machine Learning and Data Mining: Status and Prospects

Maintainability Estimation of Component Based Software Development Using Fuzzy AHP

Performance Appraisal System using Multifactorial Evaluation Model

How To Use Neural Networks In Data Mining

Using SQL Server Management Studio

RISK ASSESSMENT BASED UPON FUZZY SET THEORY

Designing Programming Exercises with Computer Assisted Instruction *

A Rough Set View on Bayes Theorem

HANDLING IMPRECISION IN QUALITATIVE DATA WAREHOUSE: URBAN BUILDING SITES ANNOYANCE ANALYSIS USE CASE

Enterprise Resource Planning Analysis of Business Intelligence & Emergence of Mining Objects

ANALYZING SYSTEM MAINTAINABILITY USING ENTERPRISE ARCHITECTURE MODELS

Fuzzy Logic -based Pre-processing for Fuzzy Association Rule Mining

Increasing the Business Performances using Business Intelligence

Fuzzy Active Queue Management for Assured Forwarding Traffic in Differentiated Services Network

S.Thiripura Sundari*, Dr.A.Padmapriya**

The Research and Design of NSL-Oriented Automation Testing Framework

Agent-based University Library System

Fuzzy Cognitive Map for Software Testing Using Artificial Intelligence Techniques

A Fuzzy Querying System based on SQLf2 and SQLf3

ASSESSMENT OF THE EFFECTIVENESS OF ERP SYSTEMS BY A FUZZY LOGIC APPROACH

High-Mix Low-Volume Flow Shop Manufacturing System Scheduling

Big Data with Rough Set Using Map- Reduce

Visualizing e-government Portal and Its Performance in WEBVS

Fuzzy Spatial Data Warehouse: A Multidimensional Model

Improving Computer Supported Environmental Friendly Product Development by Analysis of Data

Extending Data Processing Capabilities of Relational Database Management Systems.

AN APPLICATION OF INTERVAL-VALUED INTUITIONISTIC FUZZY SETS FOR MEDICAL DIAGNOSIS OF HEADACHE. Received January 2010; revised May 2010

CHAPTER 2 Estimating Probabilities

Oracle Data Miner (Extension of SQL Developer 4.0)

Functions. MATH 160, Precalculus. J. Robert Buchanan. Fall Department of Mathematics. J. Robert Buchanan Functions

Electric Power Steering Automation for Autonomous Driving

Optimization under fuzzy if-then rules

A Brief Introduction to MySQL

Topics in basic DBMS course

Basic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 12: June 22, Abstract. Review session.

Keywords web based medical management, patient database on cloud, patient management and customized applications on tablets, android programming.

Transcription:

Data Processing on Database Management Systems with Fuzzy Query İrfan Şimşek 1 and Vedat Topuz 2 1 Msc. Sultançiftliği Primary School, Çekmeköy, 34788, Istanbul, Turkey Ph.: (+90) 216 312 13 81; Fax: (+90) 216 429 29 10 irfansimsek@gmail.com 2 Asst. Prof. Dr. Marmara University Vocational School of Technical Sciences, Göztepe, 34722, Istanbul, Turkey Ph.: (+90) 216 418 25 04; Fax: (+90) 216 418 25 05 vtopuz@marmara.edu.tr Abstract. In this study, a fuzzy query tool (SQLf) for non-fuzzy database management systems was developed. In addition, samples of fuzzy queries were made by using real data with the tool developed in this study. Performance of SQLf was tested with the data about the Marmara University students' food grant. The food grant data were collected in MySQL database by using a form which had been filled on the web. The students filled a form on the web to describe their social and economical conditions for the food grant request. This form consists of questions which have fuzzy and crisp answers. The main purpose of this fuzzy query is to determine the students who deserve the grant. The SQLf easily found the eligible students for the grant through predefined fuzzy values. The fuzzy query tool (SQLf) could be used easily with other database system like ORACLE and SQL server. Keywords: Fuzzy logic, fuzzy query, database. 1 Introduction Database management systems have made a significant progress in terms of functionality and performance since they were first designed in the 1960s. However, the query systems of relational database management systems, which are widespread today, are based on two-value logic. In this logic, an entry either meets the criteria or not. After the querying criteria, it creates sets whose boundaries are certain. This is in contradiction with our natural thinking method, because we are unable to differentiate some objects in our daily lives in such a certain way. For example, a person does not suddenly become short or tall because of a couple of millimeters difference [1-6]. In order to define these situations, using fuzzy logic will be beneficial to simplify the query and get a more correct report. Today's database management systems are advanced in terms of performance and functionality and almost all of them have their own high-level query systems. However, these query systems work with precise values or value intervals. [7-9]. M. Graña Romay et al. (Eds.): HAIS 2010, Part I, LNAI 6076, pp. 170 177, 2010. Springer-Verlag Berlin Heidelberg 2010

Data Processing on Database Management Systems with Fuzzy Query 171 The fuzzy set theory, proposed by L.A. Zadeh, aims at processing the indefinite and vague information. In other words, the concept of fuzziness refers to the state of ambiguity which stems from the lack of certainty. The fuzzy logic and the fuzzy set theory play an important role for vague knowledge display and almost all of our expressions in the daily language contain fuzziness. (cold-hot, rich-poor, short-long etc.) [10-13]. Ambiguity plays an important role in human thinking style, especially in communication, inference, and in identifying and abstracting figures; and the importance of the fuzzy theory appears at this point. When we wish to transform the user interfaces which enable us to communicate with machines into a human-oriented style, the fuzzy theory becomes an effective tool at our hands [14]. The fuzzy query provides us with the ability to evaluate imprecise data and use expressions such as old or rich which do not imply certain quantities. The fuzzy query provides the nearest data to us, if what we search for does not exist. This is a very beneficial feature especially if we do not have absolute information or the information we have is not quantitative [9, 15-17]. 2 SQLf Fuzzy Query Software The SQLf Software was written by taking into consideration the software designed to make fuzzy queries on database management systems such as SummarySQL and FuzzyQuery. The Figure 1 shows the relationship between the SQLf software's browser, php and database server. The task of the SQLf software is to make both classical and fuzzy queries on non-fuzzy database systems and report the results. The software was encoded in PHP programming language. It is available on the web and the address is http://www.fuzzyquery.com. Fig. 1. Relationship between the SQLf Software's Browser, Php and Database Server The components of the software are as follows: 1. The graphical interface that interacts with the user 2. Making connection settings to the database management system 3. Defining the criteria necessary for the query (Criteria Definition) 4. Defining fuzzy sets for table fields (Fuzzy Sets) 5. Monitoring the impact of hedges on the current fuzzy sets (Hedges) 6. Creating precise and fuzzy queries from the defined criteria (Query Design)

172 İ. Şimşek and V. Topuz 7. Controlling the queries created and determining the desired fields on the result table (Query Control) 8. Displaying the result table and the query statistics after running the controlled query according to the desired fields (Query Run) The software is composed of two main sections as shown in figure 2, namely the fuzzying and query. FUZZIFICATION MODULE QUERY MODULE Query Interpreter Query Processor Fig. 2. Main Parts of Systems The fuzzification module: Since the database folder on which query is made is not fuzzy, firstly a fuzzying operation is needed. To this end, the user is shown the fields of the desired folder and then s/he is enabled to define fuzzy sets for the fields s/he desired. There is no restriction about which fields the user can fuzzy. Fuzzying is generally made for the fields containing quantitative data. Fuzzy sets are defined as the sets of pairs of elements and degrees of membership. In order to reuse the definitions, a set database was formed in which all the entered information is stored. The query module: The query processor steps in, after the defined queries are controlled. The general structure of the query processor is demonstrated in the Figure 3, finds the matching degree of each entry and produces a report accordingly.. Unlike the classical processing, the matching degree of the query is not either 0 or 1, but it is a number between 0 and 1. D atabase Record Query M atching E A ccetable? ( > threshold? ) M atching Degree H Report Neglect Record Fig. 3. Essence of Database Querying 3 Example: Marmara University Food Grant Performance of SQLf was tested with Marmara University student s food grant data. The students filled a form which describes their social and economical positions for

Data Processing on Database Management Systems with Fuzzy Query 173 the food grant request on the web. This form consists of questions which have fuzzy and crisp answers. The SQLf easily found the eligible students for grants with predefined fuzzy values. Fuzzy query tool (SQLf) was designed to work not only with this database, but also with other databases. 3.1 The Assessment Table The Food Grant Database consists of three tables; namely student information, family information and contact information. Since our aim is to find the students who deserve the food grant, we will conduct the assessment on the family information table which contains the student's living conditions, the state of family, and the other received grants. Table 1 shows the family information table's field names, field types and other features of fields. The fields which will be assessed in this table and the characteristics of these fields in terms of the information they contain are as follows: Table 1. Family Information Table Field Name Field Type Empty Default Explanation Id int(11) No Student ID stofpa tinyint(4) No State of parents (1-3) as a numerical value numch int(11) No Number of children in the family numchattsch int(11) No Number of children attending school. fathocc tinyint(11) No Father's occupation (1) Private, (2) Self-Employed,(3)Public,(4) Unemployed. mothocc tinyint(11) No Mother's occupation stfamhou tinyint(4) No State of house in which the family stay (1-4) as a numerical value. netinc decimal(10) No Sum of the family's net income scho1 varchar(20) Yes NULL scho2 varchar(20) Yes NULL scho3 varchar(20) Yes NULL Names of the scholarships that the student receive, if any noscho tinyint(6) No Whether the student receives scholarship from another institution as 1 and 0 sthouse tinyint(4) No State of the house in which the student currently stay (1-6) as a numerical value 3.2 Defining Fuzzy Sets Before defining the criteria, constitution of the fuzzy sets is needed for the fuzzy criteria. The fuzzy sets constituted are placed at the fsql_fsets table in the MySQL database to be used later. The fuzzy sets constituted for our application sample are shown in Table 2.

174 İ. Şimşek and V. Topuz Field (field name) fsname (fuzzy set name) Table 2. Food Grant Fuzzy Sets Table fsetform (formal information) fsalpha (Alpha cut coefficient) fmin (minimum data) netinc poor decline 0 0 1000 stofpa bad l.increasing 0 1 3 numchattsch very growth 0 0 20 numch very growth 0 0 20 fathocc bad growth 0 1 4 mothocc bad growth 0 1 4 stfamhou bad decline 0 1 4 esthouse bad decline 0 1 6 3.3 Preparing the Criteria Fmax (maximum data) After constituting the fuzzy sets, the criteria should be prepared in order to use these fuzzy sets in our query. The criteria are divided into two categories; namely the precise qualitative expressions and the fuzzy qualitative expressions. The processing steps for the fields for which criteria will be prepared are as follows: 1. The Criteria Definition section should be visited. 2. The relevant field should be selected from the fields section. 3. Since fuzzy qualitative expressions will be constituted, the fuzzy operator (@) should be selected from the operators section. 4. We do not need to select any switcher for our application sample. Thus, the expression of <none> should be selected from the Hedges section. 5. From the value section, the set, which we have constituted from the Fuzzy Sets section before, should be selected.. Figure 4 shows the fuzzy qualitative expressions prepared for the application sample. Totally four fuzzy qualitative expressions have been prepared. Fig. 4. Criteria Definition Sections

Data Processing on Database Management Systems with Fuzzy Query 175 3.4 Constituting the Queries from the Prepared Criteria By connecting the simple fuzzy qualitative expressions prepared in the Criteria Definition section with AND or OR in the Create Query section, complex fuzzy qualitative expressions are constituted. The figure 5 shows the complex fuzzy qualitative expressions constituted for the application sample. Fig. 5. Query Design Sections The query sentence can either be a simple single sentence, or a complex sentence consisting of several simple sentences connected with AND or OR. If such a complex sentence is the case, the matching degrees of each sub-sentence are calculated for each entry and thus the overall matching degree is obtained. The entries whose matching degrees are above a defined lower limit are written on the output folder. The sentence or sub-sentences may not be fuzzy. In this case, the operators such as =, >, >=, <, <= etc. and constant values are used in the query, instead of switchers and fuzzy sets. Fig. 6. Query Control Sections

176 İ. Şimşek and V. Topuz 3.5 Controlling the Queries Figure 6 shows Query Control sections. The Query Control section should be visited in order to control the SQL and SQLf expressions which appear after approving the expressions connected in the Query Design section. In this section, the user not only controls the expressions but also defines the settings for the result report. The boundary value is also determined in this section. We defined it as 0.2 in our application sample. It means that, after the query, those whose degrees of membership are below 0.2 will be ignored during the reporting. 3.6 Running the Queries and the Result Table The results shown by Table 3 are obtained from the Run Query section. After the query, 88 out of 3645 people are listed. In this table µ(grant) field shows the fuzzy deserve level of grant according student ID and other information which are used in fuzzy query. µ(grant) values could be between 0 and 1 and 1 value means that student completely deserved the grant. A part of the result table is presented in Table 3. Table 3. Result Table µ(grant) ID sthouse stfamhou numch numchattsch netinc 0.86 2282 1 1 3 3 265 0.82 2301 1 1 3 3 300 0.778 2610 2 1 4 4 170 0.777 1704 2 2 5 5 200 0.776 1590 2 2 3 3 245 4 Conclusion This paper proposes a fuzzy query languages (fuzzy relational calculus and fuzzy relational algebra) based on the relational database query languages. This is an application of the fuzzy set theory and the fuzzy logic was carried out by developing an interface which renders possible to query on any relational database with query sentences similar to the sentences used in the daily language. Complex fuzzy query sentences including hedges and crisp values could be constituted. Efficiency of application is shown with student food grant problem. This is an example of relational database which have crisp and fuzzy fields. Hence, it is not convenient to say who deserved food grant easily. Therefore all applicant student food grant deserved degree was found as a fuzzy membership value. Consequently this developed application could be used to query any relational database which has crisp or fuzzy fields.

Data Processing on Database Management Systems with Fuzzy Query 177 References 1. Mutlu, T.: A Fuzzy Query Tool For Non-Fuzzy Databases, Master Thesis, Istanbul Technical University Information Sciences Institute, Istanbul (1996) 2. Bahadır, A.: Flexible Querying in Standard Database Systems With Fuzzy Set Approach, Master Thesis, Istanbul Technical University Information Sciences Institute, Istanbul (1999) 3. Andersen, T., Christiansen, H., Larsen, H.L.: Flexible Query Answering System, pp. 45 61, 187-209, 247-277. Kluwer Academic Publishers, Boston (1997) 4. Zadeh, L.A., Kacprzyk, J.: Fuzzy Logic for the Management of Uncertainty, pp. 645 672. Wiley, New York (1992) 5. Kacprzyk, J., Ziolkowski, A.: Database Queries with Fuzzy Linguistic Quantifiers. IEEE Transactions on Systems, Man and Cybernetics SMC-16(3), 474 478 (1986) 6. Takahashi, Y.: A Fuzzy Query Language for Relational Databases. IEE Transactions on Systems, Man and Cybernetics 21(6), 1576 1579 (1991) 7. Rasmussen, D., Yager, R.R.: SummarySQL A Fuzzy Tool For Data Mining. Intelligent Data Analysis 1(1-4), 49 58 (1997) 8. Rasmani, K.A., Shen, Q.: A Data-Driven Fuzzy Rule-Based Approach for Student Academic Performance Evaluation. Applied Intelligence 23(3), 305 319 (2006) 9. Zadeh, L.A.: Knowledge Representation in Fuzzy Logic. IEEE Transactions on Knowledge and Data Engineering 1(1), 89 100 (1989) 10. Klir, G.J., Yuan, B.: Fuzzy Sets and Fuzzy Logic Theory and Applications, pp. 379 388. Prentice Hall, New Jersey (1995) 11. Tanaka, K.: An Introduction to Fuzzy Logic for Practical Applications, pp. 68 75. Springer, New Jersey (1996) 12. Ross, J.T.: Fuzzy Logic with Engineering Applications, pp. 52 75. McGraw Hill Inc, New York (2004) 13. Kosko, B.: Fuzzy Engineering, pp. 18 24. Prentice Hall, New Jersey (1997) 14. Zongmin, M.: Fuzzy Database Modeling of Imprecise and Uncertain Engineering Information, pp. 137 155. Springer, New York (2006) 15. Zimmermann, H.J.: Fuzzy Sets, Decision Making, and Expert Systems, pp. 125 134. Kluwer Academic Publishers, Boston (1987) 16. Terano, T., Asai, K., Sugeno, M.: Fuzzy Systems Theory and Its Applications. Academic Press, San Diego (1992) 17. Şen, O.N.: Oracle SQL, SQL*PLUS, PL/SQL and Database Management, Beta Impression Publication Distributor, Istanbul, pp. 85 90 (2000)