Rough Sets and Fuzzy Rough Sets: Models and Applications



Similar documents
ROUGH SETS AND DATA MINING. Zdzisław Pawlak

An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015

FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM

Big Data with Rough Set Using Map- Reduce

Data Mining and Soft Computing. Francisco Herrera

Prototype-based classification by fuzzification of cases

Fuzzy Logic -based Pre-processing for Fuzzy Association Rule Mining

Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation

Meta-learning. Synonyms. Definition. Characteristics

Section 5 shows comparison between CRSA and DRSA. Finally, Section 6 concludes the paper. <ε then STOP, otherwise return. to step 2.

DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES

Manjeet Kaur Bhullar, Kiranbir Kaur Department of CSE, GNDU, Amritsar, Punjab, India

Optimization under fuzzy if-then rules

Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies

Experiments in Web Page Classification for Semantic Web

Performance Study on Data Discretization Techniques Using Nutrition Dataset

Introduction to Learning & Decision Trees

Data Mining: A Preprocessing Engine

Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms

AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM

Impact of Boolean factorization as preprocessing methods for classification of Boolean data

Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup

EFFICIENT DATA PRE-PROCESSING FOR DATA MINING

Linguistic Preference Modeling: Foundation Models and New Trends. Extended Abstract

A Novel Feature Selection Method Based on an Integrated Data Envelopment Analysis and Entropy Mode

USING THE AGGLOMERATIVE METHOD OF HIERARCHICAL CLUSTERING AS A DATA MINING TOOL IN CAPITAL MARKET 1. Vera Marinova Boncheva

A Rough Set View on Bayes Theorem

Roulette Sampling for Cost-Sensitive Learning

A FUZZY LOGIC APPROACH FOR SALES FORECASTING

A Two-Step Method for Clustering Mixed Categroical and Numeric Data

Data Mining based on Rough Set and Decision Tree Optimization

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Introduction to Data Mining Techniques

Probabilistic Rough Set Approximations

The Optimality of Naive Bayes

CSP Scheduling on basis of Priority of Specific Service using Cloud Broker

Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification

ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION

Random forest algorithm in big data environment

Subject Description Form

Project Management Efficiency A Fuzzy Logic Approach

ISSN: ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 3, September 2013

Three Perspectives of Data Mining

Using Rough Sets to predict insolvency of Spanish non-life insurance companies

FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS

A FUZZY CLUSTERING ENSEMBLE APPROACH FOR CATEGORICAL DATA

Comparative Analysis of EM Clustering Algorithm and Density Based Clustering Algorithm Using WEKA tool.

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

High-dimensional labeled data analysis with Gabriel graphs

A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE

Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing Classifier

DATA PREPARATION FOR DATA MINING

Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin

Reference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification k-nearest neighbors

Applied Mathematical Sciences, Vol. 7, 2013, no. 112, HIKARI Ltd,

Predicting the Risk of Heart Attacks using Neural Network and Decision Tree

Association Rule Mining with Fuzzy Logic: an Overview

International Journal of Advance Research in Computer Science and Management Studies

Introducing diversity among the models of multi-label classification ensemble

Meta-learning via Search Combined with Parameter Optimization.

Research on Trust Management Strategies in Cloud Computing Environment

A Three-Way Decision Approach to Spam Filtering

Social Media Mining. Data Mining Essentials

A Survey on Outlier Detection Techniques for Credit Card Fraud Detection

Marcin Szczuka The University of Warsaw Poland

Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods

Knowledge Based Descriptive Neural Networks

A Novel Approach for Heart Disease Diagnosis using Data Mining and Fuzzy Logic

Rule based Classification of BSE Stock Data with Data Mining

An Evaluation of Neural Networks Approaches used for Software Effort Estimation

DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support

An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset

CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing

Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing

How To Use Neural Networks In Data Mining

Research Article EFFICIENT TECHNIQUES TO DEAL WITH BIG DATA CLASSIFICATION PROBLEMS G.Somasekhar 1 *, Dr. K.

Analysis of Various Techniques to Handling Missing Value in Dataset Rajnik L. Vaishnav a, Dr. K. M. Patel b a

Enhanced data mining analysis in higher educational system using rough set theory

Self Organizing Maps for Visualization of Categories

Grid Density Clustering Algorithm

Web Mining Seminar CSE 450. Spring 2008 MWF 11:10 12:00pm Maginnes 113

Web Document Clustering

An Overview of Knowledge Discovery Database and Data mining Techniques

Optimization of Fuzzy Inventory Models under Fuzzy Demand and Fuzzy Lead Time

Extracting Fuzzy Rules from Data for Function Approximation and Pattern Classification

The Research of Data Mining Based on Neural Networks

A Survey on Parallel Method for Rough Set using MapReduce Technique for Data Mining

Transcription:

Rough Sets and Fuzzy Rough Sets: Models and Applications Chris Cornelis Department of Applied Mathematics and Computer Science, Ghent University, Belgium XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 1/47 Introduction Lotfi Zadeh (Baku, Feb. 4, 1921) Zdzisław Pawlak (Łodz, 1926 Warsaw, 2006) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 2/47

Introduction Fuzzy Sets (1965) Designed for dealing with gradual information Rough Sets (1982) Designed for dealing with incomplete information XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 3/47 Introduction Fuzzy Rough Sets (1990) Didier Dubois & Henri Prade XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 4/47

Introduction http://www.roughsets.org Rough Set Database System (RSDS): 3882 publications (941 in journals, 2187 in proceedings) International conferences RSCTC: Rough Sets and Current Trends in Computing Japan (2006), USA (2008), Poland (2010) RSKT: Rough Sets and Knowledge Technology China (2008), Australia (2009), China (2010) RSFDGrC: Rough Sets, Fuzzy Sets, Data mining and Granular Computing Canada (2005,2007), India (2009) TRS: Transactions on Rough Sets (LNCS, Springer) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 5/47 Introduction Rough set publications in Information Sciences, Fuzzy Sets and Systems and Int. Journal of Approximate Reasoning XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 6/47

Overview Introduction Rough Sets (RS) Pawlak s model and generalizations Application: feature selection Fuzzy Rough Sets (FRS) Implication/t-norm based model Vaguely quantified rough set model Applications in data analysis Software Conclusion XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 7/47 Rough set theory Goal: to approximate a concept C using 1 a set A X of examples of C XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 8/47

Rough set theory Goal: to approximate a concept C using 1 a set A X of examples of C 2 an equivalence relation R in X XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 9/47 Lower Approximation y R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 10/47

Upper Approximation y R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 11/47 Rough Set (R A, R A) y R A y R A [y] R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 12/47

Boundary region y R A y R A [y] R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 13/47 Rough sets: application domains Machine learning Supervised learning, e.g. feature selection and rule induction Unsupervised learning, e.g. rough clustering Data warehousing Information retrieval Multiple Criteria Decision Making Semantic Web XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 14/47

Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 15/47 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 16/47

Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject Diploma(x i ) = Diploma(x j ) (x i, x j ) R Experience(x i ) = Experience(x j ) Spanish(x i ) = Spanish(x j ) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 16/47 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject (x 1, x 6 ) R,x 1 A, x 6 A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 17/47

Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject R A = {x 2, x 3 } XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 18/47 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject R A = {x 1, x 2, x 3, x 6 } XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 19/47

Rough set feature selection Data reduction method Dependent only on the data itself Reduct: minimal feature subset such that objects discernibility is preserved Decision reduct: minimal feature subset such that objects in different classes can still be discerned XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 20/47 Example: finding a decision reduct Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 21/47

Example: finding a decision reduct Applicant Experience Spanish Decision x 1 Medium Yes Accept x 2 High No Accept x 3 High Yes Accept x 4 High No Reject x 5 Low Yes Reject x 6 Medium Yes Reject x 7 Low No Reject {Experience,Spanish} is no decision reduct XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 22/47 Example: finding a decision reduct Applicant Diploma Experience Decision x 1 MSc Medium Accept x 2 MSc High Accept x 3 MSc High Accept x 4 MBA High Reject x 5 MCE Low Reject x 6 MSc Medium Reject x 7 MCE Low Reject {Diploma,Experience} is a decision reduct XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 23/47

Finding decision reducts Theorem (Skowron and Rauszer, 1992) Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a decision attribute d. The decision reducts of (X, A {d}) are the prime implicants of the boolean function f(a 1,...,a m) = { O ij 1 j < i n and O ij } O ij = { if d(xi ) = d(x j ) {a A a(x i ) a(x j )} otherwise XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 24/47 Finding decision reducts Theorem (Skowron and Rauszer, 1992) Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a decision attribute d. The decision reducts of (X, A {d}) are the prime implicants of the boolean function f(a 1,...,a m) = { O ij 1 j < i n and O ij } O ij = { if d(xi ) = d(x j ) {a A a(x i ) a(x j )} otherwise Problem of finding all (decision) reducts is NP-complete Solution: heuristic approaches for finding (approximate) decision reducts XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 24/47

Positive region Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a set of decision classes C. For B A, Positive region: R B = {(x, y) X 2 ( a B)(a(x) = a(y))} POS B = C C R B C XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 25/47 Degree of dependency Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a set of decision classes C. For B A, R B = {(x, y) X 2 ( a B)(a(x) = a(y))} Positive region: POS B = C C R B C Degree of dependency: γ B = POS B X Theorem B is a decision reduct if γ B = γ A and γ B < γ B for all B B. XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 26/47

Heuristic search Goal: to find a subset B A such that γ B is maximal B is minimal Greedy approaches (hillclimbing) More complex heuristics: genetic algorithms, ant colony optimization, XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 27/47 Generalizations of Pawlak rough sets The definition of lower and upper approximation may be weakened Variable Precision Rough Sets (Ziarko, 1993): given 1 u > l 0, y R A [y] R A [y] R y R A [y] R A [y] R u > l If u = 1 and l = 0, Pawlak s approximations are recovered Intuition: introduce noise tolerance into approximations XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 28/47

Generalizations of Pawlak rough sets The requirement that R is an equivalence relation may be weakened Reflexive + transitive: dominance based rough sets (Greco, Matarazzo and Słowiński, 2001) MCDM Reflexive + symmetric: tolerance rough sets E.g. proximity-based (x, y) R d(x, y) α XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 29/47 Overview Introduction Rough Sets (RS) Pawlak s model and generalizations Application: feature selection Fuzzy Rough Sets (FRS) Implication/t-norm based model Vaguely quantified rough set model Applications in data analysis Software Conclusion XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 30/47

Fuzzy rough sets: motivation Indiscernibility may be gradual rather than binary a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x 1 1 101 50 15 36 24.2 0.526 26 0 x 2 8 176 90 34 300 33.7 0.467 58 1 x 3 7 150 66 42 342 34.7 0.718 42 0 x 4 7 187 68 39 304 37.7 0.254 41 1 x 5 0 100 88 60 110 46.8 0.962 31 0 x 6 0 105 64 41 142 41.5 0.173 22 0 x 7 1 95 66 13 38 19.6 0.334 25 0 (Diabetes dataset partim, UCI Machine Learning Repository) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 31/47 Fuzzy rough sets: motivation Indiscernibility may be gradual rather than binary a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x 1 1 101 50 15 36 24.2 0.526 26 0 x 2 8 176 90 34 300 33.7 0.467 58 1 x 3 7 150 66 42 342 34.7 0.718 42 0 x 4 7 187 68 39 304 37.7 0.254 41 1 x 5 0 100 88 60 110 46.8 0.962 31 0 x 6 0 105 64 41 142 41.5 0.173 22 0 x 7 1 95 66 13 38 19.6 0.334 25 0 (Diabetes dataset partim, UCI Machine Learning Repository) Allow that R is a fuzzy relation XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 31/47

Fuzzy rough sets: motivation Concepts may be fuzzy rather than crisp a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x 1 0.0351 95 2.68 0 0.4161 7.853 33.2 5.118 48.5 x 2 0.0837 45 3.44 0 0.437 7.185 38.9 4.567 34.9 x 3 0.1061 30 4.93 0 0.428 6.095 65.1 6.336 20.1 x 4 0.0883 12.5 7.87 0 0.524 6.012 66.6 5.561 22.9 x 5 1.4139 0 19.58 1 0.871 6.129 96.0 1.749 17.0 x 6 2.1492 0 19.58 0 0.871 5.709 98.5 1.623 19.4 x 7 3.3211 0 19.58 1 0.871 5.403 100 1.322 13.4 (Housing dataset partim, UCI Machine Learning Repository) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 32/47 Fuzzy rough sets: motivation Concepts may be fuzzy rather than crisp a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x 1 0.0351 95 2.68 0 0.4161 7.853 33.2 5.118 48.5 x 2 0.0837 45 3.44 0 0.437 7.185 38.9 4.567 34.9 x 3 0.1061 30 4.93 0 0.428 6.095 65.1 6.336 20.1 x 4 0.0883 12.5 7.87 0 0.524 6.012 66.6 5.561 22.9 x 5 1.4139 0 19.58 1 0.871 6.129 96.0 1.749 17.0 x 6 2.1492 0 19.58 0 0.871 5.709 98.5 1.623 19.4 x 7 3.3211 0 19.58 1 0.871 5.403 100 1.322 13.4 (Housing dataset partim, UCI Machine Learning Repository) Allow that A is a fuzzy set XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 32/47

Rough set (R A, R A) y R A y R A [y] R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 33/47 Rough set (R A, R A) y R A ( x X)((x, y) R x A) y R A ( x X)((x, y) R x A) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 34/47

Fuzzy rough set (R A, R A) (R A)(y) = inf I(R(x, y), A(x)) x X (R A)(y) = sup T (R(x, y), A(x)) x X I(x, y) = max(1 x, y), T (x, y) = min(x, y) (Dubois and Prade, 1990) S-, R- and QL-implications (Radzikowska and Kerre, 2002) If A and R are crisp, we retrieve Pawlak s approximations XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 35/47 Vaguely Quantified Rough Sets Principle: soften the quantifiers inside the definitions of lower and upper approximation y belongs to the lower approximation of A iff Pawlak: all elements of [y] R belong to A VPRS: at least a fraction u of [y] R belongs to A VQRS: most elements of [y] R belong to A y belongs to the upper approximation of A iff Pawlak: at least one element of [y] R belongs to A VPRS: more than a fraction l of [y] R belongs to A VQRS: some elements of [y] R belong to A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 36/47

Vaguely Quantified Rough Sets y belongs to the lower approximation of A iff most elements of [y] R belong to A y belongs to the upper approximation of A iff some elements of [y] R belong to A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 37/47 Vaguely Quantified Rough Sets ( ) [y]r A R A(y) = Q u [y] R ( ) [y]r A R A(y) = Q l [y] R (Cornelis, De Cock and Radzikowska, 2007) If R and A are crisp, Pawlak s approximations are NOT retrieved VQRS uses cardinality-based inclusion/overlap measures, while classical FRS uses logic-based measures XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 38/47

Fuzzy-rough feature selection Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } a fuzzy tolerance relation R B for any B A a set of decision classes C Positive region: ( ) POS B (x) = R B C (x) C C Degree of dependency: γ B = POS B X = POS B (x) x X X XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 39/47 Fuzzy-rough feature selection Definition (Jensen and Shen, 2007) B is a decision reduct if γ B = γ A and γ B < γ B for all B B. Heuristic approaches to find a subset B A such that γ B is maximal B is minimal Other extensions of decision reducts have been considered in e.g. (Cornelis, Jensen, Hurtado and Ślȩzak, 2010) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 40/47

Fuzzy-rough K-nearest neighbours Goal: classification of test object y given training data T K nearest neighbours in T determine y s membership to lower and upper approximation of each class Class with highest membership is chosen (Jensen and Cornelis, 2008) (1) GetNearestNeighbours(y,K) (2) µ 1 (y) 0, µ 2 (y) 0, Class (3) C C (4) if ((R C)(y) µ 1 (y) (R C)(y) µ 2 (y)) (5) Class C (6) µ 1 (y) (R C)(y), µ 2 (y) (R C)(y) (7) output Class XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 41/47 QuickRules Goal: generate fuzzy classification rules using minimum number of attributes Integrates feature selection and rule induction Decision reduct is obtained by a hillclimbing search On the fly, decision rules are generated for fully covered training objects (Jensen, Cornelis and Shen, 2009) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 42/47

Fuzzy-rough data analysis in practice Several fuzzy-rough feature selection and classification methods have been ported to WEKA and are available at Richard Jensen s homepage http://users.aber.ac.uk/rkj/home/ XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 43/47 Conclusion Fuzzy sets model gradual information Rough sets model incomplete information They are highly complementary soft computing paradigms They have many applications, in particular in data analysis (Fuzzy) rough sets raise many research challenges, both practical and theoretical XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 44/47

Bibliography D. Chen, E. Tsang, S. Zhao, An approach of attributes reduction based on fuzzy rough sets, Proc. IEEE Int. Conf. on Systems, Man, and Cybernetics, 2007, pp. 486 491. D. Chen, E. Tsang, S. Zhao, Attribute reduction based on fuzzy rough sets, Proc. Int. Conf. on Rough Sets and Intelligent Systems Paradigms, 2007, pp. 73 89. C. Cornelis, M. De Cock, A. Radzikowska, Vaguely quantified rough sets, Proceedings of 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing (RSFDGrC2007), Lecture Notes in Artificial Intelligence 4482, 2007, pp. 87 94. C. Cornelis, M. De Cock, A.M. Radzikowska, Fuzzy rough sets: from theory into practice, Handbook of Granular Computing (W. Pedrycz, A. Skowron, V. Kreinovich, eds.), John Wiley and Sons, 2008, pp. 533 552. R. Jensen, C. Cornelis, A new approach to fuzzy-rough nearest neighbour classification, Proceedings of the 6th International Conference on Rough Sets and Current Trends in Computing (RSCTC 2008), 2008, pp. 310-319 C. Cornelis, R. Jensen, G. Hurtado Martín D. Ślȩzak, Attribute selection with fuzzy decision reducts, Information Sciences 180(2) (2010) 209 224. M. De Cock, C. Cornelis, E.E. Kerre, Fuzzy rough sets: the forgotten step, IEEE Transactions on Fuzzy Systems 15(1) (2007) 137 153. R. Jensen, Q. Shen, Fuzzy-rough sets assisted attribute selection, IEEE Transactions on Fuzzy Systems 15(1) (2007) 73 89. R. Jensen, Q. Shen, New approaches to fuzzy-rough feature selection, IEEE Transactions on Fuzzy Systems 17(4) (2009) 824 838. R. Jensen, C. Cornelis, Q. Shen, Hybrid fuzzy-rough rule induction and feature selection, Proceedings of the 18th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2009), 2009, pp. 1151-1156. XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 45/47 Bibliography Z. Pawlak, Rough Sets, International Journal of Computer and Information Sciences 11(5) (1982) 341 356. Z. Pawlak, Rough Sets Theoretical Aspects of Reasoning about Data, Kluwer Academic Publishers, Dordrecht, Netherlands, 1991. A.M. Radzikowska, E.E. Kerre, A comparative study of fuzzy rough sets, Fuzzy Sets and Systems 126 (2002) 137 156. A. Skowron, C. Rauszer, The Discernibility Matrices and Functions in Information Systems, Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory (R. Słowiński, ed.), Kluwer Academic Publishers, Dordrecht, Netherlands, 1992, pp. 331 362. J. Stepaniuk, Tolerance Information Granules, Monitoring, Security, and Rescue Techniques in Multiagent Systems. Advances in Soft Computing, Springer, 2005, pp. 305 316. E.C.C. Tsang, D.G. Chen, D.S. Yeung, X.Z. Wang, J.W.T Lee, attributes reduction using fuzzy rough sets, IEEE Transactions on Fuzzy Systems 16(5) (2008) 1130 1141. I.H. Witten, E. Frank, Data Mining: Practical machine learning tools and techniques, 2nd Edition, Morgan Kaufmann, San Francisco, 2005. M. Yang, S. Chen, X. Yang, A novel approach of rough set-based attribute reduction using fuzzy discernibility matrix, Proc. 4th Int. Conf. on Fuzzy Systems and Knowledge Discovery, 2007, pp. 96 101. S. Zhao, E.C.C. Tsang, On fuzzy approximation operators in attribute reduction with fuzzy rough sets, Information Sciences 178(16), (2007) 3163 3176. XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 46/47

Para terminar Gracias por su atención! Preguntas? (en inglés, por favor;-)) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 47/47