Rough Sets and Fuzzy Rough Sets: Models and Applications
|
|
|
- Reynold Randall
- 10 years ago
- Views:
Transcription
1 Rough Sets and Fuzzy Rough Sets: Models and Applications Chris Cornelis Department of Applied Mathematics and Computer Science, Ghent University, Belgium XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 1/47 Introduction Lotfi Zadeh (Baku, Feb. 4, 1921) Zdzisław Pawlak (Łodz, 1926 Warsaw, 2006) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 2/47
2 Introduction Fuzzy Sets (1965) Designed for dealing with gradual information Rough Sets (1982) Designed for dealing with incomplete information XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 3/47 Introduction Fuzzy Rough Sets (1990) Didier Dubois & Henri Prade XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 4/47
3 Introduction Rough Set Database System (RSDS): 3882 publications (941 in journals, 2187 in proceedings) International conferences RSCTC: Rough Sets and Current Trends in Computing Japan (2006), USA (2008), Poland (2010) RSKT: Rough Sets and Knowledge Technology China (2008), Australia (2009), China (2010) RSFDGrC: Rough Sets, Fuzzy Sets, Data mining and Granular Computing Canada (2005,2007), India (2009) TRS: Transactions on Rough Sets (LNCS, Springer) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 5/47 Introduction Rough set publications in Information Sciences, Fuzzy Sets and Systems and Int. Journal of Approximate Reasoning XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 6/47
4 Overview Introduction Rough Sets (RS) Pawlak s model and generalizations Application: feature selection Fuzzy Rough Sets (FRS) Implication/t-norm based model Vaguely quantified rough set model Applications in data analysis Software Conclusion XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 7/47 Rough set theory Goal: to approximate a concept C using 1 a set A X of examples of C XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 8/47
5 Rough set theory Goal: to approximate a concept C using 1 a set A X of examples of C 2 an equivalence relation R in X XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 9/47 Lower Approximation y R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 10/47
6 Upper Approximation y R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 11/47 Rough Set (R A, R A) y R A y R A [y] R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 12/47
7 Boundary region y R A y R A [y] R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 13/47 Rough sets: application domains Machine learning Supervised learning, e.g. feature selection and rule induction Unsupervised learning, e.g. rough clustering Data warehousing Information retrieval Multiple Criteria Decision Making Semantic Web XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 14/47
8 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 15/47 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 16/47
9 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject Diploma(x i ) = Diploma(x j ) (x i, x j ) R Experience(x i ) = Experience(x j ) Spanish(x i ) = Spanish(x j ) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 16/47 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject (x 1, x 6 ) R,x 1 A, x 6 A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 17/47
10 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject R A = {x 2, x 3 } XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 18/47 Example: data analysis Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject R A = {x 1, x 2, x 3, x 6 } XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 19/47
11 Rough set feature selection Data reduction method Dependent only on the data itself Reduct: minimal feature subset such that objects discernibility is preserved Decision reduct: minimal feature subset such that objects in different classes can still be discerned XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 20/47 Example: finding a decision reduct Applicant Diploma Experience Spanish Decision x 1 MSc Medium Yes Accept x 2 MSc High No Accept x 3 MSc High Yes Accept x 4 MBA High No Reject x 5 MCE Low Yes Reject x 6 MSc Medium Yes Reject x 7 MCE Low No Reject XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 21/47
12 Example: finding a decision reduct Applicant Experience Spanish Decision x 1 Medium Yes Accept x 2 High No Accept x 3 High Yes Accept x 4 High No Reject x 5 Low Yes Reject x 6 Medium Yes Reject x 7 Low No Reject {Experience,Spanish} is no decision reduct XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 22/47 Example: finding a decision reduct Applicant Diploma Experience Decision x 1 MSc Medium Accept x 2 MSc High Accept x 3 MSc High Accept x 4 MBA High Reject x 5 MCE Low Reject x 6 MSc Medium Reject x 7 MCE Low Reject {Diploma,Experience} is a decision reduct XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 23/47
13 Finding decision reducts Theorem (Skowron and Rauszer, 1992) Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a decision attribute d. The decision reducts of (X, A {d}) are the prime implicants of the boolean function f(a 1,...,a m) = { O ij 1 j < i n and O ij } O ij = { if d(xi ) = d(x j ) {a A a(x i ) a(x j )} otherwise XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 24/47 Finding decision reducts Theorem (Skowron and Rauszer, 1992) Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a decision attribute d. The decision reducts of (X, A {d}) are the prime implicants of the boolean function f(a 1,...,a m) = { O ij 1 j < i n and O ij } O ij = { if d(xi ) = d(x j ) {a A a(x i ) a(x j )} otherwise Problem of finding all (decision) reducts is NP-complete Solution: heuristic approaches for finding (approximate) decision reducts XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 24/47
14 Positive region Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a set of decision classes C. For B A, Positive region: R B = {(x, y) X 2 ( a B)(a(x) = a(y))} POS B = C C R B C XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 25/47 Degree of dependency Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } and a set of decision classes C. For B A, R B = {(x, y) X 2 ( a B)(a(x) = a(y))} Positive region: POS B = C C R B C Degree of dependency: γ B = POS B X Theorem B is a decision reduct if γ B = γ A and γ B < γ B for all B B. XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 26/47
15 Heuristic search Goal: to find a subset B A such that γ B is maximal B is minimal Greedy approaches (hillclimbing) More complex heuristics: genetic algorithms, ant colony optimization, XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 27/47 Generalizations of Pawlak rough sets The definition of lower and upper approximation may be weakened Variable Precision Rough Sets (Ziarko, 1993): given 1 u > l 0, y R A [y] R A [y] R y R A [y] R A [y] R u > l If u = 1 and l = 0, Pawlak s approximations are recovered Intuition: introduce noise tolerance into approximations XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 28/47
16 Generalizations of Pawlak rough sets The requirement that R is an equivalence relation may be weakened Reflexive + transitive: dominance based rough sets (Greco, Matarazzo and Słowiński, 2001) MCDM Reflexive + symmetric: tolerance rough sets E.g. proximity-based (x, y) R d(x, y) α XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 29/47 Overview Introduction Rough Sets (RS) Pawlak s model and generalizations Application: feature selection Fuzzy Rough Sets (FRS) Implication/t-norm based model Vaguely quantified rough set model Applications in data analysis Software Conclusion XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 30/47
17 Fuzzy rough sets: motivation Indiscernibility may be gradual rather than binary a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x x x x x x x (Diabetes dataset partim, UCI Machine Learning Repository) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 31/47 Fuzzy rough sets: motivation Indiscernibility may be gradual rather than binary a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x x x x x x x (Diabetes dataset partim, UCI Machine Learning Repository) Allow that R is a fuzzy relation XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 31/47
18 Fuzzy rough sets: motivation Concepts may be fuzzy rather than crisp a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x x x x x x x (Housing dataset partim, UCI Machine Learning Repository) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 32/47 Fuzzy rough sets: motivation Concepts may be fuzzy rather than crisp a 1 a 2 a 3 a 4 a 5 a 6 a 7 a 8 d x x x x x x x (Housing dataset partim, UCI Machine Learning Repository) Allow that A is a fuzzy set XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 32/47
19 Rough set (R A, R A) y R A y R A [y] R A [y] R A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 33/47 Rough set (R A, R A) y R A ( x X)((x, y) R x A) y R A ( x X)((x, y) R x A) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 34/47
20 Fuzzy rough set (R A, R A) (R A)(y) = inf I(R(x, y), A(x)) x X (R A)(y) = sup T (R(x, y), A(x)) x X I(x, y) = max(1 x, y), T (x, y) = min(x, y) (Dubois and Prade, 1990) S-, R- and QL-implications (Radzikowska and Kerre, 2002) If A and R are crisp, we retrieve Pawlak s approximations XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 35/47 Vaguely Quantified Rough Sets Principle: soften the quantifiers inside the definitions of lower and upper approximation y belongs to the lower approximation of A iff Pawlak: all elements of [y] R belong to A VPRS: at least a fraction u of [y] R belongs to A VQRS: most elements of [y] R belong to A y belongs to the upper approximation of A iff Pawlak: at least one element of [y] R belongs to A VPRS: more than a fraction l of [y] R belongs to A VQRS: some elements of [y] R belong to A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 36/47
21 Vaguely Quantified Rough Sets y belongs to the lower approximation of A iff most elements of [y] R belong to A y belongs to the upper approximation of A iff some elements of [y] R belong to A XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 37/47 Vaguely Quantified Rough Sets ( ) [y]r A R A(y) = Q u [y] R ( ) [y]r A R A(y) = Q l [y] R (Cornelis, De Cock and Radzikowska, 2007) If R and A are crisp, Pawlak s approximations are NOT retrieved VQRS uses cardinality-based inclusion/overlap measures, while classical FRS uses logic-based measures XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 38/47
22 Fuzzy-rough feature selection Given a set of objects X = {x 1,...,x n }, a set of conditional attributes A = {a 1,...,a m } a fuzzy tolerance relation R B for any B A a set of decision classes C Positive region: ( ) POS B (x) = R B C (x) C C Degree of dependency: γ B = POS B X = POS B (x) x X X XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 39/47 Fuzzy-rough feature selection Definition (Jensen and Shen, 2007) B is a decision reduct if γ B = γ A and γ B < γ B for all B B. Heuristic approaches to find a subset B A such that γ B is maximal B is minimal Other extensions of decision reducts have been considered in e.g. (Cornelis, Jensen, Hurtado and Ślȩzak, 2010) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 40/47
23 Fuzzy-rough K-nearest neighbours Goal: classification of test object y given training data T K nearest neighbours in T determine y s membership to lower and upper approximation of each class Class with highest membership is chosen (Jensen and Cornelis, 2008) (1) GetNearestNeighbours(y,K) (2) µ 1 (y) 0, µ 2 (y) 0, Class (3) C C (4) if ((R C)(y) µ 1 (y) (R C)(y) µ 2 (y)) (5) Class C (6) µ 1 (y) (R C)(y), µ 2 (y) (R C)(y) (7) output Class XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 41/47 QuickRules Goal: generate fuzzy classification rules using minimum number of attributes Integrates feature selection and rule induction Decision reduct is obtained by a hillclimbing search On the fly, decision rules are generated for fully covered training objects (Jensen, Cornelis and Shen, 2009) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 42/47
24 Fuzzy-rough data analysis in practice Several fuzzy-rough feature selection and classification methods have been ported to WEKA and are available at Richard Jensen s homepage XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 43/47 Conclusion Fuzzy sets model gradual information Rough sets model incomplete information They are highly complementary soft computing paradigms They have many applications, in particular in data analysis (Fuzzy) rough sets raise many research challenges, both practical and theoretical XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 44/47
25 Bibliography D. Chen, E. Tsang, S. Zhao, An approach of attributes reduction based on fuzzy rough sets, Proc. IEEE Int. Conf. on Systems, Man, and Cybernetics, 2007, pp D. Chen, E. Tsang, S. Zhao, Attribute reduction based on fuzzy rough sets, Proc. Int. Conf. on Rough Sets and Intelligent Systems Paradigms, 2007, pp C. Cornelis, M. De Cock, A. Radzikowska, Vaguely quantified rough sets, Proceedings of 11th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing (RSFDGrC2007), Lecture Notes in Artificial Intelligence 4482, 2007, pp C. Cornelis, M. De Cock, A.M. Radzikowska, Fuzzy rough sets: from theory into practice, Handbook of Granular Computing (W. Pedrycz, A. Skowron, V. Kreinovich, eds.), John Wiley and Sons, 2008, pp R. Jensen, C. Cornelis, A new approach to fuzzy-rough nearest neighbour classification, Proceedings of the 6th International Conference on Rough Sets and Current Trends in Computing (RSCTC 2008), 2008, pp C. Cornelis, R. Jensen, G. Hurtado Martín D. Ślȩzak, Attribute selection with fuzzy decision reducts, Information Sciences 180(2) (2010) M. De Cock, C. Cornelis, E.E. Kerre, Fuzzy rough sets: the forgotten step, IEEE Transactions on Fuzzy Systems 15(1) (2007) R. Jensen, Q. Shen, Fuzzy-rough sets assisted attribute selection, IEEE Transactions on Fuzzy Systems 15(1) (2007) R. Jensen, Q. Shen, New approaches to fuzzy-rough feature selection, IEEE Transactions on Fuzzy Systems 17(4) (2009) R. Jensen, C. Cornelis, Q. Shen, Hybrid fuzzy-rough rule induction and feature selection, Proceedings of the 18th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2009), 2009, pp XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 45/47 Bibliography Z. Pawlak, Rough Sets, International Journal of Computer and Information Sciences 11(5) (1982) Z. Pawlak, Rough Sets Theoretical Aspects of Reasoning about Data, Kluwer Academic Publishers, Dordrecht, Netherlands, A.M. Radzikowska, E.E. Kerre, A comparative study of fuzzy rough sets, Fuzzy Sets and Systems 126 (2002) A. Skowron, C. Rauszer, The Discernibility Matrices and Functions in Information Systems, Intelligent Decision Support: Handbook of Applications and Advances of the Rough Sets Theory (R. Słowiński, ed.), Kluwer Academic Publishers, Dordrecht, Netherlands, 1992, pp J. Stepaniuk, Tolerance Information Granules, Monitoring, Security, and Rescue Techniques in Multiagent Systems. Advances in Soft Computing, Springer, 2005, pp E.C.C. Tsang, D.G. Chen, D.S. Yeung, X.Z. Wang, J.W.T Lee, attributes reduction using fuzzy rough sets, IEEE Transactions on Fuzzy Systems 16(5) (2008) I.H. Witten, E. Frank, Data Mining: Practical machine learning tools and techniques, 2nd Edition, Morgan Kaufmann, San Francisco, M. Yang, S. Chen, X. Yang, A novel approach of rough set-based attribute reduction using fuzzy discernibility matrix, Proc. 4th Int. Conf. on Fuzzy Systems and Knowledge Discovery, 2007, pp S. Zhao, E.C.C. Tsang, On fuzzy approximation operators in attribute reduction with fuzzy rough sets, Information Sciences 178(16), (2007) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 46/47
26 Para terminar Gracias por su atención! Preguntas? (en inglés, por favor;-)) XV Congreso Español sobre Tecnologías y Lógica Fuzzy Rough Sets and Fuzzy Rough Sets 47/47
ROUGH SETS AND DATA MINING. Zdzisław Pawlak
ROUGH SETS AND DATA MINING Zdzisław Pawlak Institute of Theoretical and Applied Informatics, Polish Academy of Sciences, ul. altycka 5, 44 100 Gliwice, Poland ASTRACT The paper gives basic ideas of rough
An Introduction to Data Mining. Big Data World. Related Fields and Disciplines. What is Data Mining? 2/12/2015
An Introduction to Data Mining for Wind Power Management Spring 2015 Big Data World Every minute: Google receives over 4 million search queries Facebook users share almost 2.5 million pieces of content
FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT MINING SYSTEM
International Journal of Innovative Computing, Information and Control ICIC International c 0 ISSN 34-48 Volume 8, Number 8, August 0 pp. 4 FUZZY CLUSTERING ANALYSIS OF DATA MINING: APPLICATION TO AN ACCIDENT
Big Data with Rough Set Using Map- Reduce
Big Data with Rough Set Using Map- Reduce Mr.G.Lenin 1, Mr. A. Raj Ganesh 2, Mr. S. Vanarasan 3 Assistant Professor, Department of CSE, Podhigai College of Engineering & Technology, Tirupattur, Tamilnadu,
Data Mining and Soft Computing. Francisco Herrera
Francisco Herrera Research Group on Soft Computing and Information Intelligent Systems (SCI 2 S) Dept. of Computer Science and A.I. University of Granada, Spain Email: [email protected] http://sci2s.ugr.es
Prototype-based classification by fuzzification of cases
Prototype-based classification by fuzzification of cases Parisa KordJamshidi Dep.Telecommunications and Information Processing Ghent university [email protected] Bernard De Baets Dep. Applied Mathematics
Fuzzy Logic -based Pre-processing for Fuzzy Association Rule Mining
Fuzzy Logic -based Pre-processing for Fuzzy Association Rule Mining by Ashish Mangalampalli, Vikram Pudi Report No: IIIT/TR/2008/127 Centre for Data Engineering International Institute of Information Technology
Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation
Pattern Recognition 40 (2007) 3509 352 www.elsevier.com/locate/pr Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation Qinghua Hu, Zongxia Xie, Daren Yu Harbin Institute
Meta-learning. Synonyms. Definition. Characteristics
Meta-learning Włodzisław Duch, Department of Informatics, Nicolaus Copernicus University, Poland, School of Computer Engineering, Nanyang Technological University, Singapore [email protected] (or search
Section 5 shows comparison between CRSA and DRSA. Finally, Section 6 concludes the paper. <ε then STOP, otherwise return. to step 2.
The Online Journal on Computer Science Information Technology (OJCSIT) Vol. () No. () Dominance-based rough set approach in business intelligence S.M Aboelnaga, H.M Abdalkader R.Hussein Information System
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES Vijayalakshmi Mahanra Rao 1, Yashwant Prasad Singh 2 Multimedia University, Cyberjaya, MALAYSIA 1 [email protected]
Manjeet Kaur Bhullar, Kiranbir Kaur Department of CSE, GNDU, Amritsar, Punjab, India
Volume 5, Issue 6, June 2015 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Multiple Pheromone
Optimization under fuzzy if-then rules
Optimization under fuzzy if-then rules Christer Carlsson [email protected] Robert Fullér [email protected] Abstract The aim of this paper is to introduce a novel statement of fuzzy mathematical programming
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com
Experiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
Performance Study on Data Discretization Techniques Using Nutrition Dataset
2009 International Symposium on Computing, Communication, and Control (ISCCC 2009) Proc.of CSIT vol.1 (2011) (2011) IACSIT Press, Singapore Performance Study on Data Discretization Techniques Using Nutrition
Introduction to Learning & Decision Trees
Artificial Intelligence: Representation and Problem Solving 5-38 April 0, 2007 Introduction to Learning & Decision Trees Learning and Decision Trees to learning What is learning? - more than just memorizing
Data Mining: A Preprocessing Engine
Journal of Computer Science 2 (9): 735-739, 2006 ISSN 1549-3636 2005 Science Publications Data Mining: A Preprocessing Engine Luai Al Shalabi, Zyad Shaaban and Basel Kasasbeh Applied Science University,
Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms
Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms Y.Y. Yao, Y. Zhao, R.B. Maguire Department of Computer Science, University of Regina Regina,
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM
AUTO CLAIM FRAUD DETECTION USING MULTI CLASSIFIER SYSTEM ABSTRACT Luis Alexandre Rodrigues and Nizam Omar Department of Electrical Engineering, Mackenzie Presbiterian University, Brazil, São Paulo [email protected],[email protected]
Impact of Boolean factorization as preprocessing methods for classification of Boolean data
Impact of Boolean factorization as preprocessing methods for classification of Boolean data Radim Belohlavek, Jan Outrata, Martin Trnecka Data Analysis and Modeling Lab (DAMOL) Dept. Computer Science,
Detection. Perspective. Network Anomaly. Bhattacharyya. Jugal. A Machine Learning »C) Dhruba Kumar. Kumar KaKta. CRC Press J Taylor & Francis Croup
Network Anomaly Detection A Machine Learning Perspective Dhruba Kumar Bhattacharyya Jugal Kumar KaKta»C) CRC Press J Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint of the Taylor
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
Linguistic Preference Modeling: Foundation Models and New Trends. Extended Abstract
Linguistic Preference Modeling: Foundation Models and New Trends F. Herrera, E. Herrera-Viedma Dept. of Computer Science and Artificial Intelligence University of Granada, 18071 - Granada, Spain e-mail:
A Novel Feature Selection Method Based on an Integrated Data Envelopment Analysis and Entropy Mode
A Novel Feature Selection Method Based on an Integrated Data Envelopment Analysis and Entropy Mode Seyed Mojtaba Hosseini Bamakan, Peyman Gholami RESEARCH CENTRE OF FICTITIOUS ECONOMY & DATA SCIENCE UNIVERSITY
USING THE AGGLOMERATIVE METHOD OF HIERARCHICAL CLUSTERING AS A DATA MINING TOOL IN CAPITAL MARKET 1. Vera Marinova Boncheva
382 [7] Reznik, A, Kussul, N., Sokolov, A.: Identification of user activity using neural networks. Cybernetics and computer techniques, vol. 123 (1999) 70 79. (in Russian) [8] Kussul, N., et al. : Multi-Agent
A Rough Set View on Bayes Theorem
A Rough Set View on Bayes Theorem Zdzisław Pawlak* University of Information Technology and Management, ul. Newelska 6, 01 447 Warsaw, Poland Rough set theory offers new perspective on Bayes theorem. The
Roulette Sampling for Cost-Sensitive Learning
Roulette Sampling for Cost-Sensitive Learning Victor S. Sheng and Charles X. Ling Department of Computer Science, University of Western Ontario, London, Ontario, Canada N6A 5B7 {ssheng,cling}@csd.uwo.ca
A FUZZY LOGIC APPROACH FOR SALES FORECASTING
A FUZZY LOGIC APPROACH FOR SALES FORECASTING ABSTRACT Sales forecasting proved to be very important in marketing where managers need to learn from historical data. Many methods have become available for
A Two-Step Method for Clustering Mixed Categroical and Numeric Data
Tamkang Journal of Science and Engineering, Vol. 13, No. 1, pp. 11 19 (2010) 11 A Two-Step Method for Clustering Mixed Categroical and Numeric Data Ming-Yi Shih*, Jar-Wen Jheng and Lien-Fu Lai Department
Data Mining based on Rough Set and Decision Tree Optimization
Data Mining based on Rough Set and Decision Tree Optimization College of Information Engineering, North China University of Water Resources and Electric Power, China, [email protected] Abstract This paper
Index Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
Introduction to Data Mining Techniques
Introduction to Data Mining Techniques Dr. Rajni Jain 1 Introduction The last decade has experienced a revolution in information availability and exchange via the internet. In the same spirit, more and
Probabilistic Rough Set Approximations
Probabilistic Rough Set Approximations Yiyu (Y.Y.) Yao Department of Computer Science University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: [email protected] Abstract Probabilistic approaches
The Optimality of Naive Bayes
The Optimality of Naive Bayes Harry Zhang Faculty of Computer Science University of New Brunswick Fredericton, New Brunswick, Canada email: hzhang@unbca E3B 5A3 Abstract Naive Bayes is one of the most
CSP Scheduling on basis of Priority of Specific Service using Cloud Broker
Research Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet CSP
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION
ISSN 9 X INFORMATION TECHNOLOGY AND CONTROL, 00, Vol., No.A ON INTEGRATING UNSUPERVISED AND SUPERVISED CLASSIFICATION FOR CREDIT RISK EVALUATION Danuta Zakrzewska Institute of Computer Science, Technical
Random forest algorithm in big data environment
Random forest algorithm in big data environment Yingchun Liu * School of Economics and Management, Beihang University, Beijing 100191, China Received 1 September 2014, www.cmnt.lv Abstract Random forest
Subject Description Form
Subject Description Form Subject Code Subject Title COMP417 Data Warehousing and Data Mining Techniques in Business and Commerce Credit Value 3 Level 4 Pre-requisite / Co-requisite/ Exclusion Objectives
Project Management Efficiency A Fuzzy Logic Approach
Project Management Efficiency A Fuzzy Logic Approach Vinay Kumar Nassa, Sri Krishan Yadav Abstract Fuzzy logic is a relatively new technique for solving engineering control problems. This technique can
ISSN: 2277-3754 ISO 9001:2008 Certified International Journal of Engineering and Innovative Technology (IJEIT) Volume 3, Issue 3, September 2013
Performance Appraisal using Fuzzy Evaluation Methodology Nisha Macwan 1, Dr.Priti Srinivas Sajja 2 Assistant Professor, SEMCOM 1 and Professor, Department of Computer Science 2 Abstract Performance is
Three Perspectives of Data Mining
Three Perspectives of Data Mining Zhi-Hua Zhou * National Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093, China Abstract This paper reviews three recent books on data mining
Using Rough Sets to predict insolvency of Spanish non-life insurance companies
Using Rough Sets to predict insolvency of Spanish non-life insurance companies M.J. Segovia-Vargas a, J.A. Gil-Fana a, A. Heras-Martínez a, J.L. Vilar-Zanón a, A. Sanchis-Arellano b a Departamento de Economía
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS
FRAUD DETECTION IN ELECTRIC POWER DISTRIBUTION NETWORKS USING AN ANN-BASED KNOWLEDGE-DISCOVERY PROCESS Breno C. Costa, Bruno. L. A. Alberto, André M. Portela, W. Maduro, Esdras O. Eler PDITec, Belo Horizonte,
A FUZZY CLUSTERING ENSEMBLE APPROACH FOR CATEGORICAL DATA
International Journal of scientific research and management (IJSRM) Volume 1 Issue 6 Pages 327-331 2013 Website: www.ijsrm.in ISSN (e): 2321-3418 A FUZZY CLUSTERING ENSEMBLE APPROACH FOR CATEGORICAL DATA
Comparative Analysis of EM Clustering Algorithm and Density Based Clustering Algorithm Using WEKA tool.
International Journal of Engineering Research and Development e-issn: 2278-067X, p-issn: 2278-800X, www.ijerd.com Volume 9, Issue 8 (January 2014), PP. 19-24 Comparative Analysis of EM Clustering Algorithm
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
High-dimensional labeled data analysis with Gabriel graphs
High-dimensional labeled data analysis with Gabriel graphs Michaël Aupetit CEA - DAM Département Analyse Surveillance Environnement BP 12-91680 - Bruyères-Le-Châtel, France Abstract. We propose the use
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE
A NEW DECISION TREE METHOD FOR DATA MINING IN MEDICINE Kasra Madadipouya 1 1 Department of Computing and Science, Asia Pacific University of Technology & Innovation ABSTRACT Today, enormous amount of data
Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing and Developing E-mail Classifier
International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-1, Issue-6, January 2013 Artificial Neural Network, Decision Tree and Statistical Techniques Applied for Designing
DATA PREPARATION FOR DATA MINING
Applied Artificial Intelligence, 17:375 381, 2003 Copyright # 2003 Taylor & Francis 0883-9514/03 $12.00 +.00 DOI: 10.1080/08839510390219264 u DATA PREPARATION FOR DATA MINING SHICHAO ZHANG and CHENGQI
Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin
Data Mining for Customer Service Support Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin Traditional Hotline Services Problem Traditional Customer Service Support (manufacturing)
Reference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification k-nearest neighbors
Classification k-nearest neighbors Data Mining Dr. Engin YILDIZTEPE Reference Books Han, J., Kamber, M., Pei, J., (2011). Data Mining: Concepts and Techniques. Third edition. San Francisco: Morgan Kaufmann
Applied Mathematical Sciences, Vol. 7, 2013, no. 112, 5591-5597 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2013.
Applied Mathematical Sciences, Vol. 7, 2013, no. 112, 5591-5597 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2013.38457 Accuracy Rate of Predictive Models in Credit Screening Anirut Suebsing
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
Association Rule Mining with Fuzzy Logic: an Overview
Association Rule Mining with Fuzzy Logic: an Overview Anand V. Saurkar 1, S. A. Gode 2 1 Dept. of Computer Science & Engineering, Datta Meghe Institute of Engineering, Technology and Research, Sawangi
International Journal of Advance Research in Computer Science and Management Studies
Volume 2, Issue 12, December 2014 ISSN: 2321 7782 (Online) International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online
Introducing diversity among the models of multi-label classification ensemble
Introducing diversity among the models of multi-label classification ensemble Lena Chekina, Lior Rokach and Bracha Shapira Ben-Gurion University of the Negev Dept. of Information Systems Engineering and
Meta-learning via Search Combined with Parameter Optimization.
Meta-learning via Search Combined with Parameter Optimization. Włodzisław Duch and Karol Grudziński Department of Informatics, Nicholas Copernicus University, Grudziądzka 5, 87-100 Toruń, Poland. www.phys.uni.torun.pl/kmk
Research on Trust Management Strategies in Cloud Computing Environment
Journal of Computational Information Systems 8: 4 (2012) 1757 1763 Available at http://www.jofcis.com Research on Trust Management Strategies in Cloud Computing Environment Wenjuan LI 1,2,, Lingdi PING
A Three-Way Decision Approach to Email Spam Filtering
A Three-Way Decision Approach to Email Spam Filtering Bing Zhou, Yiyu Yao, and Jigang Luo Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 {zhou200b,yyao,luo226}@cs.uregina.ca
Social Media Mining. Data Mining Essentials
Introduction Data production rate has been increased dramatically (Big Data) and we are able store much more data than before E.g., purchase data, social media data, mobile phone data Businesses and customers
A Survey on Outlier Detection Techniques for Credit Card Fraud Detection
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 16, Issue 2, Ver. VI (Mar-Apr. 2014), PP 44-48 A Survey on Outlier Detection Techniques for Credit Card Fraud
Marcin Szczuka The University of Warsaw Poland
Marcin Szczuka The University of Warsaw Poland Why KDD? We are drowning in the sea of data, but what we really want is knowledge PROBLEM: How to retrieve useful information (knowledge) from massive data
Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods
Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods Jerzy B laszczyński 1, Krzysztof Dembczyński 1, Wojciech Kot lowski 1, and Mariusz Paw lowski 2 1 Institute of Computing
Knowledge Based Descriptive Neural Networks
Knowledge Based Descriptive Neural Networks J. T. Yao Department of Computer Science, University or Regina Regina, Saskachewan, CANADA S4S 0A2 Email: [email protected] Abstract This paper presents a
A Novel Approach for Heart Disease Diagnosis using Data Mining and Fuzzy Logic
A Novel Approach for Heart Disease Diagnosis using Data Mining and Fuzzy Logic Nidhi Bhatla GNDEC, Ludhiana, India Kiran Jyoti GNDEC, Ludhiana, India ABSTRACT Cardiovascular disease is a term used to describe
Rule based Classification of BSE Stock Data with Data Mining
International Journal of Information Sciences and Application. ISSN 0974-2255 Volume 4, Number 1 (2012), pp. 1-9 International Research Publication House http://www.irphouse.com Rule based Classification
An Evaluation of Neural Networks Approaches used for Software Effort Estimation
Proc. of Int. Conf. on Multimedia Processing, Communication and Info. Tech., MPCIT An Evaluation of Neural Networks Approaches used for Software Effort Estimation B.V. Ajay Prakash 1, D.V.Ashoka 2, V.N.
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support
DMDSS: Data Mining Based Decision Support System to Integrate Data Mining and Decision Support Rok Rupnik, Matjaž Kukar, Marko Bajec, Marjan Krisper University of Ljubljana, Faculty of Computer and Information
An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset
P P P Health An Analysis of Missing Data Treatment Methods and Their Application to Health Care Dataset Peng Liu 1, Elia El-Darzi 2, Lei Lei 1, Christos Vasilakis 2, Panagiotis Chountas 2, and Wei Huang
CS Master Level Courses and Areas COURSE DESCRIPTIONS. CSCI 521 Real-Time Systems. CSCI 522 High Performance Computing
CS Master Level Courses and Areas The graduate courses offered may change over time, in response to new developments in computer science and the interests of faculty and students; the list of graduate
Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing
www.ijcsi.org 198 Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing Lilian Sing oei 1 and Jiayang Wang 2 1 School of Information Science and Engineering, Central South University
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
Research Article www.ijptonline.com EFFICIENT TECHNIQUES TO DEAL WITH BIG DATA CLASSIFICATION PROBLEMS G.Somasekhar 1 *, Dr. K.
ISSN: 0975-766X CODEN: IJPTFI Available Online through Research Article www.ijptonline.com EFFICIENT TECHNIQUES TO DEAL WITH BIG DATA CLASSIFICATION PROBLEMS G.Somasekhar 1 *, Dr. K.Karthikeyan 2 1 Research
Analysis of Various Techniques to Handling Missing Value in Dataset Rajnik L. Vaishnav a, Dr. K. M. Patel b a
Available online at www.ijiere.com International Journal of Innovative and Emerging Research in Engineering e-issn: 2394-3343 e-issn: 2394-5494 Analysis of Various Techniques to Handling Missing Value
Enhanced data mining analysis in higher educational system using rough set theory
African Journal of Mathematics and Computer Science Research Vol. 2(9), pp. 184-188, October, 2009 Available online at http://www.academicjournals.org/ajmcsr ISSN 2006-9731 2009 Academic Journals Review
Self Organizing Maps for Visualization of Categories
Self Organizing Maps for Visualization of Categories Julian Szymański 1 and Włodzisław Duch 2,3 1 Department of Computer Systems Architecture, Gdańsk University of Technology, Poland, [email protected]
Grid Density Clustering Algorithm
Grid Density Clustering Algorithm Amandeep Kaur Mann 1, Navneet Kaur 2, Scholar, M.Tech (CSE), RIMT, Mandi Gobindgarh, Punjab, India 1 Assistant Professor (CSE), RIMT, Mandi Gobindgarh, Punjab, India 2
Web Mining Seminar CSE 450. Spring 2008 MWF 11:10 12:00pm Maginnes 113
CSE 450 Web Mining Seminar Spring 2008 MWF 11:10 12:00pm Maginnes 113 Instructor: Dr. Brian D. Davison Dept. of Computer Science & Engineering Lehigh University [email protected] http://www.cse.lehigh.edu/~brian/course/webmining/
Web Document Clustering
Web Document Clustering Lab Project based on the MDL clustering suite http://www.cs.ccsu.edu/~markov/mdlclustering/ Zdravko Markov Computer Science Department Central Connecticut State University New Britain,
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Optimization of Fuzzy Inventory Models under Fuzzy Demand and Fuzzy Lead Time
Tamsui Oxford Journal of Management Sciences, Vol. 0, No. (-6) Optimization of Fuzzy Inventory Models under Fuzzy Demand and Fuzzy Lead Time Chih-Hsun Hsieh (Received September 9, 00; Revised October,
Extracting Fuzzy Rules from Data for Function Approximation and Pattern Classification
Extracting Fuzzy Rules from Data for Function Approximation and Pattern Classification Chapter 9 in Fuzzy Information Engineering: A Guided Tour of Applications, ed. D. Dubois, H. Prade, and R. Yager,
The Research of Data Mining Based on Neural Networks
2011 International Conference on Computer Science and Information Technology (ICCSIT 2011) IPCSIT vol. 51 (2012) (2012) IACSIT Press, Singapore DOI: 10.7763/IPCSIT.2012.V51.09 The Research of Data Mining
A Survey on Parallel Method for Rough Set using MapReduce Technique for Data Mining
www.ijecs.in International Journal Of Engineering And Computer Science ISSN: 2319-7242 Volume 4 Issue 9 Sep 2015, Page No. 14160-14163 A Survey on Parallel Method for Rough Set using MapReduce Technique
