the beginning of a run of a GA a large population of random chromosomes is created. Chromosome models depend on the case.
|
|
- Jemima Holt
- 7 years ago
- Views:
Transcription
1 Comparison of Document Similarity in Information Retrieval System by Different Formulation Poltak Sihombing 1, Abdullah Embong 2, Putra Sumari 3 1, 2,3 School of Computer Science, Universiti Sains Malaysia, Penang, Malaysia 1 poltakhombing@yahoo.com ; 2 ae@cs.usm.my; 3 putras@cs.usm.my ABSTRACT In this paper we are going to implement Horng & Yeh s formulation in Information Retrieval System, (IRS) and to compare it with the Jaccard s formulation and Dice s formulation. In the previous research, we have developed the Jaccard and Dice s formulation in a prototype called the Journal Browser. Each technique has been implemented in IRS using Genetic Algorithm (GA). The objective of GA was to find a set of documents which best fit the searcher s needs. In this study, an evaluation function for the fitness of each chromosome was selected based on Horng & Yeh s score. This score is formulated to measure the relationship of the query with some documents in a database. To initialize a population of the queries, we need first to decide the number of genes for each individual and the total number of chromosomes (popsize) in the initial population. GA is basically based on natural biological evolution theory. The parent solution (chromosome) with the higher level of fitness has a bigger similarity percentage of documents, while those with lower level of fitness have less similarity percentage of documents. By the similarity percentage of documents, the user can choose the most relevant document from the database. KEYWORDS Database, information, retrieval, document, similarity, genetic algorithm. 1. Introduction The goal of any IRS is to help a user to locate the most similar documents that have the potential to satisfy the user information needs. To solve this problem, researchers have implemented several methods such as inverted index, Boolean querying, knowledge-based, neural network, probabilistic retrieval and machine learning approach [1]. A newer paradigm, generally considered to be the machine learning approach has attracted the attention of researchers in artificial intelligence, computer science, and other functional disciplines such as engineering, medicine, and business. In contrast to performance systems which acquire knowledge from human experts, machine learning systems acquire knowledge automatically from examples, i.e., from data source. The most frequently used techniques include symbolic, inductive learning algorithms, multiple-layered, feed-forward neural networks such as back propagation networks, and evolution-based GA[2] [3]. The GA was used because of its ability to determine document similarity by keywords competition. A keyword represents a gene (a bit pattern), a document's list of keywords represents individuals (a bit string), and a collection of documents initially judged relevant by a user represents the initial population. [4]. In this research we are interested to investigate the use of GA to determine a document similarity by using Horng and Yeh s fitness measure and to implement this technique in a prototype of Journal Browser. We have also implemented the formulation of Jaccard and formulation of Dice in the previous research. 2. Using GA for IRS The literature study revealed several implementations of genetic algorithms in information retrieval. For example, Gordon[5] proposed genetic algorithms based on approach of document indexing. Competing document descriptions (keywords) are associated with a document and altered over time by using genetic mutation and crossover operators. In his design, a keyword represents a gene (a bit pattern), a document's list of keywords represents individuals (a bit string), and a collection of documents initially judged relevant by a user represents the initial population. Based on a Jaccard's formulation (fitness measure), the initial population evolved through generations and eventually 1
2 converged to an optimal (improved) population - a set of keywords which best described the documents. Genetic Algorithm (GA) is a part of evolutionary computing, which is rapidly growing area of artificial intelligence. GA is inspired by Darwin s theory of evolution. Simply said, problems are solved by an evolutionary process resulting in a best (fittest) solution [6]. A GA maintains a population of individuals, P (t) = x 1, x n at iteration t. Each individual represents a potential solution to the problem at hand and is implemented as some data structure S. Each solution x i is evaluated to give some measure of fitness. Then a new population at iteration t+1 is formed by selecting the fitter individuals. Some members of the new population undergo transformation by means of genetic operators to form new solutions. There are unary transformations m i (mutation type), which create new individuals by a small change in a single individual and higher order transformations c j (crossover type), which create new individuals by combining parts from several (two or more) individuals. The crossover and mutation are the most important parts of the genetic algorithm [6]. The performance is influenced mainly by these two operators. The genetic algorithm was executed in the following steps: a. Encoding of a Chromosome Algorithm begins with a set of solutions (represented by chromosomes) called population. Solutions from one population are taken and used to form a new population. This is motivated by the expectation that the new population will be better than the old one. Solutions which are then selected to form new solutions (offspring) are selected according to their fitness [8]. In IRS the keywords used in the set of userselected documents were first identified to represent the underlying bit strings for the initial population. A chromosome is formed by gene which represents bit (0 and 1). Each bit represents the same unique keyword throughout the complete GA process. When a keyword is present in a document, the bit is set to 1; otherwise it is set to 0. Each document could then be represented in terms of a sequence of 0s and 1s which is called a chromosome model. At the beginning of a run of a GA a large population of random chromosomes is created. Chromosome models depend on the case. b. Crossover. This is simply the chance that two chromosomes will swap their bits. Crossover is performed by selecting a random gene along the length of the chromosomes and swapping all the genes after that point [7]. E.g. Given two chromosomes A: B: Choose a random bit along the length, say at position 9, and swap all the bits after that point, so the above become: c. Mutation A : B : After a crossover is performed, mutation takes place. Mutation is intended to prevent falling of all solutions in the population into a local optimum of the solved problem. Mutation operation randomly changes the offspring resulted from crossover. In case of binary encoding we can switch a few randomly chosen bits from 1 to 0 or from 0 to 1. Mutation can be illustrated as follows [7]: Original offspring Original offspring Mutated offspring Mutated offspring The technique of mutation (as well as crossover) depends mainly on the encoding of chromosomes. For example when we are encoding permutations, mutation could be performed as an exchange of two genes. d. Determination Of Population Determination of population on next generation is based on the fitness score. The higher level of fitness has a bigger probability to reproduce, while those with lower level of fitness have less probability to reproduce. Usually, this probability is selected by Roulette wheel selection method. In the next generation, all population is evaluated to determine whether they have reached the expected solution [7]. 2
3 3. Implementation In the previous research before we have implemented the Jaccard formulation and also the Dice formulation. In Jaccard s formulation, for 2 queries, the similarity value was found to be 25.00%. In Dice s formulation, for 2 queries, the similarity value was found to be 27.27%. For 3 queries, in Jaccard s formulation the similarity value was found to be 20.00%. In Dice s formulation, for 3 queries, the similarity value was found to be 27.39% [16]. 3.1 Jaccard Formulation Jaccard s score is formulated below: #( X Y ) = #( X Y ) Where #(S) showing number of element in S. For example [16]: S = {a, b, c, d, e, f, g, h, i, j} If X = {a, b, e, g, h, i, j} and Y = {b, c, d, f, g, j} then X Y X Y therefore = { b, g, j} #( X Y ) = #( X Y ) 3.2 Dice Formulation = { a, b, c, d, e, f, g, h, i, j} 3 10 = 0.3 Dice s score is formulated below: From this formula it can be seen that if X is equal to Y then Dice s score is equal to 1, meaning that document X is precisely the same as document Y, although this case is very difficult to be found in the database[17]. 3.3 Horng & Yeh Formulation In this research, Horng &Yeh formulation has been implemented in a form of a prototype of Journal Browser as shown in Fig.1, menu detail in Fig. 2 and Copy Files in Fig. 3. Horng & Yeh formulation as shown below: Where D is total of document retrieval and r (d) is a function of relevant document, d is set to 1 if a keyword present in document, otherwise it is to 0. The formulation of Horng and Yeh represented the value of similarity measure in Genetic Algorithm. The fitness with a higher score reflects a higher probability similarity of document. Application of this technique will facilitate searching and retrieval of the required document from one or more databases based on the similarity level [13] [14]. 4. The Experimental Result and its Analysis Some of the example results of document similarity are shown in appendix 1. Several queries have been given with different number of document request. The result shows that documents similarity levels are consistent even though the percentage of similarity may change. This means that the order of the document resulted from the queries may changes but most of the documents resulted in the top choice remain. When more documents were used as the source of the queries, the document similarity level may decrease because the fitness value decreases. For 2 queries, 3 queries and 4 queries, the similarity value can be seen as shown in Table 1, Table 2, Table 3, and Fig. 4. Table 1. Comparison of Percentage Similarity (2 Queries) Method Percentage Jaccard Dice Horng & Yeh Doc Doc Doc Doc Doc
4 Fig.1. Journal Browser Fig. 2. Menu Detail Fig. 3. Menu Copy Files 4
5 Table 2. Comparison of Percentage Similarity (3 Queries) Method Percentage Jaccard Dice Horng & Yeh Doc Doc Doc Doc Doc Table 3. Comparison of Percentage Similarity (4 Queries) Method Percentage Jaccard Dice Horng & Yeh Doc Doc Doc Doc Doc The experimental result, it is observed that if the sum of document as a query is increased, not only the sum of generation is increase but also the sum of mutation is increase as shown in Table 4 and Table 5. Table 4. Comparison of Generation Sum of Generation Sum of Jaccard Dice Horng & Yeh Query 2 Queries Queries Queries Table 5. Comparison of Mutation Sum of Mutation Sum of Jaccard Dice Horng & Yeh Query 2 Queries Queries Queries Percentage Comparison Of Document Similarity Jaccard Dice Horng & Yeh Formulation Percentage Doc. 1 Doc. 2 Doc. 3 Doc. 4 Doc. 5 Fig. 4. Comparison of Percentage Similarity The experimental result of document similarity by different formulation shows that the higher percentage of document similarity is a Dice formulation. In Dice s formulation was found to be 27.39%, Horng &Yeh s formulation was found to be 25.00%, and Jaccard s formulation was found to be 20.00% as shown in Table 2 and Fig. 4. S um O f G eneration Comparison Of Generation Jaccard Dice Horng & Yeh Formulation Fig. 5. Comparison of Generation 2 Queries 3 Queries 4 Queries It is observed that by increasing the sum generation, it can not be guaranteed that the similarity level of the retrieved document is also increased as shown in Fig. 5. 5
6 5. Conclusion and Future Work What we have learned from this is that we can draw some conclusions about overall trends in the technique of probability in document similarity by different formulation. It is observed that if the sum of query is increased, the percentage of similarity of the document retrieved may increase or decrease as shown above. But if the sum of the queries are increased the sum of crossover, mutation and generation may also increase. We can conclude that by increasing the sum of crossover, mutation and generation, it can not be guaranteed that the similarity level of the retrieved document is also increased. In a future study, we are going to use other method, and compare the result with the other method which has been developed. References [1]. Chen, Hinchey, Machine Learning for Information Retrieval: Neural Networks, Symbolic Learning, and Genetic Algorithms, Artificial Intelligence Lab, Eller College of Management, The University of Arizona, tml [2]. J. R. Quinlan. Discovering rules by induction from large collections of examples. In Expert Systems in the Micro-electronic Age, Pages , Michie, D., Editor, Edinburgh University Press, Edinburgh, Scotland, [3]. D. E. Rumelhart, G. E. Hinton, and R. J. Williams. Learning internal representations by error propagation. In Parallel Distributed Processing, pages , D. E. Rumelhart, J. L. McClelland, and the PDP Research Group, Editors, The MIT Press, Cambridge, MA, [4]. R.K.Belew.Adaptive information retrieval. In Proceedings of the Twelfth Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval, pages 11-20, Cambridge, MA, June 25-28, [5]. M. Gordon. Probabilistic and genetic algorithms for document retrieval. Communications of the ACM, 31(10): , October [6]. Hsiung, Sam, An Introduction to Genetic Algorithms, generation5, 2000, p, [7]. Anonym, Genetic Algorithm Tutorial, ai-junkie, [8]. Marek Obitko, Genetic Algorithms, [9]. D. E. Goldberg. Genetic and evolutionary algorithms come of age. Communications of the ACM, 37(3): , March [10]. H. Chen and K. J. Lynch. Automatic construction of networks of concepts characterizing document databases.ieee Transactions on Systems, Man and Cybernetics, 22(5): , September/October [11]. H. Chen, K. J. Lynch, K. Basu, and D. T. Ng. Generating, integrating, and activating thesauri for concept-based document retrieval. IEEE EXPERT, Special Series on Artificial Intelligence in Text-based Information Systems, 8(2):25-34, April [12]. H. Chen and J. Kim. GANNET: a machine learning approach to document retrieval. Journal of Management Information Systems, 11(3):7-41, Winter [13]. Cristina Lopez-Pujalte, Vicente P. Guerrero- Bote. Order-Based Fitness Functions for Genetic Algorithms Applied to Relevance Feedback. Journal of the American Society for Information Science and Technology- January 15, [14]. M. D. Gordon. User-based document clustering by redescribing subject descriptions with a genetic algorithm. Journal of the American Society for Information Science, 42(5): , June [15]. V. V. Raghavan and B. Agarwal. Optimal determination of user-oriented clusters: An application for the reproductive plan. In Proceedings of the Second International Conference on Genetic Algorithms and Their Applications, pages , Cambridge, MA, July [16]. Poltak Sihombing, Abdullah Embong and Putra Sumari. A Technique Of Probability In Document Similarity Comparison In Information Retrieval System. In Proceedings of the IMT-GT Conference. Parapat, Indonesia. June [17]. Poltak Sihombing, Abdullah Embong and Putra Sumari. Application of Genetic Algorithm to Determine a Document Similarity Level in IRS. In Proceedings of the Malaysian Software Engineering Conference 05 (MySec05). Penang, Malaysia. Des
7 APPENDIX 1. HORNG & YEH METHOD FOR 2 QUERIES ================= Information Retrieval Report 3/12/ :03:47 AM ================= Dokumen Yang Dipilih 1. Relevant Rule Discovery by Language Bias for Semantic Query Optimization Keywords: Database, Language, Bias, Generator, Knowledge, Discovery, Rule, Semantic, Query, Optimization 2. Using Rough Set Theory for Automatic Data Analysis Keywords: Database, Analysis, Rough Set, data, automatic ========== Himpunan Kata Kunci Keseluruhan ========== Database, Language, Bias, Generator, Knowledge, Discovery, Rule, Semantic, Query, Optimization, Analysis, Rough Set, data, automatic Generasi ke-1 Populasi Awal : [ ] [ ] Populasi Sesudah Seleksi : Populasi Sesudah Persilangan : Proses Mutasi : Populasi Sesudah Mutasi : ===== Generasi Akhir ===== Solusi : Kata kunci : Database, Analysis, Rough Set, data, automatic Metode yang digunakan: Horng and Yeh Lama Proses: 0 detik THE RETRIEVAL RESULT FOR 2 QUERIES: (22.22%)A New Approach to Active Rule Analysis and Reorganization (12.50%)Data Partitioning for Incremental Data Mining (12.50%)New Institutions for Doing Science From Databases to Open Source Biology (12.50%)Bias Generator for View Discovery in Deductive Databases (11.11%)Data Classification Techniques for Cancer Dataset HORNG & YEH METHOD FOR 3 QUERIES ================= Information Retrieval Report 3/12/ :03:47 AM ================= Dokumen Yang Dipilih 1. Relevant Rule Discovery by Language Bias for Semantic Query Optimization Keywords : Database, Language, Bias, Generator, Knowledge, Discovery, Rule, Semantic, Query, Optimization 2. Using Rough Set Theory for Automatic Data Analysis Keywords : Database, Analysis, Rough Set, data, automatic 3. A Comparative Study of Techniques to Handle Missing Values in the Classification Task of Data Mining Keywords : Data Mining, value, missing, attribute ========== Himpunan Kata Kunci Keseluruhan ========== Database, Language, Bias, Generator, Knowledge, Discovery, Rule, Semantic, Query, Optimization, Analysis, Rough Set, data, automatic, Data Mining, value, missing, attribute Generasi ke-1 Populasi Awal : [ ] [ ] [ ] 7
8 Populasi Sesudah Seleksi : Populasi Sesudah Persilangan : Proses Mutasi : Populasi Sesudah Mutasi : Generasi ke-2 Populasi Awal : [ ] [ ] [ ] Populasi Sesudah Seleksi : Proses Persilangan : Kromosom 1 dengan kromosom 3, posisi 15 Populasi Sesudah Persilangan : Proses Mutasi : Populasi Sesudah Mutasi : Generasi ke-3 Populasi Awal : [ ] [ ] [ ] Populasi Sesudah Seleksi : Proses Persilangan : Kromosom 3 dengan kromosom 1, posisi 13 Populasi Sesudah Persilangan : Proses Mutasi : Populasi Sesudah Mutasi : ===== Generasi Akhir ===== Solusi : Kata kunci : Database, Language, Bias, Generator, Knowledge, Discovery, Rule, Semantic, Query, Optimization Metode yang digunakan: Horng and Yeh Lama Proses: 1 detik THE RETRIEVAL RESULT FOR 3 QUERIES: (27.27%) Bias Generator for View Discovery in Deductive Databases (23.08%) Knowledge Acquisition for Viewbased Query Processing (23.08%) A New Approach to Active Rule Analysis and Reorganization (23.08%) Knowledge Discovery for Trigger Conflict Resolution (16.67%) SUT Filter: A System for Data Preparation to Support Knowledge Discovery 8
Keywords: Information Retrieval, Vector Space Model, Database, Similarity Measure, Genetic Algorithm.
Volume 3, Issue 8, August 2013 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Effective Information
More informationEvolutionary SAT Solver (ESS)
Ninth LACCEI Latin American and Caribbean Conference (LACCEI 2011), Engineering for a Smart Planet, Innovation, Information Technology and Computational Tools for Sustainable Development, August 3-5, 2011,
More informationISSN: 2319-5967 ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 2, Issue 3, May 2013
Transistor Level Fault Finding in VLSI Circuits using Genetic Algorithm Lalit A. Patel, Sarman K. Hadia CSPIT, CHARUSAT, Changa., CSPIT, CHARUSAT, Changa Abstract This paper presents, genetic based algorithm
More informationA Genetic Algorithm Processor Based on Redundant Binary Numbers (GAPBRBN)
ISSN: 2278 1323 All Rights Reserved 2014 IJARCET 3910 A Genetic Algorithm Processor Based on Redundant Binary Numbers (GAPBRBN) Miss: KIRTI JOSHI Abstract A Genetic Algorithm (GA) is an intelligent search
More informationA Robust Method for Solving Transcendental Equations
www.ijcsi.org 413 A Robust Method for Solving Transcendental Equations Md. Golam Moazzam, Amita Chakraborty and Md. Al-Amin Bhuiyan Department of Computer Science and Engineering, Jahangirnagar University,
More informationA Novel Framework for Personalized Web Search
A Novel Framework for Personalized Web Search Aditi Sharan a, * Mayank Saini a a School of Computer and Systems Sciences, Jawaharlal Nehru University, New Delhi-67, India Abstract One hundred users, one
More informationComparison of Major Domination Schemes for Diploid Binary Genetic Algorithms in Dynamic Environments
Comparison of Maor Domination Schemes for Diploid Binary Genetic Algorithms in Dynamic Environments A. Sima UYAR and A. Emre HARMANCI Istanbul Technical University Computer Engineering Department Maslak
More informationDECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES Vijayalakshmi Mahanra Rao 1, Yashwant Prasad Singh 2 Multimedia University, Cyberjaya, MALAYSIA 1 lakshmi.mahanra@gmail.com
More informationGA as a Data Optimization Tool for Predictive Analytics
GA as a Data Optimization Tool for Predictive Analytics Chandra.J 1, Dr.Nachamai.M 2,Dr.Anitha.S.Pillai 3 1Assistant Professor, Department of computer Science, Christ University, Bangalore,India, chandra.j@christunivesity.in
More informationModified Version of Roulette Selection for Evolution Algorithms - the Fan Selection
Modified Version of Roulette Selection for Evolution Algorithms - the Fan Selection Adam S lowik, Micha l Bia lko Department of Electronic, Technical University of Koszalin, ul. Śniadeckich 2, 75-453 Koszalin,
More informationMemory Allocation Technique for Segregated Free List Based on Genetic Algorithm
Journal of Al-Nahrain University Vol.15 (2), June, 2012, pp.161-168 Science Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm Manal F. Younis Computer Department, College
More informationCellular Automaton: The Roulette Wheel and the Landscape Effect
Cellular Automaton: The Roulette Wheel and the Landscape Effect Ioan Hălălae Faculty of Engineering, Eftimie Murgu University, Traian Vuia Square 1-4, 385 Reşiţa, Romania Phone: +40 255 210227, Fax: +40
More informationCollege of information technology Department of software
University of Babylon Undergraduate: third class College of information technology Department of software Subj.: Application of AI lecture notes/2011-2012 ***************************************************************************
More informationGenetic Algorithms and Sudoku
Genetic Algorithms and Sudoku Dr. John M. Weiss Department of Mathematics and Computer Science South Dakota School of Mines and Technology (SDSM&T) Rapid City, SD 57701-3995 john.weiss@sdsmt.edu MICS 2009
More informationAlpha Cut based Novel Selection for Genetic Algorithm
Alpha Cut based Novel for Genetic Algorithm Rakesh Kumar Professor Girdhar Gopal Research Scholar Rajesh Kumar Assistant Professor ABSTRACT Genetic algorithm (GA) has several genetic operators that can
More informationA Parallel Processor for Distributed Genetic Algorithm with Redundant Binary Number
A Parallel Processor for Distributed Genetic Algorithm with Redundant Binary Number 1 Tomohiro KAMIMURA, 2 Akinori KANASUGI 1 Department of Electronics, Tokyo Denki University, 07ee055@ms.dendai.ac.jp
More informationD A T A M I N I N G C L A S S I F I C A T I O N
D A T A M I N I N G C L A S S I F I C A T I O N FABRICIO VOZNIKA LEO NARDO VIA NA INTRODUCTION Nowadays there is huge amount of data being collected and stored in databases everywhere across the globe.
More informationNumerical Research on Distributed Genetic Algorithm with Redundant
Numerical Research on Distributed Genetic Algorithm with Redundant Binary Number 1 Sayori Seto, 2 Akinori Kanasugi 1,2 Graduate School of Engineering, Tokyo Denki University, Japan 10kme41@ms.dendai.ac.jp,
More informationIntroduction To Genetic Algorithms
1 Introduction To Genetic Algorithms Dr. Rajib Kumar Bhattacharjya Department of Civil Engineering IIT Guwahati Email: rkbc@iitg.ernet.in References 2 D. E. Goldberg, Genetic Algorithm In Search, Optimization
More informationLab 4: 26 th March 2012. Exercise 1: Evolutionary algorithms
Lab 4: 26 th March 2012 Exercise 1: Evolutionary algorithms 1. Found a problem where EAs would certainly perform very poorly compared to alternative approaches. Explain why. Suppose that we want to find
More informationVolume 3, Issue 2, February 2015 International Journal of Advance Research in Computer Science and Management Studies
Volume 3, Issue 2, February 2015 International Journal of Advance Research in Computer Science and Management Studies Research Article / Survey Paper / Case Study Available online at: www.ijarcsms.com
More informationDesign of Web Ranking Module using Genetic Algorithm
Design of Web Ranking Module using Genetic Algorithm Vikas Thada Research Scholar Dr.K.N.M. University Newai, India Vivek Jaglan, Ph.D Asst.Prof(CSE),ASET Amity University Gurgaon, India ABSTRACT Crawling
More informationA Non-Linear Schema Theorem for Genetic Algorithms
A Non-Linear Schema Theorem for Genetic Algorithms William A Greene Computer Science Department University of New Orleans New Orleans, LA 70148 bill@csunoedu 504-280-6755 Abstract We generalize Holland
More informationEstimation of the COCOMO Model Parameters Using Genetic Algorithms for NASA Software Projects
Journal of Computer Science 2 (2): 118-123, 2006 ISSN 1549-3636 2006 Science Publications Estimation of the COCOMO Model Parameters Using Genetic Algorithms for NASA Software Projects Alaa F. Sheta Computers
More informationHolland s GA Schema Theorem
Holland s GA Schema Theorem v Objective provide a formal model for the effectiveness of the GA search process. v In the following we will first approach the problem through the framework formalized by
More informationSocial Business Intelligence Text Search System
Social Business Intelligence Text Search System Sagar Ligade ME Computer Engineering. Pune Institute of Computer Technology Pune, India ABSTRACT Today the search engine plays the important role in the
More informationHow To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
More informationComparative Study: ACO and EC for TSP
Comparative Study: ACO and EC for TSP Urszula Boryczka 1 and Rafa l Skinderowicz 1 and Damian Świstowski1 1 University of Silesia, Institute of Computer Science, Sosnowiec, Poland, e-mail: uboryczk@us.edu.pl
More informationOriginal Article Efficient Genetic Algorithm on Linear Programming Problem for Fittest Chromosomes
International Archive of Applied Sciences and Technology Volume 3 [2] June 2012: 47-57 ISSN: 0976-4828 Society of Education, India Website: www.soeagra.com/iaast/iaast.htm Original Article Efficient Genetic
More informationA Review of Data Mining Techniques
Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,
More informationSelection Procedures for Module Discovery: Exploring Evolutionary Algorithms for Cognitive Science
Selection Procedures for Module Discovery: Exploring Evolutionary Algorithms for Cognitive Science Janet Wiles (j.wiles@csee.uq.edu.au) Ruth Schulz (ruth@csee.uq.edu.au) Scott Bolland (scottb@csee.uq.edu.au)
More information14.10.2014. Overview. Swarms in nature. Fish, birds, ants, termites, Introduction to swarm intelligence principles Particle Swarm Optimization (PSO)
Overview Kyrre Glette kyrrehg@ifi INF3490 Swarm Intelligence Particle Swarm Optimization Introduction to swarm intelligence principles Particle Swarm Optimization (PSO) 3 Swarms in nature Fish, birds,
More informationManagement Science Letters
Management Science Letters 4 (2014) 905 912 Contents lists available at GrowingScience Management Science Letters homepage: www.growingscience.com/msl Measuring customer loyalty using an extended RFM and
More informationStock price prediction using genetic algorithms and evolution strategies
Stock price prediction using genetic algorithms and evolution strategies Ganesh Bonde Institute of Artificial Intelligence University Of Georgia Athens,GA-30601 Email: ganesh84@uga.edu Rasheed Khaled Institute
More informationNew Modifications of Selection Operator in Genetic Algorithms for the Traveling Salesman Problem
New Modifications of Selection Operator in Genetic Algorithms for the Traveling Salesman Problem Radovic, Marija; and Milutinovic, Veljko Abstract One of the algorithms used for solving Traveling Salesman
More informationGenetic Algorithm Evolution of Cellular Automata Rules for Complex Binary Sequence Prediction
Brill Academic Publishers P.O. Box 9000, 2300 PA Leiden, The Netherlands Lecture Series on Computer and Computational Sciences Volume 1, 2005, pp. 1-6 Genetic Algorithm Evolution of Cellular Automata Rules
More informationInternational Journal of Software and Web Sciences (IJSWS) www.iasir.net
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) ISSN (Print): 2279-0063 ISSN (Online): 2279-0071 International
More informationGenetic Algorithms. Rule Discovery. Data Mining
Genetic Algorithms for Rule Discovery in Data Mining Magnus Erik Hvass Pedersen (971055) Daimi, University of Aarhus, October 2003 1 Introduction The purpose of this document is to verify attendance of
More informationFART Neural Network based Probabilistic Motif Discovery in Unaligned Biological Sequences
FART Neural Network based Probabilistic Motif Discovery in Unaligned Biological Sequences M. Hemalatha, P. Ranjit Jeba Thangaiah and K. Vivekanandan, Member IEEE Abstract Finding Motif in bio-sequences
More informationAsexual Versus Sexual Reproduction in Genetic Algorithms 1
Asexual Versus Sexual Reproduction in Genetic Algorithms Wendy Ann Deslauriers (wendyd@alumni.princeton.edu) Institute of Cognitive Science,Room 22, Dunton Tower Carleton University, 25 Colonel By Drive
More informationEvolutionary Algorithms using Evolutionary Algorithms
Evolutionary Algorithms using Evolutionary Algorithms Neeraja Ganesan, Sowmya Ravi & Vandita Raju S.I.E.S GST, Mumbai, India E-mail: neer.ganesan@gmail.com, Abstract A subset of AI is, evolutionary algorithm
More informationMultiple Layer Perceptron Training Using Genetic Algorithms
Multiple Layer Perceptron Training Using Genetic Algorithms Udo Seiffert University of South Australia, Adelaide Knowledge-Based Intelligent Engineering Systems Centre (KES) Mawson Lakes, 5095, Adelaide,
More informationA Detailed Study on Information Retrieval using Genetic Algorithm
Journal of Industrial and Intelligent Information Vol. 1, No. 3, September 2013 A Detailed Study on Information Retrieval using Genetic Algorithm Md. Abu Kausar Department of Computer & System Sciences
More informationEvolutionary Prefetching and Caching in an Independent Storage Units Model
Evolutionary Prefetching and Caching in an Independent Units Model Athena Vakali Department of Informatics Aristotle University of Thessaloniki, Greece E-mail: avakali@csdauthgr Abstract Modern applications
More informationUSING GENETIC ALGORITHM IN NETWORK SECURITY
USING GENETIC ALGORITHM IN NETWORK SECURITY Ehab Talal Abdel-Ra'of Bader 1 & Hebah H. O. Nasereddin 2 1 Amman Arab University. 2 Middle East University, P.O. Box: 144378, Code 11814, Amman-Jordan Email:
More informationResearch RESOURCE SCHEDULING OF CONSTRUCTION PROJECTS USING GENETIC ALGORITHM Devikamalam. J 1, Jane Helena. H 2. Address for Correspondence
Devikamalam, et al., International Journal of Advanced Engineering Technology Research Paper RESOURCE SCHEDULING OF CONSTRUCTION PROJECTS USING GENETIC ALGORITHM Devikamalam. J 1, Jane Helena. H 2 Address
More informationA Service Revenue-oriented Task Scheduling Model of Cloud Computing
Journal of Information & Computational Science 10:10 (2013) 3153 3161 July 1, 2013 Available at http://www.joics.com A Service Revenue-oriented Task Scheduling Model of Cloud Computing Jianguang Deng a,b,,
More informationVol. 35, No. 3, Sept 30,2000 ملخص تعتبر الخوارزمات الجينية واحدة من أفضل طرق البحث من ناحية األداء. فبالرغم من أن استخدام هذه الطريقة ال يعطي الحل
AIN SHAMS UNIVERSITY FACULTY OF ENGINEERING Vol. 35, No. 3, Sept 30,2000 SCIENTIFIC BULLETIN Received on : 3/9/2000 Accepted on: 28/9/2000 pp : 337-348 GENETIC ALGORITHMS AND ITS USE WITH BACK- PROPAGATION
More informationSTUDY ON APPLICATION OF GENETIC ALGORITHM IN CONSTRUCTION RESOURCE LEVELLING
STUDY ON APPLICATION OF GENETIC ALGORITHM IN CONSTRUCTION RESOURCE LEVELLING N.Satheesh Kumar 1,R.Raj Kumar 2 PG Student, Department of Civil Engineering, Kongu Engineering College, Perundurai, Tamilnadu,India
More informationA SURVEY ON GENETIC ALGORITHM FOR INTRUSION DETECTION SYSTEM
A SURVEY ON GENETIC ALGORITHM FOR INTRUSION DETECTION SYSTEM MS. DIMPI K PATEL Department of Computer Science and Engineering, Hasmukh Goswami college of Engineering, Ahmedabad, Gujarat ABSTRACT The Internet
More informationCHAPTER 6 GENETIC ALGORITHM OPTIMIZED FUZZY CONTROLLED MOBILE ROBOT
77 CHAPTER 6 GENETIC ALGORITHM OPTIMIZED FUZZY CONTROLLED MOBILE ROBOT 6.1 INTRODUCTION The idea of evolutionary computing was introduced by (Ingo Rechenberg 1971) in his work Evolutionary strategies.
More informationAbout the Author. The Role of Artificial Intelligence in Software Engineering. Brief History of AI. Introduction 2/27/2013
About the Author The Role of Artificial Intelligence in Software Engineering By: Mark Harman Presented by: Jacob Lear Mark Harman is a Professor of Software Engineering at University College London Director
More informationData Mining Using Neural Genetic Approach: A Review
RESEARCH ARTICLE OPEN ACCESS Data Mining Using Neural Genetic Approach: A Review Parvez Rahi 1, Bhumika Gupta 2, Sombir Singh Bisht 3 ( 1, 2,3 Dept. of Computer Science & Engineering, G. B. Pant Engineering
More informationOptimizing Machine Allocation in Semiconductor Manufacturing Capacity Planning using. Bio-Inspired Approaches
Optimizing Machine Allocation in Semiconductor Manufacturing Capacity Planning using Umi Kalsom Yusof School of Computer Sciences Universiti Sains Malaysia 11800 USM, Penang, Malaysia umiyusof@cs.usm.my
More informationHorizontal Partitioning by Predicate Abstraction and its Application to Data Warehouse Design
Horizontal Partitioning by Predicate Abstraction and its Application to Data Warehouse Design Aleksandar Dimovski 1, Goran Velinov 2, and Dragan Sahpaski 2 1 Faculty of Information-Communication Technologies,
More informationEFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
More informationEvolutionary Detection of Rules for Text Categorization. Application to Spam Filtering
Advances in Intelligent Systems and Technologies Proceedings ECIT2004 - Third European Conference on Intelligent Systems and Technologies Iasi, Romania, July 21-23, 2004 Evolutionary Detection of Rules
More informationGenetic algorithms for changing environments
Genetic algorithms for changing environments John J. Grefenstette Navy Center for Applied Research in Artificial Intelligence, Naval Research Laboratory, Washington, DC 375, USA gref@aic.nrl.navy.mil Abstract
More informationPassive Microwave Remote Sensing for Sea Ice Thickness Retrieval Using Neural Network and Genetic Algorithm
Progress In Electromagnetics Research Symposium, Beijing, China, March 23 27, 2009 229 Passive Microwave Remote Sensing for Sea Ice Thickness Retrieval Using Neural Network and Genetic Algorithm H. J.
More informationGenetic Algorithm Performance with Different Selection Strategies in Solving TSP
Proceedings of the World Congress on Engineering Vol II WCE, July 6-8,, London, U.K. Genetic Algorithm Performance with Different Selection Strategies in Solving TSP Noraini Mohd Razali, John Geraghty
More informationThe Applications of Genetic Algorithms in Stock Market Data Mining Optimisation
The Applications of Genetic Algorithms in Stock Market Data Mining Optimisation Li Lin, Longbing Cao, Jiaqi Wang, Chengqi Zhang Faculty of Information Technology, University of Technology, Sydney, NSW
More informationGenetic Algorithm for Event Scheduling System
Genetic Algorithm for Event Scheduling 1 Abd. Samad Hasan Basari, 2 Siti Musliza Jalal, 3 Burairah Hussin and 4 Nabihah Mohd Isa Faculty of Information and Communication Technology UTeM, Hang Tuah Jaya,
More informationGENETIC ALGORITHM FORECASTING FOR TELECOMMUNICATIONS PRODUCTS
1 GENETIC ALGORITHM FORECASTING FOR TELECOMMUNICATIONS PRODUCTS STEPHEN D. SLOAN, RAYMOND W. SAW, JAMES J. SLUSS, JR., MONTE P. TULL, AND JOSEPH P. HAVLICEK School of Electrical & Computer Engineering
More informationA Fast Computational Genetic Algorithm for Economic Load Dispatch
A Fast Computational Genetic Algorithm for Economic Load Dispatch M.Sailaja Kumari 1, M.Sydulu 2 Email: 1 Sailaja_matam@Yahoo.com 1, 2 Department of Electrical Engineering National Institute of Technology,
More informationGenetic Algorithms commonly used selection, replacement, and variation operators Fernando Lobo University of Algarve
Genetic Algorithms commonly used selection, replacement, and variation operators Fernando Lobo University of Algarve Outline Selection methods Replacement methods Variation operators Selection Methods
More informationFeature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier D.Nithya a, *, V.Suganya b,1, R.Saranya Irudaya Mary c,1 Abstract - This paper presents,
More informationSanjeev Kumar. contribute
RESEARCH ISSUES IN DATAA MINING Sanjeev Kumar I.A.S.R.I., Library Avenue, Pusa, New Delhi-110012 sanjeevk@iasri.res.in 1. Introduction The field of data mining and knowledgee discovery is emerging as a
More informationEffect of Using Neural Networks in GA-Based School Timetabling
Effect of Using Neural Networks in GA-Based School Timetabling JANIS ZUTERS Department of Computer Science University of Latvia Raina bulv. 19, Riga, LV-1050 LATVIA janis.zuters@lu.lv Abstract: - The school
More informationE-mail Spam Filtering Using Genetic Algorithm: A Deeper Analysis
ISSN:0975-9646 Mandeep hodhary et al, / (IJSIT) International Journal of omputer Science and Information Technologies, Vol. 6 (5), 205, 4266-4270 E-mail Spam Filtering Using Genetic Algorithm: A Deeper
More informationSolving Banana (Rosenbrock) Function Based on Fitness Function
Available online at www.worldscientificnews.com WSN 6 (2015) 41-56 EISSN 2392-2192 Solving Banana (Rosenbrock) Function Based on Fitness Function Lubna Zaghlul Bashir 1,a, Rajaa Salih Mohammed Hasan 2,b
More informationAn Automated Guided Model For Integrating News Into Stock Trading Strategies Pallavi Parshuram Katke 1, Ass.Prof. B.R.Solunke 2
www.ijecs.in International Journal Of Engineering And Computer Science ISSN:2319-7242 Volume 4 Issue - 12 December, 2015 Page No. 15312-15316 An Automated Guided Model For Integrating News Into Stock Trading
More informationApplying Design Patterns in Distributing a Genetic Algorithm Application
Applying Design Patterns in Distributing a Genetic Algorithm Application Nick Burns Mike Bradley Mei-Ling L. Liu California Polytechnic State University Computer Science Department San Luis Obispo, CA
More informationCHAPTER 3 SECURITY CONSTRAINED OPTIMAL SHORT-TERM HYDROTHERMAL SCHEDULING
60 CHAPTER 3 SECURITY CONSTRAINED OPTIMAL SHORT-TERM HYDROTHERMAL SCHEDULING 3.1 INTRODUCTION Optimal short-term hydrothermal scheduling of power systems aims at determining optimal hydro and thermal generations
More informationTowards Heuristic Web Services Composition Using Immune Algorithm
Towards Heuristic Web Services Composition Using Immune Algorithm Jiuyun Xu School of Computer & Communication Engineering China University of Petroleum xujiuyun@ieee.org Stephan Reiff-Marganiec Department
More informationA Binary Model on the Basis of Imperialist Competitive Algorithm in Order to Solve the Problem of Knapsack 1-0
212 International Conference on System Engineering and Modeling (ICSEM 212) IPCSIT vol. 34 (212) (212) IACSIT Press, Singapore A Binary Model on the Basis of Imperialist Competitive Algorithm in Order
More informationEffects of Symbiotic Evolution in Genetic Algorithms for Job-Shop Scheduling
Proceedings of the th Hawaii International Conference on System Sciences - 00 Effects of Symbiotic Evolution in Genetic Algorithms for Job-Shop Scheduling Yasuhiro Tsujimura Yuichiro Mafune Mitsuo Gen
More informationResearch Article Service Composition Optimization Using Differential Evolution and Opposition-based Learning
Research Journal of Applied Sciences, Engineering and Technology 11(2): 229-234, 2015 ISSN: 2040-7459; e-issn: 2040-7467 2015 Maxwell Scientific Publication Corp. Submitted: May 20, 2015 Accepted: June
More informationUsing Genetic Algorithms in Secured Business Intelligence Mobile Applications
Informatica Economică vol. 15, no. 1/2011 69 Using Genetic Algorithms in Secured Business Intelligence Mobile Applications Silvia TRIF Academy of Economic Studies, Bucharest, Romania silviatrif@gmail.com
More informationA Review And Evaluations Of Shortest Path Algorithms
A Review And Evaluations Of Shortest Path Algorithms Kairanbay Magzhan, Hajar Mat Jani Abstract: Nowadays, in computer networks, the routing is based on the shortest path problem. This will help in minimizing
More informationHybrid Evolution of Heterogeneous Neural Networks
Hybrid Evolution of Heterogeneous Neural Networks 01001110 01100101 01110101 01110010 01101111 01101110 01101111 01110110 01100001 00100000 01110011 01101011 01110101 01110000 01101001 01101110 01100001
More informationOptimal Tuning of PID Controller Using Meta Heuristic Approach
International Journal of Electronic and Electrical Engineering. ISSN 0974-2174, Volume 7, Number 2 (2014), pp. 171-176 International Research Publication House http://www.irphouse.com Optimal Tuning of
More informationKeywords revenue management, yield management, genetic algorithm, airline reservation
Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Revenue Management
More informationA Survey on Intrusion Detection System with Data Mining Techniques
A Survey on Intrusion Detection System with Data Mining Techniques Ms. Ruth D 1, Mrs. Lovelin Ponn Felciah M 2 1 M.Phil Scholar, Department of Computer Science, Bishop Heber College (Autonomous), Trichirappalli,
More informationImproving Decision Making and Managing Knowledge
Improving Decision Making and Managing Knowledge Decision Making and Information Systems Information Requirements of Key Decision-Making Groups in a Firm Senior managers, middle managers, operational managers,
More informationMaster's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University
Master's projects at ITMO University Daniil Chivilikhin PhD Student @ ITMO University General information Guidance from our lab's researchers Publishable results 2 Research areas Research at ITMO Evolutionary
More informationCLOUD DATABASE ROUTE SCHEDULING USING COMBANATION OF PARTICLE SWARM OPTIMIZATION AND GENETIC ALGORITHM
CLOUD DATABASE ROUTE SCHEDULING USING COMBANATION OF PARTICLE SWARM OPTIMIZATION AND GENETIC ALGORITHM *Shabnam Ghasemi 1 and Mohammad Kalantari 2 1 Deparment of Computer Engineering, Islamic Azad University,
More informationHOST SCHEDULING ALGORITHM USING GENETIC ALGORITHM IN CLOUD COMPUTING ENVIRONMENT
International Journal of Research in Engineering & Technology (IJRET) Vol. 1, Issue 1, June 2013, 7-12 Impact Journals HOST SCHEDULING ALGORITHM USING GENETIC ALGORITHM IN CLOUD COMPUTING ENVIRONMENT TARUN
More informationl 8 r 3 l 9 r 1 l 3 l 7 l 1 l 6 l 5 l 10 l 2 l 4 r 2
Heuristic Algorithms for the Terminal Assignment Problem Sami Khuri Teresa Chiu Department of Mathematics and Computer Science San Jose State University One Washington Square San Jose, CA 95192-0103 khuri@jupiter.sjsu.edu
More informationConstrained Classification of Large Imbalanced Data by Logistic Regression and Genetic Algorithm
Constrained Classification of Large Imbalanced Data by Logistic Regression and Genetic Algorithm Martin Hlosta, Rostislav Stríž, Jan Kupčík, Jaroslav Zendulka, and Tomáš Hruška A. Imbalanced Data Classification
More informationAn Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
More informationRepresentation of Electronic Mail Filtering Profiles: A User Study
Representation of Electronic Mail Filtering Profiles: A User Study Michael J. Pazzani Department of Information and Computer Science University of California, Irvine Irvine, CA 92697 +1 949 824 5888 pazzani@ics.uci.edu
More informationInternational Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) www.iasir.net
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
More informationPLAANN as a Classification Tool for Customer Intelligence in Banking
PLAANN as a Classification Tool for Customer Intelligence in Banking EUNITE World Competition in domain of Intelligent Technologies The Research Report Ireneusz Czarnowski and Piotr Jedrzejowicz Department
More informationIntroduction to Data Mining Techniques
Introduction to Data Mining Techniques Dr. Rajni Jain 1 Introduction The last decade has experienced a revolution in information availability and exchange via the internet. In the same spirit, more and
More informationOptimizing Testing Efficiency with Error-Prone Path Identification and Genetic Algorithms
Optimizing Testing Efficiency with Error-Prone Path Identification and Genetic Algorithms James R. Birt Renate Sitte Griffith University, School of Information Technology Gold Coast Campus PBM 50 Gold
More informationA survey on Data Mining based Intrusion Detection Systems
International Journal of Computer Networks and Communications Security VOL. 2, NO. 12, DECEMBER 2014, 485 490 Available online at: www.ijcncs.org ISSN 2308-9830 A survey on Data Mining based Intrusion
More informationRandomly Encryption Using Genetic Algorithm
Randomly Encryption Using Genetic Algorithm ALI JASSIM MOHAMED ALI Department of physics, College of Science, Al-Mustansiriyah University, Baghdad, Iraq. SUMMARY In this research work a genetic algorithm
More informationPractical Applications of Evolutionary Computation to Financial Engineering
Hitoshi Iba and Claus C. Aranha Practical Applications of Evolutionary Computation to Financial Engineering Robust Techniques for Forecasting, Trading and Hedging 4Q Springer Contents 1 Introduction to
More informationA New Method for Traffic Forecasting Based on the Data Mining Technology with Artificial Intelligent Algorithms
Research Journal of Applied Sciences, Engineering and Technology 5(12): 3417-3422, 213 ISSN: 24-7459; e-issn: 24-7467 Maxwell Scientific Organization, 213 Submitted: October 17, 212 Accepted: November
More informationAn evolutionary learning spam filter system
An evolutionary learning spam filter system Catalin Stoean 1, Ruxandra Gorunescu 2, Mike Preuss 3, D. Dumitrescu 4 1 University of Craiova, Romania, catalin.stoean@inf.ucv.ro 2 University of Craiova, Romania,
More information