Prediction of Software Development Modication Eort Enhanced by a Genetic Algorithm
|
|
|
- Ashlyn Thornton
- 9 years ago
- Views:
Transcription
1 Prediction of Software Development Modication Eort Enhanced by a Genetic Algorithm Gerg Balogh, Ádám Zoltán Végh, and Árpád Beszédes Department of Software Engineering University of Szeged, Szeged, Hungary {geryxyz,azvegh,beszedes}@inf.u-szeged.hu Abstract. During the planning, development, and maintenance of software projects one of the main challenges is to accurately predict the modication cost of a particular piece of code. Several methods are traditionally applied for this purpose and many of them are based on static code investigation. We experimented with a combined use of product and process attributes (metrics) to improve cost prediction, and we applied machine learning to this end. The method depends on several important parameters which can signicantly inuence the success of the learning model. In the present work, we overview the usage of search based methods (one genetic algorithm in particular) to calibrate these parameters. For the rst set of experiments four industrial projects were analysed, and the accuracy of the predictions was compared to previous results. We found that by calibrating the parameters using search based methods we could achieve signicant improvement in the overall eciency of the prediction, from about 50% to 70% (F-measure). Keywords: software development, eort estimation, modication eort, genetic algorithm 1 Introduction One of the tasks in software cost estimation, especially in the evolution phase, is to predict the cost (required eort) for modifying a piece of code. A possible approach for such modication eort prediction is to use various software attributes from historical development data and from the current version of the software. The attributes can be expressed in the form of software metrics, both product and process. Product metrics are calculated by preforming the static analysis of the software (a simple example is the logical lines of code), while process metrics can represent time-related quantities collected during project implementation (for example, the Net Development Time of the modications). The metrics can then be used, with the help of a model, to predict the modication costs. Numerous articles were written about experiences using process or product metrics [1,2], but researches using both are rare. Our previous works tried to predict the cost of future modications by applying both product metrics calculated from the source code and process metrics collected during the development [3]. We compared the results when using only one type of metrics with the case when both kinds of metrics had been used. We applied machine learning to create the model for the prediction. In an advanced model we used a more complex metric for expressing the modication eort, which was the aggregation of dierent kinds of modications like creation, deletion, and type change. To express the modication eort in a single value, we used dierent parameters (weights) for the dierent kinds of modications. The choice of these parameters was crucial for the accuracy of prediction and their calibration was not simple. In the present work, we
2 report on our early experiences in applying search based methods to determine these parameters (a basic genetic algorithm (ga) was used for this purpose). A typical improvement of as much as 20 percentage points was achieved in the combined prediction accuracy (F-measure on the precision and recall) when comparing the model with initial parameters to the one obtained after running the search method. We summarize the experiments with the following research questions: rq. 1 Is it possible to dene a better model for software development eort prediction than the one used in paper [ 3,4]? In particular, we seek for a better way to express modication eort. rq. 2 Can the ga be used to improve the precision of the estimation of software development eort compared to the results in paper [3,4]? In particular, what rate of improvement can be obtained after calibrating the crucial parameters of the cost metric and re-applying them to the model? To achieve the mentioned goals we implemented a framework which is capable to collect and aggregate product and process metrics from various sources including the source code and the integrated development environment. The framework detects the modications between revisions, and tries to predict the eort of further changes. 2 Overview The overall process is shown in Fig. 1. The experiment starts with a measurement phase where the used data is collected from various sources: the metrics about the evolution of the software, the source code from the svn version controlling system, and the metrics estimations which were given by the project manager. This phase has two main tasks; to collect and calculate the process and product metrics, and to detect and group the modications of the source code between the revisions. The metrics and the modication groups are sent to the ga which prepares a population of individual entities. During the initial set-up of the population and the evolution steps, two metrics are calculated: Typed Modication (TMod), which was dened as the weighted count of modications between two revisions; and Modication Eort (Meff), the ratio of TMod and the net development time of these modications. Afterwards the prediction model is tested and its weighted F-measure value is used as tness to rank the individuals in the population and select the best entities for breeding. As the nal step of the evolution cycle, the new weights of the modication are calculated and the model is updated. When the precision reaches an appropriate value, the ga stops and a new, enhanced model is built using the weights of the best entity in the nal population. This Meff prediction model is the output of the execution of the framework. Data was collected during the experiment from about 800 revision, about 75 days long. Both r&d and industrial projects were analysed and the source code was written in Java language using the Java ee 6 virtual machine and the Seam 2 framework. Altogether 2200 records were collected as learning set. 3 Modication Eort Prediction Process and product metrics were used together per source le basis as separated entities. Then, the appropriate cost function was calculated from this data. The metrics used as predictors were the followings: (i) Logical Lines Of Code (lloc) (ii) c&k metric suit dened by Chidamber and Kemerer[2] (iii) Estimated development time of a task, aggregated into 3 groups: short, medium,
3 Fig. 1. Overview of the experiment long. (tt) (iv) Developer's Experience in Programming (dep) (v) Number of File Access (nfa) (vi) Number of Developers of File (ndf) (vii) Development Time (dt) 1 To extend our previous framework, a new metric called Modication Eort ( Meff) was dened which was calculated as the following: rst, the modications were grouped, based on the target entity (e.g.: method) and the preformed action (e.g.: creation) [5]. The weighted count of the pre viously dened modications was called Typed Modication (TMod), this expresses the dierent amount of developer's eort used in the modications. The ratio of the net development time with the TMod is the Meff metric. The weight was dened on the basis of the per groups of modication. We established a single rule for each disjunct group which determined the contained relation. The following modication groups were measured. ˆ class ž creation ž deletion ž accessibility change ˆ method ž creation ž deletion ž accessibility change ž prototype change ž return type change ž size change ˆ data member ž creation ž deletion ž accessibility change ž type change Instead of the Level of Modication Complexity (lmc) dened as the ratio of the Eectively Changed Lines Of Code (ecloc) for the next change of the le/class and dt, the net development time between two revisions [3,4] Meff was used as the target function, which was equally labelled with: low, medium and high values. The Weka framework [6] machine learning and the 10-fold cross-validation utility were used to evaluate our model. The weight sum of the F-measure was chosen as the tness value which is the harmonic mean of precision and recall. 4 Genetic Algorithm sbse was used to ne tune the weights of the modication. The initial weight-vector was set by our developer experience. The algorithm was iterated on the basis of previously set scenario. Our assumption was that the genetic algorithm should converge to the suitable weights, which should provide a more accurate estimation [7]. 1 detailed in paper [3]
4 As previously described, a weighted sum of the count of modication group was used as the target function. A genetic algorithm (ga) was used to ne tune the weights of each group. The individuals identied by its chromosome, which is a vector over the real numbers with the same dimension. In the model each chromosome represents a weight-vector, and every element determines the weight of a single modication group. The tness value is calculated for each individual by evaluating the model with the weights dened in that particular individual. The nal goal of the ga is to improve the precision of the model. For classication problems, the F-measure value can give a reliable approximation of the accuracy. The base model was not enhanced with the ga, but it was evaluated using the F-measure. Thus the F-measure was chosen to be the tness value of the ga. An evolution step starts with the breeding process which consists of two steps, rst, the ga selects the two best entities with its tness value. The crossover operator will apply only this pair. Every call of the crossover operator produces exactly one child. The algorithm repeats the operation to produce more than one child. We used a uniform crossover logic. During the crossover the algorithm iterates via the elements of the chromosome (vector) and randomly chooses an element from one of the two parents. Every element has the same chance to be copied into the child's chromosome [8]. The chromosome of the children is subject to mutation. A lower limit and an upper limit were determined for the weights of the groups. During the mutation some elements (weights) of the chromosome change. The algorithm gets the half of the distance between the limits and the current selected weight and sets the current value either to the lower or to the upper half point. This way, the two limits are never exceeded. Then, the child is included in the population. The individuals with the worst tness value are killed (removed from the population) to maintain the size of the population, this way the current evolution step is completed and the algorithm proceeds to the next generation [9,10]. The above mentioned ga parameters and their values are shown in Tab. 1. Table 1. ga parameters initial mutation rate 100% mutation rate 50% mutation lower limit 0.5 mutation upper limit 100 birth count 2 child per evolution step crossover rate 2 crossover per evolution step population size 200 individuals generation count 50 generation 5 Preliminary Results As shown in Fig. 2, the tness value of the prediction grows in every case. Tab. 2 shows the same tness values. As it can be seen in Tab. 3, the average grows with about 18 percent. It is also relevant that in the worst case our model proves to be better with about 16 percent. With these pieces of information the two research questions can be answered.
5 Fig. 2. Fitness values of prediction per project Table 2. Fitness values of prediction per project Project1 Project2 Project3 Project4 base model 59,8000% 47,0000% 44,2000% 45,3000% enhanced model 75,2630% 66,6425% 67,4835% 60,2791% Table 3. Comparison of models worst best average median base experiment 44,2000% 59,8000% 49,0750% 46,1500% enhanced model 60,2791% 75,2630% 67,4170% 67,0630% dierence 16,0791% 15,4630% 18,3420% 20,9130% rq. 1 Yes, it is possible. Our model gives a better estimation from the beginning of the evolution and the population average tness value is higher in every generation. rq. 2 Yes, the use of the ga can improve precision. With this simple ga implementation a signi cant improvement was reached. The weight of groups was aggregated from all four projects and weighted with the size of the learning set. Two aspects were created to examine the validity of the weights calculated by the ga. Fig. 3. Aggregated weights of groups
6 These aspects are shown in Fig. 5. The values can be interpreted as the importance of a mod ication, i.e. how much gain will be achieved by applying the modication. The diagram on the left shows a by action aggregation. As it can be seen, the creation and deletion are more important than the type and visibility changes. On the right side, a subject based aggregation can be seen. The most important modication was applied on the method elements which included the method body modications as well. 6 Conclusion and Plans In this paper, we present two new metrics and a new procedure with which we can increase the eectiveness of our modication cost prediction method based on product and process metrics and machine learning. Our previous results have been improved with the introduction of new metrics (TMod, Meff) used as target function, and by using a genetic algorithm to calibrate certain crucial parameters required by the model. We were able to increase the success of the prediction model signicantly. We manually investigated the nal parameter values produced by the ga. These parameters seem to be valid based on our own developer experience, but further analysis will be needed to validate the results. In the future, we plan to prepare a new set of parameters by collecting developer experience within our team. We plan to use a questionnaire and compare the results with the automatically calculated ones, which do not use the bias of developer experiment. We also plan to repeat the experiment on a bigger dataset and apply the method to achieve dierent goals like comparing projects or developer productivity. We would like to ne tune our ga with the testing of some other crossover operator or tness value. References 1. Mockus, A., Weiss, D.M., Zhang, P.: Understanding and predicting eort in software projects. In: Proceedings of the 25th International Conference on Software Engineering, IEEE Computer Society (2003) Chidamber, S.R., Kemerer, C.F.: A metrics suite for object oriented design. IEEE Transactions on Software Engineering 20(6) (1994) Tóth, G., Végh, A.Z., Beszédes, A., Gyimóthy, T.: Adding Process Metrics to Enhance Modication Complexity Prediction. In: Proceedings of the 19th IEEE International Conference on Program Com prehension (ICPC'11), Ieee (June 2011) Tóth, G., Végh, A.Z., Beszédes, A., Schrettner, L., Gergely, T., Gyimóthy, T.: Adjusting Eort Esti mation Using Micro-Productivity Proles. In: Proceedings of the 12th Symposium on Programming Languages and Software Tools (SPLST'11). (2011) Abdi, M.K., Lounis, H., Sahraoui, H.: Using Coupling Metrics for Change Impact Analysis in Objec t-oriented Systems. In: Proceedings of the 10th ECOOP Workshop on Quantitative Approaches in ObjectOriented Software Engineering QAOOSE 06. (2006) Holmes, G., Donkin, A., Witten, I.: WEKA: a machine learning workbench. In: Proceedings of ANZIIS '94 - Australian New Zealnd Intelligent Information Systems Conference, IEEE Cavicchio, D.: Adaptive search using simulated evolution. (1970) 8. Syswerda, G.: Uniform Crossover in Genetic Algorithms. (1989) Melanie, M.: An introduction to genetic algorithms. Cambridge, Massachusetts London, England, Fifth (1999) 10. Sivanandam, S.: Introduction to genetic algorithms. (2007)
Introduction to Logistic Regression
OpenStax-CNX module: m42090 1 Introduction to Logistic Regression Dan Calderon This work is produced by OpenStax-CNX and licensed under the Creative Commons Attribution License 3.0 Abstract Gives introduction
An intelligent stock trading decision support system through integration of genetic algorithm based fuzzy neural network and articial neural network
Fuzzy Sets and Systems 118 (2001) 21 45 www.elsevier.com/locate/fss An intelligent stock trading decision support system through integration of genetic algorithm based fuzzy neural network and articial
1. Introduction MINING AND TRACKING EVOLVING WEB USER TRENDS FROM LARGE WEB SERVER LOGS. Basheer Hawwash and Olfa Nasraoui
MINING AND TRACKING EVOLVING WEB USER TRENDS FROM LARGE WEB SERVER LOGS Basheer Hawwash and Olfa Nasraoui Knowledge Discovery and Web Mining Lab Dept. of Computer Engineering and Computer Science University
A Parallel Processor for Distributed Genetic Algorithm with Redundant Binary Number
A Parallel Processor for Distributed Genetic Algorithm with Redundant Binary Number 1 Tomohiro KAMIMURA, 2 Akinori KANASUGI 1 Department of Electronics, Tokyo Denki University, [email protected]
A Hybrid Tabu Search Method for Assembly Line Balancing
Proceedings of the 7th WSEAS International Conference on Simulation, Modelling and Optimization, Beijing, China, September 15-17, 2007 443 A Hybrid Tabu Search Method for Assembly Line Balancing SUPAPORN
Evaluation of Students' Modeling and Programming Skills
Evaluation of Students' Modeling and Programming Skills Birgit Demuth, Sebastian Götz, Harry Sneed, and Uwe Schmidt Technische Universität Dresden Faculty of Computer Science Abstract. In winter semester
Numerical Research on Distributed Genetic Algorithm with Redundant
Numerical Research on Distributed Genetic Algorithm with Redundant Binary Number 1 Sayori Seto, 2 Akinori Kanasugi 1,2 Graduate School of Engineering, Tokyo Denki University, Japan [email protected],
Background knowledge-enrichment for bottom clauses improving.
Background knowledge-enrichment for bottom clauses improving. Orlando Muñoz Texzocotetla and René MacKinney-Romero Departamento de Ingeniería Eléctrica Universidad Autónoma Metropolitana México D.F. 09340,
Goldberg, D. E. (1989). Genetic algorithms in search, optimization, and machine learning. Reading, MA:
is another objective that the GA could optimize. The approach used here is also adaptable. On any particular project, the designer can congure the GA to focus on optimizing certain constraints (such as
Classification of Learners Using Linear Regression
Proceedings of the Federated Conference on Computer Science and Information Systems pp. 717 721 ISBN 978-83-60810-22-4 Classification of Learners Using Linear Regression Marian Cristian Mihăescu Software
Predicting Critical Problems from Execution Logs of a Large-Scale Software System
Predicting Critical Problems from Execution Logs of a Large-Scale Software System Árpád Beszédes, Lajos Jenő Fülöp and Tibor Gyimóthy Department of Software Engineering, University of Szeged Árpád tér
Optimization of Preventive Maintenance Scheduling in Processing Plants
18 th European Symposium on Computer Aided Process Engineering ESCAPE 18 Bertrand Braunschweig and Xavier Joulia (Editors) 2008 Elsevier B.V./Ltd. All rights reserved. Optimization of Preventive Maintenance
arxiv:1506.04135v1 [cs.ir] 12 Jun 2015
Reducing offline evaluation bias of collaborative filtering algorithms Arnaud de Myttenaere 1,2, Boris Golden 1, Bénédicte Le Grand 3 & Fabrice Rossi 2 arxiv:1506.04135v1 [cs.ir] 12 Jun 2015 1 - Viadeo
Alpha Cut based Novel Selection for Genetic Algorithm
Alpha Cut based Novel for Genetic Algorithm Rakesh Kumar Professor Girdhar Gopal Research Scholar Rajesh Kumar Assistant Professor ABSTRACT Genetic algorithm (GA) has several genetic operators that can
Open Source Software: How Can Design Metrics Facilitate Architecture Recovery?
Open Source Software: How Can Design Metrics Facilitate Architecture Recovery? Eleni Constantinou 1, George Kakarontzas 2, and Ioannis Stamelos 1 1 Computer Science Department Aristotle University of Thessaloniki
Oscillations of the Sending Window in Compound TCP
Oscillations of the Sending Window in Compound TCP Alberto Blanc 1, Denis Collange 1, and Konstantin Avrachenkov 2 1 Orange Labs, 905 rue Albert Einstein, 06921 Sophia Antipolis, France 2 I.N.R.I.A. 2004
interactive automatic (rules) automatic (patterns) interactive REP ENVIRONMENT KERNEL
AN APPROACH TO SOFTWARE CHANGE MANAGEMENT SUPPORT Jun Han Peninsula School of Computing and Information Technology Monash University, McMahons Road, Frankston, Vic 3199, Australia phone: +61 3 99044604,
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Contents. Dedication List of Figures List of Tables. Acknowledgments
Contents Dedication List of Figures List of Tables Foreword Preface Acknowledgments v xiii xvii xix xxi xxv Part I Concepts and Techniques 1. INTRODUCTION 3 1 The Quest for Knowledge 3 2 Problem Description
New Modifications of Selection Operator in Genetic Algorithms for the Traveling Salesman Problem
New Modifications of Selection Operator in Genetic Algorithms for the Traveling Salesman Problem Radovic, Marija; and Milutinovic, Veljko Abstract One of the algorithms used for solving Traveling Salesman
Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm
Journal of Al-Nahrain University Vol.15 (2), June, 2012, pp.161-168 Science Memory Allocation Technique for Segregated Free List Based on Genetic Algorithm Manal F. Younis Computer Department, College
Genetic Algorithms and Sudoku
Genetic Algorithms and Sudoku Dr. John M. Weiss Department of Mathematics and Computer Science South Dakota School of Mines and Technology (SDSM&T) Rapid City, SD 57701-3995 [email protected] MICS 2009
Extension of Decision Tree Algorithm for Stream Data Mining Using Real Data
Fifth International Workshop on Computational Intelligence & Applications IEEE SMC Hiroshima Chapter, Hiroshima University, Japan, November 10, 11 & 12, 2009 Extension of Decision Tree Algorithm for Stream
Experiments in Web Page Classification for Semantic Web
Experiments in Web Page Classification for Semantic Web Asad Satti, Nick Cercone, Vlado Kešelj Faculty of Computer Science, Dalhousie University E-mail: {rashid,nick,vlado}@cs.dal.ca Abstract We address
A Review And Evaluations Of Shortest Path Algorithms
A Review And Evaluations Of Shortest Path Algorithms Kairanbay Magzhan, Hajar Mat Jani Abstract: Nowadays, in computer networks, the routing is based on the shortest path problem. This will help in minimizing
The «SQALE» Analysis Model An analysis model compliant with the representation condition for assessing the Quality of Software Source Code
The «SQALE» Analysis Model An analysis model compliant with the representation condition for assessing the Quality of Software Source Code Jean-Louis Letouzey DNV IT Global Services Arcueil, France [email protected]
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES
DECISION TREE INDUCTION FOR FINANCIAL FRAUD DETECTION USING ENSEMBLE LEARNING TECHNIQUES Vijayalakshmi Mahanra Rao 1, Yashwant Prasad Singh 2 Multimedia University, Cyberjaya, MALAYSIA 1 [email protected]
Classification of Titanic Passenger Data and Chances of Surviving the Disaster Data Mining with Weka and Kaggle Competition Data
Proceedings of Student-Faculty Research Day, CSIS, Pace University, May 2 nd, 2014 Classification of Titanic Passenger Data and Chances of Surviving the Disaster Data Mining with Weka and Kaggle Competition
Modified Version of Roulette Selection for Evolution Algorithms - the Fan Selection
Modified Version of Roulette Selection for Evolution Algorithms - the Fan Selection Adam S lowik, Micha l Bia lko Department of Electronic, Technical University of Koszalin, ul. Śniadeckich 2, 75-453 Koszalin,
Choosing the Best Classification Performance Metric for Wrapper-based Software Metric Selection for Defect Prediction
Choosing the Best Classification Performance Metric for Wrapper-based Software Metric Selection for Defect Prediction Huanjing Wang Western Kentucky University [email protected] Taghi M. Khoshgoftaar
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis
Towards SoMEST Combining Social Media Monitoring with Event Extraction and Timeline Analysis Yue Dai, Ernest Arendarenko, Tuomo Kakkonen, Ding Liao School of Computing University of Eastern Finland {yvedai,
ISSN: 2319-5967 ISO 9001:2008 Certified International Journal of Engineering Science and Innovative Technology (IJESIT) Volume 2, Issue 3, May 2013
Transistor Level Fault Finding in VLSI Circuits using Genetic Algorithm Lalit A. Patel, Sarman K. Hadia CSPIT, CHARUSAT, Changa., CSPIT, CHARUSAT, Changa Abstract This paper presents, genetic based algorithm
Keywords revenue management, yield management, genetic algorithm, airline reservation
Volume 4, Issue 1, January 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Revenue Management
A Robust Method for Solving Transcendental Equations
www.ijcsi.org 413 A Robust Method for Solving Transcendental Equations Md. Golam Moazzam, Amita Chakraborty and Md. Al-Amin Bhuiyan Department of Computer Science and Engineering, Jahangirnagar University,
PLAANN as a Classification Tool for Customer Intelligence in Banking
PLAANN as a Classification Tool for Customer Intelligence in Banking EUNITE World Competition in domain of Intelligent Technologies The Research Report Ireneusz Czarnowski and Piotr Jedrzejowicz Department
1 INTRODUCTION TO SYSTEM ANALYSIS AND DESIGN
1 INTRODUCTION TO SYSTEM ANALYSIS AND DESIGN 1.1 INTRODUCTION Systems are created to solve problems. One can think of the systems approach as an organized way of dealing with a problem. In this dynamic
Three types of messages: A, B, C. Assume A is the oldest type, and C is the most recent type.
Chronological Sampling for Email Filtering Ching-Lung Fu 2, Daniel Silver 1, and James Blustein 2 1 Acadia University, Wolfville, Nova Scotia, Canada 2 Dalhousie University, Halifax, Nova Scotia, Canada
Introduction to Data Mining
Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association
Chapter 1: Key Concepts of Programming and Software Engineering
Chapter 1: Key Concepts of Programming and Software Engineering Software Engineering Coding without a solution design increases debugging time - known fact! A team of programmers for a large software development
CLOUD DATABASE ROUTE SCHEDULING USING COMBANATION OF PARTICLE SWARM OPTIMIZATION AND GENETIC ALGORITHM
CLOUD DATABASE ROUTE SCHEDULING USING COMBANATION OF PARTICLE SWARM OPTIMIZATION AND GENETIC ALGORITHM *Shabnam Ghasemi 1 and Mohammad Kalantari 2 1 Deparment of Computer Engineering, Islamic Azad University,
A Comparison of Genotype Representations to Acquire Stock Trading Strategy Using Genetic Algorithms
2009 International Conference on Adaptive and Intelligent Systems A Comparison of Genotype Representations to Acquire Stock Trading Strategy Using Genetic Algorithms Kazuhiro Matsui Dept. of Computer Science
Architecture bits. (Chromosome) (Evolved chromosome) Downloading. Downloading PLD. GA operation Architecture bits
A Pattern Recognition System Using Evolvable Hardware Masaya Iwata 1 Isamu Kajitani 2 Hitoshi Yamada 2 Hitoshi Iba 1 Tetsuya Higuchi 1 1 1-1-4,Umezono,Tsukuba,Ibaraki,305,Japan Electrotechnical Laboratory
L25: Ensemble learning
L25: Ensemble learning Introduction Methods for constructing ensembles Combination strategies Stacked generalization Mixtures of experts Bagging Boosting CSCE 666 Pattern Analysis Ricardo Gutierrez-Osuna
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification
Performance Analysis of Naive Bayes and J48 Classification Algorithm for Data Classification Tina R. Patil, Mrs. S. S. Sherekar Sant Gadgebaba Amravati University, Amravati [email protected], [email protected]
Predictive Analytics using Genetic Algorithm for Efficient Supply Chain Inventory Optimization
182 IJCSNS International Journal of Computer Science and Network Security, VOL.10 No.3, March 2010 Predictive Analytics using Genetic Algorithm for Efficient Supply Chain Inventory Optimization P.Radhakrishnan
Empirical study of Software Quality Evaluation in Agile Methodology Using Traditional Metrics
Empirical study of Software Quality Evaluation in Agile Methodology Using Traditional Metrics Kumi Jinzenji NTT Software Innovation Canter NTT Corporation Tokyo, Japan [email protected] Takashi
Content-Based Discovery of Twitter Influencers
Content-Based Discovery of Twitter Influencers Chiara Francalanci, Irma Metra Department of Electronics, Information and Bioengineering Polytechnic of Milan, Italy [email protected] [email protected]
A Non-Linear Schema Theorem for Genetic Algorithms
A Non-Linear Schema Theorem for Genetic Algorithms William A Greene Computer Science Department University of New Orleans New Orleans, LA 70148 bill@csunoedu 504-280-6755 Abstract We generalize Holland
Runtime Hardware Reconfiguration using Machine Learning
Runtime Hardware Reconfiguration using Machine Learning Tanmay Gangwani University of Illinois, Urbana-Champaign [email protected] Abstract Tailoring the machine hardware to varying needs of the software
EXTENDED ANGEL: KNOWLEDGE-BASED APPROACH FOR LOC AND EFFORT ESTIMATION FOR MULTIMEDIA PROJECTS IN MEDICAL DOMAIN
EXTENDED ANGEL: KNOWLEDGE-BASED APPROACH FOR LOC AND EFFORT ESTIMATION FOR MULTIMEDIA PROJECTS IN MEDICAL DOMAIN Sridhar S Associate Professor, Department of Information Science and Technology, Anna University,
Path Selection Methods for Localized Quality of Service Routing
Path Selection Methods for Localized Quality of Service Routing Xin Yuan and Arif Saifee Department of Computer Science, Florida State University, Tallahassee, FL Abstract Localized Quality of Service
Predictive Model Representation and Comparison: Towards Data and Predictive Models Governance
Predictive Model Representation and Comparison Towards Data and Predictive Models Governance Mokhairi Makhtar, Daniel C. Neagu, Mick Ridley School of Computing, Informatics and Media, University of Bradford
Overview. Evaluation Connectionist and Statistical Language Processing. Test and Validation Set. Training and Test Set
Overview Evaluation Connectionist and Statistical Language Processing Frank Keller [email protected] Computerlinguistik Universität des Saarlandes training set, validation set, test set holdout, stratification
A Genetic Algorithm Approach for Solving a Flexible Job Shop Scheduling Problem
A Genetic Algorithm Approach for Solving a Flexible Job Shop Scheduling Problem Sayedmohammadreza Vaghefinezhad 1, Kuan Yew Wong 2 1 Department of Manufacturing & Industrial Engineering, Faculty of Mechanical
SYSTEMS AND SOFTWARE REQUIREMENTS SPECIFICATION (SSRS) TEMPLATE. Version A.4, January 2014 FOREWORD DOCUMENT CONVENTIONS
SYSTEMS AND SOFTWARE REQUIREMENTS SPECIFICATION (SSRS) TEMPLATE Version A.4, January 2014 FOREWORD This document was written to provide software development projects with a template for generating a System
GA as a Data Optimization Tool for Predictive Analytics
GA as a Data Optimization Tool for Predictive Analytics Chandra.J 1, Dr.Nachamai.M 2,Dr.Anitha.S.Pillai 3 1Assistant Professor, Department of computer Science, Christ University, Bangalore,India, [email protected]
Proposal and Analysis of Stock Trading System Using Genetic Algorithm and Stock Back Test System
Proposal and Analysis of Stock Trading System Using Genetic Algorithm and Stock Back Test System Abstract: In recent years, many brokerage firms and hedge funds use a trading system based on financial
A Service Revenue-oriented Task Scheduling Model of Cloud Computing
Journal of Information & Computational Science 10:10 (2013) 3153 3161 July 1, 2013 Available at http://www.joics.com A Service Revenue-oriented Task Scheduling Model of Cloud Computing Jianguang Deng a,b,,
W6.B.1. FAQs CS535 BIG DATA W6.B.3. 4. If the distance of the point is additionally less than the tight distance T 2, remove it from the original set
http://wwwcscolostateedu/~cs535 W6B W6B2 CS535 BIG DAA FAQs Please prepare for the last minute rush Store your output files safely Partial score will be given for the output from less than 50GB input Computer
Business Intelligence Integration. Joel Da Costa, Takudzwa Mabande, Richard Migwalla Antoine Bagula, Joseph Balikuddembe
Business Intelligence Integration Joel Da Costa, Takudzwa Mabande, Richard Migwalla Antoine Bagula, Joseph Balikuddembe Project Description Business Intelligence (BI) is the practice of using computer
Genetic Algorithm. Based on Darwinian Paradigm. Intrinsically a robust search and optimization mechanism. Conceptual Algorithm
24 Genetic Algorithm Based on Darwinian Paradigm Reproduction Competition Survive Selection Intrinsically a robust search and optimization mechanism Slide -47 - Conceptual Algorithm Slide -48 - 25 Genetic
Data Mining - Evaluation of Classifiers
Data Mining - Evaluation of Classifiers Lecturer: JERZY STEFANOWSKI Institute of Computing Sciences Poznan University of Technology Poznan, Poland Lecture 4 SE Master Course 2008/2009 revised for 2010
Requirements. Version Control History. Implementation Technology. Software Process. Developers
If Your Version Control System Could Talk... Thomas Ball Jung-Min Kim Adam A. Porter Harvey P. Siy Bell Laboratories Dept. of Computer Sciences Bell Laboratories Lucent Technologies University of Maryland
Management Science Letters
Management Science Letters 4 (2014) 905 912 Contents lists available at GrowingScience Management Science Letters homepage: www.growingscience.com/msl Measuring customer loyalty using an extended RFM and
A Novel Binary Particle Swarm Optimization
Proceedings of the 5th Mediterranean Conference on T33- A Novel Binary Particle Swarm Optimization Motaba Ahmadieh Khanesar, Member, IEEE, Mohammad Teshnehlab and Mahdi Aliyari Shoorehdeli K. N. Toosi
Quality prediction model for object oriented software using UML metrics
THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS TECHNICAL REPORT OF IEICE. UML Quality prediction model for object oriented software using UML metrics CAMARGO CRUZ ANA ERIKA and KOICHIRO
Applying Design Patterns in Distributing a Genetic Algorithm Application
Applying Design Patterns in Distributing a Genetic Algorithm Application Nick Burns Mike Bradley Mei-Ling L. Liu California Polytechnic State University Computer Science Department San Luis Obispo, CA
Optimization of sampling strata with the SamplingStrata package
Optimization of sampling strata with the SamplingStrata package Package version 1.1 Giulio Barcaroli January 12, 2016 Abstract In stratified random sampling the problem of determining the optimal size
Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods
Mining Direct Marketing Data by Ensembles of Weak Learners and Rough Set Methods Jerzy B laszczyński 1, Krzysztof Dembczyński 1, Wojciech Kot lowski 1, and Mariusz Paw lowski 2 1 Institute of Computing
Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm
1 Enhancing the SNR of the Fiber Optic Rotation Sensor using the LMS Algorithm Hani Mehrpouyan, Student Member, IEEE, Department of Electrical and Computer Engineering Queen s University, Kingston, Ontario,
Cross-Validation. Synonyms Rotation estimation
Comp. by: BVijayalakshmiGalleys0000875816 Date:6/11/08 Time:19:52:53 Stage:First Proof C PAYAM REFAEILZADEH, LEI TANG, HUAN LIU Arizona State University Synonyms Rotation estimation Definition is a statistical
Introduction To Genetic Algorithms
1 Introduction To Genetic Algorithms Dr. Rajib Kumar Bhattacharjya Department of Civil Engineering IIT Guwahati Email: [email protected] References 2 D. E. Goldberg, Genetic Algorithm In Search, Optimization
High-Mix Low-Volume Flow Shop Manufacturing System Scheduling
Proceedings of the 14th IAC Symposium on Information Control Problems in Manufacturing, May 23-25, 2012 High-Mix Low-Volume low Shop Manufacturing System Scheduling Juraj Svancara, Zdenka Kralova Institute
Investigation of Adherence Degree of Agile Requirements Engineering Practices in Non-Agile Software Development Organizations
Investigation of Adherence Degree of Agile Requirements Engineering Practices in Non-Agile Software Development Organizations Mennatallah H. Ibrahim Department of Computers and Information Sciences Institute
An Introduction to. Metrics. used during. Software Development
An Introduction to Metrics used during Software Development Life Cycle www.softwaretestinggenius.com Page 1 of 10 Define the Metric Objectives You can t control what you can t measure. This is a quote
Evaluation & Validation: Credibility: Evaluating what has been learned
Evaluation & Validation: Credibility: Evaluating what has been learned How predictive is a learned model? How can we evaluate a model Test the model Statistical tests Considerations in evaluating a Model
Keywords Data mining, Classification Algorithm, Decision tree, J48, Random forest, Random tree, LMT, WEKA 3.7. Fig.1. Data mining techniques.
International Journal of Emerging Research in Management &Technology Research Article October 2015 Comparative Study of Various Decision Tree Classification Algorithm Using WEKA Purva Sewaiwar, Kamal Kant
How To Understand The Impact Of A Computer On Organization
International Journal of Research in Engineering & Technology (IJRET) Vol. 1, Issue 1, June 2013, 1-6 Impact Journals IMPACT OF COMPUTER ON ORGANIZATION A. D. BHOSALE 1 & MARATHE DAGADU MITHARAM 2 1 Department
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control
Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control Andre BERGMANN Salzgitter Mannesmann Forschung GmbH; Duisburg, Germany Phone: +49 203 9993154, Fax: +49 203 9993234;
A Brief Study of the Nurse Scheduling Problem (NSP)
A Brief Study of the Nurse Scheduling Problem (NSP) Lizzy Augustine, Morgan Faer, Andreas Kavountzis, Reema Patel Submitted Tuesday December 15, 2009 0. Introduction and Background Our interest in the
Introducing diversity among the models of multi-label classification ensemble
Introducing diversity among the models of multi-label classification ensemble Lena Chekina, Lior Rokach and Bracha Shapira Ben-Gurion University of the Negev Dept. of Information Systems Engineering and
Program Visualization for Programming Education Case of Jeliot 3
Program Visualization for Programming Education Case of Jeliot 3 Roman Bednarik, Andrés Moreno, Niko Myller Department of Computer Science University of Joensuu [email protected] Abstract:
A Sarsa based Autonomous Stock Trading Agent
A Sarsa based Autonomous Stock Trading Agent Achal Augustine The University of Texas at Austin Department of Computer Science Austin, TX 78712 USA [email protected] Abstract This paper describes an autonomous
Numerical Algorithms for Predicting Sports Results
Numerical Algorithms for Predicting Sports Results by Jack David Blundell, 1 School of Computing, Faculty of Engineering ABSTRACT Numerical models can help predict the outcome of sporting events. The features
A simple algorithm with no simple verication
A simple algorithm with no simple verication Laszlo Csirmaz Central European University Abstract The correctness of a simple sorting algorithm is resented, which algorithm is \evidently wrong" at the rst
Prediction of Stock Performance Using Analytical Techniques
136 JOURNAL OF EMERGING TECHNOLOGIES IN WEB INTELLIGENCE, VOL. 5, NO. 2, MAY 2013 Prediction of Stock Performance Using Analytical Techniques Carol Hargreaves Institute of Systems Science National University
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor
A Genetic Algorithm-Evolved 3D Point Cloud Descriptor Dominik Wȩgrzyn and Luís A. Alexandre IT - Instituto de Telecomunicações Dept. of Computer Science, Univ. Beira Interior, 6200-001 Covilhã, Portugal
