Machine Learning-based Software Testing: Towards a Classification Framework
|
|
|
- Lorraine Gibbs
- 9 years ago
- Views:
Transcription
1 Machine Learning-based Software Testing: Towards a Classification Framework Mahdi Noorian 1, Ebrahim Bagheri 1,2, and Wheichang Du 1 University of New Brunswick, Fredericton, Canada 1 Athabasca University, Edmonton, Canada 2 [email protected], [email protected], [email protected] Abstract Software Testing (ST) processes attempt to verify and validate the capability of a software system to meet its required attributes and functionality. As software systems become more complex, the need for automated software testing methods emerges. Machine Learning (ML) techniques have shown to be quite useful for this automation process. Various works have been presented in the junction of ML and ST areas. The lack of general guidelines for applying appropriate learning methods for software testing purposes is our major motivation in this current paper. In this paper, we introduce a classification framework which can help to systematically review research work in the ML and ST domains. The proposed framework dimensions are defined using major characteristics of existing software testing and machine learning methods. Our framework can be used to effectively construct a concrete set of guidelines for choosing the most appropriate learning method and applying it to a distinct stage of the software testing life-cycle for automation purposes. I. INTRODUCTION Software Testing (ST) is an investigation process which attempts to validate and verify the alignment of a software system s attributes and functionality with its intended goals. Software testing is a labour intensive and costly process and as mentioned in [2], [3], a testing process may require up to 50% of the development resources. Due to this fact, automated testing approaches are desired to reduce this cost and time. Besides, automation can significantly enhance the testing process performance. Hence, for future software systems development, steps need to be taken towards the development of automated testing strategies [4]. Several interesting attempts have already been made for automating the software testing process. Machine Learning (ML) as a sub domain of AI [12] is widely used in various stages of the software development life-cycle [19], especially for automating software testing processes [5]. In [1], [17], evolutionary algorithms have been employed for automating test case generation. In [16], Artificial Neural Networks (ANN) have been used to build a model for estimating the effectiveness of the generated test cases. Briand et al. in [8] has proposed a method based on the C4.5 decision tree algorithm for predicting potential bugs in a software system and localizing the bugs in order to reduce the debugging process time. All research works show that the employment of machine learning techniques is a promising approach for automating testing processes. But, still some major questions remain that need to be addressed and further investigated such as: What types of machine learning methods can be effective in different aspects of software testing? What are the strengths and weaknesses of each learning method in the context of testing? How can we determine the proper type of learning method for the stages of a testing process? Where are the critical points in a software testing process in which ML can positively contribute? The general purpose of our ongoing research is to provide a specific set of guidelines for automating software testing processes with the aid of machine leaning techniques. To come up with such set of guidelines, it is required to systematically analyse the current research work and find answers to some of the abovementioned questions. In the first step, we propose a framework which can be used to classify the current research work in the conjunction of ML and ST. In addition, to support our framework, some works are reviewed. This classification framework can assist ST practitioners to analyse and understand the emerging applications of the ML-ST domain. Such structured classification framework can be useful for defining and constructing a set of guidelines for automating software testing processes. The reminder of this paper is as follows. In Section II, the proposed classification framework is presented. In addition, the dimensions of the framework are discussed in detail. Section III is devoted to presenting some work in the area of ML and ST. In Section IV we discuss how this set of representative work can be classified according to our proposed framework. We conclude the paper with conclusions and direction for future work in Section V. II. THE PROPOSED FRAMEWORK FOR CLASSIFICATION During the past decade, several works have been introduced based on machine learning techniques where learning algorithms have been applied to the various stages of software testing. In order to clarify the stand point of ML in the area of ST, we propose a classification framework. In this section, we take a look at the proposed framework and its dimensions. A. The Framework s Dimensions The most important factors that can be used to distinguish the current research work are represented in Fig. 1. In its highest layer, the framework consists of two main categories
2 TABLE I TESTING GENERAL ACTIVITY DIMENTION Sub-dimension Possible value A: Test Planning Testing cost estimation [9] Test case prioritization [1] B: Test Case Management Test case design [1] Test case refinement [6] Test case evaluation [16] Fault localization [18] C: Debugging Bug prioritization [18] Fault prediction [15] (Testing and ML). According to the defined categories, six main dimensions and some sub-dimensions are obtainable in the classification framework. The framework s dimensions are as follows: Dimension 1-Testing approach: According to the older testing terminology [2], we define the testing approach dimension with three possible values: black-box, white-box, and graybox [2]. Based on the black-box approach, testing can be performed using the external description of a software system such as the software specification. In a white-box approach, the internal properties of a software system like source code can be used for testing purposes. The grey-box approach is a combination of the two, which considers both internal and external properties at the same time. Dimension 2-Testing general activity: In the second dimension, the standard testing process life-cycle [13] inspired us to define the a) test planning, b) test case management, and c) debugging sub-dimensions. With respect to the testing process life-cycle, we found out that these three phases are critical and that automation can effectively assist them in order to reduce the cost and time of the whole testing process. The possible activities that can be automated based on ML in a, b, and c are presented in Table I. In the test planning sub-dimension, testing cost estimation can help test managers to predict testing process cost and time and provide good testing plan to manage the testing process efficiently. Test case management includes several tasks such as test case prioritization, which intends to prioritize the test input space in terms of test case effectiveness; test case design, which intends to generate high quality test cases; test case refinement, which intends to map the current specification of a software system to the existing test cases in order to reuse the available test cases; and test case evaluation which intends to measure the quality of the generated test cases. In the Debugging sub-dimension, fault localization can help to find the exact location of the program that is defected. In addition, bug prioritization intends to prioritize the revealed faults based on their severities; later test engineer can focus on more critical faults accordingly. Fault prediction can assist test engineers in the debugging stage, in the sense that potential faults for a given program are predicted. Dimension 3-Testing level: Software development process includes a range of stages form requirement analysis to implementation [13]. The development process goes further even after the implementation stage and concentrates on software maintenance. In Dimension 3, we have employed the following widely accepted testing levels in order to classify existing work: acceptance testing, system testing, integration testing, module testing, unit testing, and regression testing, which refer to requirement analysis, architectural design, subsystem design, detailed design, implementation, and maintenance in the software development stages, respectively. Dimension 4-Learning technique: In the machine learning area, various types of learning methods have been introduced and each of them has its own specific characteristics. We adopt Mitchel [12] learning method classification and define learning technique dimension values as follows: Decision Trees (DT), Artificial Neural Networks (ANN), Genetic Algorithms (GA), Bayesian Learning (BL), Instance Base Learning (IBL), Clustering, and Hybrid methods, which can be combination of several other learning methods. Dimension 5-Learning property: Each learning method has its own specific characteristics. This dimension is devoted to evaluating learning methods properties in various aspects. To do so, three sub-dimensions; Training data properties, Supervision, Time generalization are defined. The Training data properties evaluate the properties of a data set. The gathered data for learning process can be small or large in terms of its quantity. Data can be noisy or accurate in terms of errors that might exist in a data set. In addition, learning can be supervised or unsupervised; therefore, the Supervision sub-dimension is defined. For a given target function the time generalization sub-dimension exposes how its generalization is; which can be either eager (at learning phase) or lazy (at classification phase). In terms of online and offline learning, also time generalization shows how a learning system updates its approximation of the target function after the initial training data are used. The last sub-dimension, automation degree, is responsible for addressing the learning system in terms of the degree of automation. The designed system can be fully or partially automatic. Dimension 6-Elements learned: In each learning system for software testing automation, various types of data can be used to build the target function. The training data can be collected in different stages of the software testing process or software development life-cycle. The learning elements could be software metrics, software specification, CFG (control flow graph), call graph, test case, execution data, failure reports, and/or coverage data. The elements learned dimension is defined to distinguish the type of learning element that being used for an individual learning model in the testing process. Fig. 1 shows the framework dimensions, their subdimensions and some possible values for them. The main dimensions are shown with ellipse and numbered from 1 to 6. The sub-dimensions are represented with ellipse as well. In addition, the possible values of each dimension are shown with rectangles. III. THE EXAMPLES OF ML IN SOFTWARE TESTING In this section, some research work in the area of ML and ST has been selected for supporting the proposed classification framework. In Section IV, we discuss how this work can be classified with proposed framework. Note that, Dimension
3 Fig. 1. Classification framework dimensions. 2 has been chosen as the main dimension for classification purposes. We have tried to select and present at least one work according to sub-dimensions a, b, and c. In the following three subsections, we have chosen and briefly introduced three interesting works (due to space limitation) that have employed machine learning techniques for test planning, test case management, and debugging, respectively. Later, we will show how these three works that fall under different aspects of software testing can be effectively described using our proposed classification framework. In our future longer publication, we will comprehensively classify the relevant related literature within our proposed framework. The following three works are only introduced to show the potential of our framework. A. Test planning In [9], machine learning techniques have been used to identify the main factors of software testing where they have been employed to predict the testing process time. Understanding the important factors of software under the test can help test managers to prepare a good test plan. The developed methodology consists of four phases; 1) Database formation; 2) Data collection; 3) Classification of software and 4) Analysing the results. In the first phase, databases of 25 software systems were constructed. The sample software were collected from various sources with different attributes. The second phase is devoted to real data extraction form software systems. According to the predefined factors and for a given software system, values were extracted for each individual factor. The list of determined factors included code complexity measures, measures of programmer and tester experience, testing time and a number of other attributes. The complete attributes list includes twenty-seven attributes. In the third phase, COB- WEB/3 [11] was used to classify the software systems and build a model to predict the cost of testing for the new software system. COBWEB/3 builds a decision tree. In the last phase, the classification results need to be analyzed to get specific time prediction. B. Test Case Management In the context of software evolution, sometimes the available test cases need to be refined to address the current test requirements of the software system. The process of test case refinement is called test case re-engineering [6]. In [6], [7], a semi-automatic methodology based on a machine learning approach is provided to explore the limitations and find the possible redundancies of the available test cases and then refine them for future usage. As illustrated in Fig. 2, the MELBA (MachinE Learning based refinement of BlAck-box test specification) methodology is an iterative process and it consists of five main activities. In the Activity 1, test cases are transformed into the abstract level using the Category-Partition (CP) strategy [14], so they would be ready to be used by the machine learning method in Activity 2. The CP method helps to model the input domain
4 Fig. 2. Overview of the MELBA process [6]. and as discussed in [14], CP requires defining the sets of categories and choices. The categories are the properties of the input and environment parameters of the software system and the choices are the possible values for each category. The abstraction process is conducted using the defined categories and choices. An abstract test case shows an output equivalence class and the pairs of (category, choice). In Activity 2, the machine learning algorithm (C4.5 decision tree) is used to learn the rules for classifying the abstracted test cases. Activity 3 is the important part of MELBA, which is devoted to analysing the learnt rules. The results of the analyzed tree could lead to determine the problems that cause the redundant test cases or the needs for adding new test cases (Activity 4). In addition, the learnt rules may indicate that CP needs to be updated (Activity 5). This iterative process will continue until no more problems could be identified in the trees. C. Debugging In software debugging, fault localization refers to process of finding the exact locations of the program that are defected and contain faults. In [18], the Back-Propagation (BP) neural network method [10] is used to effectively pinpoint the program faults. According to proposed approach, first the BP neural network is trained with both the coverage data and the execution result, then the trained network is used to predict and rank the suspicious statements of the code, in terms of its potential to have faults. The proposed method for fault localization consists of the following steps: 1) Building up the dataset: In order to create dataset, it is required to run the program with several test cases and collect the coverage data for each execution. Beside this, the execution results are also need to be collected. This coverage data (coverage vectors) and its execution results can be used for training phase. 2) Training of the BP neural network: In this stage BP neural network is built with k input-layer neurons, three hidden-layer neurons, and one output-layer neuron. The sigmoid function is used as the transfer function. In order to perform training process, coverage vector of each test cases t 1...t n is used as input data and their corresponding r 1...r n testing result is used as expected output. 3) Prioritizing the defected statements in program: the trained network can be used for prioritization purpose. Assume, we have a set of virtual test cases v 1...v n which they cover statements s 1...s n respectively. It means that, the execution of virtual test case v i (i=1...m) covers only s i statement. With respect to this assumption, we can say that if the execution of test case v i fails, then the probability that s i is defected is high. Here, the trained network can be used to predict the expected output for each virtual test case (r vi ). The value of r vi is between 0 and 1. The larger the value of r vi, the more probable s i is defected. Hence, the statements s 1...s m can be ranked based on r v1...r vm values in descending order. At the end, the bugs can be identified by examining statements one by one form top of the list. IV. CLASSIFICATION BASED ON PROPOSED FRAMEWORK In Section II, we introduced a classification framework. There, we discussed the framework s dimensions, subdimensions, and some possible values of them. This framework can be used to classify and highlight the main features that need to be considered in the junction of ML and ST area. Following that, in Section III, we presented some representative work in the ML and ST domains. In this section, we show how the proposed framework can be applied on the three sample presented works. Table II summarizes the results of this classification for each individual work that we have reviewed. In this table, the columns show the dimensions and the related sub-dimensions of the framework and the rows represent the research work that we reviewed. In addition, each cell in the junction of a column and a row indicates the value(s) of that dimension for the corresponding research work. To clarify, from Table II it can be understood that the automation in [18] has been done in the debugging stage of testing process and the task type was fault localization, which automatically finds the suspicious statements in the source codes. In addition, in terms of testing level, this work has been conducted at the unit testing level. With respect to the testing approach dimension, this work was carried out based on the white-box approach, because the statements of the source code are considered for testing purposes. From the ML perspective it can be derived that this work has benefited from the ANN learning algorithm which is a supervised learning method. Furthermore, the elements which were used as input in the learning process were coverage data. For the work presented in [9], automation has been performed in the test planning stage whose task type is testing cost estimation. The DT (Decision Tree) type learning algorithm was used to build a model for estimating testing process cost. From the automation degree point of view, this work proposed a semi-automatic approach, since in some parts of this method we can see the need for test engineer s intervention. Furthermore, software metrics have been employed as input for the learning process. In this work, the learning task has been conducted using a small size of dataset, which can considerably decrease the learning process time. Finally, in [6], [7], testing general activity is addressed by test case management value, which means, for automation purposes the learning method was applied in the test management stage in the software testing life-cycle. The task type in this work was test case refinement. The proposed method in
5 this work, in terms of testing level, can be categorised in the regression testing level, because this work focused on reusing the existing test cases for new updated version of the current software system. With regards to the testing approach, this work used the black-box method, since it considers only the software specification for testing purposes. From the machine leaning point of view, this work employed a semi-automatic approach using DT- based learning technique. The dataset size was quite small and the elements which have been used to build dataset were test cases and software specification documents. Table II and the above classification show how our proposed framework is useful in categorizing and explaining work in the intersection of ML and ST. V. CONCLUSION AND FUTURE WORK Testing is critical task in the software development process and considerably imposes cost and time restrictions on the development process. Therefore, automating the testing process can significantly increase testing process performance. In the context of software testing, different types of data can be collected in various stages of testing. Machine learning techniques can be used to find patterns in this data and use them for automation purposes. In this paper, we have introduced a framework for classifying the current research works in ML and ST. Some sample works of ML in ST have been reviewed and appropriately classified according to the proposed framework. The proposed framework provides us with the opportunity to systematically investigate and extract the prominent information from existing research works in ML and ST. Considering the introduced framework, our main attempt for future research is reviewing and classifying all current work in the area of ML and ST. The collected information from classification process will assist us to construct the sets of concrete guidelines for applying ML methods in the ST process. These guidelines will help test engineers to choose the most efficient and appropriate learning method for automation of a target testing stage. Our initial probe has shown that our classification framework is quite strong in providing the means to capture various aspects of work in ML and ST. REFERENCES [1] Moataz A. Ahmed and Irman Hermadi. Ga-based multiple paths test data generator. Comput. Oper. Res., 35: , October [2] Paul Ammann and Jeff Offutt. Introduction to software testing. Cambridge University Press, [3] Boris Beizer. Software testing techniques (2nd ed.). Van Nostrand Reinhold Co., New York, NY, USA, [4] Antonia Bertolino. Software testing research: Achievements, challenges, dreams. In 2007 Future of Software Engineering, FOSE 07, pages , Washington, DC, USA, IEEE Computer Society. [5] Lionel C. Briand. Novel applications of machine learning in software testing. Quality Software, International Conference on, 0:3 10, [6] Lionel C. Briand, Yvan Labiche, and Zaheer Bawar. Using machine learning to refine black-box test specifications and test suites. In Proceedings of the 2008 The Eighth International Conference on Quality Software, Washington, DC, USA, IEEE Computer Society. [7] Lionel C. Briand, Yvan Labiche, Zaheer Bawar, and Nadia Traldi Spido. Using machine learning to refine category-partition test specifications and test suites. Inf. Softw. Technol., 51, November [8] Lionel C. Briand, Yvan Labiche, and Xuetao Liu. Using machine learning to support debugging with tarantula. In Proceedings of the The 18th IEEE International Symposium on Software Reliability, ISSRE 07, pages , Washington, DC, USA, IEEE Computer Society. [9] Thomas J. Cheatham, Jungsoon P. Yoo, and Nancy J. Wahl. Software testing: a machine learning experiment. In Proceedings of the 1995 ACM 23rd annual conference on Computer science, CSC 95, pages , New York, NY, USA, ACM. [10] Laurene Fausett, editor. Fundamentals of neural networks: architectures, algorithms, and applications. Prentice-Hall, Inc., Upper Saddle River, NJ, USA, [11] Kathi Een Mckumcr, Kevin Thompson, Uncl As, Kathleen Mckusick, and Kevin Thompson. Cobweb/3: A portable implementation, [12] Tom M. Mitchell. Machine learning. McGraw Hill series in computer science. McGraw-Hill, [13] Glenford J. Myers and Corey Sandler. The Art of Software Testing. John Wiley & Sons, [14] T. J. Ostrand and M. J. Balcer. The category-partition method for specifying and generating fuctional tests. Commun. ACM, 31: , June [15] Susan A. Sherer. Software fault prediction. Journal of Systems and Software, 29(2):97 105, [16] A. von Mayrhauser, C. Anderson, and R. Mraz. Using a neural network to predict test case effectiveness. In Aerospace Applications Conference, Proceedings., 1995 IEEE, number 0, pages vol.2, February [17] Joachim Wegener, Andre Baresel, and Harmen Sthamer. Evolutionary test environment for automatic structural testing. Information and Software Technology, 43(14): , [18] W. Eric Wong and Yu Qi. Bp neural network-based effective fault localization. International Journal of Software Engineering and Knowledge Engineering, 19(4): , [19] Du Zhang and Jeffrey Tsai. Machine learning and software engineering. Software Quality Journal, 11:87 119, /A:
Applications of Machine Learning in Software Testing. Lionel C. Briand Simula Research Laboratory and University of Oslo
Applications of Machine Learning in Software Testing Lionel C. Briand Simula Research Laboratory and University of Oslo Acknowledgments Yvan labiche Xutao Liu Zaheer Bawar Kambiz Frounchi March 2008 2
PREDICTIVE TECHNIQUES IN SOFTWARE ENGINEERING : APPLICATION IN SOFTWARE TESTING
PREDICTIVE TECHNIQUES IN SOFTWARE ENGINEERING : APPLICATION IN SOFTWARE TESTING Jelber Sayyad Shirabad Lionel C. Briand, Yvan Labiche, Zaheer Bawar Presented By : Faezeh R.Sadeghi Overview Introduction
Test Case Design Using Classification Trees and the Classification-Tree Editor CTE
Quality Week 1995 Test Case Design Using Classification Trees and the Classification-Tree Editor CTE Matthias Grochtmann Joachim Wegener Klaus Grimm Daimler-Benz AG Research and Technology Alt-Moabit 96a
Performance Based Evaluation of New Software Testing Using Artificial Neural Network
Performance Based Evaluation of New Software Testing Using Artificial Neural Network Jogi John 1, Mangesh Wanjari 2 1 Priyadarshini College of Engineering, Nagpur, Maharashtra, India 2 Shri Ramdeobaba
Fuzzy Cognitive Map for Software Testing Using Artificial Intelligence Techniques
Fuzzy ognitive Map for Software Testing Using Artificial Intelligence Techniques Deane Larkman 1, Masoud Mohammadian 1, Bala Balachandran 1, Ric Jentzsch 2 1 Faculty of Information Science and Engineering,
Comparison of K-means and Backpropagation Data Mining Algorithms
Comparison of K-means and Backpropagation Data Mining Algorithms Nitu Mathuriya, Dr. Ashish Bansal Abstract Data mining has got more and more mature as a field of basic research in computer science and
Test Case Design Using Classification Trees
STAR 94, 8-12 May 1994, Washington, D.C. Test Case Design Using Classification Trees Matthias Grochtmann Daimler-Benz AG Forschung und Technik Alt-Moabit 91b D-10559 Berlin, Germany Tel: +49 30 39 982-229
Impact of Feature Selection on the Performance of Wireless Intrusion Detection Systems
2009 International Conference on Computer Engineering and Applications IPCSIT vol.2 (2011) (2011) IACSIT Press, Singapore Impact of Feature Selection on the Performance of ireless Intrusion Detection Systems
How To Use Neural Networks In Data Mining
International Journal of Electronics and Computer Science Engineering 1449 Available Online at www.ijecse.org ISSN- 2277-1956 Neural Networks in Data Mining Priyanka Gaur Department of Information and
International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014
RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer
NEURAL NETWORKS IN DATA MINING
NEURAL NETWORKS IN DATA MINING 1 DR. YASHPAL SINGH, 2 ALOK SINGH CHAUHAN 1 Reader, Bundelkhand Institute of Engineering & Technology, Jhansi, India 2 Lecturer, United Institute of Management, Allahabad,
Web Application Regression Testing: A Session Based Test Case Prioritization Approach
Web Application Regression Testing: A Session Based Test Case Prioritization Approach Mojtaba Raeisi Nejad Dobuneh 1, Dayang Norhayati Abang Jawawi 2, Mohammad V. Malakooti 3 Faculty and Head of Department
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH
330 SURVIVABILITY ANALYSIS OF PEDIATRIC LEUKAEMIC PATIENTS USING NEURAL NETWORK APPROACH T. M. D.Saumya 1, T. Rupasinghe 2 and P. Abeysinghe 3 1 Department of Industrial Management, University of Kelaniya,
Network Machine Learning Research Group. Intended status: Informational October 19, 2015 Expires: April 21, 2016
Network Machine Learning Research Group S. Jiang Internet-Draft Huawei Technologies Co., Ltd Intended status: Informational October 19, 2015 Expires: April 21, 2016 Abstract Network Machine Learning draft-jiang-nmlrg-network-machine-learning-00
Intelligent and Automated Software Testing Methods Classification
Intelligent and Automated Software Testing Methods Classification Seyed Reza Shahamiri Department of Software Engineering Faculty of Computer Science and Information s University Teknologi Malaysia (UTM)
International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) www.iasir.net
International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational
Regression Testing Based on Comparing Fault Detection by multi criteria before prioritization and after prioritization
Regression Testing Based on Comparing Fault Detection by multi criteria before prioritization and after prioritization KanwalpreetKaur #, Satwinder Singh * #Research Scholar, Dept of Computer Science and
Process Models and Metrics
Process Models and Metrics PROCESS MODELS AND METRICS These models and metrics capture information about the processes being performed We can model and measure the definition of the process process performers
DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM
INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM M. Mayilvaganan 1, S. Aparna 2 1 Associate
DATA PREPARATION FOR DATA MINING
Applied Artificial Intelligence, 17:375 381, 2003 Copyright # 2003 Taylor & Francis 0883-9514/03 $12.00 +.00 DOI: 10.1080/08839510390219264 u DATA PREPARATION FOR DATA MINING SHICHAO ZHANG and CHENGQI
Data quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
Comparing Methods to Identify Defect Reports in a Change Management Database
Comparing Methods to Identify Defect Reports in a Change Management Database Elaine J. Weyuker, Thomas J. Ostrand AT&T Labs - Research 180 Park Avenue Florham Park, NJ 07932 (weyuker,ostrand)@research.att.com
Effect of Using Neural Networks in GA-Based School Timetabling
Effect of Using Neural Networks in GA-Based School Timetabling JANIS ZUTERS Department of Computer Science University of Latvia Raina bulv. 19, Riga, LV-1050 LATVIA [email protected] Abstract: - The school
A Content based Spam Filtering Using Optical Back Propagation Technique
A Content based Spam Filtering Using Optical Back Propagation Technique Sarab M. Hameed 1, Noor Alhuda J. Mohammed 2 Department of Computer Science, College of Science, University of Baghdad - Iraq ABSTRACT
A New Approach For Estimating Software Effort Using RBFN Network
IJCSNS International Journal of Computer Science and Network Security, VOL.8 No.7, July 008 37 A New Approach For Estimating Software Using RBFN Network Ch. Satyananda Reddy, P. Sankara Rao, KVSVN Raju,
Security Risk Assessment of Software Architecture
Security Risk Assessment of Software Architecture Fadi HajSaid 1, Yousef Hassouneh 2, Hany Ammar 1, 1 West Virginia University, USA; 2 Birzeit University, Palestine [email protected], [email protected],
Certified Tester. Advanced Level Overview
Version 2012 Copyright Notice This document may be copied in its entirety, or extracts made, if the source is acknowledged. Copyright (hereinafter called ISTQB ). Advanced Level Working Group: Mike Smith
An Agent-Based Concept for Problem Management Systems to Enhance Reliability
An Agent-Based Concept for Problem Management Systems to Enhance Reliability H. Wang, N. Jazdi, P. Goehner A defective component in an industrial automation system affects only a limited number of sub
Predicting Critical Problems from Execution Logs of a Large-Scale Software System
Predicting Critical Problems from Execution Logs of a Large-Scale Software System Árpád Beszédes, Lajos Jenő Fülöp and Tibor Gyimóthy Department of Software Engineering, University of Szeged Árpád tér
Finding Execution Faults in Dynamic Web Application
International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 5 (2014), pp. 445-452 International Research Publications House http://www. irphouse.com /ijict.htm Finding
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier
Feature Selection using Integer and Binary coded Genetic Algorithm to improve the performance of SVM Classifier D.Nithya a, *, V.Suganya b,1, R.Saranya Irudaya Mary c,1 Abstract - This paper presents,
ISTQB Certified Tester. Foundation Level. Sample Exam 1
ISTQB Certified Tester Foundation Level Version 2015 American Copyright Notice This document may be copied in its entirety, or extracts made, if the source is acknowledged. #1 When test cases are designed
Data Mining Part 5. Prediction
Data Mining Part 5. Prediction 5.1 Spring 2010 Instructor: Dr. Masoud Yaghini Outline Classification vs. Numeric Prediction Prediction Process Data Preparation Comparing Prediction Methods References Classification
Data Mining for Customer Service Support. Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin
Data Mining for Customer Service Support Senioritis Seminar Presentation Megan Boice Jay Carter Nick Linke KC Tobin Traditional Hotline Services Problem Traditional Customer Service Support (manufacturing)
SOFTWARE TESTING STRATEGY APPROACH ON SOURCE CODE APPLYING CONDITIONAL COVERAGE METHOD
SOFTWARE TESTING STRATEGY APPROACH ON SOURCE CODE APPLYING CONDITIONAL COVERAGE METHOD Jaya Srivastaval 1 and Twinkle Dwivedi 2 1 Department of Computer Science & Engineering, Shri Ramswaroop Memorial
How To Understand Software Engineering
PESIT Bangalore South Campus Department of MCA SOFTWARE ENGINEERING 1. GENERAL INFORMATION Academic Year: JULY-NOV 2015 Semester(s):III Title Code Duration (hrs) SOFTWARE ENGINEERING 13MCA33 Lectures 52Hrs
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING
EFFICIENT DATA PRE-PROCESSING FOR DATA MINING USING NEURAL NETWORKS JothiKumar.R 1, Sivabalan.R.V 2 1 Research scholar, Noorul Islam University, Nagercoil, India Assistant Professor, Adhiparasakthi College
A HOLISTIC FRAMEWORK FOR KNOWLEDGE MANAGEMENT
A HOLISTIC FRAMEWORK FOR KNOWLEDGE MANAGEMENT Dr. Shamsul Chowdhury, Roosevelt University, [email protected] ABSTRACT Knowledge management refers to the set of processes developed in an organization
How To Test A Web Based Application Automatically
A General Framework for Testing Web-Based Applications Saeed Abrishami, Mohsen Kahani Computer Engineering Department, Ferdowsi University of Mashhad [email protected] r, [email protected] Abstract Software
An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks
2011 International Conference on Network and Electronics Engineering IPCSIT vol.11 (2011) (2011) IACSIT Press, Singapore An Anomaly-Based Method for DDoS Attacks Detection using RBF Neural Networks Reyhaneh
Neural Networks and Support Vector Machines
INF5390 - Kunstig intelligens Neural Networks and Support Vector Machines Roar Fjellheim INF5390-13 Neural Networks and SVM 1 Outline Neural networks Perceptrons Neural networks Support vector machines
COMPARATIVE STUDY OF SOFTWARE TESTING TOOLS ON THE BASIS OF SOFTWARE TESTING METHODOLOGIES
International Journal of Advance Research In Science And Engineering http://www.ijarse.com COMPARATIVE STUDY OF SOFTWARE TESTING TOOLS ON THE BASIS OF SOFTWARE TESTING METHODOLOGIES 1 Lav Kumar Dixit,
Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with Panchromatic Textural Features
Remote Sensing and Geoinformation Lena Halounová, Editor not only for Scientific Cooperation EARSeL, 2011 Multiscale Object-Based Classification of Satellite Images Merging Multispectral Information with
Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies
Text Mining Approach for Big Data Analysis Using Clustering and Classification Methodologies Somesh S Chavadi 1, Dr. Asha T 2 1 PG Student, 2 Professor, Department of Computer Science and Engineering,
ANALYSIS OF OPEN SOURCE DEFECT TRACKING TOOLS FOR USE IN DEFECT ESTIMATION
ANALYSIS OF OPEN SOURCE DEFECT TRACKING TOOLS FOR USE IN DEFECT ESTIMATION Catherine V. Stringfellow, Dileep Potnuri Department of Computer Science Midwestern State University Wichita Falls, TX U.S.A.
An Introduction to Neural Networks
An Introduction to Vincent Cheung Kevin Cannons Signal & Data Compression Laboratory Electrical & Computer Engineering University of Manitoba Winnipeg, Manitoba, Canada Advisor: Dr. W. Kinsner May 27,
Implementation of hybrid software architecture for Artificial Intelligence System
IJCSNS International Journal of Computer Science and Network Security, VOL.7 No.1, January 2007 35 Implementation of hybrid software architecture for Artificial Intelligence System B.Vinayagasundaram and
A Study of Web Log Analysis Using Clustering Techniques
A Study of Web Log Analysis Using Clustering Techniques Hemanshu Rana 1, Mayank Patel 2 Assistant Professor, Dept of CSE, M.G Institute of Technical Education, Gujarat India 1 Assistant Professor, Dept
IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION
http:// IDENTIFIC ATION OF SOFTWARE EROSION USING LOGISTIC REGRESSION Harinder Kaur 1, Raveen Bajwa 2 1 PG Student., CSE., Baba Banda Singh Bahadur Engg. College, Fatehgarh Sahib, (India) 2 Asstt. Prof.,
Software Defect Prediction Modeling
Software Defect Prediction Modeling Burak Turhan Department of Computer Engineering, Bogazici University [email protected] Abstract Defect predictors are helpful tools for project managers and developers.
Component visualization methods for large legacy software in C/C++
Annales Mathematicae et Informaticae 44 (2015) pp. 23 33 http://ami.ektf.hu Component visualization methods for large legacy software in C/C++ Máté Cserép a, Dániel Krupp b a Eötvös Loránd University [email protected]
Technical versus non-technical skills in test automation
Technical versus non-technical skills in test automation Dorothy Graham Software Testing Consultant [email protected] SUMMARY In this paper, I discuss the role of the testers and test automators
Numerical Research on Distributed Genetic Algorithm with Redundant
Numerical Research on Distributed Genetic Algorithm with Redundant Binary Number 1 Sayori Seto, 2 Akinori Kanasugi 1,2 Graduate School of Engineering, Tokyo Denki University, Japan [email protected],
Confirmation Bias as a Human Aspect in Software Engineering
Confirmation Bias as a Human Aspect in Software Engineering Gul Calikli, PhD Data Science Laboratory, Department of Mechanical and Industrial Engineering, Ryerson University Why Human Aspects in Software
Genetic Algorithm Based Test Data Generator
Genetic Algorithm Based Test Data Generator Irman Hermadi Department of Information and Computer Science King Fahd University of Petroleum & Minerals KFUPM Box # 868, Dhahran 31261, Saudi Arabia [email protected]
Karunya University Dept. of Information Technology
PART A Questions 1. Mention any two software process models. 2. Define risk management. 3. What is a module? 4. What do you mean by requirement process? 5. Define integration testing. 6. State the main
D A T A M I N I N G C L A S S I F I C A T I O N
D A T A M I N I N G C L A S S I F I C A T I O N FABRICIO VOZNIKA LEO NARDO VIA NA INTRODUCTION Nowadays there is huge amount of data being collected and stored in databases everywhere across the globe.
Pattern Recognition Using Feature Based Die-Map Clusteringin the Semiconductor Manufacturing Process
Pattern Recognition Using Feature Based Die-Map Clusteringin the Semiconductor Manufacturing Process Seung Hwan Park, Cheng-Sool Park, Jun Seok Kim, Youngji Yoo, Daewoong An, Jun-Geol Baek Abstract Depending
An Overview of Knowledge Discovery Database and Data mining Techniques
An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,
Facilitating Business Process Discovery using Email Analysis
Facilitating Business Process Discovery using Email Analysis Matin Mavaddat [email protected] Stewart Green Stewart.Green Ian Beeson Ian.Beeson Jin Sa Jin.Sa Abstract Extracting business process
A New Method for Traffic Forecasting Based on the Data Mining Technology with Artificial Intelligent Algorithms
Research Journal of Applied Sciences, Engineering and Technology 5(12): 3417-3422, 213 ISSN: 24-7459; e-issn: 24-7467 Maxwell Scientific Organization, 213 Submitted: October 17, 212 Accepted: November
8. Machine Learning Applied Artificial Intelligence
8. Machine Learning Applied Artificial Intelligence Prof. Dr. Bernhard Humm Faculty of Computer Science Hochschule Darmstadt University of Applied Sciences 1 Retrospective Natural Language Processing Name
Reference Books. Data Mining. Supervised vs. Unsupervised Learning. Classification: Definition. Classification k-nearest neighbors
Classification k-nearest neighbors Data Mining Dr. Engin YILDIZTEPE Reference Books Han, J., Kamber, M., Pei, J., (2011). Data Mining: Concepts and Techniques. Third edition. San Francisco: Morgan Kaufmann
Latest Research and Development on Software Testing Techniques and Tools
General Article International Journal of Current Engineering and Technology E-ISSN 2277 4106, P-ISSN 2347-5161 2014 INPRESSCO, All Rights Reserved Available at http://inpressco.com/category/ijcet Rasneet
Keywords : Data Warehouse, Data Warehouse Testing, Lifecycle based Testing
Volume 4, Issue 12, December 2014 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com A Lifecycle
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree
Predicting the Risk of Heart Attacks using Neural Network and Decision Tree S.Florence 1, N.G.Bhuvaneswari Amma 2, G.Annapoorani 3, K.Malathi 4 PG Scholar, Indian Institute of Information Technology, Srirangam,
How To Develop Software
Software Engineering Prof. N.L. Sarda Computer Science & Engineering Indian Institute of Technology, Bombay Lecture-4 Overview of Phases (Part - II) We studied the problem definition phase, with which
A Framework of Model-Driven Web Application Testing
A Framework of Model-Driven Web Application Testing Nuo Li, Qin-qin Ma, Ji Wu, Mao-zhong Jin, Chao Liu Software Engineering Institute, School of Computer Science and Engineering, Beihang University, China
An Agile Methodology Based Model for Change- Oriented Software Engineering
An Agile Methodology Based Model for Change- Oriented Software Engineering Naresh Kumar Nagwani, Pradeep Singh Department of Computer Sc. & Engg. National Institute of Technology, Raipur [email protected],
A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks
A Systemic Artificial Intelligence (AI) Approach to Difficult Text Analytics Tasks Text Analytics World, Boston, 2013 Lars Hard, CTO Agenda Difficult text analytics tasks Feature extraction Bio-inspired
A Survey on Outlier Detection Techniques for Credit Card Fraud Detection
IOSR Journal of Computer Engineering (IOSR-JCE) e-issn: 2278-0661, p- ISSN: 2278-8727Volume 16, Issue 2, Ver. VI (Mar-Apr. 2014), PP 44-48 A Survey on Outlier Detection Techniques for Credit Card Fraud
Efficient Agent Based Testing Framework for Web Applications
International Journal of Scientific & Engineering Research, Volume 3, Issue 2, February-2012 1 Efficient Agent Based Testing Framework for Web Applications Ms.Neha Saluja Prof.Amit Kanskar Abstract- Now
Master's projects at ITMO University. Daniil Chivilikhin PhD Student @ ITMO University
Master's projects at ITMO University Daniil Chivilikhin PhD Student @ ITMO University General information Guidance from our lab's researchers Publishable results 2 Research areas Research at ITMO Evolutionary
Choosing the Best Classification Performance Metric for Wrapper-based Software Metric Selection for Defect Prediction
Choosing the Best Classification Performance Metric for Wrapper-based Software Metric Selection for Defect Prediction Huanjing Wang Western Kentucky University [email protected] Taghi M. Khoshgoftaar
American International Journal of Research in Science, Technology, Engineering & Mathematics
American International Journal of Research in Science, Technology, Engineering & Mathematics Available online at http://www.iasir.net ISSN (Print): 2328-349, ISSN (Online): 2328-3580, ISSN (CD-ROM): 2328-3629
AUTOMATION OF ENERGY DEMAND FORECASTING. Sanzad Siddique, B.S.
AUTOMATION OF ENERGY DEMAND FORECASTING by Sanzad Siddique, B.S. A Thesis submitted to the Faculty of the Graduate School, Marquette University, in Partial Fulfillment of the Requirements for the Degree
Improving Decision Making and Managing Knowledge
Improving Decision Making and Managing Knowledge Decision Making and Information Systems Information Requirements of Key Decision-Making Groups in a Firm Senior managers, middle managers, operational managers,
Data Mining and Neural Networks in Stata
Data Mining and Neural Networks in Stata 2 nd Italian Stata Users Group Meeting Milano, 10 October 2005 Mario Lucchini e Maurizo Pisati Università di Milano-Bicocca [email protected] [email protected]
Software Defect Prediction for Quality Improvement Using Hybrid Approach
Software Defect Prediction for Quality Improvement Using Hybrid Approach 1 Pooja Paramshetti, 2 D. A. Phalke D.Y. Patil College of Engineering, Akurdi, Pune. Savitribai Phule Pune University ABSTRACT In
Basic Testing Concepts and Terminology
T-76.5613 Software Testing and Quality Assurance Lecture 2, 13.9.2006 Basic Testing Concepts and Terminology Juha Itkonen SoberIT Contents Realities and principles of Testing terminology and basic concepts
Domain Classification of Technical Terms Using the Web
Systems and Computers in Japan, Vol. 38, No. 14, 2007 Translated from Denshi Joho Tsushin Gakkai Ronbunshi, Vol. J89-D, No. 11, November 2006, pp. 2470 2482 Domain Classification of Technical Terms Using
Business Intelligence. Data Mining and Optimization for Decision Making
Brochure More information from http://www.researchandmarkets.com/reports/2325743/ Business Intelligence. Data Mining and Optimization for Decision Making Description: Business intelligence is a broad category
Data Mining Algorithms Part 1. Dejan Sarka
Data Mining Algorithms Part 1 Dejan Sarka Join the conversation on Twitter: @DevWeek #DW2015 Instructor Bio Dejan Sarka ([email protected]) 30 years of experience SQL Server MVP, MCT, 13 books 7+ courses
Utilizing Domain-Specific Modelling for Software Testing
Utilizing Domain-Specific Modelling for Software Testing Olli-Pekka Puolitaival, Teemu Kanstrén VTT Technical Research Centre of Finland Oulu, Finland {olli-pekka.puolitaival, teemu.kanstren}@vtt.fi Abstract
Classification of Engineering Consultancy Firms Using Self-Organizing Maps: A Scientific Approach
International Journal of Civil & Environmental Engineering IJCEE-IJENS Vol:13 No:03 46 Classification of Engineering Consultancy Firms Using Self-Organizing Maps: A Scientific Approach Mansour N. Jadid
Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model
Software Development Cost and Time Forecasting Using a High Performance Artificial Neural Network Model Iman Attarzadeh and Siew Hock Ow Department of Software Engineering Faculty of Computer Science &
Mobile Phone APP Software Browsing Behavior using Clustering Analysis
Proceedings of the 2014 International Conference on Industrial Engineering and Operations Management Bali, Indonesia, January 7 9, 2014 Mobile Phone APP Software Browsing Behavior using Clustering Analysis
131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10
1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom
Development of a Network Configuration Management System Using Artificial Neural Networks
Development of a Network Configuration Management System Using Artificial Neural Networks R. Ramiah*, E. Gemikonakli, and O. Gemikonakli** *MSc Computer Network Management, **Tutor and Head of Department
