Praseeda Manoj Department of Computer Science Muscat College, Sultanate of Oman

Size: px
Start display at page:

Download "Praseeda Manoj Department of Computer Science Muscat College, Sultanate of Oman"

Transcription

1 International Journal of Electronics and Computer Science Engineering 290 Available Online at ISSN Analysis of Grid Based Distributed Data Mining System for Service Oriented Frameworks Praseeda Manoj Department of Computer Science Muscat College, Sultanate of Oman Abstract- Distribution of data and computation allows for solving larger problems and execute applications that are distributed in nature. A Grid is a distributed computing infrastructure that enables to manage large amount of data and run business applications supporting consumers and end users. The Grid can play a significant role in providing an effective computational infrastructure that enables coordinated resource sharing within dynamic organizations. There have been several systems proposed to build distributed data mining. This paper analyses different grid based distributed data mining applications which help to give an overview of how Grid computing can be used to support distributed data mining. In addition, the synergy between data mining and grid technology is also discussed. This concept is implemented in Weka4WS, a framework that extends the widely used open source Weka toolkit to support distributed data mining on WSRF-enabled Grids. Weka4WS adopts the WSRF technology for running remote data mining algorithms and managing distributed computations. Keywords Data mining, Grid, DDM, WSRF I. INTRODUCTION Every organization that has embraced the concept of a data warehouse believes that data mining is a distinct part of its future. The major reason that data mining has attracted a great deal of attention in the information industry in recent years is due to the wide availability of huge amounts of data and the imminent need for turning such data into useful information and knowledge. Data mining is the process of analyzing data from different perspectives and summarizing it into useful information. Technically, the data mining process finds the correlations and patterns existing among several fields in a large relational database. Traditional on-line transaction processing systems, OLTPs, have contributed substantially to the evolution and wide acceptance of relational technology as a major tool for efficient storage, retrieval and management of large amounts of data, but are not good at delivering meaningful analysis in return. This is where Data Mining or Knowledge Discovery in database (KDD) has obvious benefits for any enterprise. Using a combination of techniques, including statistical analysis, multidimensional analysis, intelligent agents, and data visualization, KDD can discover highly useful informative patterns within the data that can be used to develop predictive models of behavior. Data Mining or KDD is the nontrivial extraction of implicit, previously unknown, potentially useful and understandable patterns information from data. II. AIM AND OBJECTIVES The aim of this paper is to analyse the advantages of WSRF enabled grid computing compared to OGSI grid computing and discussion of applications like WekaG, DataMiningGrid and Weka4WS. This paper investigates the synergy between data mining and grid technology using Globus toolkit4.0, a grid computing framework. It describes how the two paradigms data mining and grid technology- can benefit from each other. This paper also describes Weka4WS, a framework that extends the widely used open source Weka toolkit to support distributed data mining on WSRF-enabled Grids. Weka4WS adopts the WSRF technology for running remote data mining algorithms and managing distributed computations. The Weka4WS user interface supports the execution of both local and remote data mining tasks. A performance analysis of Weka4WS for executing distributed data mining tasks in different network scenarios is presented. III. DISTRIBUTED DATA MINING AND GRIDS Data mining and knowledge discovery can benefit from the use of DDM techniques to improve mining performance of huge data or distributed data. Although there are many efficient algorithms and techniques for mining centralized

2 Analysis of Grid Based Distributed Data Mining System for Service Oriented Frameworks 291 data sets, it's inefficient or incapable to deal with huge data sets or distributed data sets. There are two main reasons to choose DDM. The first one is that data is very large. If data is too large, it's hard to store it at a single site, or it's inefficient or incapable to mine such large data at a single site. In such cases, data may be decomposed into some parts that are distributed at different sites. Then we perform the data mining operations for each site. At the end, the mining results of each site are combined to gain global results. This will optimize centralized data mining since the work load is distributed among the sites.the second reason is that we need to deal with inherent distributed datasets. In fact, various wired and wireless networks such as internet, intranets, local area networks, wireless networks etc. produce many distributed resources of data. These distributed data need to be mined to gain global patterns, models or knowledge. The straightforward solution is to transfer all data to a central site, where data mining is done. However, even if we have enough capacity to handle the data storage and data mining at a central site, it may be too expensive to transfer the local data sets to the central site. On the other hand, the privacy issue is playing an important role in the emerging distributed data. The distributed data sets may not be transferred because of privacy, security or autonomy of the data sets. Therefore, DDM is an effective and scalable solution for mining huge and distributed data sets in distributed computing environments. In recent years, DDM has attracted a lot of attention among the fields of research and applications. Many techniques and systems of DDM have been proposed. However, the DDM problems such as heterogeneous data, complex data, security, privacy and autonomy of local databases, network topology and transmission scheme, still bother us. As the grid is becoming a well-accepted computing infrastructure in science and industry, it is necessary to provide general data mining services, algorithms and applications that help analysts, scientists, organizations and professionals to leverage grid capacity in supporting high performance distributed computing for solving their data mining problem in a distributed way. IV. FEATURES OF WSRF ON WEB SERVICES WSRF is a family of technical specifications concerned with the creation, addressing, inspection and lifetime management of resources using Web services. A Web service is a software component that can be accessed by remote entities using standard internet protocols such as HTTP. The capabilities offered by a service are defined using the Web Services Description Language (WSDL), an XML-based formalism that allows to define the operations exposed by a Web Service, as well as specifying the input and output messages that must be exchanged to invoke such operations. The set of operations and associated messages constitute the interface of a service. An important feature of Web services is the independence of the service interface from the implementation of the operations. To invoke a Web service, a remote entity needs to know only its WSDL interface, without worrying about the actual programming language used to implement its operations. This allows to couple in an easy way distributed software components implemented using different languages and running on heterogeneous platforms. Web Services in Grid computing are used as uniform interfaces for accessing remote resources and composing distributed applications, independently from their location and specific implementation. The so-called Open Grid Services Architecture (OGSA) defines an architectural model for Grid systems in which distributed resources and applications are modeled as web services that interact with each other using Internet-based standards. WSRF implements the OGSA philosophy by defining a set of Web service standards for the implementation of Grid systems. WSRF mainly focuses on managing stateful resources using Web services. The combination of a stateful resource with a Web service is termed WS-Resource. The possibility to define a state associated to a Web service is the most important difference between WSRF-compliant Web services and pre-wsrf ones. This is a key feature in implementing Grid systems, because Grid applications can be composed by multiple long-running processes, whose state needs to be accessed and monitored to control the overall execution. In this context, WS-Resources provide a standard way to represent, advertise, and access properties associated to processes as required by complex Grid applications.

3 IJECSE, Volume2, Number 1 Praseeda Manoj et sl. 292 V. OVERVIEW OF EXISTING DATA MINING APPLICATIONS ON THE GRID The following applications are aimed to adapt the toolkit Weka to a Grid environment using WSRF technology. i. WekaG This application uses open architectures such as OGSI and the Globus Toolkit3.0. As it is an extension of the open source Weka tool it can be further extended with data mining techniques and algorithms when needed. WekaG also implements authorization access to resources, combined with the security measurements of the Globus toolkit. WekaG implements a vertical architecture called Data mining Grid Architecture (DMGA), which is based on the data mining phases: preprocessing, data mining and post-processing. The application implements client/server architecture. The server side is responsible for a set of grid services that implement the different data mining algorithms and data mining phases. The client side interacts with the server and provides a user interface which is integrated in the Weka interface. WekaG is implemented to include the following features: coupling data sources, authorization access to resources, discovery based on metadata, planning and scheduling tasks and identifying the available and appropriate resources. It does not support OLAP and there is little known about the scalability of the application. ii. DataMiningGrid The DataMiningGrid system is developed for generic and sector independent data mining interfaces and tools to be exploited on the grid. This system composed the following requirements: massive and distributed data, distributed operations, data privacy and security, user friendliness and resource identification and metadata. The main objectives of the project are the development of grid interfaces that could be used by data mining tools, a user friendly workflow editor for configuration, text mining and ontology learning services, a test bed with some demonstrator applications and the last objective is to develop all of this with emerging grid standards. To join the DataMiningGridtestbed the user need linux machine or windows machine on which core GT4 services will have to be installed. iii. Weka4WS Weka4WS allowing the execution of all its data mining algorithms on remote Grid nodes. To enable remote invocation, the data mining algorithms provided by the Weka library are exposed as a Web Service, which can be easily deployed on the available Grid nodes. The architecture of Weka4WS includes three kinds of nodes: storage nodes, which contain the datasets to be mined; compute nodes, on which remote data mining algorithms are run; user nodes, which are the local machines of users. Remote execution is managed using basic WSRF mechanisms like state management, notifications, etc. while the Globus Toolkit 4 services are used for standard Grid functionalities, such as security and file transfer. Weka4WS can only handle a dataset contained by a single storage node. This dataset is then transferred to computing nodes to be mined. If data are considerably large this transfer will cause high communication overhead. Figure 1 - Local task and Remote task execution

4 Analysis of Grid Based Distributed Data Mining System for Service Oriented Frameworks 293 VI. ARCHITECTURE OF WSRF ENABLED GRID This paper describes the more efficient DDM technique, WEKA4WS, a WSRF enabled grid. Weka4WS, a framework that extends the widely used Weka toolkit for supporting distributed data mining on Grid environments. Weka provides a large collection of machine learning algorithms written in Java for data pre-processing, classification, clustering, association rules, and visualization, which can be invoked through a common graphical user interface. In Weka, the overall data mining process takes place on a single machine, since the algorithms can be executed only locally. The goal of Weka4WS is to extend Weka to support remote execution of the data mining algorithms. In such a way, distributed data mining tasks can be concurrently executed on decentralized Grid nodes by exploiting data distribution and improving application performance. Weka4WS is an application that extends Weka to perform data mining tasks on WSRF enabled grids. The first prototype of Weka4WS has been developed using the Java WSRF library provided by GT4. The goal of Weka4WS is to support remote execution of data mining algorithms in such a way that distributed data mining tasks can be concurrently executed on decentralized nodes on the grid, exploiting data distribution and improving performance. Each task is managed by a single thread and therefore a user can start multiple tasks in parallel, taking full advantage of the grid environment. Weka4WS leveraging the OGSA and WSRF standards, will provide a distributed data mining open service middleware by which users can design higher level distributed data mining services that cover the main steps of the KDD process and offer typical distributed data mining patterns. Figure 2 : Weka4WS architecture. VII. CONCLUSION With the advancement of information technology, increasingly complex and resource-demanding applications have become possible. As a result, even larger-scale problems are projected and in many areas so-called grand challenge problems are being tackled. These problems put an even greater demand on the underlying computing resources. A large number of applications that need many resources is modern data mining applications in science, engineering and other areas. Grid technology is an answer to the increasing demand for affordable large-scale computing resources. The grid technology and the complex nature of data mining applications have led to a new relation of data mining and grid. A data mining grid enables data mining applications and provides a comprehensive solution for affordable highperformance resources satisfying the needs of large-scale data mining problems. Mining grid data could be understood as a methodology that could help to address the complex issues involved in running and maintaining large grid computing environments.

5 IJECSE, Volume2, Number 1 Praseeda Manoj et sl. 294 To support complex data-mining applications, grid environments must provide adaptive data management and data analysis tools and techniques through the offer of resources, services and decentralized data access mechanisms. This paper discussed the importance of grid computing in distributed data mining. Grid can offer an effective infrastructure for managing data mining and knowledge discovery applications. It can represent in a near future an effective infrastructure for managing very large data sources and providing high-level mechanisms for extracting valuable knowledge from them. To solve this class of tasks, advanced tools and services for knowledge discovery are vital. In this paper, advanced tool like Weka4WS systems is described. In the next years the Grid will be used as a platform for implementing and deploying geographically distributed knowledge discovery and knowledge management services and applications. Weka4WS adopts the emerging Web Services Resource Framework (WSRF) for remotely running data mining algorithms and composing distributed knowledge discovery applications that integrate data, tools, and resources available from dispersed sites through the SOA paradigm. This paper described the architecture of Weka4WS by exploiting the WSRF library provided by Globus Toolkit 4. Weka4WS provides an effective way to perform compute-intensive distributed data analysis on large-scale Grid environments. The Weka4WS Web services can be directly invoked within adhoc programs to implement applications that coordinate the invocation of multiple data mining services in a distributed scenario. Thus, a distributed data mining application can be composed by several tasks that execute on multiple Grid nodes in parallel and/or in sequence. VIII. FUTURE DEVELOPMENT The importance of high-performance data mining is going to be considered a real added value. Grid can offer an effective infrastructure for deploying data mining and knowledge discovery applications. The future use of the Grid is mainly related to its ability embody many of those properties and to manage world-wide complex distributed applications. Among those, knowledge-based applications are a major goal. To reach this goal, the Grid needs to evolve towards an open decentralized infrastructure based on interoperable high-level services that make use of knowledge both in providing resources and in giving results to end users. Software technologies for the implementation and deployment of knowledge Grids as discussed in this paper will provide important elements to build up knowledge-based applications on a local Grid or on a World Wide Grid. These models, techniques, and tools can provide the basic components for developing Grid based complex systems such as distributed knowledge management systems providing pervasive access, adaptively and high performance for virtual organizations in science, engineering and industry that need to produce knowledge-based applications. In future, Weka Knowledge Flow environment can support the visual design of distributed data mining applications which will be able to handle different storage nodes using Globus Toolkit 5.This will allow users to design and execute complex data mining applications on the Grid in a simple and effective way. IX. REFERENCES [1] Data Mining Techniques in Grid Computing Environments Editor Werner Dubitzky. University of Ulster, UK, by John Wiley & Sons, Ltd. [2] Data Mining Practical Machine Learning Tools and Techniques Third Edition By Ian H. Witten, Eibe Frank & Mark A. Hall. [3] Meta-learning in Grid-based Data Mining Systems, Moez Ben Haj Hmida and Yahya Slimani IJCNC Vol.2, No.5, September 2010 [4] Schuster, A., Wolff. R. Trock, D.: A High-Performance Distributed Algorithm for Mining Association Rules. In: Third IEEE International Conference on Data Mining, Florida, USA (2003) [5] M. Cannataro, A. Congiusta, C. Mastroianni, A. Pugliese, D. Talia, P. Trunfio, Grid-Based Data Mining and Knowledge Discovery, Intelligent Technologies for Information Analysis, N. Zhong and J. Liu (eds.), Springer-Verlag, chapt. 2 (2004), pp [6] K. Czajkowski et al., The WS-Resource Framework Version ibm.com/developerworks/library/ws-resource/wswsrf. pdf. [7] Maarten Altorf - Data mining on grids- Universiteit Leiden August 007. [8] H. Kargupta and C. Kamath and P. Chan, Distributed and Parallel Data Mining: Emergence,Growth, and Future Directions, In: Advances in Distributed and Parallel Knowledge Discovery, AAAI/MIT Press, pp , (2000).

6 Analysis of Grid Based Distributed Data Mining System for Service Oriented Frameworks 295 [9] Cannataro, M., Congiusta, A., Pugliese, A., Talia, D. and Trunfio, P. (2004b), Distributed data mining on grids: services, tools, and applications, IEEE Transactions on Systems, Man, and Cybernetics: Part B 34 (6), [10] Foster. What is the Grid? A Three Point Checklist, July [11] Mastroianni, C., Talia, D. and Trunfio, P. (2004), Metadata for managing grid resources in data mining applications, Journal of Grid Computing 2 (1), [12] H. Witten and E. Frank. Data Mining: Practical machine learning tools with Java implementations. Morgan Kaufmann, [13] Data Mining Techniques, First Edition by Arun K. Pujari. [14] Erwin, D. W. and Snelling, D. F. (2001), UNICORE: a grid computing environment, in International Conference on Parallel and Distributed Computing (Euro-Par 01), Vol of LNCS, Springer, Manchester, UK, pp [15] Congiusta, A., Talia, D. and Trunfio, P. (2007), Distributed data mining services leveraging WSRF,Future Generation Computer Systems 23 (1), [16] [17] K. Czajkowski et al., The WS-Resource Framework Version ibm.com/developerworks/library/ws-resource/wswsrf. pdf. [18] accessed on 14-Dec-2012.

Grid-based Distributed Data Mining Systems, Algorithms and Services

Grid-based Distributed Data Mining Systems, Algorithms and Services Grid-based Distributed Data Mining Systems, Algorithms and Services Domenico Talia Abstract Distribution of data and computation allows for solving larger problems and execute applications that are distributed

More information

Distributed Framework for Data Mining As a Service on Private Cloud

Distributed Framework for Data Mining As a Service on Private Cloud RESEARCH ARTICLE OPEN ACCESS Distributed Framework for Data Mining As a Service on Private Cloud Shraddha Masih *, Sanjay Tanwani** *Research Scholar & Associate Professor, School of Computer Science &

More information

Data mining on grids.

Data mining on grids. Data mining on grids. Maarten Altorf maltorf@yahoo.com Universiteit Leiden August 2007 Contents 1 Introduction 3 2 Data mining 4 2.1 KDD.................................. 4 2.2 Data mining tasks..........................

More information

KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery

KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery KNOWLEDGE GRID An Architecture for Distributed Knowledge Discovery Mario Cannataro 1 and Domenico Talia 2 1 ICAR-CNR 2 DEIS Via P. Bucci, Cubo 41-C University of Calabria 87036 Rende (CS) Via P. Bucci,

More information

1 Mobile Data Mining on Small

1 Mobile Data Mining on Small 1 Mobile Data Mining on Small Devices Through Web Services Domenico Talia and Paolo Trunfio DEIS, University of Calabria Via Pietro Bucci 41C 87036 Rende (CS), Italy 1.1 INTRODUCTION Analysis of data is

More information

Big Data Mining Services and Knowledge Discovery Applications on Clouds

Big Data Mining Services and Knowledge Discovery Applications on Clouds Big Data Mining Services and Knowledge Discovery Applications on Clouds Domenico Talia DIMES, Università della Calabria & DtoK Lab Italy talia@dimes.unical.it Data Availability or Data Deluge? Some decades

More information

Collaborative & Integrated Network & Systems Management: Management Using Grid Technologies

Collaborative & Integrated Network & Systems Management: Management Using Grid Technologies 2011 International Conference on Computer Communication and Management Proc.of CSIT vol.5 (2011) (2011) IACSIT Press, Singapore Collaborative & Integrated Network & Systems Management: Management Using

More information

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014 RESEARCH ARTICLE OPEN ACCESS A Survey of Data Mining: Concepts with Applications and its Future Scope Dr. Zubair Khan 1, Ashish Kumar 2, Sunny Kumar 3 M.Tech Research Scholar 2. Department of Computer

More information

Using Mining@Home for Distributed Ensemble Learning

Using Mining@Home for Distributed Ensemble Learning Using Mining@Home for Distributed Ensemble Learning Eugenio Cesario 1, Carlo Mastroianni 1, and Domenico Talia 1,2 1 ICAR-CNR, Italy {cesario,mastroianni}@icar.cnr.it 2 University of Calabria, Italy talia@deis.unical.it

More information

Data Mining Solutions for the Business Environment

Data Mining Solutions for the Business Environment Database Systems Journal vol. IV, no. 4/2013 21 Data Mining Solutions for the Business Environment Ruxandra PETRE University of Economic Studies, Bucharest, Romania ruxandra_stefania.petre@yahoo.com Over

More information

Writing Grid Service Using GT3 Core. Dec, 2003. Abstract

Writing Grid Service Using GT3 Core. Dec, 2003. Abstract Writing Grid Service Using GT3 Core Dec, 2003 Long Wang wangling@mail.utexas.edu Department of Electrical & Computer Engineering The University of Texas at Austin James C. Browne browne@cs.utexas.edu Department

More information

Grid based Integration of Real-Time Value-at-Risk (VaR) Services. Abstract

Grid based Integration of Real-Time Value-at-Risk (VaR) Services. Abstract Grid based Integration of Real-Time Value-at-Risk (VaR) s Paul Donachy Daniel Stødle Terrence J harmer Ron H Perrott Belfast e-science Centre www.qub.ac.uk/escience Brian Conlon Gavan Corr First Derivatives

More information

Grid Computing Vs. Cloud Computing

Grid Computing Vs. Cloud Computing International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 3, Number 6 (2013), pp. 577-582 International Research Publications House http://www. irphouse.com /ijict.htm Grid

More information

SPATIAL DATA CLASSIFICATION AND DATA MINING

SPATIAL DATA CLASSIFICATION AND DATA MINING , pp.-40-44. Available online at http://www. bioinfo. in/contents. php?id=42 SPATIAL DATA CLASSIFICATION AND DATA MINING RATHI J.B. * AND PATIL A.D. Department of Computer Science & Engineering, Jawaharlal

More information

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

131-1. Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10 1/10 131-1 Adding New Level in KDD to Make the Web Usage Mining More Efficient Mohammad Ala a AL_Hamami PHD Student, Lecturer m_ah_1@yahoocom Soukaena Hassan Hashem PHD Student, Lecturer soukaena_hassan@yahoocom

More information

A Survey Study on Monitoring Service for Grid

A Survey Study on Monitoring Service for Grid A Survey Study on Monitoring Service for Grid Erkang You erkyou@indiana.edu ABSTRACT Grid is a distributed system that integrates heterogeneous systems into a single transparent computer, aiming to provide

More information

GSiB: PSE Infrastructure for Dynamic Service-oriented Grid Applications

GSiB: PSE Infrastructure for Dynamic Service-oriented Grid Applications GSiB: PSE Infrastructure for Dynamic Service-oriented Grid Applications Yan Huang Department of Computer Science Cardiff University PO Box 916 Cardiff CF24 3XF United Kingdom Yan.Huang@cs.cardiff.ac.uk

More information

Load balancing in SOAJA (Service Oriented Java Adaptive Applications)

Load balancing in SOAJA (Service Oriented Java Adaptive Applications) Load balancing in SOAJA (Service Oriented Java Adaptive Applications) Richard Olejnik Université des Sciences et Technologies de Lille Laboratoire d Informatique Fondamentale de Lille (LIFL UMR CNRS 8022)

More information

Manifest for Big Data Pig, Hive & Jaql

Manifest for Big Data Pig, Hive & Jaql Manifest for Big Data Pig, Hive & Jaql Ajay Chotrani, Priyanka Punjabi, Prachi Ratnani, Rupali Hande Final Year Student, Dept. of Computer Engineering, V.E.S.I.T, Mumbai, India Faculty, Computer Engineering,

More information

GENERIC DATA ACCESS AND INTEGRATION SERVICE FOR DISTRIBUTED COMPUTING ENVIRONMENT

GENERIC DATA ACCESS AND INTEGRATION SERVICE FOR DISTRIBUTED COMPUTING ENVIRONMENT GENERIC DATA ACCESS AND INTEGRATION SERVICE FOR DISTRIBUTED COMPUTING ENVIRONMENT Hemant Mehta 1, Priyesh Kanungo 2 and Manohar Chandwani 3 1 School of Computer Science, Devi Ahilya University, Indore,

More information

Data Mining System, Functionalities and Applications: A Radical Review

Data Mining System, Functionalities and Applications: A Radical Review Data Mining System, Functionalities and Applications: A Radical Review Dr. Poonam Chaudhary System Programmer, Kurukshetra University, Kurukshetra Abstract: Data Mining is the process of locating potentially

More information

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware

Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware Analyses on functional capabilities of BizTalk Server, Oracle BPEL Process Manger and WebSphere Process Server for applications in Grid middleware R. Goranova University of Sofia St. Kliment Ohridski,

More information

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington

GEOG 482/582 : GIS Data Management. Lesson 10: Enterprise GIS Data Management Strategies GEOG 482/582 / My Course / University of Washington GEOG 482/582 : GIS Data Management Lesson 10: Enterprise GIS Data Management Strategies Overview Learning Objective Questions: 1. What are challenges for multi-user database environments? 2. What is Enterprise

More information

GRID COMPUTING Techniques and Applications BARRY WILKINSON

GRID COMPUTING Techniques and Applications BARRY WILKINSON GRID COMPUTING Techniques and Applications BARRY WILKINSON Contents Preface About the Author CHAPTER 1 INTRODUCTION TO GRID COMPUTING 1 1.1 Grid Computing Concept 1 1.2 History of Distributed Computing

More information

Data Warehousing and OLAP Technology for Knowledge Discovery

Data Warehousing and OLAP Technology for Knowledge Discovery 542 Data Warehousing and OLAP Technology for Knowledge Discovery Aparajita Suman Abstract Since time immemorial, libraries have been generating services using the knowledge stored in various repositories

More information

XML Data Integration in OGSA Grids

XML Data Integration in OGSA Grids XML Data Integration in OGSA Grids Carmela Comito and Domenico Talia University of Calabria Italy comito@si.deis.unical.it Outline Introduction Data Integration and Grids The XMAP Data Integration Framework

More information

CLEVER: a CLoud-Enabled Virtual EnviRonment

CLEVER: a CLoud-Enabled Virtual EnviRonment CLEVER: a CLoud-Enabled Virtual EnviRonment Francesco Tusa Maurizio Paone Massimo Villari Antonio Puliafito {ftusa,mpaone,mvillari,apuliafito}@unime.it Università degli Studi di Messina, Dipartimento di

More information

CHAPTER 1 INTRODUCTION

CHAPTER 1 INTRODUCTION 1 CHAPTER 1 INTRODUCTION Exploration is a process of discovery. In the database exploration process, an analyst executes a sequence of transformations over a collection of data structures to discover useful

More information

An Overview of Knowledge Discovery Database and Data mining Techniques

An Overview of Knowledge Discovery Database and Data mining Techniques An Overview of Knowledge Discovery Database and Data mining Techniques Priyadharsini.C 1, Dr. Antony Selvadoss Thanamani 2 M.Phil, Department of Computer Science, NGM College, Pollachi, Coimbatore, Tamilnadu,

More information

Cluster, Grid, Cloud Concepts

Cluster, Grid, Cloud Concepts Cluster, Grid, Cloud Concepts Kalaiselvan.K Contents Section 1: Cluster Section 2: Grid Section 3: Cloud Cluster An Overview Need for a Cluster Cluster categorizations A computer cluster is a group of

More information

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets

The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets The Data Grid: Towards an Architecture for Distributed Management and Analysis of Large Scientific Datasets!! Large data collections appear in many scientific domains like climate studies.!! Users and

More information

Introduction to Service Oriented Architectures (SOA)

Introduction to Service Oriented Architectures (SOA) Introduction to Service Oriented Architectures (SOA) Responsible Institutions: ETHZ (Concept) ETHZ (Overall) ETHZ (Revision) http://www.eu-orchestra.org - Version from: 26.10.2007 1 Content 1. Introduction

More information

A standards-based approach to application integration

A standards-based approach to application integration A standards-based approach to application integration An introduction to IBM s WebSphere ESB product Jim MacNair Senior Consulting IT Specialist Macnair@us.ibm.com Copyright IBM Corporation 2005. All rights

More information

Principles and Foundations of Web Services: An Holistic View (Technologies, Business Drivers, Models, Architectures and Standards)

Principles and Foundations of Web Services: An Holistic View (Technologies, Business Drivers, Models, Architectures and Standards) Principles and Foundations of Web Services: An Holistic View (Technologies, Business Drivers, Models, Architectures and Standards) Michael P. Papazoglou (INFOLAB/CRISM, Tilburg University, The Netherlands)

More information

A Cloud Framework for Parameter Sweeping Data Mining Applications

A Cloud Framework for Parameter Sweeping Data Mining Applications A Cloud Framework for Parameter Sweeping Data Mining Applications Fabrizio Marozzo DEIS, University of Calabria Rende (CS), Italy Email: fmarozzo@deis.unical.it Domenico Talia ICAR-CNR DEIS, University

More information

Scientific versus Business Workflows

Scientific versus Business Workflows 2 Scientific versus Business Workflows Roger Barga and Dennis Gannon The formal concept of a workflow has existed in the business world for a long time. An entire industry of tools and technology devoted

More information

CMiS: A Cloud Computing Based Management Information System

CMiS: A Cloud Computing Based Management Information System International Journal of Information and Computation Technology. ISSN 0974-2239 Volume 4, Number 1 (2014), pp. 15-20 International Research Publications House http://www. irphouse.com /ijict.htm CMiS:

More information

Data Grids. Lidan Wang April 5, 2007

Data Grids. Lidan Wang April 5, 2007 Data Grids Lidan Wang April 5, 2007 Outline Data-intensive applications Challenges in data access, integration and management in Grid setting Grid services for these data-intensive application Architectural

More information

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Course 803401 DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Oman College of Management and Technology Course 803401 DSS Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization CS/MIS Department Information Sharing

More information

A Review of Data Mining Techniques

A Review of Data Mining Techniques Available Online at www.ijcsmc.com International Journal of Computer Science and Mobile Computing A Monthly Journal of Computer Science and Information Technology IJCSMC, Vol. 3, Issue. 4, April 2014,

More information

Introduction to Data Mining

Introduction to Data Mining Introduction to Data Mining Jay Urbain Credits: Nazli Goharian & David Grossman @ IIT Outline Introduction Data Pre-processing Data Mining Algorithms Naïve Bayes Decision Tree Neural Network Association

More information

Syllabus. HMI 7437: Data Warehousing and Data/Text Mining for Healthcare

Syllabus. HMI 7437: Data Warehousing and Data/Text Mining for Healthcare Syllabus HMI 7437: Data Warehousing and Data/Text Mining for Healthcare 1. Instructor Illhoi Yoo, Ph.D Office: 404 Clark Hall Email: muteaching@gmail.com Office hours: TBA Classroom: TBA Class hours: TBA

More information

Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley

Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University. Xu Liang ** University of California, Berkeley P1.1 AN INTEGRATED DATA MANAGEMENT, RETRIEVAL AND VISUALIZATION SYSTEM FOR EARTH SCIENCE DATASETS Zhenping Liu *, Yao Liang * Virginia Polytechnic Institute and State University Xu Liang ** University

More information

Six Strategies for Building High Performance SOA Applications

Six Strategies for Building High Performance SOA Applications Six Strategies for Building High Performance SOA Applications Uwe Breitenbücher, Oliver Kopp, Frank Leymann, Michael Reiter, Dieter Roller, and Tobias Unger University of Stuttgart, Institute of Architecture

More information

Introduction. A. Bellaachia Page: 1

Introduction. A. Bellaachia Page: 1 Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.

More information

Fair Scheduling Algorithm with Dynamic Load Balancing Using In Grid Computing

Fair Scheduling Algorithm with Dynamic Load Balancing Using In Grid Computing Research Inventy: International Journal Of Engineering And Science Vol.2, Issue 10 (April 2013), Pp 53-57 Issn(e): 2278-4721, Issn(p):2319-6483, Www.Researchinventy.Com Fair Scheduling Algorithm with Dynamic

More information

Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI

Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI Data Mining Knowledge Discovery, Data Warehousing and Machine Learning Final remarks Lecturer: JERZY STEFANOWSKI Email: Jerzy.Stefanowski@cs.put.poznan.pl Data Mining a step in A KDD Process Data mining:

More information

Information Systems and Technologies in Organizations

Information Systems and Technologies in Organizations Information Systems and Technologies in Organizations Information System One that collects, processes, stores, analyzes, and disseminates information for a specific purpose Is school register an information

More information

16th International Conference on Control Systems and Computer Science (CSCS16 07)

16th International Conference on Control Systems and Computer Science (CSCS16 07) 16th International Conference on Control Systems and Computer Science (CSCS16 07) TOWARDS AN IO INTENSIVE GRID APPLICATION INSTRUMENTATION IN MEDIOGRID Dacian Tudor 1, Florin Pop 2, Valentin Cristea 2,

More information

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control

Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control Developing Scalable Smart Grid Infrastructure to Enable Secure Transmission System Control EP/K006487/1 UK PI: Prof Gareth Taylor (BU) China PI: Prof Yong-Hua Song (THU) Consortium UK Members: Brunel University

More information

The Prophecy-Prototype of Prediction modeling tool

The Prophecy-Prototype of Prediction modeling tool The Prophecy-Prototype of Prediction modeling tool Ms. Ashwini Dalvi 1, Ms. Dhvni K.Shah 2, Ms. Rujul B.Desai 3, Ms. Shraddha M.Vora 4, Mr. Vaibhav G.Tailor 5 Department of Information Technology, Mumbai

More information

An approach to grid scheduling by using Condor-G Matchmaking mechanism

An approach to grid scheduling by using Condor-G Matchmaking mechanism An approach to grid scheduling by using Condor-G Matchmaking mechanism E. Imamagic, B. Radic, D. Dobrenic University Computing Centre, University of Zagreb, Croatia {emir.imamagic, branimir.radic, dobrisa.dobrenic}@srce.hr

More information

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM.

DATA MINING TECHNOLOGY. Keywords: data mining, data warehouse, knowledge discovery, OLAP, OLAM. DATA MINING TECHNOLOGY Georgiana Marin 1 Abstract In terms of data processing, classical statistical models are restrictive; it requires hypotheses, the knowledge and experience of specialists, equations,

More information

A Grid Architecture for Manufacturing Database System

A Grid Architecture for Manufacturing Database System Database Systems Journal vol. II, no. 2/2011 23 A Grid Architecture for Manufacturing Database System Laurentiu CIOVICĂ, Constantin Daniel AVRAM Economic Informatics Department, Academy of Economic Studies

More information

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4

Concepts and Architecture of the Grid. Summary of Grid 2, Chapter 4 Concepts and Architecture of the Grid Summary of Grid 2, Chapter 4 Concepts of Grid Mantra: Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations Allows

More information

Using ESB and BPEL for evolving healthcare systems towards SOA

Using ESB and BPEL for evolving healthcare systems towards SOA ehealth Beyond the Horizon Get IT There S.K. Andersen et al. (Eds.) IOS Press, 2008 2008 Organizing Committee of MIE 2008. All rights reserved. 747 Using ESB and BPEL for evolving healthcare systems towards

More information

Classic Grid Architecture

Classic Grid Architecture Peer-to to-peer Grids Classic Grid Architecture Resources Database Database Netsolve Collaboration Composition Content Access Computing Security Middle Tier Brokers Service Providers Middle Tier becomes

More information

Data Mining for Data Cloud and Compute Cloud

Data Mining for Data Cloud and Compute Cloud Data Mining for Data Cloud and Compute Cloud Prof. Uzma Ali 1, Prof. Punam Khandar 2 Assistant Professor, Dept. Of Computer Application, SRCOEM, Nagpur, India 1 Assistant Professor, Dept. Of Computer Application,

More information

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova

Using the Grid for the interactive workflow management in biomedicine. Andrea Schenone BIOLAB DIST University of Genova Using the Grid for the interactive workflow management in biomedicine Andrea Schenone BIOLAB DIST University of Genova overview background requirements solution case study results background A multilevel

More information

KNOWLEDGE BASE DATA MINING FOR BUSINESS INTELLIGENCE

KNOWLEDGE BASE DATA MINING FOR BUSINESS INTELLIGENCE KNOWLEDGE BASE DATA MINING FOR BUSINESS INTELLIGENCE Dr. Ruchira Bhargava 1 and Yogesh Kumar Jakhar 2 1 Associate Professor, Department of Computer Science, Shri JagdishPrasad Jhabarmal Tibrewala University,

More information

Mobile Cloud Computing: Paradigms and Challenges 移 动 云 计 算 : 模 式 与 挑 战

Mobile Cloud Computing: Paradigms and Challenges 移 动 云 计 算 : 模 式 与 挑 战 Mobile Cloud Computing: Paradigms and Challenges 移 动 云 计 算 : 模 式 与 挑 战 Jiannong Cao Internet & Mobile Computing Lab Department of Computing Hong Kong Polytechnic University Email: csjcao@comp.polyu.edu.hk

More information

IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand

IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand PartnerWorld Developers IBM Solutions Grid for Business Partners Helping IBM Business Partners to Grid-enable applications for the next phase of e-business on demand 2 Introducing the IBM Solutions Grid

More information

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper

IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper IEEE International Conference on Computing, Analytics and Security Trends CAST-2016 (19 21 December, 2016) Call for Paper CAST-2015 provides an opportunity for researchers, academicians, scientists and

More information

Service Oriented Architecture

Service Oriented Architecture Service Oriented Architecture Charlie Abela Department of Artificial Intelligence charlie.abela@um.edu.mt Last Lecture Web Ontology Language Problems? CSA 3210 Service Oriented Architecture 2 Lecture Outline

More information

A QoS-aware Method for Web Services Discovery

A QoS-aware Method for Web Services Discovery Journal of Geographic Information System, 2010, 2, 40-44 doi:10.4236/jgis.2010.21008 Published Online January 2010 (http://www.scirp.org/journal/jgis) A QoS-aware Method for Web Services Discovery Bian

More information

Data Mining Governance for Service Oriented Architecture

Data Mining Governance for Service Oriented Architecture Data Mining Governance for Service Oriented Architecture Ali Beklen Software Group IBM Turkey Istanbul, TURKEY alibek@tr.ibm.com Turgay Tugay Bilgin Dept. of Computer Engineering Maltepe University Istanbul,

More information

An innovative, open-standards solution for Konnex interoperability with other domotic middlewares

An innovative, open-standards solution for Konnex interoperability with other domotic middlewares An innovative, open-standards solution for Konnex interoperability with other domotic middlewares Vittorio Miori, Luca Tarrini, Maurizio Manca, Gabriele Tolomei Italian National Research Council (C.N.R.),

More information

Use of Data Mining in the field of Library and Information Science : An Overview

Use of Data Mining in the field of Library and Information Science : An Overview 512 Use of Data Mining in the field of Library and Information Science : An Overview Roopesh K Dwivedi R P Bajpai Abstract Data Mining refers to the extraction or Mining knowledge from large amount of

More information

Next Generation Intelligent, Mobile, Widely Distributed Applications: Solving Tomorrow s Software Development Challenges Today

Next Generation Intelligent, Mobile, Widely Distributed Applications: Solving Tomorrow s Software Development Challenges Today Next Generation Intelligent, Mobile, Widely Distributed Applications: Solving Tomorrow s Software Development Challenges Today By Bob DeAnna, Chief Technology Officer, Recursion Software We are on the

More information

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets

Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Data Quality Mining: Employing Classifiers for Assuring consistent Datasets Fabian Grüning Carl von Ossietzky Universität Oldenburg, Germany, fabian.gruening@informatik.uni-oldenburg.de Abstract: Independent

More information

Bibliography. University of Applied Sciences Fulda, Prof. Dr. S. Groß

Bibliography. University of Applied Sciences Fulda, Prof. Dr. S. Groß Slide III Bibliography 1) Abbas, A.: Grid Computing - A Practical Guide to Technology and Applications. Charles River Media, 2004. http://www.charlesriver.com/titles/gridcomputing.html 2) Berman, F.; et

More information

Grid Scheduling Dictionary of Terms and Keywords

Grid Scheduling Dictionary of Terms and Keywords Grid Scheduling Dictionary Working Group M. Roehrig, Sandia National Laboratories W. Ziegler, Fraunhofer-Institute for Algorithms and Scientific Computing Document: Category: Informational June 2002 Status

More information

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam ECLT 5810 E-Commerce Data Mining Techniques - Introduction Prof. Wai Lam Data Opportunities Business infrastructure have improved the ability to collect data Virtually every aspect of business is now open

More information

Methodology of performance evaluation of integrated service systems with timeout control scheme

Methodology of performance evaluation of integrated service systems with timeout control scheme Methodology of performance evaluation of integrated service systems with timeout control scheme Akira Kawaguchi and Hiroshi Yamada NTT Service Integration Laboratories, NTT Corporation 9-11, Midori-cho

More information

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL

ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR

More information

Service Oriented Distributed Manager for Grid System

Service Oriented Distributed Manager for Grid System Service Oriented Distributed Manager for Grid System Entisar S. Alkayal Faculty of Computing and Information Technology King Abdul Aziz University Jeddah, Saudi Arabia entisar_alkayal@hotmail.com Abstract

More information

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence

Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Emerging Technologies Shaping the Future of Data Warehouses & Business Intelligence Service Oriented Architecture SOA and Web Services John O Brien President and Executive Architect Zukeran Technologies

More information

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL

BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL The Fifth International Conference on e-learning (elearning-2014), 22-23 September 2014, Belgrade, Serbia BOOSTING - A METHOD FOR IMPROVING THE ACCURACY OF PREDICTIVE MODEL SNJEŽANA MILINKOVIĆ University

More information

Distributed Systems and Recent Innovations: Challenges and Benefits

Distributed Systems and Recent Innovations: Challenges and Benefits Distributed Systems and Recent Innovations: Challenges and Benefits 1. Introduction Krishna Nadiminti, Marcos Dias de Assunção, and Rajkumar Buyya Grid Computing and Distributed Systems Laboratory Department

More information

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

Chapter 5. Warehousing, Data Acquisition, Data. Visualization Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization 5-1 Learning Objectives

More information

Self-organized Multi-agent System for Service Management in the Next Generation Networks

Self-organized Multi-agent System for Service Management in the Next Generation Networks PROCEEDINGS OF THE WORKSHOP ON APPLICATIONS OF SOFTWARE AGENTS ISBN 978-86-7031-188-6, pp. 18-24, 2011 Self-organized Multi-agent System for Service Management in the Next Generation Networks Mario Kusek

More information

A Quick Introduction to SOA

A Quick Introduction to SOA Software Engineering Competence Center TUTORIAL A Quick Introduction to SOA Mahmoud Mohamed AbdAllah Senior R&D Engineer-SECC mmabdallah@itida.gov.eg Waseim Hashem Mahjoub Senior R&D Engineer-SECC Copyright

More information

A Case Study in Integrated Quality Assurance for Performance Management Systems

A Case Study in Integrated Quality Assurance for Performance Management Systems A Case Study in Integrated Quality Assurance for Performance Management Systems Liam Peyton, Bo Zhan, Bernard Stepien School of Information Technology and Engineering, University of Ottawa, 800 King Edward

More information

TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY

TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY Danubianu Mirela Stefan cel Mare University of Suceava Faculty of Electrical Engineering andcomputer Science 13 Universitatii Street, Suceava

More information

CHAPTER 7 SUMMARY AND CONCLUSION

CHAPTER 7 SUMMARY AND CONCLUSION 179 CHAPTER 7 SUMMARY AND CONCLUSION This chapter summarizes our research achievements and conclude this thesis with discussions and interesting avenues for future exploration. The thesis describes a novel

More information

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization Turban, Aronson, and Liang Decision Support Systems and Intelligent Systems, Seventh Edition Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

More information

Web Service Based Data Management for Grid Applications

Web Service Based Data Management for Grid Applications Web Service Based Data Management for Grid Applications T. Boehm Zuse-Institute Berlin (ZIB), Berlin, Germany Abstract Web Services play an important role in providing an interface between end user applications

More information

Open DMIX - Data Integration and Exploration Services for Data Grids, Data Web and Knowledge Grid Applications

Open DMIX - Data Integration and Exploration Services for Data Grids, Data Web and Knowledge Grid Applications Open DMIX - Data Integration and Exploration Services for Data Grids, Data Web and Knowledge Grid Applications Robert L. Grossman, Yunhong Gu, Dave Hanley, Xinwei Hong and Gokulnath Rao Laboratory for

More information

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours.

01219211 Software Development Training Camp 1 (0-3) Prerequisite : 01204214 Program development skill enhancement camp, at least 48 person-hours. (International Program) 01219141 Object-Oriented Modeling and Programming 3 (3-0) Object concepts, object-oriented design and analysis, object-oriented analysis relating to developing conceptual models

More information

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) (See also General Regulations) Any publication based on work approved for a higher degree should contain a reference to

More information

COURSE RECOMMENDER SYSTEM IN E-LEARNING

COURSE RECOMMENDER SYSTEM IN E-LEARNING International Journal of Computer Science and Communication Vol. 3, No. 1, January-June 2012, pp. 159-164 COURSE RECOMMENDER SYSTEM IN E-LEARNING Sunita B Aher 1, Lobo L.M.R.J. 2 1 M.E. (CSE)-II, Walchand

More information

Cloud Analytics. A Path Towards Next Generation Affordable BI. P. Radha Krishna and Kishore Indukuri Varma

Cloud Analytics. A Path Towards Next Generation Affordable BI. P. Radha Krishna and Kishore Indukuri Varma Cloud Analytics A Path Towards Next Generation Affordable BI P. Radha Krishna and Kishore Indukuri Varma Abstract Technology innovation and its adoption are two critical successful factors for any business/organization.

More information

Retrieval on the Grid Results from the European Project GRACE (Grid Search and Categorization Engine)

Retrieval on the Grid Results from the European Project GRACE (Grid Search and Categorization Engine) Retrieval on the Grid Results from the European Project GRACE (Grid Search and Categorization Engine) FRANK SCHOLZE, WERNER STEPHAN Stuttgart University Library Main parts of this paper are based on the

More information

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc])

REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) 305 REGULATIONS FOR THE DEGREE OF MASTER OF SCIENCE IN COMPUTER SCIENCE (MSc[CompSc]) (See also General Regulations) Any publication based on work approved for a higher degree should contain a reference

More information

DATA MINING AND WAREHOUSING CONCEPTS

DATA MINING AND WAREHOUSING CONCEPTS CHAPTER 1 DATA MINING AND WAREHOUSING CONCEPTS 1.1 INTRODUCTION The past couple of decades have seen a dramatic increase in the amount of information or data being stored in electronic format. This accumulation

More information

Cloud Computing and Software Agents: Towards Cloud Intelligent Services

Cloud Computing and Software Agents: Towards Cloud Intelligent Services Cloud Computing and Software Agents: Towards Cloud Intelligent Services Domenico Talia ICAR-CNR & University of Calabria Rende, Italy talia@deis.unical.it Abstract Cloud computing systems provide large-scale

More information

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining

BUSINESS INTELLIGENCE. Keywords: business intelligence, architecture, concepts, dashboards, ETL, data mining BUSINESS INTELLIGENCE Bogdan Mohor Dumitrita 1 Abstract A Business Intelligence (BI)-driven approach can be very effective in implementing business transformation programs within an enterprise framework.

More information

jeti: A Tool for Remote Tool Integration

jeti: A Tool for Remote Tool Integration jeti: A Tool for Remote Tool Integration Tiziana Margaria 1, Ralf Nagel 2, and Bernhard Steffen 2 1 Service Engineering for Distributed Systems, Institute for Informatics, University of Göttingen, Germany

More information

Pentaho Data Mining Last Modified on January 22, 2007

Pentaho Data Mining Last Modified on January 22, 2007 Pentaho Data Mining Copyright 2007 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org

More information