A Novel Approach of Query Optimization for Distributed Database Systems
|
|
|
- Julianna Berry
- 10 years ago
- Views:
Transcription
1 307 A Novel Approach of Query Optimization for Distributed Database Systems Deepak Sukheja 1, Umesh Kumar Singh 2 1: Priyatam Institute of Technology and Management Indore , India. 2: Institute of Computer Science, Vikram University Ujjain, India Abstract Query optimization in distributed databases explicitly needed in many aspects of the optimization process, often making it imperative for the optimizer to consult underlying data sources while doing cost based optimization. This is not only increases the cost of optimization, but also changes the trade-offs involved in the optimization process significantly. The leading cost in this optimization process is the cost of costing that traditionally has been considered insignificant. The optimizer can only afford a few rounds of messages to the under- lying data sources and he nce the optimization techniques in this environment must be geared toward gathering all the required cost information with minimal communication. In this paper, we explore the design and s earch space for a que ry optimizer in distributed environment and de monstrate the need for this optimization approach in various aspects of the optimization process. This work present minimumcommunication cost query cost variants of various query optimization techniques, and di scuss tradeoffs in their performance in the present development. We have implemented a nov el optimization approach in the distributed database environment, somewhat unexpectedly, indicate that a simple two-phase optimization scheme performs fairly well as long as the physical database design is known to the optimizer, though more determined algorithms are required. Keyword: Query Optimization, Query Optimization approach, Query processing in DDBMS Introduction The need for distributed database services has increased dramatically in present working environment. Within enterprises, IT infrastructures are often decentralized as a result of mergers, acquisitions, and specialized corporate applications, resulting in deployment of large distributed databases. Possibly more significantly, the Internet and intranet have enabled new enterprise ventures including Business-to-Business Net Markets (or Hubs) [1, 2], who s business hinges on federating thousands of decentralized catalogs and other databases. In general, distributed database technology has been the subject of multiple research thrusts, including schema integration [3, 4], data transformation [2], as well as distributed query processing and optimization. The query optimization work goes back as far as the early distributed database systems (R*, SDD-1, Distributed Ingres [5, 7]), and the lot of research s recently has been focused on l inking data sources of various capabilities and cost models [8]. However, query optimization in the broad distributed environment presents peculiarities that change the trade-offs in the optimization process quite significantly. Distributed query processors need to consider three basic requirements: Need of Query Processing: I n a large scale distributed system, both data access and
2 308 computation can be carried out at various sites. For global efficiency, it is beneficial to consider assigning portions of a query plan in arbitrary distributed ways. In fact, this has been one of the major motivations for development of distributed database systems. Need of Cost Factors: In a c entralized DBMS, query execution cost is a single dimensional factor measured in conceptual units. In a distributed database, costs must be dividing into multiple dimensions under the control of single logical database. One proposal for a universal cost metric is hard currency, but typically there are other costs that are valuable to expose orthogonally, including response time, data freshness, and accuracy of computations [9, 10]. Need of Cost Estimation: This work is motivated by the necessity of the cost estimation in the part of the query optimizer from the optimization process. Apart from of the number of cost dimensions factor, a c entralized optimizer cannot accurately estimate the costs of operations at many autonomous sites. Z. G. Ives and A. Tomasic perposed middleware systems in [8, 11] address this problem by involving site specific wrappers in the optimization process, but they do not consider the cost of communicating with these wrappers. This cost is not significant in these systems because the wrappers typically reside in the same address space as the optimizer. But in general, the execution costs may also depend on transient system issues including inter communication cost between two sites [9]. Therefore co st estimation process must be distributed in a manner reflective of the query processing, with cost estimates being provided by the sites that would be doing the work. However, to the best of our knowledge, complete cost estimation, which requires the optimizer to communicate with the sites merely to find the cost of an operation, has not been studied before. In such a scenario, communication may become the dominant cost in the query optimization process. The high cost of costing raises a number of new design challenges, and adds additional factors to the complexity of distributed query optimization. Contributions of the Paper In this paper, consider a large space of distributed query optimizer design alternatives and argu the need for taking into consideration the high cost of optimization in heterogeneous environment. Accordingly, its present minimum communication cost factors of various well-known optimization approaches and presents an effective algorithms for query optimization in the distributed database environments. 2. Architecture and Problem Definition For query optimization purposes, the most relevant parts of the system are the query optimizer in the middleware, and the bidders at the fundamental sites (Figure 1). As in a cen tralized database system, the query optimizer could use a variety of different optimization algorithms, but the heterogeneous and distributed database system requires that the cost estimates be made by the original data sources or by the bidders. T he optimizer and the bidder communicate through use of two mechanisms: (1) Request for Bid (RFB) that the optimizer uses to request cost of an operation, and (2) Bid through which a bidder makes cost estimates. Fig1. System Architecture The Distributed Query Optimization Problem The distributed query optimization problem is to find an execution plan for a user specified query that satisfies an optimization goal provided by the user, this goal may be a function of many variables, including response time, total execution cost, accuracy and staleness of the data. For simplicity,
3 309 we concentrate on t wo of these factors, response time and total execution cost, though it is fairly easy to extend these to include other factors, assuming they can be easily estimated. Since we assume that the only information we have about the costs of operations is through the interface to the bidders, the optimization problem has to be restated as optimizing over the cost information exported by the bidders. Before describing the adaptations of the known query optimization algorithms to take into account the high cost of optimization, we will discuss two important issues that affect the optimization cost in this framework significantly. Query Optimization Approach To discuss the optimization technique and algorithm in this paper, it will make the following assumptions: Accurate Statistics: We assume that statistics regarding the cardinalities and the selectivity are available. This information can be collected through standard protocols that allow querying the host database about statistics, or by caching statistics from before query executions. Communication Costs: We assume that communication costs remain roughly constant for the duration of optimization and execution of the query, and that the optimizer can estimate the communication costs incurred in data transfer between any two sites involved in the query. No Pipelining Across Sites: We assume that there is no pipelining of data among query operators across sites. In general, all optimization algorithms break into three steps: Step 1: Choose subplans that require cost estimates and prepare the requests for bids. Step 2: Send messages to the bidders requesting costs. Step 3: Calculate the costs for plans/subplans. If possible, decide on an execution plan for the query, other- wise, repeat steps 2 and 3. Clearly we should try to minimize the number of repetitions of steps 2 a nd 3, s ince step 2 i nvolves expensive communication. The proposed algorithm has tried to minimize the retrieval of large datasets. The cost measure that we have considered is the size of the retrieved result sets. The proposed algorithm searches all possible plans for the query, using top down approach and the principle of optimality to cut subplans as early as possible. However the algorithm is finish it w orks in reasonable time in exponential nature and it is guaranteed to find the optimal plan for executing the query. Proposed optimization algorithms break into foure steps and these steps are as follows: Step 1: Catalog all feasible joins and multi-joins. (A feasible relation is defined as either a b ase relation or an intermediate relation that can be generated without Cartesian product; a feasible join is defined to be a join of two or more feasible relations that does not involve a Cartesian product). Step 2: Create bid requests for the joins and compute step 1 for scans on the base tables. Step 3: Request costs from the bidders for these join and scan operations. If input relations are intermediate table in that case for each join, only request the cost of performing that individual join and assuming that the input relations have already been computed. Step 4: Calculate the costs for plans/subplans recursively using classical dynamic programming (partial order dynamic programming if multidimensional costs are desired) and find the optimal plan for the query. Consider a banking enterprise (given in the text book) having the following relation schemas: branch (branch_name,branch_city,assets) customer ( customer_name, customer_street, customer_city) loan (loan_number,branch_name,amount) borrower (customer_name,loan_number) account (account_number, branch_name, balance) depositor (customer_name,account_number) We could distribute the tables among 3 sites: Site 1: branch Site 2: customer, borrower, depositor Site 3: loan, account There will be a central data dictionary which will contain the information regarding which tables are in which sites and the schema definition of the tables. Now consider a query: select customer_name, loan_number, amount from borrower, loanl where borrower.loan_number = Loan..loan_number and branch_name = Perryridge and amount > 1200;
4 310 select customer_name, borrower.loan_number,amount from borrower,loan where borrower.loan_number = loan.loan_number and branch_name = Perryridge and amount > 1200; Middle layer R1 Fig 3. R2 Site 1 Site 3 Site 2 Compute R1 X R2 branch loan, account customer, borrower, depositor Reslt set 1 Result set 2 Fig 2. Step 1: Look at the From clause and refer catalog for the central data dictionary to determine which sites are to be considered for this particular query. Create n number of SubSelect, SubFrom and SubWhere lists where n: no. of sites Distribute the SelectItems into the SubSelect(n) lists, FromItems into the SubFrom(n) lists. Step 2: Pick each item from the Where-Items list. If that clause belongs completely to a site, then include that clause into the corresponding SubWhere list. If a clause does not belong completely to one site then put that clause into a new list Final Where. Parse that clause to get the operands. Look for attributes of tables in individual operands. Find the sites to which these attributes belong to. If these attributes are not included in the respective Sub Select then include them. This step is one of the key steps in this algorithm. It is not intuitive and the example would help us understand it. Compute R1 X R2 Middle layer Select customer_name, borrower.loan_number from borrower Site 2 Select customer_name, borrower.loan number Site 3 Result set 1 Result set 2 Site 2 Site 3
5 311 Fig 4. Step 3: Using the SubSelect, SubFrom and SubWhere lists generate the subquery. FinalSelect = SelectItems FinalFrom = {R1,R2..,Rn) where n : no of sites and rn stands for the result set obtained from the nth site. Generate the final query from the FinalSelect, FinalFrom and FinalWhere lists. Execuation Plan: SUBQUERY[1] =NIL SUBQUERY[2] : SELECT CUSTOMER_NAME,B.LOAN_NUMBER FROM BORROWER ; SUBQUERY[3] : SELECT AMOUNT, L.LOAN_NUMBER FROM LOAN L WHERE BRANCH_NAME = PERRYRIDGE AND AMOUNT>1200; FINALQUERY : SELECT CUSTOMER_NAME,B.LOAN_NUMBER, AMOUNT FROM R2,R3 WHERE B.LOAN_NUMBER = LOAN.LOAN_NUMBER Where R1, R2 are result sets obtained from sites 2 and 3 respectively. select customer_name, borrower.loan_number,amount from borrower,loan where borrower.loan_number = loan.loan_number and branch_name = Perryridge and amount > 1200; Compute R1 X R2 Figure 5. Step 4: Execute the sub queries at the respective sites parallel (intra-query parallelism) and the final query at the middle layer. Conclusion A presented approach of query optimization is very useful for distributed database systems. To computations of cost from the optimization process, the optimizer must consult the data sources involved in an operation to find the cost of that operation. The mentioned analytical process indicate that, in many cases, especially when the physical database design is known to the optimizer, this query optimization algorithm works very well. In absence of such information, more aggressive optimization techniques must be used. References [1]Net market makers inc [2] L. Knight. the e-market maker revolution, dataquest inc [3] M. Lenzerini, and S.B. Navathe. A comparative analysis of methodologies for database scheme integration. ACM Computing Surveys, 15(4): [4R. J. Miller, L. M. Haas, and M. Hernández, Schema Mapping as Query. Discovery, in VLDB, 2000, pp
6 312 [5] L. M. Haas, P. G. Selinger, E. Bertino, D. Daniels, B. G. Lindsay, G. M.. Lohman, Y. Masunaga, C. Mohan, P. Ng, P. F. Wilms, and R. A. Yost. R*: A research project on distributed relational dbms. IEEE Database Eng. Bull.,. 5(4): [7] P. A. Bernstein, N. Goodman, E. Wong, C. L. Reeve, and J. B. R.. Jr., Query processing in a s ystem for distributed databases (SDD-1),. TODS, vol. 6, no. 4, [15] R. S. Epstein, M. Stonebraker,... Databases Where Relations Are Hash Partitioned, TODS, vol. 16, no. 2, pp [8] Z. G. Ives, D. Florescu, M. Friedman, A. Y. Levy, and D. S.Weld. An adaptive query execution system for data integration.in SIGMOD, [9] R. Avnur, J.M. Hellerstein, B. Lo, C. Olston, B. Raman, V. Raman, T. Roth, and K. Wylie, "CONTROL: Continuous Output and Navigation Technology with Refinement On- Line", in Proc. SIGMOD Conference, 1998, pp [10] C.Olston and J. Widom. Offering a precisionperformance tradeoff for aggregation queries over replicated data. vldb.org/conf /2000/pp [11] A. Tomasic, R. Amouroux, P. Bonnet, O. Kapitskaia, H. Naacke, and L. Raschid. The distributed information search component (DISCO) and the world wide web. In IEEE 1998, Volume: 10 Issue: 5 pp
Simple SQL Queries (3)
Simple SQL Queries (3) What Have We Learned about SQL? Basic SELECT-FROM-WHERE structure Selecting from multiple tables String operations Set operations Aggregate functions and group-by queries Null values
SUBQUERIES AND VIEWS. CS121: Introduction to Relational Database Systems Fall 2015 Lecture 6
SUBQUERIES AND VIEWS CS121: Introduction to Relational Database Systems Fall 2015 Lecture 6 String Comparisons and GROUP BY 2! Last time, introduced many advanced features of SQL, including GROUP BY! Recall:
Comp 5311 Database Management Systems. 3. Structured Query Language 1
Comp 5311 Database Management Systems 3. Structured Query Language 1 1 Aspects of SQL Most common Query Language used in all commercial systems Discussion is based on the SQL92 Standard. Commercial products
Data Mining and Database Systems: Where is the Intersection?
Data Mining and Database Systems: Where is the Intersection? Surajit Chaudhuri Microsoft Research Email: [email protected] 1 Introduction The promise of decision support systems is to exploit enterprise
Advance DBMS. Structured Query Language (SQL)
Structured Query Language (SQL) Introduction Commercial database systems use more user friendly language to specify the queries. SQL is the most influential commercially marketed product language. Other
Introduction to SQL (3.1-3.4)
CSL 451 Introduction to Database Systems Introduction to SQL (3.1-3.4) Department of Computer Science and Engineering Indian Institute of Technology Ropar Narayanan (CK) Chatapuram Krishnan! Summary Parts
You can use command show database; to check if bank has been created.
Banking Example primary key is underlined) branch branch-name, branch-city, assets) customer customer-name, customer-street, customer-city) account account-number, branch-name, balance) loan loan-number,
SODDA A SERVICE-ORIENTED DISTRIBUTED DATABASE ARCHITECTURE
SODDA A SERVICE-ORIENTED DISTRIBUTED DATABASE ARCHITECTURE Breno Mansur Rabelo Centro EData Universidade do Estado de Minas Gerais, Belo Horizonte, MG, Brazil [email protected] Clodoveu Augusto Davis
Chapter 6: Integrity Constraints
Chapter 6: Integrity Constraints Domain Constraints Referential Integrity Assertions Triggers Functional Dependencies Database Systems Concepts 6.1 Silberschatz, Korth and Sudarshan c 1997 Domain Constraints
BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT
BUILDING BLOCKS OF DATAWAREHOUSE G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT 1 Data Warehouse Subject Oriented Organized around major subjects, such as customer, product, sales. Focusing on
A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System
A Multidatabase System as 4-Tiered Client-Server Distributed Heterogeneous Database System Mohammad Ghulam Ali Academic Post Graduate Studies and Research Indian Institute of Technology, Kharagpur Kharagpur,
Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner
24 Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner Rekha S. Nyaykhor M. Tech, Dept. Of CSE, Priyadarshini Bhagwati College of Engineering, Nagpur, India
Preparing Data Sets for the Data Mining Analysis using the Most Efficient Horizontal Aggregation Method in SQL
Preparing Data Sets for the Data Mining Analysis using the Most Efficient Horizontal Aggregation Method in SQL Jasna S MTech Student TKM College of engineering Kollam Manu J Pillai Assistant Professor
Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661, ISBN: 2278-8727 Volume 6, Issue 5 (Nov. - Dec. 2012), PP 36-41 Horizontal Aggregations in SQL to Prepare Data Sets for Data Mining Analysis
Outline. Mariposa: A wide-area distributed database. Outline. Motivation. Outline. (wrong) Assumptions in Distributed DBMS
Mariposa: A wide-area distributed database Presentation: Shahed 7. Experiment and Conclusion Discussion: Dutch 2 Motivation 1) Build a wide-area Distributed database system 2) Apply principles of economics
Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis
Query Optimization Approach in SQL to prepare Data Sets for Data Mining Analysis Rajesh Reddy Muley 1, Sravani Achanta 2, Prof.S.V.Achutha Rao 3 1 pursuing M.Tech(CSE), Vikas College of Engineering and
B2.2-R3: INTRODUCTION TO DATABASE MANAGEMENT SYSTEMS
B2.2-R3: INTRODUCTION TO DATABASE MANAGEMENT SYSTEMS NOTE: 1. There are TWO PARTS in this Module/Paper. PART ONE contains FOUR questions and PART TWO contains FIVE questions. 2. PART ONE is to be answered
A Knowledge Management Framework Using Business Intelligence Solutions
www.ijcsi.org 102 A Knowledge Management Framework Using Business Intelligence Solutions Marwa Gadu 1 and Prof. Dr. Nashaat El-Khameesy 2 1 Computer and Information Systems Department, Sadat Academy For
Lection 3-4 WAREHOUSING
Lection 3-4 DATA WAREHOUSING Learning Objectives Understand d the basic definitions iti and concepts of data warehouses Understand data warehousing architectures Describe the processes used in developing
Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations
Dataset Preparation and Indexing for Data Mining Analysis Using Horizontal Aggregations Binomol George, Ambily Balaram Abstract To analyze data efficiently, data mining systems are widely using datasets
Chapter 1: Introduction
Chapter 1: Introduction Database System Concepts, 5th Ed. See www.db book.com for conditions on re use Chapter 1: Introduction Purpose of Database Systems View of Data Database Languages Relational Databases
A Critical Review of Data Warehouse
Global Journal of Business Management and Information Technology. Volume 1, Number 2 (2011), pp. 95-103 Research India Publications http://www.ripublication.com A Critical Review of Data Warehouse Sachin
PartJoin: An Efficient Storage and Query Execution for Data Warehouses
PartJoin: An Efficient Storage and Query Execution for Data Warehouses Ladjel Bellatreche 1, Michel Schneider 2, Mukesh Mohania 3, and Bharat Bhargava 4 1 IMERIR, Perpignan, FRANCE [email protected] 2
TECHNIQUES FOR DATA REPLICATION ON DISTRIBUTED DATABASES
Constantin Brâncuşi University of Târgu Jiu ENGINEERING FACULTY SCIENTIFIC CONFERENCE 13 th edition with international participation November 07-08, 2008 Târgu Jiu TECHNIQUES FOR DATA REPLICATION ON DISTRIBUTED
Lesson 8: Introduction to Databases E-R Data Modeling
Lesson 8: Introduction to Databases E-R Data Modeling Contents Introduction to Databases Abstraction, Schemas, and Views Data Models Database Management System (DBMS) Components Entity Relationship Data
Efficient Query Optimization for Distributed Join in Database Federation
Efficient Query Optimization for Distributed Join in Database Federation by Di Wang A Thesis Submitted to the Faculty of the Worcester Polytechnic Institute In partial fulfillment of the requirements for
How Can Data Sources Specify Their Security Needs to a Data Warehouse?
How Can Data Sources Specify Their Security Needs to a Data Warehouse? Arnon Rosenthal The MITRE Corporation [email protected] Edward Sciore Boston College (and MITRE) [email protected] Abstract In current warehouse
A Novel Cloud Computing Data Fragmentation Service Design for Distributed Systems
A Novel Cloud Computing Data Fragmentation Service Design for Distributed Systems Ismail Hababeh School of Computer Engineering and Information Technology, German-Jordanian University Amman, Jordan Abstract-
Elena Baralis, Silvia Chiusano Politecnico di Torino. Pag. 1. Query optimization. DBMS Architecture. Query optimizer. Query optimizer.
DBMS Architecture INSTRUCTION OPTIMIZER Database Management Systems MANAGEMENT OF ACCESS METHODS BUFFER MANAGER CONCURRENCY CONTROL RELIABILITY MANAGEMENT Index Files Data Files System Catalog BASE It
A Survey on Data Warehouse Architecture
A Survey on Data Warehouse Architecture Rajiv Senapati 1, D.Anil Kumar 2 1 Assistant Professor, Department of IT, G.I.E.T, Gunupur, India 2 Associate Professor, Department of CSE, G.I.E.T, Gunupur, India
MTCache: Mid-Tier Database Caching for SQL Server
MTCache: Mid-Tier Database Caching for SQL Server Per-Åke Larson Jonathan Goldstein Microsoft {palarson,jongold}@microsoft.com Hongfei Guo University of Wisconsin [email protected] Jingren Zhou Columbia
Load Distribution in Large Scale Network Monitoring Infrastructures
Load Distribution in Large Scale Network Monitoring Infrastructures Josep Sanjuàs-Cuxart, Pere Barlet-Ros, Gianluca Iannaccone, and Josep Solé-Pareta Universitat Politècnica de Catalunya (UPC) {jsanjuas,pbarlet,pareta}@ac.upc.edu
Data Warehouse Snowflake Design and Performance Considerations in Business Analytics
Journal of Advances in Information Technology Vol. 6, No. 4, November 2015 Data Warehouse Snowflake Design and Performance Considerations in Business Analytics Jiangping Wang and Janet L. Kourik Walker
Integrating Heterogeneous Data Sources Using XML
Integrating Heterogeneous Data Sources Using XML 1 Yogesh R.Rochlani, 2 Prof. A.R. Itkikar 1 Department of Computer Science & Engineering Sipna COET, SGBAU, Amravati (MH), India 2 Department of Computer
BUILDING OLAP TOOLS OVER LARGE DATABASES
BUILDING OLAP TOOLS OVER LARGE DATABASES Rui Oliveira, Jorge Bernardino ISEC Instituto Superior de Engenharia de Coimbra, Polytechnic Institute of Coimbra Quinta da Nora, Rua Pedro Nunes, P-3030-199 Coimbra,
Data W a Ware r house house and and OLAP Week 5 1
Data Warehouse and OLAP Week 5 1 Midterm I Friday, March 4 Scope Homework assignments 1 4 Open book Team Homework Assignment #7 Read pp. 121 139, 146 150 of the text book. Do Examples 3.8, 3.10 and Exercise
Basic Concepts. Chapter A: Network Model. Cont.) Data-Structure Diagrams (Cont( Data-Structure Diagrams. General Relationships. The DBTG CODASYL Model
Chapter A: Network Model Basic Concepts Basic Concepts Data-Structure Diagrams The DBTG CODASYL Model DBTG Data-Retrieval Facility DBTG Update Facility DBTG Set-Processing Facility Mapping of Networks
Techniques to Produce Good Web Service Compositions in The Semantic Grid
Techniques to Produce Good Web Service Compositions in The Semantic Grid Eduardo Blanco Universidad Simón Bolívar, Departamento de Computación y Tecnología de la Información, Apartado 89000, Caracas 1080-A,
Creation of Datasets for Data Mining Analysis by Using Horizontal Aggregation in SQL
International Journal of Computer Applications in Engineering Sciences [VOL III, ISSUE I, MARCH 2013] [ISSN: 2231-4946] Creation of Datasets for Data Mining Analysis by Using Horizontal Aggregation in
Report on the Dagstuhl Seminar Data Quality on the Web
Report on the Dagstuhl Seminar Data Quality on the Web Michael Gertz M. Tamer Özsu Gunter Saake Kai-Uwe Sattler U of California at Davis, U.S.A. U of Waterloo, Canada U of Magdeburg, Germany TU Ilmenau,
2 Associating Facts with Time
TEMPORAL DATABASES Richard Thomas Snodgrass A temporal database (see Temporal Database) contains time-varying data. Time is an important aspect of all real-world phenomena. Events occur at specific points
Implementation of P2P Reputation Management Using Distributed Identities and Decentralized Recommendation Chains
Implementation of P2P Reputation Management Using Distributed Identities and Decentralized Recommendation Chains P.Satheesh Associate professor Dept of Computer Science and Engineering MVGR college of
Chapter 1: Introduction
Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition Language Data Manipulation Language Transaction Management Storage Management Database Administrator Database
INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS
INTEGRATION OF XML DATA IN PEER-TO-PEER E-COMMERCE APPLICATIONS Tadeusz Pankowski 1,2 1 Institute of Control and Information Engineering Poznan University of Technology Pl. M.S.-Curie 5, 60-965 Poznan
We know how to query a database using SQL. A set of tables and their schemas are given Data are properly loaded
E-R Diagram Database Development We know how to query a database using SQL A set of tables and their schemas are given Data are properly loaded But, how can we develop appropriate tables and their schema
ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU
Computer Science 14 (2) 2013 http://dx.doi.org/10.7494/csci.2013.14.2.243 Marcin Pietroń Pawe l Russek Kazimierz Wiatr ACCELERATING SELECT WHERE AND SELECT JOIN QUERIES ON A GPU Abstract This paper presents
Chapter 1: Introduction. Database Management System (DBMS)
Chapter 1: Introduction Purpose of Database Systems View of Data Data Models Data Definition Language Data Manipulation Language Transaction Management Storage Management Database Administrator Database
EFFECTIVE DATA RECOVERY FOR CONSTRUCTIVE CLOUD PLATFORM
INTERNATIONAL JOURNAL OF REVIEWS ON RECENT ELECTRONICS AND COMPUTER SCIENCE EFFECTIVE DATA RECOVERY FOR CONSTRUCTIVE CLOUD PLATFORM Macha Arun 1, B.Ravi Kumar 2 1 M.Tech Student, Dept of CSE, Holy Mary
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data
An Analysis on Density Based Clustering of Multi Dimensional Spatial Data K. Mumtaz 1 Assistant Professor, Department of MCA Vivekanandha Institute of Information and Management Studies, Tiruchengode,
Chapter 14: Query Optimization
Chapter 14: Query Optimization Database System Concepts 5 th Ed. See www.db-book.com for conditions on re-use Chapter 14: Query Optimization Introduction Transformation of Relational Expressions Catalog
Turkish Journal of Engineering, Science and Technology
Turkish Journal of Engineering, Science and Technology 03 (2014) 106-110 Turkish Journal of Engineering, Science and Technology journal homepage: www.tujest.com Integrating Data Warehouse with OLAP Server
Associate Professor, Department of CSE, Shri Vishnu Engineering College for Women, Andhra Pradesh, India 2
Volume 6, Issue 3, March 2016 ISSN: 2277 128X International Journal of Advanced Research in Computer Science and Software Engineering Research Paper Available online at: www.ijarcsse.com Special Issue
QOS Based Web Service Ranking Using Fuzzy C-means Clusters
Research Journal of Applied Sciences, Engineering and Technology 10(9): 1045-1050, 2015 ISSN: 2040-7459; e-issn: 2040-7467 Maxwell Scientific Organization, 2015 Submitted: March 19, 2015 Accepted: April
Udai Shankar 2 Deptt. of Computer Sc. & Engineering Madan Mohan Malaviya Engineering College, Gorakhpur, India
A Protocol for Concurrency Control in Real-Time Replicated Databases System Ashish Srivastava 1 College, Gorakhpur. India Udai Shankar 2 College, Gorakhpur, India Sanjay Kumar Tiwari 3 College, Gorakhpur,
chapater 7 : Distributed Database Management Systems
chapater 7 : Distributed Database Management Systems Distributed Database Management System When an organization is geographically dispersed, it may choose to store its databases on a central database
SQL Query Evaluation. Winter 2006-2007 Lecture 23
SQL Query Evaluation Winter 2006-2007 Lecture 23 SQL Query Processing Databases go through three steps: Parse SQL into an execution plan Optimize the execution plan Evaluate the optimized plan Execution
Grid Data Integration based on Schema-mapping
Grid Data Integration based on Schema-mapping Carmela Comito and Domenico Talia DEIS, University of Calabria, Via P. Bucci 41 c, 87036 Rende, Italy {ccomito, talia}@deis.unical.it http://www.deis.unical.it/
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM
A REVIEW PAPER ON THE HADOOP DISTRIBUTED FILE SYSTEM Sneha D.Borkar 1, Prof.Chaitali S.Surtakar 2 Student of B.E., Information Technology, J.D.I.E.T, [email protected] Assistant Professor, Information
AN OVERVIEW OF DISTRIBUTED DATABASE MANAGEMENT
AN OVERVIEW OF DISTRIBUTED DATABASE MANAGEMENT BY AYSE YASEMIN SEYDIM CSE 8343 - DISTRIBUTED OPERATING SYSTEMS FALL 1998 TERM PROJECT TABLE OF CONTENTS INTRODUCTION...2 1. WHAT IS A DISTRIBUTED DATABASE
A QoS-Aware Web Service Selection Based on Clustering
International Journal of Scientific and Research Publications, Volume 4, Issue 2, February 2014 1 A QoS-Aware Web Service Selection Based on Clustering R.Karthiban PG scholar, Computer Science and Engineering,
The Study on Data Warehouse Design and Usage
International Journal of Scientific and Research Publications, Volume 3, Issue 3, March 2013 1 The Study on Data Warehouse Design and Usage Mr. Dishek Mankad 1, Mr. Preyash Dholakia 2 1 M.C.A., B.R.Patel
Index Terms : Load rebalance, distributed file systems, clouds, movement cost, load imbalance, chunk.
Load Rebalancing for Distributed File Systems in Clouds. Smita Salunkhe, S. S. Sannakki Department of Computer Science and Engineering KLS Gogte Institute of Technology, Belgaum, Karnataka, India Affiliated
Constraint-based Query Distribution Framework for an Integrated Global Schema
Constraint-based Query Distribution Framework for an Integrated Global Schema Ahmad Kamran Malik 1, Muhammad Abdul Qadir 1, Nadeem Iftikhar 2, and Muhammad Usman 3 1 Muhammad Ali Jinnah University, Islamabad,
SQL is capable in manipulating relational data SQL is not good for many other tasks
Embedded SQL SQL Is Not for All SQL is capable in manipulating relational data SQL is not good for many other tasks Control structures: loops, conditional branches, Advanced data structures: trees, arrays,
Web-Based Genomic Information Integration with Gene Ontology
Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, [email protected] Abstract. Despite the dramatic growth of online genomic
XML Data Integration in OGSA Grids
XML Data Integration in OGSA Grids Carmela Comito and Domenico Talia University of Calabria Italy [email protected] Outline Introduction Data Integration and Grids The XMAP Data Integration Framework
TOP-DOWN APPROACH PROCESS BUILT ON CONCEPTUAL DESIGN TO PHYSICAL DESIGN USING LIS, GCS SCHEMA
TOP-DOWN APPROACH PROCESS BUILT ON CONCEPTUAL DESIGN TO PHYSICAL DESIGN USING LIS, GCS SCHEMA Ajay B. Gadicha 1, A. S. Alvi 2, Vijay B. Gadicha 3, S. M. Zaki 4 1&4 Deptt. of Information Technology, P.
Fig. 3. PostgreSQL subsystems
Development of a Parallel DBMS on the Basis of PostgreSQL C. S. Pan [email protected] South Ural State University Abstract. The paper describes the architecture and the design of PargreSQL parallel database
DATABASE MANAGEMENT SYSTEMS
CHAPTER DATABASE MANAGEMENT SYSTEMS This chapter reintroduces the term database in a more technical sense than it has been used up to now. Data is one of the most valuable assets held by most organizations.
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL
SEARCH ENGINE WITH PARALLEL PROCESSING AND INCREMENTAL K-MEANS FOR FAST SEARCH AND RETRIEVAL Krishna Kiran Kattamuri 1 and Rupa Chiramdasu 2 Department of Computer Science Engineering, VVIT, Guntur, India
ORGANIZATIONAL KNOWLEDGE MAPPING BASED ON LIBRARY INFORMATION SYSTEM
ORGANIZATIONAL KNOWLEDGE MAPPING BASED ON LIBRARY INFORMATION SYSTEM IRANDOC CASE STUDY Ammar Jalalimanesh a,*, Elaheh Homayounvala a a Information engineering department, Iranian Research Institute for
Log Mining Based on Hadoop s Map and Reduce Technique
Log Mining Based on Hadoop s Map and Reduce Technique ABSTRACT: Anuja Pandit Department of Computer Science, [email protected] Amruta Deshpande Department of Computer Science, [email protected]
... Foreword... 17. ... Preface... 19
... Foreword... 17... Preface... 19 PART I... SAP's Enterprise Information Management Strategy and Portfolio... 25 1... Introducing Enterprise Information Management... 27 1.1... Defining Enterprise Information
DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE
DEVELOPMENT OF HASH TABLE BASED WEB-READY DATA MINING ENGINE SK MD OBAIDULLAH Department of Computer Science & Engineering, Aliah University, Saltlake, Sector-V, Kol-900091, West Bengal, India [email protected]
A Design and implementation of a data warehouse for research administration universities
A Design and implementation of a data warehouse for research administration universities André Flory 1, Pierre Soupirot 2, and Anne Tchounikine 3 1 CRI : Centre de Ressources Informatiques INSA de Lyon
Chapter 3. Database Environment - Objectives. Multi-user DBMS Architectures. Teleprocessing. File-Server
Chapter 3 Database Architectures and the Web Transparencies Database Environment - Objectives The meaning of the client server architecture and the advantages of this type of architecture for a DBMS. The
Big Data Begets Big Database Theory
Big Data Begets Big Database Theory Dan Suciu University of Washington 1 Motivation Industry analysts describe Big Data in terms of three V s: volume, velocity, variety. The data is too big to process
DLDB: Extending Relational Databases to Support Semantic Web Queries
DLDB: Extending Relational Databases to Support Semantic Web Queries Zhengxiang Pan (Lehigh University, USA [email protected]) Jeff Heflin (Lehigh University, USA [email protected]) Abstract: We
E-Governance in Higher Education: Concept and Role of Data Warehousing Techniques
E-Governance in Higher Education: Concept and Role of Data Warehousing Techniques Prateek Bhanti Asst. Professor, FASC, MITS Deemed University, Lakshmangarh-332311, Sikar, Rajasthan, INDIA Urmani Kaushal
Fuzzy Multi-Join and Top-K Query Model for Search-As-You-Type in Multiple Tables
Fuzzy Multi-Join and Top-K Query Model for Search-As-You-Type in Multiple Tables 1 M.Naveena, 2 S.Sangeetha 1 M.E-CSE, 2 AP-CSE V.S.B. Engineering College, Karur, Tamilnadu, India. 1 [email protected],
