KINGS COLLEGE OF ENGINEERING

Similar documents
Data Mining: Concepts and Techniques. Jiawei Han. Micheline Kamber. Simon Fräser University К MORGAN KAUFMANN PUBLISHERS. AN IMPRINT OF Elsevier

CS1011: DATA WAREHOUSING AND MINING TWO MARKS QUESTIONS AND ANSWERS

Copyright 2007 Ramez Elmasri and Shamkant B. Navathe. Slide 29-1

DATA WAREHOUSING - OLAP

1. What are the uses of statistics in data mining? Statistics is used to Estimate the complexity of a data mining problem. Suggest which data mining

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

Part 22. Data Warehousing

Week 3 lecture slides

DATA WAREHOUSING AND OLAP TECHNOLOGY

DATA WAREHOUSE E KNOWLEDGE DISCOVERY

OLAP & DATA MINING CS561-SPRING 2012 WPI, MOHAMED ELTABAKH

OLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP

Database Applications. Advanced Querying. Transaction Processing. Transaction Processing. Data Warehouse. Decision Support. Transaction processing

Data W a Ware r house house and and OLAP II Week 6 1

1. OLAP is an acronym for a. Online Analytical Processing b. Online Analysis Process c. Online Arithmetic Processing d. Object Linking and Processing

M Designing and Implementing OLAP Solutions Using Microsoft SQL Server Day Course

Database Management System Dr. S. Srinath Department of Computer Science & Engineering Indian Institute of Technology, Madras Lecture No.

New Approach of Computing Data Cubes in Data Warehousing

Data Warehousing. Read chapter 13 of Riguzzi et al Sistemi Informativi. Slides derived from those by Hector Garcia-Molina

CHAPTER 4 Data Warehouse Architecture

A Technical Review on On-Line Analytical Processing (OLAP)

Data Warehouse Design

Basics of Dimensional Modeling

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing

Fluency With Information Technology CSE100/IMT100

Data W a Ware r house house and and OLAP Week 5 1

Introduction to Data Warehousing. Ms Swapnil Shrivastava

European Archival Records and Knowledge Preservation Database Archiving in the E-ARK Project

OLAP and OLTP. AMIT KUMAR BINDAL Associate Professor M M U MULLANA

Week 13: Data Warehousing. Warehousing

Index Contents Page No. Introduction . Data Mining & Knowledge Discovery

OLAP. Business Intelligence OLAP definition & application Multidimensional data representation

DATA CUBES E Jayant Haritsa Computer Science and Automation Indian Institute of Science. JAN 2014 Slide 1 DATA CUBES

IST722 Data Warehousing

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Winter 2009 Lecture 15 - Data Warehousing: Cubes


Subject Description Form

OLAP Theory-English version

Review. Data Warehousing. Today. Star schema. Star join indexes. Dimension hierarchies

Data Warehousing and Online Analytical Processing

CS2032 Data warehousing and Data Mining Unit II Page 1

John R. Vacca INSIDE

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Overview of Data Warehousing and OLAP

(b) How data mining is different from knowledge discovery in databases (KDD)? Explain.

Data Warehousing and Data Mining. A.A Datawarehousing & Datamining 1

What is OLAP - On-line analytical processing

Data Warehousing, OLAP, and Data Mining

Data Warehousing and Decision Support. Introduction. Three Complementary Trends. Chapter 23, Part A

14. Data Warehousing & Data Mining

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Business Intelligence. Data Mining and Optimization for Decision Making

Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining

Anwendersoftware Anwendungssoftwares a. Data-Warehouse-, Data-Mining- and OLAP-Technologies. Online Analytic Processing

Chapter 3, Data Warehouse and OLAP Operations

Monitoring Genebanks using Datamarts based in an Open Source Tool

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

OLAP and Data Warehousing! Introduction!

Data Warehousing Concepts

Data Warehousing and OLAP

BUILDING BLOCKS OF DATAWAREHOUSE. G.Lakshmi Priya & Razia Sultana.A Assistant Professor/IT

Data Warehouse: Introduction

Hybrid OLAP, An Introduction

Learning Objectives. Definition of OLAP Data cubes OLAP operations MDX OLAP servers

Building Data Cubes and Mining Them. Jelena Jovanovic

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

OLAP Systems and Multidimensional Expressions I

Outline. Data Warehousing. What is a Warehouse? What is a Warehouse?

Multi-dimensional index structures Part I: motivation

When to consider OLAP?

Principles of Data Mining by Hand&Mannila&Smyth

Data Warehousing (DW) Online Analytical Processing (OLAP) Data Mining

Data Mining as Part of Knowledge Discovery in Databases (KDD)

Data Warehouse design

SAS BI Course Content; Introduction to DWH / BI Concepts

DATA WAREHOUSING APPLICATIONS: AN ANALYTICAL TOOL FOR DECISION SUPPORT SYSTEM

Data Mining. Vera Goebel. Department of Informatics, University of Oslo

Decision Support. Chapter 23. Database Management Systems, 2 nd Edition. R. Ramakrishnan and J. Gehrke 1

SQL Server Analysis Services Complete Practical & Real-time Training

Unit -3. Learning Objective. Demand for Online analytical processing Major features and functions OLAP models and implementation considerations

This tutorial will help computer science graduates to understand the basic-toadvanced concepts related to data warehousing.

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

CHAPTER 3. Data Warehouses and OLAP

Turkish Journal of Engineering, Science and Technology

Web Log Data Sparsity Analysis and Performance Evaluation for OLAP

Introduction to Data Mining

CSC 177 Data warehouse and Mining project. Pooja Vora Vishma Shah Guided by Prof. Meiliu lu

COM CO P 5318 Da t Da a t Explora Explor t a ion and Analysis y Chapte Chapt r e 3

Chapter 3: Cluster Analysis

3/17/2009. Knowledge Management BIKM eclassifier Integrated BIKM Tools

Data Preprocessing. Week 2

Chapter 20: Data Analysis

Transcription:

KINGS COLLEGE OF ENGINEERING DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING ACADEMIC YEAR 2011-2012 / ODD SEMESTER SUBJECT CODE\NAME: CS1011-DATA WAREHOUSE AND DATA MINING YEAR / SEM: IV / VII UNIT I BASICS OF DATA WAREHOUSING 1. Define Data warehouse? 2. What are operational databases? 3. Define OLTP? 4. Define OLAP? 5. How a database design is represented in OLAP systems? 6. Write short notes on multidimensional data model? 7. Define data cube? 8. What are facts? 9. What are dimensions? 10. Define dimension table? 11. Define fact table? 12. What are lattice of cuboids? 13. What is apex cuboid? 14. List out the components of star schema? 15. What is snowflake schema? 16. List out the components of fact constellation schema? Kings College of Engineering 1

17. Point out the major difference between the star schema and the snowflake schema? 18. Which is popular in the data warehouse design, star schema model (or) snowflake schema model? 19. Define concept hierarchy? 20. Define Total Order? 21. Define partial order? 22. Define schema hierarchy? 23. List out the OLAP operations in multidimensional data model? 24. What is roll-up operation? 25. What is drill-down operation? 26. What is slice operation? 27. What is dice operation? 28. What is pivot operation? 29. List out the views in the design of a data warehouse? 30. List out the steps of the data warehouse design process? 31. Define ROLAP? 32. Define MOLAP? 33. Define HOLAP? 34. What is data mart? 35. What are dependent and independent data marts? 36. What is virtual warehouse? 37. Define indexing? 38. What are the types of indexing? 39. Define metadata? Kings College of Engineering 2

40. Define VLDB? PART - B 1. Discuss the components of data warehouse. (16) 2. List out the differences between OLTP and OLAP. (16) 3. Discuss the various schematic representations in multidimensional model. (16) 4. Explain the OLAP operations multidimensional model. (16) 5. Explain the design and construction of a data warehouse. (16) 6. Explain the three-tier data warehouse architecture. (16) 7. (a) Explain indexing. (8) (b)write short notes on VLDB. (8) UNIT II DATA PREPROCESSING LANGUAGE ARCHITECTURE CONCEPT DESCRIPTION 1. Define Data cleaning. 2. Define Data mining. 3. What is Data Generalization? 4. Define Data mining Query Language. 5. What are the different types of coupling? 6. Define Data Integration. 7. Define Data Transformation. Kings College of Engineering 3

8. What is GIS? 9. What is the Graphical User Interface? 10. What is Concept Description? 11. What are the ways to control a generalization process? 12. What are the approaches categorized in flexible generalization of large dataset? 13. What are the methods of class comparison? 14. What is the difference between semi-tight coupling and tight coupling? 15. What are the uses of statistics in data mining? 16. What is the dispersion? 17. What is a q-q plot? PART B 1. What are the types of data pre-processing techniques? Explain in detail about them? (16) 2. (a) Explain the smoothing Techniques? (8) (b) Explain Data transformation in detail? (8) 3. Explain data reduction? (16) 4. Explain parametric methods and non-parametric methods of reduction? (16) 5. Explain Data Discretization and Concept Hierarchy Generation? (16) 6. Explain Data mining Primitives? (16) 7. Explain Attribute Oriented Induction? (16) 8. Explain Statistical measures in data bases? (16) UNIT III ASSOCIATION RULES Kings College of Engineering 4

1. Define Association Rule Mining. 2. When we can say the association rules are interesting? 3. Explain Association rule in mathematical notations. 4. Define support and confidence in Association rule mining. 5. How is association rules mined from large databases? 6. Describe the different classifications of Association rule mining. 7. What is the purpose of Apriori Algorithm? 8. Define anti-monotone property. 9. How to generate association rules from frequent item sets? 10. Give few techniques to improve the efficiency of Apriori algorithm. 11. What are the things suffering the performance of Apriori candidate generation technique. 12. Describe the method of generating frequent item sets without candidate generation. 13. Define Iceberg query. 14. Mention few approaches to mining Multilevel Association Rules 15. What are multidimensional association rules? 16. Define constraint-based Association Mining. PART - B 1. (a) Explain multilevel association rule? (8) (b) Explain Apriori algorithm. (8) 2. Explain the apriori algorithm for finding frequent item sets without candidate generation? (16) 3. (a) Explain the apriori algorithm for finding frequent item sets? (8) (b) Explain Apriori algorithm. (8) Kings College of Engineering 5

4. Explain mining single dimensional Boolean associated rules from transaction Database? (16) 5. Explain mining Multi dimensional Boolean associated rules from transaction Database? (16) 6. Explain constraint-based association mining? (16) UNIT IV CLASSIFICATION AND CLUSTERING 1. Define the concept of classification. 2. What is Decision tree? 3. What is Attribute Selection Measure? 4. Describe Tree pruning methods. 5. Define Pre Pruning 6. Define Post Pruning. 7. What is meant by Pattern? 8. Define the concept of prediction. 9. Define Clustering? 10. What do you mean by Cluster Analysis? 11. What are the fields in which clustering techniques are used? 12. What are the requirements of cluster analysis? 13. What is interval scaled variables? 14. Define Binary variables? And what are the two types of binary variables? 15. Define nominal, ordinal and ratio scaled variables? 16. What do u mean by partitioning method? 17. Define CLARA and CLARANS? 18. What is Hierarchical method? Kings College of Engineering 6

19. Differentiate Agglomerative and Divisive Hierarchical Clustering? 20. Define Density based method? 21. What is a DBSCAN? 22. What are Bayesian Classifiers? 23. Where is decision trees mainly used? 24. How will you solve a classification problem using decision trees? 25. What is decision tree pruning? 26. Explain ID3 27. What is Association rule? 28. What is Association rule? 29. Define support. 30. Define Confidence. 31. How is association rules mined from large databases? 32. What is the classification of association rules based on various criteria? 33. What are the different types of data used for cluster analysis? PART - B 1. Explain various classification methods? (16) 2. Explain Bayessian classification? (16) 3. Explain Classifier accuracy with examples? (16) 4. Explain Hierarchical method of classifications? (16) 5. Explain Partition Methods of clustering? (16) 6. Explain classification by Decision tree induction? (16) Kings College of Engineering 7

7. Explain Outlier analysis? (16) 8. Explain the types of data in cluster analysis. (16) 9. (a) Discuss the requirements of clustering in data mining. (8) (b) Write sort notes on clustering methods (8) UNIT V RECENT TRENDS 1. Define Spatial Databases. 2. What are Time-Series databases? 3. What is Time Series Analysis? 4. What is Smoothing? 5. Name some of the data mining applications 6. Define Multimedia Database PART B 1. Explain mining complex types of data? (16) 2. (a)discuss abut WWW. (8) (b) Write short notes on Text Database. (8) 3. Describe mining for Spatial Database? (16) 4. Describe mining for Multimedia Database. (16) 5. Explain data mining application? (16) Kings College of Engineering 8