Data Mining and Data Warehousing on US Farmer s Data

Similar documents
CSC 177 Fall 2014 Team Project Final Report

CSC 177 Data warehouse and Mining project. Pooja Vora Vishma Shah Guided by Prof. Meiliu lu

Course Outline: Course: Implementing a Data Warehouse with Microsoft SQL Server 2012 Learning Method: Instructor-led Classroom Learning

Delivering Business Intelligence With Microsoft SQL Server 2005 or 2008 HDT922 Five Days

Implementing a Data Warehouse with Microsoft SQL Server 2012

GEHC IT Solutions. Centricity Practice Solution. Centricity Analytics 3.0

Course DSS. Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

5.5 Copyright 2011 Pearson Education, Inc. publishing as Prentice Hall. Figure 5-2

Chapter 5 Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization

Implementing a Data Warehouse with Microsoft SQL Server

SQL Server 2012 Business Intelligence Boot Camp

TIM 50 - Business Information Systems

DIPLOMA IN WEBDEVELOPMENT

Introduction Predictive Analytics Tools: Weka

Implementing a Data Warehouse with Microsoft SQL Server

Foundations of Business Intelligence: Databases and Information Management

Tutorials for Project on Building a Business Analytic Model Using Data Mining Tool and Data Warehouse and OLAP Cubes IST 734

2074 : Designing and Implementing OLAP Solutions Using Microsoft SQL Server 2000

DATA MINING USING PENTAHO / WEKA

Business Intelligence Tutorial

COURSE 20463C: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

SAP BO Course Details

Implementing a Data Warehouse with Microsoft SQL Server

Implementing a Data Warehouse with Microsoft SQL Server 2012 (70-463)

Web Development using PHP (WD_PHP) Duration 1.5 months

Data Warehousing and Data Mining in Business Applications

Presented by: Jose Chinchilla, MCITP

Fluency With Information Technology CSE100/IMT100

Course: SAS BI(business intelligence) and DI(Data integration)training - Training Duration: 30 + Days. Take Away:

Course MIS. Foundations of Business Intelligence

Implementing a Data Warehouse with Microsoft SQL Server 2012 MOC 10777

Chapter 5. Warehousing, Data Acquisition, Data. Visualization

<no narration for this slide>

CHAPTER 5: BUSINESS ANALYTICS

Course 6234A: Implementing and Maintaining Microsoft SQL Server 2008 Analysis Services

Implement a Data Warehouse with Microsoft SQL Server 20463C; 5 days

Business Intelligence Tutorial: Introduction to the Data Warehouse Center

SQL Server Administrator Introduction - 3 Days Objectives

Implementing a Data Warehouse with Microsoft SQL Server MOC 20463

COURSE OUTLINE MOC 20463: IMPLEMENTING A DATA WAREHOUSE WITH MICROSOFT SQL SERVER

A Brief Tutorial on Database Queries, Data Mining, and OLAP

MS 50511A The Microsoft Business Intelligence 2010 Stack

#mstrworld. No Data Left behind: 20+ new data sources with new data preparation in MicroStrategy 10

Foundations of Business Intelligence: Databases and Information Management

Microsoft Data Warehouse in Depth

DBTech Pro Workshop. Knowledge Discovery from Databases (KDD) Including Data Warehousing and Data Mining. Georgios Evangelidis

Implementing a Data Warehouse with Microsoft SQL Server 2012

LITERATURE SURVEY ON DATA WAREHOUSE AND ITS TECHNIQUES

University of Gaziantep, Department of Business Administration

WEB DEVELOPMENT COURSE (PHP/ MYSQL)

East Asia Network Sdn Bhd

Migrating a Discoverer System to Oracle Business Intelligence Enterprise Edition

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

Oracle Data Miner (Extension of SQL Developer 4.0)

Online Courses. Version 9 Comprehensive Series. What's New Series

Implementing Data Models and Reports with Microsoft SQL Server 20466C; 5 Days

CHAPTER 4: BUSINESS ANALYTICS

Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing

Management Decision Making. Hadi Hosseini CS 330 David R. Cheriton School of Computer Science University of Waterloo July 14, 2011

SQL Server 2005 Features Comparison

Course Outline. Module 1: Introduction to Data Warehousing

Chapter 14: Databases and Database Management Systems

Republic Polytechnic School of Information and Communications Technology C355 Business Intelligence. Module Curriculum

An Introduction to WEKA. As presented by PACE

Chapter 6 FOUNDATIONS OF BUSINESS INTELLIGENCE: DATABASES AND INFORMATION MANAGEMENT Learning Objectives

Bussiness Intelligence and Data Warehouse. Tomas Bartos CIS 764, Kansas State University

Implementing Data Models and Reports with Microsoft SQL Server 2012 MOC 10778

Data Warehousing Concepts

Microsoft. Course 20463C: Implementing a Data Warehouse with Microsoft SQL Server

Microsoft Implementing Data Models and Reports with Microsoft SQL Server

Module 1: Introduction to Data Warehousing and OLAP

Course 10777A: Implementing a Data Warehouse with Microsoft SQL Server 2012

Data. Data and database. Aniel Nieves-González. Fall 2015

Students who successfully complete the Health Science Informatics major will be able to:

City University of Hong Kong. Information on a Course offered by Department of Information Systems with effect from Semester B in 2013 / 2014

Analytics Canvas Tutorial: Cleaning Website Referral Traffic Data. N m o d a l S o l u t i o n s I n c. A l l R i g h t s R e s e r v e d

Foundations of Business Intelligence: Databases and Information Management

Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot

Implementing a Data Warehouse with Microsoft SQL Server 2012

Beta: Implementing a Data Warehouse with Microsoft SQL Server 2012

Implementing Data Models and Reports with Microsoft SQL Server

Prerequisites. Course Outline

Keywords Big Data; OODBMS; RDBMS; hadoop; EDM; learning analytics, data abundance.

Implementing a Data Warehouse with Microsoft SQL Server 2012

Microsoft Services Exceed your business with Microsoft SharePoint Server 2010

Data Search. Searching and Finding information in Unstructured and Structured Data Sources

CSE 544 Principles of Database Management Systems. Magdalena Balazinska Fall 2007 Lecture 16 - Data Warehousing

Alejandro Vaisman Esteban Zimanyi. Data. Warehouse. Systems. Design and Implementation. ^ Springer

Data Warehousing and Data Mining

Horizontal Aggregations In SQL To Generate Data Sets For Data Mining Analysis In An Optimized Manner

End to End Microsoft BI with SQL 2008 R2 and SharePoint 2010

Welcome to the second half ofour orientation on Spotfire Administration.

Transcription:

Data Mining and Data Warehousing on US Farmer s Data Guide: Dr. Meiliu Lu Presented By, Yogesh Isawe Kalindi Mehta Aditi Kulkarni

* Data Warehousing Project * Introduction * Background * Technologies Explored * Implementation Steps * Demo * Future scope * Data Mining Project * Objective * Algorithm Applied * Demo * Learning Experience * References Agenda

Introduction * The primary objective of our project is to design data mart. * We have used Star schema to generate it. * This data mart answers questions related to US farmers market.

Background * Source : http://catalog.data.gov/dataset/farmers- markets- geographic- data * Dataset: US Farmer s Market Data * Farmer s Market Dataset: Fact table 5 Dimensions, 1907 records

Technologies Explored * Data Preprocessing * Microsoft Excel Spreadsheet * MySQL Server * Data Mart * MySQL Server * CSV to SQL Converter * PHP * Ajax * JQuery * Twitter Bootstrap * OLAP Operations * SQL Server Queries

Implementation Steps * Data Cleaning and Preprocessing * Data Mart * OLAP Operations

Data Cleaning and Preprocessing * Original data had 8000 rows, we trimmed data to 1907 rows. * Add missing values using SQL Script * Season duration is not consistent. To maintain consistency we add two columns for season start and end

SQL Script

Data Mart * Data mart is implemented on star schema base * Data Mart provided following information to user * Market Name, Address, Goods and Nutrition program available, Season details on basis of below attributes * State * City * Goods * Nutrition Program * Season Duration * Location Type

Market Market_ID Market_Name Website Goods Goods_ID Beakgoods Cheese Meat Wine Location Location_ID Location_Type Street State Zip Fact Table Market_ID Location_ID Season_ID Program_ID Goods_ID Program Program_ID WIC WICCash SNAP SFMNP Season Season_ID Season_start Season_end Star Schema

Database Queries * select m.market_id, m.market_name, CONCAT(l.street,l.city,l.state,l.zip) AS Address, s.season_start, s.season_end, l.location_type,p.wic,p.wicash, p.sfmnp,p.snap, g.bakedgoods,g.cheese, g.crafts,g.flowers,g.eggs,g.seafood,g.herbs,g.vegetables,g.honey,g.jams,g.maple,g.meat, g.nursery, g.nuts,g.plants,g.ploutry,g.prepared,g.soap,g.trees,g.wine from Season s,fact_table as f,market_details as m,program as p,location as l,goods as g where s.season_start >'$season_start' and s.season_end < '$season_end' and s.season_id=f.season_id

Fun Quiz * How many dimensions we have used for star schema? A. 6 B. 5

DEMO

Future Scope * Privileged user can insert new records in future * Integrate Google Maps for location and directions * Develop Mobile Application * Apply UI Validations and filtering option on data

DATA MINING PROJECT

Objective * Mining data to extract knowledge from available data. * Explore different data mining tools. * Apply different data mining algorithms to US Farmers Market Data

Algorithms Applied * Tool Used * Weka * Classification Algorithm * Logistic Algorithm * J48 * Clustering Algorithm * K- Means * EM Algorithm

Fun Quiz * Which tool is used for Data Mining? 1. Weka 2. Rapid Miner

DEMO

Classification Algorithms

Histogram of states on goods class

Logistic Algorithm On class SFMNP

J48 Algorithms with class Bake Goods

Decision Tree for class Bakesgoods

Clustering Algorithms

Simple K- Means Algorithm

EM Algorithm applied on Nutrition Program

Learning Experience * Analytical processing * Learned different data mining tools like Weka, rapid Miner * Learned about real time application for different data mining algorithms * Learn about new technologies like PHP, Ajax, JQuery, Twitter Bootstrap

References * Data Source: http://catalog.data.gov/dataset/farmers- markets- geographic- data * Weka Tutorial: http://youtu.be/m7kpibgedki * Rapid Miner Tutorial: https://www.youtube.com/watch? v=eyyghzsvzpm&list=pllyinnlbo1evvz2wjlwrp_jwgg 5It1O6

Questions