2 Contents WEKA Introduction 3 Background information. 3 Installation. 3 Where to get WEKA... 3 Downloading Information... 3 Opening the program.. 4 Chooser Menu. 4-6 Preprocessing Classify Cluster 8 Associate 9 Select Attributes Visualization Microsoft SQL Database Relational Database 15 Procedures to access and use Database 15 Loading flat files into relational database DTS Table Relationships Creating Views Queries to produce results Temp Table example Volume Data 22
3 Introduction WEKA, formally called Waikato Environment for Knowledge Learning, is a computer program that was developed at the University of Waikato in New Zealand for the purpose of identifying information from raw data gathered from agricultural domains. WEKA supports many different standard data mining tasks such as data preprocessing, classification, clustering, regression, visualization and feature selection. The basic premise of the application is to utilize a computer application that can be trained to perform machine learning capabilities and derive useful information in the form of trends and patterns. WEKA is an open source application that is freely available under the GNU general public license agreement. Originally written in C the WEKA application has been completely rewritten in Java and is compatible with almost every computing platform. It is user friendly with a graphical interface that allows for quick set up and operation. WEKA operates on the predication that the user data is available as a flat file or relation, this means that each data object is described by a fixed number of attributes that usually are of a specific type, normal alpha-numeric or numeric values. The WEKA application allows novice users a tool to identify hidden information from database and file systems with simple to use options and visual interfaces. Installation The program information can be found by conducting a search on the Web for WEKA Data Mining or going directly to the site at The site has a very large amount of useful information on the program s benefits and background. New users might find some benefit from investigating the user manual for the program. The main WEKA site has links to this information as well as past experiments for new users to refine the potential uses that might be of particular interest to them. When prepared to download the software it is best to select the latest application from the selection offered on the site. The format for downloading the application is offered in a self installation package and is a simple procedure that provides the complete program on the end users machine that is ready to use when extracted.
4 Opening the program Once the program has been loaded on the user s machine it is opened by navigating to the programs start option and that will depend on the user s operating system. Figure 1 is an example of the initial opening screen on a computer with Windows XP. Figure 1 Chooser screen There are four options available on this initial screen. Simple CLI- provides users without a graphic interface option the ability to execute commands from a terminal window. Explorer- the graphical interface used to conduct experimentation on raw data Experimenter- this option allows users to conduct different experimental variations on data sets and perform statistical manipulation Knowledge Flow-basically the same functionality as Explorer with drag and drop functionality. The advantage of this option is that it supports incremental learning from previous results While the options available can be useful for different applications the remaining focus of the user manual will be on the Experimenter option through the rest of the user guide. After selecting the Experimenter option the program starts and provides the user with a separate graphical interface.
5 Figure 2 Figure 2 shows the opening screen with the available options. At first there is only the option to select the Preprocess tab in the top left corner. This is due to the necessity to present the data set to the application so it can be manipulated. After the data has been preprocessed the other tabs become active for use. There are six tabs: 1. Preprocess- used to choose the data file to be used by the application 2. Classify- used to test and train different learning schemes on the preprocessed data file under experimentation 3. Cluster- used to apply different tools that identify clusters within the data file 4. Association- used to apply different rules to the data file that identify association within the data 5. Select attributes-used to apply different rules to reveal changes based on selected attributes inclusion or exclusion from the experiment 6. Visualize- used to see what the various manipulation produced on the data set in a 2D format, in scatter plot and bar graph output Once the initial preprocessing of the data set has been completed the user can move between the tab options to perform changes to the experiment and view the results in real time. This provides the benefit of having the ability to move from one option to the next so that when a condition becomes exposed it can be placed in a different environment to be visually changed instantaneously.
6 Preprocessing In order to experiment with the application the data set needs to be presented to WEKA in a format that the program understands. There are rules for the type of data that WEKA will accept. There are three options for presenting data into the program. Open File- allows for the user to select files residing on the local machine or recorded medium Open URL- provides a mechanism to locate a file or data source from a different location specified by the user Open Database- allows the user to retrieve files or data from a database source provided by the user There are restrictions on the type of data that can be accepted into the program. Originally the software was designed to import only ARFF files, newer versions allow different file types such as CSV, C4.5 and serialized instance formats. The extensions for these files include.csv,.arff,.names,.bsi and.data. Figure 3 shows an example of selection of the file weather.arff. Figure 3 Once the initial data has been selected and loaded the user can select options for refining the experimental data. The options in the preprocess window include selection of optional filters to apply and the user can select or remove different attributes of the data set as necessary to identify specific information. The ability to pick from the available attributes allows users to separate different parts of the data set for clarity in the experimentation. The user can modify the attribute selection and change the relationship among the different attributes by deselecting different choices from the original data set. There are many different filtering options available within the preprocessing window and the user can select the different options based on need and type of data present.
7 Classify The user has the option of applying many different algorithms to the data set that would in theory produce a representation of the information used to make observation easier. It is difficult to identify which of the options would provide the best output for the experiment. The best approach is to independently apply a mixture of the available choices and see what yields something close to the desired results. The Classify tab is where the user selects the classifier choices. Figure 4 shows some of the categories. Figure 4 Again there are several options to be selected inside of the classify tab. Test option gives the user the choice of using four different test mode scenarios on the data set: 1. Use training set 2. Supplied training set 3. Cross validation 4. Split percentage There is the option of applying any or all of the modes to produce results that can be compared by the user. Additionally inside the test options toolbox there is a dropdown menu so the user can select various items to apply that depending on the choice can provide output options such as saving the results to file or specifying the random seed value to be applied for the classification. The classifiers in WEKA have been developed to train the data set to produce output that has been classified based on the characteristics of the last attribute in the data set. For a specific attribute to be
8 used the option must be selected by the user in the options menu before testing is performed. Finally the results have been calculated and they are shown in the text box on the lower right. They can be saved in a file and later retrieved for comparison at a later time or viewed within the window after changes and different results have been derived. Cluster The Cluster tab opens the process that is used to identify commonalties or clusters of occurrences within the data set and produce information for the user to analyze. There are a few options within the cluster window that are similar to those described in the classifier tab. They are use training set, supplied test set, percentage split. The fourth option is classes to cluster evaluation, which compares how well the data compares with a pre-assigned class within the data. While in cluster mode users have the option of ignoring some of the attributes from the data set. This can be useful if there are specific attributes causing the results to be out of range or for large data sets. Figure 5 shows the Cluster window and some of its options. Figure 5
9 Associate The associate tab opens a window to select the options for associations within the data set. The user selects one of the choices and presses start to yield the results. There are few options for this window and they are shown in Figure 6 below. Select Attributes Figure 6 The next tab is used to select the specific attributes used for the calculation process. By default all of the available attributes are used in the evaluation of the data set. If the use wanted to exclude certain categories of the data they would deselect those specific choices from the list in the cluster window. This is useful if some of the attributes are of a different form such as alphanumeric data that could alter the results. The software searches through the selected attributes to decide which of them will best fit the desired calculation. To perform this, the user has to select two options, an attribute evaluator and a search method. Once this is done the program evaluates the data based on the sub set of the attributes then performs the necessary search for commonality with the date. Figure 7 shows the opinions of attribute evaluation. Figure 8 shows the options for the search method. Figure 7
10 Figure 8
11 Visualization The last tab in the window is the visualization tab. Within the program calculations and comparisons have occurred on the data set. Selections of attributes and methods of manipulation have been chosen. The final piece of the puzzle is looking at the information that has been derived throughout the process. The user can now actually see the fruit of their efforts in a two dimensional representation of the information. The first screen that the user sees when they select the visualization option is a matrix of plots representing the different attributes within the data set plotted against the other attributes. If necessary there is a scroll bar to view all of the produced plots. The user can select a specific plot from the matrix to view its contents for analyzation. A grid pattern of the plots allows the user to select the attribute positioning to their liking and for better understanding. Once a specific plot has been selected the user can change the attributes from one view to another providing flexibility. Figure 9 shows the plot matrix view. Figure 9 The scatter plot matrix gives the user a visual representation of the manipulated data sets for selection and analysis. The choices are the attributes across the top and the same from top to bottom giving the user easy access to pick the area of interest. Clicking on a plot brings up a separate window of the selected scatter plot. The user can then look at a visualization of the data of the attributes selected and select areas of the scatter plot with a selection window or by clicking on the points within the plot to identify the point s specific information. Figure 10 shows the scatter plot for two attributes and the points derived from the data set. There are a few options to view the plot that could be helpful to the user. It is formatted similar to an X/Y graph yet it can show any of the attribute classes that appear on the main scatter plot matrix. This is handy when the scale of the attribute is unable to be ascertained in one axis over the other. Within the plot the points can be adjusted by utilizing a feature called jitter. This option moves the individual points so that in the event of close data points users can reveal hidden multiple occurrences within the initial plot. Figure 11 shows an example of this point selection and the results the user sees.
12 Figure 10 Figure 11 There are a few options to manipulate the view for the identification of subsets or to separate the data points on the plot. Polyline---can be used to segment different values for additional visualization clarity on the plot. This is useful when there are many data points represented on the graph.
13 Rectangle--this tool is helpful to select instances within the graph for copying or clarification. Polygon Users can connect points to segregate information and isolate points for reference. This user guide is meant to assist users in their efforts to become familiar with some of the features within the Explorer portion of the WEKA data mining software application and is used for informational purposes only. It is a summary of the user information found on the programs main web site. For a more comprehensive and in depth version users can visit the main site for examples and FAQ s about the program.
14 Microsoft SQL Server User Manual
15 Relational Database: Our team decided to load the provided data into a Microsoft SQL Server database. Mr. Washington gave us five text files, four of which are tab delimited files of New York Transit Authority data. List of Files: Data field descriptions A list of 11 column headers for 2006 and 2007 data 2006 incident data 11 column list of incident data for year incident data 11 column list of incident data for year 2007 Station list two column list of station ID and station descriptions Trouble list two column list of trouble ID and trouble descriptions Later in the project, two more files were provided. These files had rider volume by day for years 2006 and List of Files: Volume2006.xls Volume 2007.xls Procedures to access and use Database: The database server is an internal server, which means it can only be directly access within Pace University grounds. For access off Pace campuses, the team would need to use the university s VPN dialer, which would give us network access as if we were inside the university. Once the dialer is downloaded and installed, we can connect using our Pace University issued user accounts ( address). Now that we are virtually connected to the Pace University network, we can access the database server. There are a number of tools that can be used with Microsoft SQL Server. We narrowed our preferences down to two, Quest Toad for SQL Server ( or SQL Server 2005 Express Edition ( Both software applications are free to use. Once the software is downloaded and installed, we can connect to the database server and to our specific database. Here is the connection information: Server Name: csis-rose ( ) Authentication: SQL Server Authentication Database Name: SubwayIncidents User Name: SubwayIncidents PW: <ask for password> Loading flat files into relational database: Before loading data into your relational database system, we first had to consider some preliminary questions: Table structure, data types, and relational integrity. We decided to take the six data files and import them into their own tables. Deciding on data types was more complex. Because the two incident data files (2006 and 2007) contained information about stations and trouble codes (from station list and trouble code list), we wanted create relationships between those tables.
16 Here are the table structures: Incident Table (tblsubwaydata2007 and tblsubwaydata2006) Train Station Table (tblstationlist) Trouble List Table (tbltroublelist) Volume Table (tblvolume2006 and tblvolume2007) These six were created by executing SQL code. Here is an example of how the Incident table was created: CREATE TABLE [SubwayIncidents].[pcronin].[tblSubwayData2006] ( [AssaultID] varchar (6) NULL, [IncidentNo] varchar (6) NULL, [IncidentDate] datetime NULL, [Duration] varchar (3) NULL, [LateTrains] varchar (3) NULL, [TerminalCancel] varchar (2) NULL, [EnrouteCancel] varchar (2) NULL, [StationID] int NULL, [TroubleCode] varchar (4) NULL, [TrainLine] varchar (2) NULL ) DTS: After the tables were created with desired data types and structure, we had to import the data from the text files to the relational tables inside SQL Server. We used DTS (Data Transformation Services) in Microsoft SQL Server to move the data from the text files into our database tables. Six packages were created, one for each file to table transfer.
17 These packages read the input file data and copy the data into the appropriate table columns. Here is a graphical design of one of the DTS packages: This package starts with the Create Table icon, which runs the code (shown above) to create the table. Connection 1 is the text file with tab delimited data in it. Connection 2 is the destination table in our relational database. The black line between the two represents the SQL code that copies the data.
18 The data copy is graphically represented here: Table Relationships: Now we have our six tables that are populated with the data that Mr. Washington provided us. The last step is to create relationships between them to protect data integrity. The main incident data are contained in two tables (tblsubwaydata2007 and tblsubwaydata2006) and the tblstationlist and tbltroublelist are lookup or validation tables. These tables provide full text descriptions for numeric data in the incident tables.
19 Here are the graphical relationships: The relationships are bound by the tables Primary Key (PK) and Foreign Key (FK) fields. The relationships: 1. tblstationlist.stationid = tblsubwaydata2006.stationid 2. tbltroublelist.troubleid = tblsubwaydata2006.troublecode Creating Views: To simplify comparing data between the two years, we thought it would be a good idea to create a view that would combine the data. The code below shows how we union the data together from the two tables. CREATE view vwsubwaydata as Select '2006' as [Year], AssaultID, IncidentNo, IncidentDate, Duration, LateTrains, TerminalCancel, EnrouteCancel, StationID, TroubleCode, TrainLine From tblsubwaydata2006 Union all Select '2007' as [Year], AssaultID, IncidentNo, IncidentDate, Duration, LateTrains, TerminalCancel, EnrouteCancel, StationID, TroubleCode, TrainLine From tblsubwaydata2007 Most of the queries that we constructed below are based on this SQL Server view. Instead of creating relationships for the volume data, we created views that joined the two volume data tables with the two incident tables by year. The volume data contains a date field that represents the day of the year with total volume ridership. The views join the volume data with the incident data by the date. Here is the view creation of the 2006 data: Create view vwsubwaydata2006volume AS select s.*, v.rider
20 from tblsubwaydata2006 s Inner Join tblvolume2006 v ON cast(datepart(month, IncidentDate) as varchar(2))+'/'+ cast(datepart(day, IncidentDate) as varchar(2))+'/'+ cast(datepart(year, IncidentDate) as varchar(4)) = cast(datepart(month, Date) as varchar(2))+'/'+ cast(datepart(day, Date) as varchar(2))+'/'+ cast(datepart(year, Date) as varchar(4)) Here is the view creation of the 2006 data: Create view vwsubwaydata2007volume AS select s.*, v.rider from tblsubwaydata2007 s Inner Join tblvolume2007 v ON cast(datepart(month, IncidentDate) as varchar(2))+'/'+ cast(datepart(day, IncidentDate) as varchar(2))+'/'+ cast(datepart(year, IncidentDate) as varchar(4)) = cast(datepart(month, Date) as varchar(2))+'/'+ cast(datepart(day, Date) as varchar(2))+'/'+ cast(datepart(year, Date) as varchar(4)) Queries used to produce results: Using one of the two software tools mentioned above (TOAD or MS SQL Server Express), a user can execute these queries to produce results from the data table tables and views created above. Trouble Code Lists all trouble codes with occurrence count and average duration of train delays for year 2007 and having more than one occurrence. Select [Year], sd.troublecode, TroubleDesc, Count(sd.TroubleCode) as Incidents, Avg((CAST(Duration AS int))) as AvgDuration From vwsubwaydata sd inner join tbltroublelist tl on sd.troublecode = tl.troubleid Where [Year] = '2007' Group By [Year], sd.troublecode, TroubleDesc Having Count(sd.TroubleCode) > 2 Order By Incidents desc, AvgDuration desc Train Line Lists all train lines with occurrence count and average duration of train delays for the trouble code of Person Holding Doors and year Select TrainLine, Count(TrainLine) as Incidents, Avg((CAST(Duration AS int))) as AvgDuration From vwsubwaydata Where [Year] = '2007' and TroubleCode = '0741' Group by TrainLine Order by Incidents desc Stations Lists all stations with incident occurrence count for trouble code of Employee Assaulted By Cust. and year 2007 and having more than one occurrence. Select Station, Count(sd.StationID) as Incidents From vwsubwaydata sd inner join tblstationlist sl on sd.stationid = sl.stationid
21 Where [Year] = '2007' and TroubleCode = '4011' Group by Station Having Count(sd.StationID) > 2 Order by Incidents desc Person Holding Doors incidents that lead to an assault on an employee This is a Cursor. It queries all assaults and individually loops over the record set to find associated Person Holding Doors incidents. The cursor first selects all assaults (trouble code 4011) for year It loops over this record set to find associated trouble code 0741 Persons Hold Doors. Drop table #TempIncidents Create Table #TempIncidents (AssaultID Varchar(6), Duration Varchar(3), StationID int, TrainLine Varchar(2)) varchar(6) DECLARE Assault_cursor CURSOR FOR Select AssaultID From vwsubwaydata sd Where sd.year = '2007' and TroubleCode = '4011' OPEN Assault_cursor FETCH NEXT FROM Assault_cursor WHILE = 0 BEGIN Insert Into #TempIncidents Select AssaultID, Duration, StationID, TrainLine From vwsubwaydata sd Where TroubleCode = '0741' and AssaultID END FETCH NEXT FROM Assault_cursor CLOSE Assault_cursor DEALLOCATE Assault_cursor Select * From #TempIncidents Queries based on the temporary table #TempIncidents : Station Lists the Station and the train line with how many occurrences and average duration of the delay during a door holding incident. Select Station, TrainLine, Count(sd.StationID) as Incidents, Avg((CAST(Duration AS int))) as AvgDuration From #TempIncidents sd inner join tblstationlist sl on sd.stationid = sl.stationid Group by Station, TrainLine Having Count(sd.StationID) > 2 Order by Incidents desc Train Line Lists the train line with how many occurrences and average duration of the delay during a door holding incident.
22 Select TrainLine, Count(TrainLine) as Incidents, Avg((CAST(Duration AS int))) as AvgDuration From #TempIncidents Group by TrainLine Order by Incidents desc Queries based on Volume data The following queries are based on the volume data using the volume views created earlier. Average number of riders by month select datepart(month, Date) as month, avg(rider) from tblvolume2007 group by datepart(month, Date) order by avg(rider) desc Total number of assults by month select datepart(month, IncidentDate) as month, count(incidentno) as Assault from vwsubwaydata2007volume where TroubleCode = '4011' group by datepart(month, IncidentDate) order by count(incidentno) desc Total number of "DELAYED BY TRACK/WORK GANGS" incidents by month select datepart(month, IncidentDate) as month, count(incidentno) as Incidents from vwsubwaydata2007volume where TroubleCode = '8204' group by datepart(month, IncidentDate) order by count(incidentno) desc Total number of non assault incidents by month select datepart(month, IncidentDate) as month, count(incidentno) as Incidents from vwsubwaydata2007volume where TroubleCode <> '4011' group by datepart(month, IncidentDate) order by count(incidentno) desc
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY
WEKA Explorer User Guide for Version 3-4-3 Richard Kirkby Eibe Frank November 9, 2004 c 2002, 2004 University of Waikato Contents 1 Launching WEKA 2 2 The WEKA Explorer 2 Section Tabs................................
Prof. Pietro Ducange Students Tutor and Practical Classes Course of Business Intelligence 2014 http://www.iet.unipi.it/p.ducange/esercitazionibi/ Email: firstname.lastname@example.org Office: Dipartimento di Ingegneria
An Introduction to WEKA As presented by PACE Download and Install WEKA Website: http://www.cs.waikato.ac.nz/~ml/weka/index.html 2 Content Intro and background Exploring WEKA Data Preparation Creating Models/
How to Get The Most Out of the DEP Public Internet Reports 07/13/2004 Table of Contents MS Excel... 2 Opening a Report... 2 Manipulating the Report Layout... 3 Expanding the columns... 3 Freezing Panes...
Project Zip Code Version 13.0 CUNA s Powerful Grassroots Program User Manual Copyright 2012, All Rights Reserved Project Zip Code Version 13.0 Page 1 Table of Contents Topic Page About Project Zip Code
How to Use: Microsoft Access 2007 Microsoft Office Access is a powerful tool used to create and format databases. Databases allow information to be organized in rows and tables, where queries can be formed
Rapid Assessment Key User Manual Table of Contents Getting Started with the Rapid Assessment Key... 1 Welcome to the Print Audit Rapid Assessment Key...1 System Requirements...1 Network Requirements...1
Introduction Predictive Analytics Tools: Weka Predictive Analytics Center of Excellence San Diego Supercomputer Center University of California, San Diego Tools Landscape Considerations Scale User Interface
Using MS SQL databases HOWTO Copyright 2001 Version 1.0 This HOWTO describes how to connect to a MS SQL database and how to transfer data to an SQL server database. Table of Contents Connecting to Manage
Universal Simple Control, USC-1 Data and Event Logging with the USB Flash Drive DATA-PAK The USC-1 universal simple voltage regulator control uses a flash drive to store data. Then a propriety Data and
DataPA OpenAnalytics End User Training DataPA End User Training Lesson 1 Course Overview DataPA Chapter 1 Course Overview Introduction This course covers the skills required to use DataPA OpenAnalytics
Lab 2: MS ACCESS Tables Summary Introduction to Tables and How to Build a New Database Creating Tables in Datasheet View and Design View Working with Data on Sorting and Filtering 1. Introduction Creating
Tutorial: How to Use SQL Server Management Studio from Home Steps: 1. Assess the Environment 2. Set up the Environment 3. Download Microsoft SQL Server Express Edition 4. Install Microsoft SQL Server Express
Toad for Data Analysts, Tips n Tricks or Things Everyone Should Know about TDA Just what is Toad for Data Analysts? Toad is a brand at Quest. We have several tools that have been built explicitly for developers
2011 USER GUIDE Appointment Manager 0 Suppose that you need to create an appointment manager for your business. You have a receptionist in the front office and salesmen ready to service customers. Whenever
SonicWALL GMS Custom Reports Document Scope This document describes how to configure and use the SonicWALL GMS 6.0 Custom Reports feature. This document contains the following sections: Feature Overview
Creating Reports with Microsoft Dynamics AX SQL Reporting Services. Table of Contents Lab 1: Building a Report... 1 Lab Objective... 1 Pre-Lab Setup... 1 Exercise 1: Familiarize Yourself with the Setup...
Microsoft Access 2010 Part 1: Introduction to Database Design What is a database? Identifying entities and attributes Understanding relationships and keys Developing tables and other objects Planning a
Novell ZENworks Asset Management 7.5 w w w. n o v e l l. c o m October 2006 USING THE WEB CONSOLE Table Of Contents Getting Started with ZENworks Asset Management Web Console... 1 How to Get Started...
Masking Data In this article, learn how to create and manipulate masks through both the worksheet and graph window. The article is split up into four main sections: The Mask toolbar The Mask Toolbar Buttons
Consumption of OData Services of Open Items Analytics Dashboard using SAP Predictive Analysis (Version 1.17) For validation Document version 0.1 7/7/2014 Contents What is SAP Predictive Analytics?... 3
An Oracle White Paper October 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Generate a PL/SQL script for workflow deployment Denny Wong Oracle Data Mining Technologies 10 Van de Graff Drive Burlington,
Introduction to Microsoft Access 2010 A database is a collection of information that is related. Access allows you to manage your information in one database file. Within Access there are four major objects:
Millennium FAST Finance Reporting Memorial University of Newfoundland September 2013 User Guide Version 4.0 FAST Finance User Guide Page i Contents Introducing FAST Finance Reporting 4.0... 2 What is FAST
Microsoft Access Basics 2006 ipic Development Group, LLC Authored by James D Ballotti Microsoft, Access, Excel, Word, and Office are registered trademarks of the Microsoft Corporation Version 1 - Revision
International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR
Introduction to Microsoft Access 2013 A database is a collection of information that is related. Access allows you to manage your information in one database file. Within Access there are four major objects:
Using MS SQL Server with Coldfusion and Plesk Create an MSSQL database 1) Log in to plesk via the web. Be sure to type http otherwise it will not work: http://studentxx.upj.pitt.edu:8880 2) If using Internet
Visualization with Excel Tools and Microsoft Azure Introduction Power Query and Power Map are add-ins that are available as free downloads from Microsoft to enhance the data access and data visualization
SQL Server An Overview SQL Server Microsoft SQL Server is designed to work effectively in a number of environments: As a two-tier or multi-tier client/server database system As a desktop database system
Table of contents: Access Data for Analysis Data file types Format assumptions Data from Excel Information links Add multiple data tables Create & Interpret Visualizations Table Pie Chart Cross Table Treemap
USING MYWEBSQL MyWebSQL is a database web administration tool that will be used during LIS 458 & CS 333. This document will provide the basic steps for you to become familiar with the application. 1. To
Calc Guide Chapter 4 OpenOffice.org Copyright This document is Copyright 2006 by its contributors as listed in the section titled Authors. You can distribute it and/or modify it under the terms of either
Strategic Asset Tracking System User Guide Contents 1 Overview 2 Web Application 2.1 Logging In 2.2 Navigation 2.3 Assets 2.3.1 Favorites 2.3.3 Purchasing 2.3.4 User Fields 2.3.5 History 2.3.6 Import Data
Combination Chart Extensible Visualizations Product: IBM Cognos Business Intelligence Area of Interest: Reporting Combination Chart Extensible Visualizations 2 Copyright and Trademarks Licensed Materials
1 2 3 4 Database Studio is the new tool to administrate SAP MaxDB database instances as of version 7.5. It replaces the previous tools Database Manager GUI and SQL Studio from SAP MaxDB version 7.7 onwards
Downloading & Using Data from the STORET Warehouse: An Exercise August 2012 This exercise addresses querying or searching for specific water resource data, and the respective methods used in collecting
Crystal Reports Payroll Exercise Objective This document provides step-by-step instructions on how to build a basic report on Crystal Reports XI on the MUNIS System supported by MAISD. The exercise will
Using the SimNet Course Manager Using the SimNet Course Manager Contents Overview...3 Requirements...3 Navigation...3 Action Menus...3 Sorting Lists...4 Expanding and Collapsing Sections...4 Instructor
TWIN-TURBINE CENTRIFUGAL COMPRESSOR SERVICE MONITORING TOOLS QUICK START GUIDE October 2013 This Page Left Intentionally Blank Table of Contents 1 Service Monitoring Tools Quick Start... 4 1.1 Compatibility
Microsoft Amarillo College Revision Date: July 30, 2008 Table of Contents GENERAL INFORMATION... 1 TERMINOLOGY... 1 ADVANTAGES OF USING A DATABASE... 2 A DATABASE SHOULD CONTAIN:... 3 A DATABASE SHOULD
User Guide Banner Handout: BUSINESS OBJECTS ENTERPRISE (InfoView) Document: boxi31sp3-infoview.docx Created: 5/11/2011 1:24 PM by Chris Berry; Last Modified: 8/31/2011 1:53 PM Purpose:... 2 Introduction:...
SQL Server Integration Services (SSIS) is a set of tools that let you transfer data to and from SQL Server 2005. In this lab, you ll work with the SQL Server Business Intelligence Development Studio to
Version 3.2 User Guide Copyright 2002-2009 Snow Software AB. All rights reserved. This manual and computer program is protected by copyright law and international treaties. Unauthorized reproduction or
Quick Start 2008 2 What is ScrumDesk ScrumDesk is project management tool supporting Scrum agile project management method. ScrumDesk demo is provided as hosted application where user has ScrumDesk installed
SAS BI Dashboard 3.1 User s Guide The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2007. SAS BI Dashboard 3.1: User s Guide. Cary, NC: SAS Institute Inc. SAS BI Dashboard
ical Manual Table of Contents I. Using ical... pg. 1 Calendar views and formats..pg. 1 Navigating the calendar.pg. 3 Searching the calendar..pg. 4 II. Administering the Calendar. pg. 5 Display options.pg.
Plugin for Microsoft Dynamics CRM 2013-2015 For On Premise and Online Deployments User Guide v. 2.3 April 2015 Contents 1. Introduction... 3 1.1. What s new in 2.3?... 3 2. Installation and configuration...
Network Connect Installation and Usage Guide I. Installing the Network Connect Client..2 II. Launching Network Connect from the Desktop.. 9 III. Launching Network Connect Pre-Windows Login 11 IV. Installing
- Solvent extraction Database - - manual - SXD user manual Three modes of functioning are available: Adminisitrator mode: database management and creation/deletion of users accounts. Operator mode: supply
Intellect Platform - The Workflow Engine Basic HelpDesk Troubleticket System - A102 Interneer, Inc. Updated on 2/22/2012 Created by Erika Keresztyen Fahey 2 Workflow - A102 - Basic HelpDesk Ticketing System
Introduction to Microsoft Access 2003 Zhi Liu School of Information Fall/2006 Introduction and Objectives Microsoft Access 2003 is a powerful, yet easy to learn, relational database application for Microsoft
For Windows Updated: August 2012 Table of Contents Section 1: Overview... 3 1.1 Introduction to SPSS Tutorials... 3 1.2 Introduction to SPSS... 3 1.3 Overview of SPSS for Windows... 3 Section 2: Entering
Data Mining SPSS 12.0 1. Overview Spring 2010 Instructor: Dr. Masoud Yaghini Introduction Types of Models Interface Projects References Outline Introduction Introduction Three of the common data mining
Logi Ad Hoc Reporting System Administration Guide Version 11.2 Last Updated: March 2014 Page 2 Table of Contents INTRODUCTION... 4 Target Audience... 4 Application Architecture... 5 Document Overview...
An Oracle White Paper September 2013 Oracle Data Miner (Extension of SQL Developer 4.0) Integrate Oracle R Enterprise Mining Algorithms into a workflow using the SQL Query node Denny Wong Oracle Data Mining
Using the Query Analyzer Using the Query Analyzer Objectives Explore the Query Analyzer user interface. Learn how to use the menu items and toolbars to work with SQL Server data and objects. Use object
Ovation Operator Workstation for Microsoft Windows Operating System Features Delivers full multi-tasking operation Accesses up to 200,000 dynamic points Secure standard operating desktop environment Intuitive
mylittleadmin for MS SQL Server Quick Start Guide version 3.5 1/25 CONTENT 1 OVERVIEW... 3 2 WHAT YOU WILL LEARN... 3 3 INSTALLATION AND CONFIGURATION... 3 4 BASIC NAVIGATION... 4 4.1. Connection 4 4.2.
SAS 9.4 PC Files Server Installation and Configuration Guide SAS Documentation The correct bibliographic citation for this manual is as follows: SAS Institute Inc. 2014. SAS 9.4 PC Files Server: Installation
INTEGRATING MICROSOFT DYNAMICS CRM WITH SIMEGO DS3 Often the most compelling way to introduce yourself to a software product is to try deliver value as soon as possible. Simego DS3 is designed to get you
Ofgem Carbon Savings Community Obligation (CSCO) Eligibility System User Guide 2015 Page 1 Table of Contents Carbon Savings Community Obligation... 3 Carbon Savings Community Obligation (CSCO) System...
Technical Paper Build Your First Web-based Report Using the SAS 9.2 Business Intelligence Clients A practical introduction to SAS Information Map Studio and SAS Web Report Studio for new and experienced
Running Descriptive and Correlational Analysis in Excel 2013 Tips for coding a survey Use short phrases for your data table headers to keep your worksheet neat, you can always edit the labels in tables
User Guide EPiServer 7 Mail Revision A, 2012 Table of Contents 3 Table of Contents Table of Contents 3 Introduction 5 About This Documentation 5 Accessing EPiServer Help System 5 Online Community on EPiServer
USER GUIDE: MaaS360 Services 05.2010 Copyright 2010 Fiberlink Corporation. All rights reserved. Information in this document is subject to change without notice. The software described in this document
Nintex Workflow for Project Server 2010 Help Last updated: Friday, March 16, 2012 1 Using Nintex Workflow for Project Server 2010 1.1 Administration and Management 1.2 Associating a Project Server Workflow
USING MS OUTLOOK In this tutorial you will learn how to use Microsoft Outlook with your EmailHosting.com account. You will learn how to setup an IMAP account, and also how to move your emails and contacts
Lesson 07: MS ACCESS - Handout Handout Introduction to database (30 mins) Microsoft Access is a database application. A database is a collection of related information put together in database objects.
Creating a universe on Hive with Hortonworks HDP 2.0 Learn how to create an SAP BusinessObjects Universe on top of Apache Hive 2 using the Hortonworks HDP 2.0 distribution Author(s): Company: Ajay Singh
USING MS OUTLOOK WITH FUSEMAIL In this tutorial you will learn how to use Microsoft Outlook with your FuseMail account. You will learn how to setup an IMAP account, and also how to move your emails and