Factors affecting the use of data mining in Mozambique: Constan'no Sotomane 12 November 2013
|
|
- Victoria Turner
- 8 years ago
- Views:
Transcription
1 Department of Computer and Systems Sciences Factors affecting the use of data mining in Mozambique: Toward a framework to facilitate the use of data mining Constan'no Sotomane 12 November 2013
2 Content Problem Area Research Objectives Research Questions Research strategy and methods Results Conclusions and Future Directions
3 Problem Area High capacity to collect and store data à Rapid increase of the amount of data à huge amount of data; Several data rich project are being established in Mozambique: Government Data Centre, the State Financial Management System (SISTAFE) and the integrated platform of service to the citizen; Data holds important information that can be used to support a better decision making and improve the competitive advantage in organizations;
4 Problem Area (2) Data mining can be used to extracts patterns from these huge amount of data; The implementation of data mining in organization can be affected by several factors dependents to the environment; The implementation of data mining as (an emergent technology) can be expensive and risky; There is the need to determine the factors that influence the application of data mining to avoid failure of the project and lose;
5 Research Objectives To determine the level of awareness and adoption of data mining in Mozambique. To determine factors affecting the use of data mining in Mozambique. The factors will be used to develop a framework to facilitate the implementation of data mining in Mozambique.
6 Research Questions Question 1: What is the level of awareness and adoption of data mining in Mozambique? Question 2: What are the factors affecting the use of data mining in Mozambique? Question 3: How to facilitate the application and use of data mining in Mozambique?
7 Research strategy and methodos (1) Interpretative Exploratory multiple case study Organization 1 à participatory observation Organization 2 à participatory observation Survey Maputo City à Interview and questionnaire Application of data mining using the dataset of Institution 1 STUDY 1 & STUDY4 Exploratory visit to Institutions to determine area for data mining application Selecting two institutions for data mining application Exploratory study on the factorms affecting the use of data mining in Mozabique Participarory- observation Application of data mining using the dataset of institution 2 STUDY 2 Survey in Maputo - use of data mining in Mozambique and related factories STUDY 3 Interview & questionaries Factors Affecting the Use of Data Mining in Mozambican Toward the framework
8 Research strategy and methods (2) Research Ques'ons Q1, Q2 Case Study 1 Organiz. 1 Q1, Q2 Observa'on Study 1,4 Awareness, Factors Case Study 2 Organiz. 2 Q1, Q2 Observa'on Study 2 Awareness, Factors Survey Maputo City Q1, Q2 Interviews Study 3 Awareness, Factors Ques'onnaires
9 Study 1: ICT for Automatic forecasting of Electrical Power Consumption: A case study in Maputo Objective 1: Develop short term forecast model for electricity consumption to complement the manual forecast of electrical consumption currently done at EDM; Objective 2: Observe the use, adoption and potential factors the implementation of the data mining at EDM.
10 The data set Electrical consumption of Maputo city: January 2003 December 2009; Weather data were not available we downloaded from internet (weather underground); Test set: January December 2008 Training set: January 2009 December 2009 Missing values, errors and outliers corrected by replacing with averages; Information about price and procedure were not possible to get
11 Characterization of the Electrical Consumption Using Visualization (1) Long term trend and seasonal variation of electrical load Variation of the electrical load per day and during the 5 years
12 Characterization of the Electrical Consumption Using Visualization (2) The electrical load is strongly correlated with the temperature and follow the season pattern The correlation with humidity is not strong as with temperature The long term trend of electrical consumption has similar pattern with GDP per capita during the period
13 Characterization of the Electrical Consumption Using Visualization (3) The clustering resulted on 3 classes: Class 0 Saturdays, Sundays and Holidays Class 1 Summer Work Days Class 2 Winter Work Days A linear regression Model was used to predict each class of the days L d + 1 ci = b 1 * L d ci + b 2 * T d + 1
14 Some results of the Prediction of Electrical Consumption Real Load Predicted Load Real Load Predicted Load M W H M W H Hours Work days - Summer Hours Work day - Winters Real Load Predicted Load Real Load Predicted Load M W H M W H Hours Saturdays Hours Sundays
15 Nr Observed Aspect/ Factor Finding Coments 1 Awarwness of data mining weak 2 Use of data mining None Forecast of electricity consumption 3 Data issues 3.1 Quality of data Acceptable 3.2 Existence of data warehouse No data werehouse 3.3 data integration Not integrated 3.4 Age of data 9 years available in timeline Available partially, in different granularity, with missing and anomalous values measurement of consumption are keep in different files available in different files, in different formats and from different formats 3.5 Sufficiency of data Acceptable Data of operation, weather not available as required 4 Privacy issues 4.1 Privacy high concern on privacy Avoid to provide certain data 5 Skills and Human resources 5.1 Existence of teamwork Not in data mining 5.2 Existence of skill Not in data mining 5.3 Understanding of data mining and techniques No 5.4 Awareness of data mining No 6 Stakeholder support issues 6.1 Existence of business champion Not in data mining 6.2 Commitment of top management Interested to see the result 6.3 Existence of change management Not observed 6.4 Organizational acceptance the Data mining is not used and is not known 6.5 Use of data mining results The data mining is not used and the awareness among the observed people is weak. It make dificult to observe this aspects 7 Organizational issues 7.1 Functional integration No observed 7.2 Alignment of IT and business Not Observed 7.3 Strategy of outsourcing Not observed 7.4 Interdisciplinary learning Not Observed The data mining is not used and the awareness among the observed people is weak. It make dificult to observe this aspects 8 Financial issues 8.1 costs Not Observed 9 Technology issues 9.1 Existence of adequate software and tools Yes 9.2 Existence of database yes 9.3 Existence of data warehouse No Not specific for data mining It is not relational database, but all the measurement are kept in files
16 Study 2: Extracting Pattern from Socioeconomic Database to Characterize Small Farmers with High and Low Corn Yield in Mozambique: A Data Mining Approach Objective 1: Use of data mining techniques to extract from the socioeconomic database the characteristics of small farmers with high and low maize yields in rural areas of Mozambique; Objective 2: Observe the use, adoption and potential factors the implementation of the data mining at MINAG;
17 The data set We used data of socio economic survey of Ministry of Agriculture of years 2007 and 2008; After cleaning the data were composed by 8821 instances and 109 attributes including attribute class (Corn Yield); Sparse, unbalanced, error, outliers and missing data; Outliers were not treated; Missing data were replaced by? Or new value; Attribute class was split in High Yield (Upper quartile -75%) and Low Yield (low quartile) Hotspot of Weka workbench was used;
18 Results: Tree of association rules for the class of small farmers with High Maize Yield
19 Results: Tree of association rules for the class of small formers with Low Maize Yield
20 Characterization of small formers with Low and High Yield of maize Nr Observed Aspect/ Factor 1 Awareness of data mining 2 Use of data mining 3 Data i ssues 3.1 Quality of data Finding weak None Comments There use of other statistical technique to analyses the dataset Limited quality there are some problem of accuracy, completeness and validity of the data as result of typing or wrong provision of i nformation 3.2 Existence of data warehouse No data warehouse 3.3 data i ntegration Not i ntegrated database Age of data 6 years Sufficiency of data enough data available Privacy i ssues Privacy high concern on privacy Skills and Human resources Existence of teamwork Not i n data mining Existence of skill Not i n data mining Understanding of data mining No and techniques Awareness of data mining No Stakeholder support i ssues Existence of business Not i n data mining champion Commitment of top Not observed management Existence of change Not observed management Organizational acceptance the Data mining i s not used and i s not known Use of data mining results Organizational i ssues Functional i ntegration No observed Alignment of IT and business Strategy of outsourcing Interdisciplinary l earning Financial i ssues costs Technology i ssues Existence of adequate 9.1 software and tools 9.2 Existence of database 9.3 Existence of data warehouse Not Observed Not observed Not Observed Measurement are kept i n a database i n SPSS some data are available from different institutions i n different formats Annual measurements Avoid to provide certain data There are considerable skills on statistics and other data analysis techniques. Limited skill i n data mining The data mining i s not used and the awareness among the observed people i s weak. It make difficult to observe this aspects Not Observed Not observed yes No Aggregated data were available from web and downloadable
21 Study 3: Factors Affecting the Use of Data Mining in Mozambique Objective 1: Determine the level of awareness and adoption of data mining in Mozambique; Objective 2: Determine factors affecting the adoption or use of data mining in Mozambique; Knowing the factors helps to determine when and under which condi'ons it should be implemented
22 Research approach, methodology Mix of qualitative and quantitative research methods Qualitative data : Focus group discussion involving twelve data analysts, ICT experts and managers; Open ended questions from the questionnaire Quantitative data: Closed questions to data annalists, ICT experts and managers; 64 valid from 110 total of respondents from Maputo city The selection of the respondents were the combination of purposive and snowball sampling
23 Result : Factors influencing the use and adoption of data mining Stakeholder support issues Existence of business champion Commitment of top management Existence of change management Non organizational acceptance Not use of data mining User request Organizational issues Functional integration Alignment of IT and business Strategy of outsourcing Interdisciplinary learning Financial issues costs Data issues Quality of data Availability of data warehouse Data integration Age of data (timeline) Insufficient data Skills and human resource issues Existence of teamwork Existence of skill Understanding of data mining and techniques Awareness of data mining Technology issues Existence of adequate software and tools Existence of database Existence of data warehouse
24 Result: Quality of data For the focus group the quality of data in Mozambique is low; The reasons for bad quality of data are: Inadequate data collection process; Historical situation of the country ; Limitation of technology for data collection and storage; Social aspects; Limited human capacity and Non use of data. For the participants of survey the quality of data is not low; The controversy about the perception of the level of quality of data indicates the difference of knowledge about the data: some use data other do not use;
25 Study 4: Short-term Load Forecasting of Electricity Consumption in Maputo Objective 1: After 1 year, verify the progress in term of data quality, availability and integration; Improve the short term load forecast of electrical consumption and apply it to the estimation of DAM of SAPP; Objective 2: verify the progress of the condition for the application of the data mining;
26 The data set Electrical consumption of Maputo city: January 2003 October 2012; Maximum temperature, minimum temperature of Maputo City: January 2003 October 12; Test set: January December 2011 Training set: January 2012 October 2012 Missing values, errors and outliers corrected by replacing with averages; Procedures and prices from the EDM, SAPP, ESKOM bokles
27 Method for forecasting electricity for DAM Df ci t+1 = b 1ci D ci t + b 2ci T M t+1 + b 3ci T m t+1 Working Day Summer Working day Winter Saturday Sunday Holiday Model c1 Model c2 Model c3 Model c4 Model c4 Df c1 t+1 Df c4 t+1 Forested of Electricity consump'on + - D f P k DAM Electricity DAM=round( P k D f ) 0.90 Target data for DM Available Electricity DAM=0 if ( P k D f ) 0 Clustering Expert knowledge (Provided by domain expert)
28 Comparison of the results obtained by model EDM and Model RR. The model RR loss less electricity than the model EDM (1) MAE= 1/N i=1 N Pk (Dr+DAM) M o d e l Model RR EDM MAE
29 Comparison of the results obtained by model EDM and Model RR (2)
30 Comparison of the results obtained by model EDM and Model RR (3)
31 Forecas1ng Electricity for DAM Nr Observed Aspect/ Factor 1 Awareness of data mining 2 Use of data mining 3 Data i ssues Finding weak None Coments 3.1 Quality of data Acceptable Available partially, with missing and anomalous values, quality i s i mproving 3.2 Existence of data warehouse No data warehouse measurement of consumption are keep i n different files 3.3 data i ntegration Not i ntegrated 3.4 Age of data 9 years 3.5 Sufficiency of data Acceptable 4 Privacy i ssues 4.1 Privacy high concern on privacy 5 Skills and Human resources 5.1 Existence of teamwork Not i n data mining 5.2 Existence of skill Not i n data mining Understanding of data mining and 5.3 No techniques 5.4 Awareness of data mining No 6 Stakeholder support i ssues 6.1 Existence of business champion Not i n data mining available i n different files, i n different formats and from different formats available i n timeline Data becoming more available Avoid to provide certain data 6.2 Commitment of top management Interested to see the result The data mining i s not used and the awareness among the observed people i s weak. It make difficult to the Data mining i s not used and i s not observe this aspects known 6.3 Existence of change management Not observed 6.4 Organizational acceptance 6.5 Use of data mining results 7 Organizational i ssues 7.1 Functional i ntegration 7.2 Alignment of IT and business 7.3 Strategy of outsourcing 7.4 Interdisciplinary l earning 8 Financial i ssues 8.1 costs 9 Technology i ssues Existence of adequate software 9.1 and tools No observed Not Observed Not observed Not Observed The data mining i s not used and the awareness among the observed people i s weak. It make difficult to observe this aspects Not Observed Yes Not specific for data mining 9.2 Existence of database yes It i s not relational database, but all the measurement are kept i n files 9.3 Existence of data warehouse No
32 Conclusion & Future direction Low awareness of DM; The application of data mining is at very beginning stage ( Pre-Natal); The identified factors are related to : Stakeholders, organization, quality and integration of data, skills, cost and technology; Develop a capability maturity model to provide guidance to organizations: Benchmarking; Provide means to gauge where there are and were they need to go in term of application of data mining; Provide guideline to move from one stage to a more mature stage.
33 Thank you
Constantino Sotomane DSV Report Series No. 14-012
FACTORS AFFECTING THE USE OF DATA MINING IN MOZAMBIQUE: Towards a framework to facilitate the use of data mining Constantino Sotomane DSV Report Series No. 14-012 Factors affecting the use of data mining
More informationvisualization tool for interpretation of empirical results from climate model: Does it add value to UbuntuNet Connect Kigali, Rwanda 2013
Development of a climate data visualization tool for interpretation of empirical results from climate model: Does it add value to different stakeholders? UbuntuNet Connect Kigali, Rwanda 2013 Co-Authors
More informationWhat is Data Mining? Data Mining (Knowledge discovery in database) Data mining: Basic steps. Mining tasks. Classification: YES, NO
What is Data Mining? Data Mining (Knowledge discovery in database) Data Mining: "The non trivial extraction of implicit, previously unknown, and potentially useful information from data" William J Frawley,
More informationnot possible or was possible at a high cost for collecting the data.
Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day
More informationA Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML
www.bsc.es A Case of Study on Hadoop Benchmark Behavior Modeling Using ALOJA-ML Josep Ll. Berral, Nicolas Poggi, David Carrera Workshop on Big Data Benchmarks Toronto, Canada 2015 1 Context ALOJA: framework
More informationChapter 10. Key Ideas Correlation, Correlation Coefficient (r),
Chapter 0 Key Ideas Correlation, Correlation Coefficient (r), Section 0-: Overview We have already explored the basics of describing single variable data sets. However, when two quantitative variables
More informationPRACTICAL DATA MINING IN A LARGE UTILITY COMPANY
QÜESTIIÓ, vol. 25, 3, p. 509-520, 2001 PRACTICAL DATA MINING IN A LARGE UTILITY COMPANY GEORGES HÉBRAIL We present in this paper the main applications of data mining techniques at Electricité de France,
More informationPROGRAM DIRECTOR: Arthur O Connor Email Contact: URL : THE PROGRAM Careers in Data Analytics Admissions Criteria CURRICULUM Program Requirements
Data Analytics (MS) PROGRAM DIRECTOR: Arthur O Connor CUNY School of Professional Studies 101 West 31 st Street, 7 th Floor New York, NY 10001 Email Contact: Arthur O Connor, arthur.oconnor@cuny.edu URL:
More informationCOPYRIGHTED MATERIAL. Contents. List of Figures. Acknowledgments
Contents List of Figures Foreword Preface xxv xxiii xv Acknowledgments xxix Chapter 1 Fraud: Detection, Prevention, and Analytics! 1 Introduction 2 Fraud! 2 Fraud Detection and Prevention 10 Big Data for
More informationSupporting Effective Evaluations: A Guide to Developing Performance Measurement Strategies
Home > Centre of Excellence for Evaluation Supporting Effective Evaluations: A Guide to Developing Performance Measurement Strategies 6.0 Performance Measurement Strategy Framework 6.1 Overview of the
More informationGerard Mc Nulty Systems Optimisation Ltd gmcnulty@iol.ie/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I
Gerard Mc Nulty Systems Optimisation Ltd gmcnulty@iol.ie/0876697867 BA.,B.A.I.,C.Eng.,F.I.E.I Data is Important because it: Helps in Corporate Aims Basis of Business Decisions Engineering Decisions Energy
More informationBest Practices in Data Mining. Executive Summary
Executive Summary Prepared by: Database & Marketing Technology Council Authors: Richard Boire, Paul Tyndall, Greg Carriere, Rob Champion Released: August 2003 Executive Summary Canadian marketers have
More informationResource Advisor OVERVIEW
Resource Advisor OVERVIEW Resource Advisor Features Customize Site Dashboards Localized Preferences Meter Integration Data Entry Flexibility Manage Project Management Goal Setting & Forecasting Scenario
More informationShort-Term Forecasting in Retail Energy Markets
Itron White Paper Energy Forecasting Short-Term Forecasting in Retail Energy Markets Frank A. Monforte, Ph.D Director, Itron Forecasting 2006, Itron Inc. All rights reserved. 1 Introduction 4 Forecasting
More informationIBM Cognos Statistics
Cognos 10 Workshop Q2 2011 IBM Cognos Statistics Business Analytics software Cognos 10 Spectrum of Business Analytics Users Personas Capabilities IT Administrators Capabilities : Administration, Framework
More informationForecasting the first step in planning. Estimating the future demand for products and services and the necessary resources to produce these outputs
PRODUCTION PLANNING AND CONTROL CHAPTER 2: FORECASTING Forecasting the first step in planning. Estimating the future demand for products and services and the necessary resources to produce these outputs
More informationOUTLIER ANALYSIS. Data Mining 1
OUTLIER ANALYSIS Data Mining 1 What Are Outliers? Outlier: A data object that deviates significantly from the normal objects as if it were generated by a different mechanism Ex.: Unusual credit card purchase,
More informationReforming the Malawi Agricultural Development and Marketing Corporation (ADMARC): Poverty and Social Impact Analysis (PSIA)
Reforming the Malawi Agricultural Development and Marketing Corporation (ADMARC): Poverty and Social Impact Analysis (PSIA) Antonio Nucifora (AFTP1) PRSP Training Course, April 25, 2005 ADMARC (Agricultural
More informationEnhancing Sales and Operations Planning with Forecasting Analytics and Business Intelligence WHITE PAPER
Enhancing Sales and Operations Planning with Forecasting Analytics and Business Intelligence WHITE PAPER Table of Contents Introduction... 1 Analytics... 1 Forecast cycle efficiencies... 3 Business intelligence...
More informationIndex Contents Page No. Introduction . Data Mining & Knowledge Discovery
Index Contents Page No. 1. Introduction 1 1.1 Related Research 2 1.2 Objective of Research Work 3 1.3 Why Data Mining is Important 3 1.4 Research Methodology 4 1.5 Research Hypothesis 4 1.6 Scope 5 2.
More informationAdditional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm
Mgt 540 Research Methods Data Analysis 1 Additional sources Compilation of sources: http://lrs.ed.uiuc.edu/tseportal/datacollectionmethodologies/jin-tselink/tselink.htm http://web.utk.edu/~dap/random/order/start.htm
More informationBuilding Energy Management: Using Data as a Tool
Building Energy Management: Using Data as a Tool Issue Brief Melissa Donnelly Program Analyst, Institute for Building Efficiency, Johnson Controls October 2012 1 http://www.energystar. gov/index.cfm?c=comm_
More informationData Mining for Model Creation. Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA 98370 paul.below@eds.
Sept 03-23-05 22 2005 Data Mining for Model Creation Presentation by Paul Below, EDS 2500 NE Plunkett Lane Poulsbo, WA USA 98370 paul.below@eds.com page 1 Agenda Data Mining and Estimating Model Creation
More information2015 MATLAB Conference Perth 21 st May 2015 Nicholas Brown. Deploying Electricity Load Forecasts on MATLAB Production Server.
2015 MATLAB Conference Perth 21 st May 2015 Nicholas Brown Deploying Electricity Load Forecasts on MATLAB Production Server. Executive Summary This presentation will show how Alinta Energy used the MATLAB
More informationCRISP - DM. Data Mining Process. Process Standardization. Why Should There be a Standard Process? Cross-Industry Standard Process for Data Mining
Mining Process CRISP - DM Cross-Industry Standard Process for Mining (CRISP-DM) European Community funded effort to develop framework for data mining tasks Goals: Cross-Industry Standard Process for Mining
More informationModeling the Relation Between Driving Behavior and Fuel Consumption
WHITE PAPER Modeling the Relation Between Driving Behavior and Fuel Consumption Many companies are investing in coaching services for professional drivers with the goal of teaching them how to reduce fuel
More informationWhy big data? Lessons from a Decade+ Experiment in Big Data
Why big data? Lessons from a Decade+ Experiment in Big Data David Belanger PhD Senior Research Fellow Stevens Institute of Technology dbelange@stevens.edu 1 What Does Big Look Like? 7 Image Source Page:
More informationKnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES
HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within
More informationThe Prophecy-Prototype of Prediction modeling tool
The Prophecy-Prototype of Prediction modeling tool Ms. Ashwini Dalvi 1, Ms. Dhvni K.Shah 2, Ms. Rujul B.Desai 3, Ms. Shraddha M.Vora 4, Mr. Vaibhav G.Tailor 5 Department of Information Technology, Mumbai
More informationEnhancing Education Quality Assurance Using Data Mining. Case Study: Arab International University Systems.
Enhancing Education Quality Assurance Using Data Mining Case Study: Arab International University Systems. Faek Diko Arab International University Damascus, Syria f-diko@aeu.ac.sy Zaidoun Alzoabi Arab
More informationWhitepaper. Power of Predictive Analytics. Published on: March 2010 Author: Sumant Sahoo
Published on: March 2010 Author: Sumant Sahoo 2009 Hexaware Technologies. All rights reserved. Table of Contents 1. Introduction 2. Problem Statement / Concerns 3. Solutions / Approaches to address the
More informationtesto dello schema Secondo livello Terzo livello Quarto livello Quinto livello
Extracting Knowledge from Biomedical Data through Logic Learning Machines and Rulex Marco Muselli Institute of Electronics, Computer and Telecommunication Engineering National Research Council of Italy,
More informationEnhancing Sales and Operations Planning with Forecasting Analytics and Business Intelligence WHITE PAPER
Enhancing Sales and Operations Planning with Forecasting Analytics and Business Intelligence WHITE PAPER SAS White Paper Table of Contents Introduction.... 1 Analytics.... 1 Forecast Cycle Efficiencies...
More informationDESIGNING WEATHER INSURANCE CONTRACTS FOR FARMERS
DESIGNING WEATHER INSURANCE CONTRACTS FOR FARMERS In Malawi, Tanzania and Kenya Excerpts from final report to the Commodity Risk Management Group, ARD, World Bank June 2007 Download the full report here:
More informationNC STATE UNIVERSITY Exploratory Analysis of Massive Data for Distribution Fault Diagnosis in Smart Grids
Exploratory Analysis of Massive Data for Distribution Fault Diagnosis in Smart Grids Yixin Cai, Mo-Yuen Chow Electrical and Computer Engineering, North Carolina State University July 2009 Outline Introduction
More informationIntroduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.
Introduction p. xvii Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p. 9 State of the Practice in Analytics p. 11 BI Versus
More informationCSC 177 Fall 2014 Team Project Final Report
CSC 177 Fall 2014 Team Project Final Report Project Title, Data Mining on Farmers Market Data Instructor: Dr. Meiliu Lu Team Members: Yogesh Isawe Kalindi Mehta Aditi Kulkarni CSc 177 DM Project Cover
More informationForecaster comments to the ORTECH Report
Forecaster comments to the ORTECH Report The Alberta Forecasting Pilot Project was truly a pioneering and landmark effort in the assessment of wind power production forecast performance in North America.
More information480093 - TDS - Socio-Environmental Data Science
Coordinating unit: Teaching unit: Academic year: Degree: ECTS credits: 2015 480 - IS.UPC - University Research Institute for Sustainability Science and Technology 715 - EIO - Department of Statistics and
More informationSTATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
More informationBig Data The Next Phase Lessons from a Decade+ Experiment in Big Data
Big Data The Next Phase Lessons from a Decade+ Experiment in Big Data David Belanger PhD Senior Research Fellow Stevens Institute of Technology dbelange@stevens.edu 1 Outline Big Data Overview Thinking
More informationIntroduction. A. Bellaachia Page: 1
Introduction 1. Objectives... 3 2. What is Data Mining?... 4 3. Knowledge Discovery Process... 5 4. KD Process Example... 7 5. Typical Data Mining Architecture... 8 6. Database vs. Data Mining... 9 7.
More informationEnergy Performance Indicators (EnPI) Tim Dantoin Focus on Energy
Energy Performance Indicators (EnPI) Tim Dantoin Focus on Energy Learning Objectives Identify and test one or more EnPls. Identify factors that may affect EnPls. Establish an energy baseline. Analyze your
More informationQualitative Agriculture Product Analysis Based SPSS Software & Management using Cloud Computing
Qualitative Agriculture Product Analysis Based SPSS Software & Management using Cloud Computing Vaishali Madan 1, Shubhangi Borkar 2 1 Dept of computer sci and engg Nagpur Institute of Technology Nagpur,India
More informationHow to gather and evaluate information
09 May 2016 How to gather and evaluate information Chartered Institute of Internal Auditors Information is central to the role of an internal auditor. Gathering and evaluating information is the basic
More informationAvailable online at www.sciencedirect.com Available online at www.sciencedirect.com. Advanced in Control Engineering and Information Science
Available online at www.sciencedirect.com Available online at www.sciencedirect.com Procedia Procedia Engineering Engineering 00 (2011) 15 (2011) 000 000 1822 1826 Procedia Engineering www.elsevier.com/locate/procedia
More informationIMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH
IMPROVING DATA INTEGRATION FOR DATA WAREHOUSE: A DATA MINING APPROACH Kalinka Mihaylova Kaloyanova St. Kliment Ohridski University of Sofia, Faculty of Mathematics and Informatics Sofia 1164, Bulgaria
More informationTitle of dissertation. RONPAKU Fellow Name
Title of dissertation (1) A Comparison of Hand-arm Vibration Syndrome between Malaysian and Japanese Workers (2) Dose-response Relationship between Hand-transmitted Vibration and Hand-arm Vibration Syndrome
More informationCalculating the Probability of Returning a Loan with Binary Probability Models
Calculating the Probability of Returning a Loan with Binary Probability Models Associate Professor PhD Julian VASILEV (e-mail: vasilev@ue-varna.bg) Varna University of Economics, Bulgaria ABSTRACT The
More informationData Isn't Everything
June 17, 2015 Innovate Forward Data Isn't Everything The Challenges of Big Data, Advanced Analytics, and Advance Computation Devices for Transportation Agencies. Using Data to Support Mission, Administration,
More informationExploratory Data Mining and Data Cleaning. Wiley Series in Probability and Statistics
Brochure More information from http://www.researchandmarkets.com/reports/2174574/ Exploratory Data Mining and Data Cleaning. Wiley Series in Probability and Statistics Description: - Written for practitioners
More informationData Science & Big Data Practice
INSIGHTS ANALYTICS INNOVATIONS Data Science & Big Data Practice Manufacturing Internet of Things (IoT) Amplify Serviceability and Productivity by integrating machine /sensor data with Data Science What
More informationGLOSSARY OF EVALUATION TERMS
Planning and Performance Management Unit Office of the Director of U.S. Foreign Assistance Final Version: March 25, 2009 INTRODUCTION This Glossary of Evaluation and Related Terms was jointly prepared
More informationUnderstanding Characteristics of Caravan Insurance Policy Buyer
Understanding Characteristics of Caravan Insurance Policy Buyer May 10, 2007 Group 5 Chih Hau Huang Masami Mabuchi Muthita Songchitruksa Nopakoon Visitrattakul Executive Summary This report is intended
More informationHow To Predict Web Site Visits
Web Site Visit Forecasting Using Data Mining Techniques Chandana Napagoda Abstract: Data mining is a technique which is used for identifying relationships between various large amounts of data in many
More informationBetter planning and forecasting with IBM Predictive Analytics
IBM Software Business Analytics SPSS Predictive Analytics Better planning and forecasting with IBM Predictive Analytics Using IBM Cognos TM1 with IBM SPSS Predictive Analytics to build better plans and
More informationDecision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010
Decision Support Optimization through Predictive Analytics - Leuven Statistical Day 2010 Ernst van Waning Senior Sales Engineer May 28, 2010 Agenda SPSS, an IBM Company SPSS Statistics User-driven product
More informationExecutive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6310, Sec. 03
INSTRUCTORS: PROFESSOR: Stuart E. Ward TEACHING ASSISTANT: Hamid Rashid E-Mail: sew9@columbia.edu hr99@columbia.edu Office Phone# 212.854.5941 (o) To Be Announced Office Room# 407A To Be Announced MEETING
More informationApplication of data mining techniques for energy modeling of HVAC sub-systems
Application of data mining techniques for energy modeling of HVAC sub-systems Mathieu Le Cam 1, Radu Zmeureanu 1, and Ahmed Daoud 2 1 Center for Zero Energy Building Studies, Department of Building, Civil,
More informationThis survey addresses individual projects, partnerships, data sources and tools. Please submit it multiple times - once for each project.
Introduction This survey has been developed jointly by the United Nations Statistics Division (UNSD) and the United Nations Economic Commission for Europe (UNECE). Our goal is to provide an overview of
More informationDATA MINING TECHNIQUES AND APPLICATIONS
DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra,
More informationCourse Syllabus. Purposes of Course:
Course Syllabus Eco 5385.701 Predictive Analytics for Economists Summer 2014 TTh 6:00 8:50 pm and Sat. 12:00 2:50 pm First Day of Class: Tuesday, June 3 Last Day of Class: Tuesday, July 1 251 Maguire Building
More informationIndex Insurance for Climate Impacts Millennium Villages Project A contract proposal
Index Insurance for Climate Impacts Millennium Villages Project A contract proposal As part of a comprehensive package of interventions intended to help break the poverty trap in rural Africa, the Millennium
More informationPentaho Data Mining Last Modified on January 22, 2007
Pentaho Data Mining Copyright 2007 Pentaho Corporation. Redistribution permitted. All trademarks are the property of their respective owners. For the latest information, please visit our web site at www.pentaho.org
More informationData quality in Accounting Information Systems
Data quality in Accounting Information Systems Comparing Several Data Mining Techniques Erjon Zoto Department of Statistics and Applied Informatics Faculty of Economy, University of Tirana Tirana, Albania
More informationCustomer Classification And Prediction Based On Data Mining Technique
Customer Classification And Prediction Based On Data Mining Technique Ms. Neethu Baby 1, Mrs. Priyanka L.T 2 1 M.E CSE, Sri Shakthi Institute of Engineering and Technology, Coimbatore 2 Assistant Professor
More informationThe Effect of Schedules on HVAC Runtime for Nest Learning Thermostat Users
WHITE PAPER SUMMARY The Effect of Schedules on HVAC Runtime for Nest Learning Thermostat Users Nest Labs, Inc. September 2013! 1 1. Introduction Since the launch of the Nest Learning Thermostat in October
More informationIslamic Republic of Afghanistan, Post Disaster Need Assessment (PDNA) Training Manual
Islamic Republic of Afghanistan, Ministry of Rural Rehabilitation and Development (MRRD) Ministry of Rural Rehabilitation & Development Post Disaster Need Assessment (PDNA) Training Manual Social Protection
More informationThe Potential Use of Remote Sensing to Produce Field Crop Statistics at Statistics Canada
Proceedings of Statistics Canada Symposium 2014 Beyond traditional survey taking: adapting to a changing world The Potential Use of Remote Sensing to Produce Field Crop Statistics at Statistics Canada
More informationRobichaud K., and Gordon, M. 1
Robichaud K., and Gordon, M. 1 AN ASSESSMENT OF DATA COLLECTION TECHNIQUES FOR HIGHWAY AGENCIES Karen Robichaud, M.Sc.Eng, P.Eng Research Associate University of New Brunswick Fredericton, NB, Canada,
More informationPredicting daily incoming solar energy from weather data
Predicting daily incoming solar energy from weather data ROMAIN JUBAN, PATRICK QUACH Stanford University - CS229 Machine Learning December 12, 2013 Being able to accurately predict the solar power hitting
More informationDemand Forecasting to Increase Profits on Perishable Items
Demand Forecasting to Increase Profits on Perishable Items Ankur Pandey Arun Chaubey Sanchit Garg Shahid Siddiqui Sharath Srinivas Forecasting Analytics, ISB Over stock Wastage Ordering/ Inventory of Perishable
More informationDATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM
INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 DATA MINING TECHNIQUES SUPPORT TO KNOWLEGDE OF BUSINESS INTELLIGENT SYSTEM M. Mayilvaganan 1, S. Aparna 2 1 Associate
More informationCI6227: Data Mining. Lesson 11b: Ensemble Learning. Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore.
CI6227: Data Mining Lesson 11b: Ensemble Learning Sinno Jialin PAN Data Analytics Department, Institute for Infocomm Research, A*STAR, Singapore Acknowledgements: slides are adapted from the lecture notes
More informationLluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining
Lluis Belanche + Alfredo Vellido Intelligent Data Analysis and Data Mining a.k.a. Data Mining II Office 319, Omega, BCN EET, office 107, TR 2, Terrassa avellido@lsi.upc.edu skype, gtalk: avellido Tels.:
More informationData Mining Applications in Manufacturing
Data Mining Applications in Manufacturing Dr Jenny Harding Senior Lecturer Wolfson School of Mechanical & Manufacturing Engineering, Loughborough University Identification of Knowledge - Context Intelligent
More informationData Mining Techniques Chapter 6: Decision Trees
Data Mining Techniques Chapter 6: Decision Trees What is a classification decision tree?.......................................... 2 Visualizing decision trees...................................................
More informationOutline: Demand Forecasting
Outline: Demand Forecasting Given the limited background from the surveys and that Chapter 7 in the book is complex, we will cover less material. The role of forecasting in the chain Characteristics of
More informationApplied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets
Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets http://info.salford-systems.com/jsm-2015-ctw August 2015 Salford Systems Course Outline Demonstration of two classification
More informationData Warehouse design
Data Warehouse design Design of Enterprise Systems University of Pavia 21/11/2013-1- Data Warehouse design DATA PRESENTATION - 2- BI Reporting Success Factors BI platform success factors include: Performance
More informationStatistical Sales Forecasting using SAP BPC
Statistical Sales Forecasting using SAP BPC Capgemini s unique statistical sales forecasting solution integrated with SAP BPC 10.0 helps global fortune 1000 company built robust & accurate sales forecasting
More informationStrategic Management System for Effective Health Care Planning (SMS-EHCP)
674 Strategic Management System for Effective Health Care Planning (SMS-EHCP) 1 O. I. Omotoso, 2 I. A. Adeyanju, 3 S. A. Ibraheem 4 K. S. Ibrahim 1,2,3,4 Department of Computer Science and Engineering,
More informationWeek 1. Exploratory Data Analysis
Week 1 Exploratory Data Analysis Practicalities This course ST903 has students from both the MSc in Financial Mathematics and the MSc in Statistics. Two lectures and one seminar/tutorial per week. Exam
More informationChapter 7: Data Mining
Chapter 7: Data Mining Overview Topics discussed: The Need for Data Mining and Business Value The Data Mining Process: Define Business Objectives Get Raw Data Identify Relevant Predictive Variables Gain
More informationData Mining with Weka
Data Mining with Weka Class 1 Lesson 1 Introduction Ian H. Witten Department of Computer Science University of Waikato New Zealand weka.waikato.ac.nz Data Mining with Weka a practical course on how to
More informationHow to use Big Data in Industry 4.0 implementations. LAURI ILISON, PhD Head of Big Data and Machine Learning
How to use Big Data in Industry 4.0 implementations LAURI ILISON, PhD Head of Big Data and Machine Learning Big Data definition? Big Data is about structured vs unstructured data Big Data is about Volume
More informationAPPENDIX 15. Review of demand and energy forecasting methodologies Frontier Economics
APPENDIX 15 Review of demand and energy forecasting methodologies Frontier Economics Energex regulatory proposal October 2014 Assessment of Energex s energy consumption and system demand forecasting procedures
More informationBusiness Analyst Work Plan. Presented by: Billie Johnson, CBAP CSM
Business Analyst Work Plan Presented by: Billie Johnson, CBAP CSM Agenda Topic Introduction Overview of a Business Analysis Work Plan Initiating a Business Analysis Effort Components of the Business Analysis
More informationData Warehousing and Data Mining in Business Applications
133 Data Warehousing and Data Mining in Business Applications Eesha Goel CSE Deptt. GZS-PTU Campus, Bathinda. Abstract Information technology is now required in all aspect of our lives that helps in business
More informationTOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY
TOWARD A DISTRIBUTED DATA MINING SYSTEM FOR TOURISM INDUSTRY Danubianu Mirela Stefan cel Mare University of Suceava Faculty of Electrical Engineering andcomputer Science 13 Universitatii Street, Suceava
More informationOLAP and Data Mining. Data Warehousing and End-User Access Tools. Introducing OLAP. Introducing OLAP
Data Warehousing and End-User Access Tools OLAP and Data Mining Accompanying growth in data warehouses is increasing demands for more powerful access tools providing advanced analytical capabilities. Key
More informationSilvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com
SPSS-SA Silvermine House Steenberg Office Park, Tokai 7945 Cape Town, South Africa Telephone: +27 21 702 4666 www.spss-sa.com SPSS-SA Training Brochure 2009 TABLE OF CONTENTS 1 SPSS TRAINING COURSES FOCUSING
More informationRequirements Fulfilled This course is required for all students majoring in Information Technology in the College of Information Technology.
Course Title: ITAP 3382: Business Intelligence Semester Credit Hours: 3 (3,0) I. Course Overview The objective of this course is to give students an understanding of key issues involved in business intelligence
More informationDATA AUDIT: Scope and Content
DATA AUDIT: Scope and Content The schedule below defines the scope of a review that will assist the FSA in its assessment of whether a firm s data management complies with the standards set out in the
More informationEnergy Savings from Business Energy Feedback
Energy Savings from Business Energy Feedback Behavior, Energy, and Climate Change Conference 2015 October 21, 2015 Jim Stewart, Ph.D. INTRODUCTION 2 Study Background Xcel Energy runs the Business Energy
More informationI D C M a r k e t S c a p e : W o r l d w i d e F i n a n c i a l S e r v i c e s C o n s u l t i n g 2 0 1 2 V e n d o r A n a l y s i s
Global Headquarters: 5 Speen Street Framingham, MA 01701 USA P.508.872.8200 F.508.935.4015 www.idc.com C O M P E T I T I V E A N A L Y S I S I D C M a r k e t S c a p e : W o r l d w i d e F i n a n c
More informationVOLUME 12: FEED THE FUTURE EVALUATION DESIGN TEMPLATE NOVEMBER 2015
VOLUME 12: FEED THE FUTURE EVALUATION DESIGN TEMPLATE NOVEMBER 2015 The attached Feed the Future Evaluation Design Template provides guidance to third party evaluators and USAID missions on recommended
More informationClimate and Weather. This document explains where we obtain weather and climate data and how we incorporate it into metrics:
OVERVIEW Climate and Weather The climate of the area where your property is located and the annual fluctuations you experience in weather conditions can affect how much energy you need to operate your
More informationTOR - Consultancy Announcement Final Evaluation of the Cash assistance and recovery support project (CARSP)
TOR - Consultancy Announcement Final Evaluation of the Cash assistance and recovery support project (CARSP) Organization Project Position type Adeso African Development Solutions and ACTED - Agency for
More informationCloud Computing Survey Perception of the companies. DPDP - Macedonia
Cloud Computing Survey Perception of the companies DPDP - Macedonia Survey regarding the awareness of the companies in relation to Cloud computing in Macedonia Executive summary The survey was conducted
More information