Hard hats for data miners: Myths and pitfalls of data mining
|
|
- Preston Butler
- 7 years ago
- Views:
Transcription
1 Hard hats for data miners: Myths and pitfalls of data mining T. Khabaza SPSS Advanced Data Mining Group Abstract The intrepid data miner runs many risks, such as being buried under mountains of data or vanishing along with the mysterious disappearing terabyte. This paper debunks some myths and sketches some hard hats for data miners. 1 Introduction Data mining is a business process, finding patterns in your data which you can use to do your business better. Through data mining we gain insight into a business problem; this insight may be of use in itself, but it also helps us to gain the other benefits of data mining, such as a predictive capability. This paper is about the practice of data mining; it is not a research paper, but reports lessons learned through solving practical business problems and through contact with many data mining users and potential users. There are many myths and misconceptions about data mining, and holding these misconceptions leads data mining users to run specific risks. The first half of this paper lists some common misconceptions about data mining, corrects them, and describes the risks to which they can lead. The second half of the paper lists other common problems or pitfalls of data mining, with their symptoms and cures. 2 Myths and misconceptions about data mining 2.1 Myth #1: Data mining is all about algorithms The ordinary business-person, attending a typical data mining conference, reading its proceedings, or even reading only the contents page of such a
2 14 Data Mining III proceedings, could be forgiven for thinking that data mining is all about advanced data analysis algorithms. This misconception might be summarised as all you need for data mining is good algorithms; the better your algorithms, the better your data mining, and its corollary advancing the state of the art in data mining means advancing knowledge of algorithms. To hold this view is to misunderstand the data mining process completely. Data mining is a business process, involving many elements such as formulating business goals, mapping business to data mining goals, acquiring, understanding and pre-processing the data, evaluating and presenting the results of analysis and deploying these results to achieve business benefit, as well as the modelling component. (A good explanation of this process can be found in the emerging industry standard process model CRISP-DM [1].) In their extreme form, the consequences of holding this misconception are disastrous for a data mining project, and such a project will fail to produce any useful results. In practice, this occurs only in the narrowest, most academic of projects, where useful results for the business are not absolutely required. In any project where there is a requirement for the results to benefit the business, the data miner who holds this misconception is forced to discard it, at least partially, and face the need for a broader view of the data mining process. This is not to denigrate those parts of data mining research which develop or improve data mining algorithms. Algorithms play a key role in data mining, and new or improved algorithms are one way in which the art of data mining advances. The problem occurs when we focus mainly or solely on algorithms and ignore the other!)()-!)5~0 of the data mining process. 2.2 Myth #2: Data mining is all about predictive accuracy Above I have rejected the notion that data mining is all about modelling algorithms, but within that part of data mining which is about algorithms, how can we judge the quality of an algorithm? Readin< data mining research literature might lead us to suppose that the main criterion for judging an algorithm is the predictive accuracy of the models it generates. This view completely misrepresents the role of algorithms in the data mining process. It is true that in order to be useful a predictive model should have some degree of accuracy, because this reflects whether the algorithm has really discovered patterns in the data. However, many other properties of an algorithm or a model affect its usefidness; examples include whether the model can be understood by the analyst, and whether it requires technical knowledge to understand the model or apply the algorithm. Considering the properties (other than predictive accuracy) which the data mining process requires of algorithms, we can see the likely consequences of holding this mistaken view: algorithms will be produced which can be used only by technology experts. These algorithms will have only the most limited role in a process which is driven by business expertise.
3 Data Mining III ~5 2.3 Myth #3: Data mining requires a data warehouse Data mining practitioners often hear statements like we are not ready for data mining yet, we need to build our data warehouse first. Such statements are based on the view that data warehousing is a pre-requisite for data mining. This is a subtle misconception about the relationship between data warehousing and data mining. It is true that data mining can benefit from the warehoused data being well organised, relatively clean, and easy to access. These benefits can accrue if the warehouse has been constructed with data mining specifically in mind, and with knowledge of the requirements of the data mining envisaged. If it has not, the warehoused data may be less useful for data mining than the source operational data, or in the worst case completely useless (for example in cases where only summary data is warehoused). To avoid this risk, it is usefhl to perform pilot data mining projects using operational data in order to determine the correct content and organisation for the warehouse. It is misleading to state that data mining requires a data warehouse; a more accurate summary of the relation would be that data mining can benefit from a data warehouse, but that to construct such a warehouse often requires data mining. 2.4 Myth #4: Data mining is all about vast quantities of data Early explanations of data mining in the computing press often start with statements like We now collect more data than ever, yet how are we to gain benefit from these vast data stores?. To focus on the size of data stores provides a convenient introduction to the topic of data mining, but subtly misrepresents its nature. Data mining becomes useful when data becomes too large or too complex to analyse by eye, that is anything larger than a few tens of examples and a handful of attributes. Many usefil data mining projects are performed on small or medium-sized datasets, for example containing only hundreds or thousands of records. Apart ffom its convenience in popular explanations, an association of data mining with vast datasets is also connected with the recent emphasis on performance and scalability of data mining tools. This drive to extend the reach of data mining tools to large data is perfectly justified there are many large datasets which it benefits us to mine. However it would be a mistake to believe that these large datasets are the sole focus of data mining. Holding this erroneous belief would lead us to produce tools which sacrifice usability for scalability, whereas in fact both aspects are essential. To quote a customer of a leading data mining tool: other data mining tools optimise machine time, but this tool optimises my time. Whether the datasets are large or small we much strive to optimise the user s time, and this may be assisted by scalability and performance.
4 16 Data Mining III 2.5 Myth #5: Data mining should be done by a technology expert Data mining technology, particularly modelling techniques, is of an advanced sort, and its workings are unlikely to be understood by the wider IT community. Some would claim that this means they should be applied only by technology experts who understand their workings. (This claim may be influenced by a historical association with statistical modelling algorithms, which are more open to misinterpretation than most data mining algorithms.) In fact, the very reverse is true, because of the paramount importance of business knowledge in data mining. When performed without business knowledge, data mining usually produces nonsensical or useless results (see pitfall #3 below). It is therefore essential that data mining is performed by someone with extensive knowledge of the business problem, which is very seldom combined with knowledge of the technology. It is the responsibility of data mining tool providers to ensure that tools are accessible to business rather than technology experts. It behoves the data mining community at large to make clear to potential users that data mining provides insight and useful suggestions, rather than mathematical certainty. 2.6 Myth #6: Neural networks are opaque and consequently useless an over-simplistic view of data mining Myth #6 is a relatively specific misconception about one family of modelling techniques (neural networks) which arises from a broader misunderstanding about the data mining process. One sometimes encounters the view that neural networks are not very useful in data mining because one cannot discover why they make the predictions that they do, or the rules that they use. This means that their predictions cannot be justified, and that they will not contribute much insight. While this argument reflects a correct emphasis on understandability of models and the insight produced by data mining, the conclusion about the disutility of neural networks is erroneous, and the argument reflects a mistaken view of the data mining process, possibly related to myth #1. This mistaken view regards data mining as a rather simple process: take the data, apply a modelling technique, use the results. This omits the iterative nature of the data mining process, and the way in which many techniques are used together to produce a result. Neural networks are used in a variety of ways in data mining projects, uses which are not impacted by the opacity of the models. Here are some examples:. Neural networks can be used for attribute selection, either by training them repeatedly with different combinations of attributes, or by using techniques of sensitivity analysis to rank the attributes by their impact on predictions. Neural networks can be used for pattern confiiation because they are particularly powerful pattern finders for many applications, neural networks can be used to confirm that a pattern exists, before spending effort on tuning other techniques to find it.
5 Data Mining III 1 7 Neural networks can be used in conjunction with other techniques, for example to improve the confidence of predictions by discarding those where the neural network disagrees with the predictions of another technique, or by using other techniques to analyse the behaviour of the neural networks. All of these uses of neural networks reflect the fact that the data mining process cannot be summarised as apply a modelling technique and use the results. Data mining facilities form a toolbox, whose contents are used in varied and sometimes surprising ways to solve a problem. 3 Pitfalls of data mining and their cures 3.1 Pitfall #1: Buried under mountains of data Data mining should be an interactive, iterative process where the analyst applies substantial business knowledge and is engaged with the data. However, those who hold myth #4 (that data mining is about vast quantities of data) often suppose that this process must be applied to all of the available data. This can lead to attempts to mine volumes of data for which the available hardware and software cannot provide an acceptable interactive response (for example, building a model within a few minutes). The data mining process becomes sluggish, and by the time a question is answered, the analyst cannot remember why it was asked. It is hard to feel that this process is generating insight. The cure for this malaise is usually some form of sampling. For example, if we have a million customers and a 20 /0 annual attrition (or chum ) rate, we need not plot our graphs or build our models using the fill million examples, or even half a million (leaving, say, half for independent results validation). Consider the following questions and answers: Q: How many chum profiles do we expect to find? A: Maybe ten. Q: How many examples of each profile do we need? A: Maybe a thousand. Conclusion: A sample of ten or twenty thousand churners, and an equivalent number of non-churners, will be sufficient for this analysis. Note that this does not mean that we will never encounter the need to build models from millions of examples, only that we should not assume that we must do so if this data is available. One interesting class of cases is those where we wish to find a rare profile. Suppose that we wish to find a specific phenomenon which causes only 1 % of churn. It might be thought that we must build models against the whole dataset in order to find it. However there are other approaches. For example we might find the common chum profiles first, using a relatively small sample to build the models, use these initial profdes to score the entire database, and then focus
6 18 Data Mining III subsequent analysis on the relatively small number who chum but were predicted not to do so. 3.2 Pitfall #2: The Mysterious Disappearing Terabyte This is a common phenomenon, but not always a pitfall. The phrase mysterious disappearing terabyte refers to the fact that for a given data mining problem, the amount of available and relevant data maybe much less than initially supposed. Consider the following scenario: You are a data mining consultant, and your client is a large bank, holding terabytes of data on its customers. There is some concern that the available computing resources will be inadequate to mining this volume of data. The bank wishes to mine information on credit risk. Different types of credit (for example personal loans, business loans, overdrafts) would present different patterns of credit risk, so each data mining project will concentrate on one type of borrower. A number of factors are judged (by the bank s domain experts) to be relevant. Are these factors collected by the bank? Yes, they have looked ahead and started collecting the relevant factors, eighteen months ago. Lots of borrowing has taken place in the intervening time so there should be no problem about data! How many bad debts of the relevant kind have occurred in that time? Plenty - almost a thousand! Thus the relevant data consists of less than a thousand cases of bad debt plus a sample from a plentifid supply of cases of good debt - say 3,000 records in all. Somehow, terabytes of data have softly and silently vanished away, fortunately not (quite) taking the data miner with them (this time). 3.3 Pitfall #3: Insufficient business knowledge I have emphasised previously the crucial role played by business knowledge in data mining. Without it, we can neither recognise useful results nor guide the data mining process towards them. It is sometimes supposed that the end user of data mining can reasonably take the attitude: here is the data, please go away and mine it, and come back with the answers. When a data mining project is organised in this way, at best the project will take many long and costly iterations to produce useful results, and at worst the results will be gibberish and the project will fail. This pitfall can only be avoided by involving the end user, and more specifically someone with a detailed knowledge of the business, at every stage of the data mining process. Ideally the data miner should be part of that business, but if a data mining consultant is used then the consultant should literally sit next to someone with the required business knowledge who understands the question under consideration. For this to work, a highly interactive data mining environment with good response time is required.. (A data mining consultant with general knowledge of the relevant industry is not sufficient detailed knowledge of the specific business is needed.)
7 Data Mining III Pitfall #4: Insufficient data knowledge In order to perfonm data mining we must be able to answer questions like what do the codes in this field mean?, and can there be more than one record per customer in this table?. In some cases this information is surprisingly hard to come by for example because the data expert has left the organisation or moved to another department, or in the case of legacy systems there may be no data expert at all. This problem is exacerbated when the database or data warehouse management is outsourced the external supplier is even less motivated than the user organisation to maintain the information in case it is needed in future. There is no simple cure for this problem. IT departments should be made aware of the need to maintain information about the organisation s databases, and when a data mining project is proposed we should consider how much data knowledge is available, and any risks caused by its absence or scarcity. 3.5 Pitfall #5: Erroneous assumptions, courtesy of the experts Business and data expertise are crucial resources for data mining, but that does not mean that the data miner should accept unquestioningly every statement of the experts. One benefit from data mining is that organisations discover surprising facts about their data and about their business. The data miner should seek to confirm the truth of experts statements so far as they relate to the data. Typical examples of erroneous or misleading statements would include: No customer can hold accounts of both these types. No case will include more than one event of this type. Only the following codes will be present in this field. Statements like this should be verified by examining the data. Data mining tools should make this easy. It is particularly important to check these issues when processing of the data will depend on them, so that mistakes in these assumptions can be spotted before they lead to errors in the treatment of data. 3.6 Pitfall #6: Incompatibility of data mining tools The data mining process requires a wide range of facilities, so it might be supposed that a wide variety of tools will be used. This can lead to a high overhead in switching contexts and converting data between different formats. At its worst this can lead to the omission of necessary steps, and even mild cases can seriously interfere with the exploratory character of data mining. The most readily available solution is to use a data mining toolkit in which all the required facilities are present in an integrated form. However, no toolkit will provide every possible facility, especially when the individual preferences of analysts are taken into account, so toolkits should also be open, and interface easily with other available tools and third-party options.
8 20 Data Mining III 3.7 Pitfall #7: Locked in the data jail-house In addition to openness with regard to tools, data mining systems should be open with regard to data. Some data mining tools require the data to be held in a proprietary format which is not compatible with commonly used database systems. (This is sometimes referred to as the data jail-house.) This can result in large overheads to transfer data into the format required, and difficulty in deploying the results into an organisation s systems. A good data mining tool will interface to your data via common standards. 3.8 Pitfall #8: Disorganized data mining This common pitfall is often a consequence of the apply the algorithm, use the results misconception (see myth #6). The data mining takes place in an ad-hoc manner, with no clear goals and no idea of how the results will be used. The consequences can be unusable results. To produce useful results, it is necessary to have clearly defined business and data mining goals, formulated early in the project, along with deployment plans. A simple way of ensuring this is to use a standard process such as CRISP-DM [1]; this ensures the correct preparation for data mining, and provides a common language for communication of methods and results. Data mining tools should support standard process models. 4 Conclusions Data mining is a business process, requiring extensive business knowledge and best practiced by, or in very close collaboration with, business experts. Data mining uses a variety of different kinds of techniques, and should not be focussed mainly or exclusively on modelling algorithms and their predictive accuracy. Each technique can play a variety of roles. Data miners should make intelligent decisions about the amount of data required, assuming neither that all of an organisation s data will be relevant, nor that all the available data will be required. Effective data mining requires flexible and interoperable techniques; this requirement is best met by integrated, open toolkits, which can interface to data via open standards. The data mining process can be characterised by interaction and engagement with the data in an iterative fashion. A standard data mining process model such as CRISP-DM helps to ensure the correct preparation for and use of data mining, and should be supported by data mining tools. References [1] Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C. and Wirth, R. CRISP-DM I. O Step-by-step data mining guide, CRISP-DM Consortium, 2000, available at org.
Tom Khabaza. Hard Hats for Data Miners: Myths and Pitfalls of Data Mining
Tom Khabaza Hard Hats for Data Miners: Myths and Pitfalls of Data Mining Hard Hats for Data Miners: Myths and Pitfalls of Data Mining By Tom Khabaza The intrepid data miner runs many risks, including being
More informationCRISP - DM. Data Mining Process. Process Standardization. Why Should There be a Standard Process? Cross-Industry Standard Process for Data Mining
Mining Process CRISP - DM Cross-Industry Standard Process for Mining (CRISP-DM) European Community funded effort to develop framework for data mining tasks Goals: Cross-Industry Standard Process for Mining
More informationActive Listening. Learning Objectives. By the end of this module, the learner will have
1 Active Listening Learning Objectives By the end of this module, the learner will have An understanding of what is meant by active listening Gained insight into your natural listening strengths and areas
More informationCRISP-DM, which stands for Cross-Industry Standard Process for Data Mining, is an industry-proven way to guide your data mining efforts.
CRISP-DM, which stands for Cross-Industry Standard Process for Data Mining, is an industry-proven way to guide your data mining efforts. As a methodology, it includes descriptions of the typical phases
More informationWhy Data Mining Research Does Not Contribute to Business?
Why Data Mining Research Does Not Contribute to Business? Mykola Pechenizkiy 1, Seppo Puuronen 1, Alexey Tsymbal 2 1 Dept. of Computer Science and Inf. Systems, University of Jyväskylä, Finland {mpechen,sepi}@cs.jyu.fi
More informationPlanning successful data mining projects
IBM SPSS Modeler Planning successful data mining projects A practical, three-step guide to planning your first data mining project and selling it internally Contents: 1 Executive summary 2 One: Start with
More informationREFLECTIONS ON THE USE OF BIG DATA FOR STATISTICAL PRODUCTION
REFLECTIONS ON THE USE OF BIG DATA FOR STATISTICAL PRODUCTION Pilar Rey del Castillo May 2013 Introduction The exploitation of the vast amount of data originated from ICT tools and referring to a big variety
More informationGROW@BU Coaching Tools
GROW@bournemouth.ac.uk Page 0 Contents 1. Introduction... 2 2. The Coaching Conversation... 3 3. Understanding Coaching Questions... 4 4. Starting a Coaching Conversation... 5 5. Coaching Behaviours...
More informationThe Unfortunate Little Secret About Current CRM Data Cleansing. (And how it destroys your bottom line.)
The Unfortunate Little Secret About Current CRM Data Cleansing. (And how it destroys your bottom line.) Until now clean data was more myth than fact. That s because there is a crucial difference between
More information7 Steps for Launching a Successful Manufacturing Big Data Project
SIGHT MACHINE WHITE PAPER 7 Steps for Launching a Successful Manufacturing Big Data Project Quick Take Manufacturers are coming up to speed on Big Data technologies and trends, but often have trouble figuring
More informationEvaluation: Designs and Approaches
Evaluation: Designs and Approaches Publication Year: 2004 The choice of a design for an outcome evaluation is often influenced by the need to compromise between cost and certainty. Generally, the more
More informationANALYSING THE SHORT STORY CONTENTS
English: The Short story. 1.x/ fall 2002/lm 1/5 ANALYSING THE SHORT STORY CONTENTS THE SHORT STORY GENRE CONVENTIONS: 2 In medias res beginning: 2 A limited number of characters. 2 Limited character description:
More informationCritical Analysis So what does that REALLY mean?
Critical Analysis So what does that REALLY mean? 1 The words critically analyse can cause panic in students when they first turn over their examination paper or are handed their assignment questions. Why?
More informationCenter for Effective Organizations
Center for Effective Organizations HR METRICS AND ANALYTICS USES AND IMPACTS CEO PUBLICATION G 04-8 (460) EDWARD E. LAWLER III ALEC LEVENSON JOHN BOUDREAU Center for Effective Organizations Marshall School
More informationFrom Cognitive Science to Data Mining: The first intelligence amplifier
From Cognitive Science to Data Mining: The first intelligence amplifier Tom Khabaza Abstract This paper gives a brief account of two hypotheses. First that data mining is a kind of intelligence amplifier,
More informationAn Introduction to Advanced Analytics and Data Mining
An Introduction to Advanced Analytics and Data Mining Dr Barry Leventhal Henry Stewart Briefing on Marketing Analytics 19 th November 2010 Agenda What are Advanced Analytics and Data Mining? The toolkit
More informationSolve your toughest challenges with data mining
IBM Software IBM SPSS Modeler Solve your toughest challenges with data mining Use predictive intelligence to make good decisions faster Solve your toughest challenges with data mining Imagine if you could
More informationStep-by-step data mining guide
Step-by-step data mining guide Pete Chapman (NCR), Julian Clinton (SPSS), Randy Kerber (NCR), Thomas Khabaza (SPSS), Thomas Reinartz (DaimlerChrysler), Colin Shearer (SPSS) and Rüdiger Wirth (DaimlerChrysler)
More informationAugmented reality enhances learning at Manchester School of Medicine
Augmented reality enhances learning at Manchester School of Medicine Welcome to the Jisc podcast. The University of Manchester is taking a unique approach to prescription training for its medical students
More informationalternative collection
01 technical this article considers two methods a company could adopt in order to speed up the collection of cash from its customers. Additionally, worked examples show how these methods can be evaluated
More informationHow a Hotel Valuation is Undertaken and What a Bank Really Needs from a Valuation
Hotel Valuation How a Hotel Valuation is Undertaken and What a Bank Really Needs from a Valuation This summary paper outlines what is involved in a hotel valuation, and how it can be useful, as well as
More informationSolve Your Toughest Challenges with Data Mining
IBM Software Business Analytics IBM SPSS Modeler Solve Your Toughest Challenges with Data Mining Use predictive intelligence to make good decisions faster Solve Your Toughest Challenges with Data Mining
More information100 Ways To Improve Your Sales Success. Some Great Tips To Boost Your Sales
100 Ways To Improve Your Sales Success Some Great Tips To Boost Your Sales 100 Ways To Improve Your Sales Success By Sean Mcpheat, Managing Director Of The Sales Training Consultancy What makes a successful
More informationLearning and Teaching
B E S T PRACTICES NEA RESEARCH BRIEF Learning and Teaching July 2006 This brief outlines nine leading research-based concepts that have served as a foundation for education reform. It compares existing
More informationGetting the best from your 360 degree feedback
1 Contents Getting the best from your 360 degree feedback... 3 What it is.... 3 And isn t.... 4 Using the system... 5 Choosing your respondents... 5 Choosing your competencies... 5 Compiling your questionnaire...
More informationBIBA Report on the Importance of Advice in the Small to Medium Enterprise Market
BIBA Report on the Importance of Advice in the Small to Medium Enterprise Market The best insurance is a BIBA broker www.biba.org.uk Member helpline: 0845 77 00 266 The FSA define advice as an opinion
More informationTest your talent How does your approach to talent strategy measure up?
1 Test your talent How does your approach to talent strategy measure up? Talent strategy or struggle? Each year at Head Heart + Brain we carry out research projects to help understand best practice in
More information360 feedback. Manager. Development Report. Sample Example. name: email: date: sample@example.com
60 feedback Manager Development Report name: email: date: Sample Example sample@example.com 9 January 200 Introduction 60 feedback enables you to get a clear view of how others perceive the way you work.
More informationAnalyzing Marketing Cases
Analyzing Marketing Cases What is a case? A case is a verbal snapshot of the whole or some part of an organization. The cases are all based upon problems and events that actually took place, although in
More informationThe top 10 secrets to using data mining to succeed at CRM
The top 10 secrets to using data mining to succeed at CRM Discover proven strategies and best practices Highlights: Plan and execute successful data mining projects using IBM SPSS Modeler. Understand the
More informationShell Mentoring Toolkit
Shell Mentoring Toolkit A reference document for mentors and mentees Human Resources LEARNING 25/07/2007 Copyright: Shell International Ltd 2006 CONTENTS What is Mentoring? 4 The Mentor s Role The Role
More informationGuidance paper - The use of calculators in the teaching and learning of mathematics
Guidance paper - The use of calculators in the teaching and learning of mathematics Background and context In mathematics, the calculator can be an effective teaching and learning resource in the primary
More informationTHE PREDICTIVE MODELLING PROCESS
THE PREDICTIVE MODELLING PROCESS Models are used extensively in business and have an important role to play in sound decision making. This paper is intended for people who need to understand the process
More informationThe changing role of the IT department in a cloud-based world. Vodafone Power to you
The changing role of the IT department in a cloud-based world Vodafone Power to you 02 Introduction With competitive pressures intensifying and the pace of innovation accelerating, recognising key trends,
More informationBBC Learning English Talk about English Business Language To Go Part 1 - Interviews
BBC Learning English Business Language To Go Part 1 - Interviews This programme was first broadcast in 2001. This is not a word for word transcript of the programme This series is all about chunks of language
More informationParticipants Manual Video Seven The OSCAR Coaching Model
Coaching Skills for Managers Online Training Programme Part One Fundamentals of Coaching Participants Manual Video Seven The OSCAR Coaching Model Developed by Phone: 01600 715517 Email: info@worthconsulting.co.uk
More informationGuide to cash flow management
Guide to cash flow management Cash flow management What is cash flow management? For a business to be successful, good cash flow management is crucial. Cash flow is the primary indicator of a business
More informationWhy Semantic Analysis is Better than Sentiment Analysis. A White Paper by T.R. Fitz-Gibbon, Chief Scientist, Networked Insights
Why Semantic Analysis is Better than Sentiment Analysis A White Paper by T.R. Fitz-Gibbon, Chief Scientist, Networked Insights Why semantic analysis is better than sentiment analysis I like it, I don t
More informationAN INTRODUCTION TO SOCIOLOGICAL THEORIES
An Introduction to Sociological Theories 1 1 AN INTRODUCTION TO SOCIOLOGICAL THEORIES Introduction Humans are social beings. Whether we like it or not, nearly everything we do in our lives takes place
More informationThe OPEN Manifesto Future City Principles
November 2013 The OPEN Manifesto Future City Principles 2 OPEN Manifesto The Open Manifesto paves the way for a more responsive city that adapts to the needs of its citizens. The success of Glasgow s Open
More informationNavigating Big Data business analytics
mwd a d v i s o r s Navigating Big Data business analytics Helena Schwenk A special report prepared for Actuate May 2013 This report is the third in a series and focuses principally on explaining what
More informationThe Journey into Speech Analytics
The Journey into Speech Analytics 2013 Edition ComputerTel Ltd. The Journey into Speech Analytics We have certainly noticed that Speech Analytics is now becoming a realistic consideration for the Contact
More information20-30 minutes, can be used within a longer activity
Locked-in 1 Age group successfully used with: Abilities assumed: Time: Size of group: 11 adult None 20-30 minutes, can be used within a longer activity anything from 2 to hundreds Focus What is an algorithm?
More informationCHAPTER 9. DEVELOPING IT SY STEM S Bringing IT System s to Life
CHAPTER 9 DEVELOPING IT SY STEM S Bringing IT System s to Life 9-2 Introduction Every Organization Is Using Information Technology But IT systems don t magically appear. Organizations spend billions of
More informationDETERMINING WHAT MEMBERS WANT: HOW TO DEVELOP A MEMBER SURVEY THAT ACTUALLY GENERATES THE INFORMATION YOU NEED.
DETERMINING WHAT MEMBERS WANT: HOW TO DEVELOP A MEMBER SURVEY THAT ACTUALLY GENERATES THE INFORMATION YOU NEED. Private clubs are interesting institutions. Where else are an institution s customers also
More informationA STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS
A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS Mrs. Jyoti Nawade 1, Dr. Balaji D 2, Mr. Pravin Nawade 3 1 Lecturer, JSPM S Bhivrabai Sawant Polytechnic, Pune (India) 2 Assistant
More informationPractical Machine Learning and Data Analysis
Data Analysis of Industrial Data Practical Machine Learning and Data Analysis Swedish Institute of Computer Science Outline Data Analysis of Industrial Data 1 Data Analysis of Industrial Data Analysing
More informationWriting an essay. This seems obvious - but it is surprising how many people don't really do this.
Writing an essay Look back If this is not your first essay, take a look at your previous one. Did your tutor make any suggestions that you need to bear in mind for this essay? Did you learn anything else
More informationDeveloping an Academic Essay
2 9 In Chapter 1: Writing an academic essay, you were introduced to the concepts of essay prompt, thesis statement and outline. In this chapter, using these concepts and looking at examples, you will obtain
More informationAudit Readiness Lessons Learned
Audit Readiness Lessons Learned Four Tips for Achieving a Smooth Audit It seems obvious: Prepare well and prepare ahead of time and the year-end audit does not have to be the painful experience most organizations
More information02 Edwards Yammer - Employee User Guide
Employee user Guide 3 What is Yammer? 3 Yammer in Edwards 4 What should I use Yammer for? 5 A Beginners Guide to Yammer 5 Sign up 5 I have my personal edwardsvacuum.com email address 5 I haven t got my
More informationINTERNATIONAL STANDARD ON ASSURANCE ENGAGEMENTS 3000 ASSURANCE ENGAGEMENTS OTHER THAN AUDITS OR REVIEWS OF HISTORICAL FINANCIAL INFORMATION CONTENTS
INTERNATIONAL STANDARD ON ASSURANCE ENGAGEMENTS 3000 ASSURANCE ENGAGEMENTS OTHER THAN AUDITS OR REVIEWS OF HISTORICAL FINANCIAL INFORMATION (Effective for assurance reports dated on or after January 1,
More informationcreativity ADAPTABIlITY PASSION www.webcon.co.in
creativity ADAPTABIlITY PASSION www.webcon.co.in TABlE OF contents 01 EXECUTIVE SUMMARY 02 BUSINESS STATEMENT 03 WEB DESIGN 04 SEO SERVICES 05 SOCIAL MEDIA 06 WHY WEBCON TECHNOLOGIES 07 QUALLTY ASSURANCE
More informationDigging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA
Digging for Gold: Business Usage for Data Mining Kim Foster, CoreTech Consulting Group, Inc., King of Prussia, PA ABSTRACT Current trends in data mining allow the business community to take advantage of
More informationHow does the problem of relativity relate to Thomas Kuhn s concept of paradigm?
How does the problem of relativity relate to Thomas Kuhn s concept of paradigm? Eli Bjørhusdal After having published The Structure of Scientific Revolutions in 1962, Kuhn was much criticised for the use
More informationCRISP-DM 1.0. Step-by-step data mining guide
CRISP-DM 1.0 Step-by-step data mining guide Pete Chapman (NCR), Julian Clinton (SPSS), Randy Kerber (NCR), Thomas Khabaza (SPSS), Thomas Reinartz (DaimlerChrysler), Colin Shearer (SPSS) and Rüdiger Wirth
More informationCFSD 21 ST CENTURY SKILL RUBRIC CRITICAL & CREATIVE THINKING
Critical and creative thinking (higher order thinking) refer to a set of cognitive skills or strategies that increases the probability of a desired outcome. In an information- rich society, the quality
More informationFive High Order Thinking Skills
Five High Order Introduction The high technology like computers and calculators has profoundly changed the world of mathematics education. It is not only what aspects of mathematics are essential for learning,
More informationEvaluating teaching. 6.1 What is teacher evaluation and why is it important?
6 Evaluating Just as assessment that supports is critical for student, teacher evaluation that focuses on providing accurate evidence of practice and supports improvement is central for teachers. Having
More informationRelative and Absolute Change Percentages
Relative and Absolute Change Percentages Ethan D. Bolker Maura M. Mast September 6, 2007 Plan Use the credit card solicitation data to address the question of measuring change. Subtraction comes naturally.
More informationExaminer s report P5 Advanced Performance Management June 2013
Examiner s report P5 Advanced Performance Management June 2013 Introduction Firstly, we would like to offer our congratulations to all of those candidates who achieved a pass at this diet and our commiserations
More informationThe Top 10 Secrets to Using Data Mining to Succeed at CRM
The Top 10 Secrets to Using Data Mining to Succeed at CRM Discover proven strategies and best practices Highlights: Plan and execute successful data mining projects. Understand the roles and responsibilities
More informationSTATISTICA. Financial Institutions. Case Study: Credit Scoring. and
Financial Institutions and STATISTICA Case Study: Credit Scoring STATISTICA Solutions for Business Intelligence, Data Mining, Quality Control, and Web-based Analytics Table of Contents INTRODUCTION: WHAT
More informationWhen being a good lawyer is not enough: Understanding how In-house lawyers really create value
When being a good lawyer is not enough: Understanding how In-house lawyers really create value Contents Foreword... 3 Do you really understand how In-house lawyers create value?... 4 Why creating value
More informationCOLUMN. Planning your SharePoint intranet project. Intranet projects on SharePoint need a clear direction APRIL 2011. Challenges and opportunities
KM COLUMN APRIL 2011 Planning your SharePoint intranet project Starting a SharePoint intranet project, whether creating a new intranet or redeveloping an existing one, can be daunting. Alongside strategy
More informationTesting, What is it Good For? Absolutely Everything!
Testing, What is it Good For? Absolutely Everything! An overview of software testing and why it s an essential step in building a good product Beth Schechner Elementool The content of this ebook is provided
More informationWorking with telecommunications
Working with telecommunications Minimizing churn in the telecommunications industry Contents: 1 Churn analysis using data mining 2 Customer churn analysis with IBM SPSS Modeler 3 Types of analysis 3 Feature
More informationCHAPTER 3 - CUSTOMER RELATIONSHIP MANAGEMENT (CRM)
CHAPTER 3 - CUSTOMER RELATIONSHIP MANAGEMENT (CRM) 3.1 INTRODUCTION The most important purpose of any service orientated organisation is to retain customers. It is critical for any organization to keep
More informationThree proven methods to achieve a higher ROI from data mining
IBM SPSS Modeler Three proven methods to achieve a higher ROI from data mining Take your business results to the next level Highlights: Incorporate additional types of data in your predictive models By
More informationQuality Meets the CEO
Quality Meets the CEO Jeffery E. Payne jepayn@rstcorp.com Reliable Software Technologies Corporate management does not care about quality. This is the cold, hard reality of the software world. Management
More informationB2B Customer Satisfaction Research
Circle Research White Paper B2B Customer Satisfaction B2B Customer Satisfaction Research IN SUMMARY This paper on B2B customer satisfaction research: Identifies why customer satisfaction matters Provides
More informationUsing Use Cases for requirements capture. Pete McBreen. 1998 McBreen.Consulting
Using Use Cases for requirements capture Pete McBreen 1998 McBreen.Consulting petemcbreen@acm.org All rights reserved. You have permission to copy and distribute the document as long as you make no changes
More informationFamily law a guide for legal consumers
Family law a guide for legal consumers Image Credit - Jim Harper A relationship breakdown is a difficult time for anyone. It is one of the most stressful experiences in life. Where you have to involve
More informationQuality. Expertise. Passion. Why you really need a SIAM Tooling Strategy
Quality. Expertise. Passion. Why you really need a SIAM Tooling Strategy To make multisourcing arrangements effective, customers must get suppliers to work together, both from the commercial and operational
More informationLluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining
Lluis Belanche + Alfredo Vellido Intelligent Data Analysis and Data Mining a.k.a. Data Mining II Office 319, Omega, BCN EET, office 107, TR 2, Terrassa avellido@lsi.upc.edu skype, gtalk: avellido Tels.:
More informationInterpretation of Financial Statements
Interpretation of Financial Statements Author Noel O Brien, Formation 2 Accounting Framework Examiner. An important component of most introductory financial accounting programmes is the analysis and interpretation
More information0.00 2000 2001 2002 2003 2004 2005 Year
1 Background 1.1 Healthcare organisations are increasingly generating and thus storing information in electronic format. This is an eclectic mix of Email, Microsoft Office Documents, Images as well as
More informationAbstract. White Paper on Application Modernization 1
Abstract The present day market is loaded with extreme competition that invites only the most innovative and smart organizations to compete together and stay above the water level. Added to this are the
More informationSmart Data Business 10 statements on the use of big data solutions in business
Smart Data Business 10 statements on the use of big data solutions in business Key results of a study conducted by Smart Data Accompanying Research in cooperation with the Business Application Research
More informationQ: What types of businesses/industries can benefit from the SBA loan programs? A: Most small owner-operated business can benefit from SBA loans
Interview with Alan Thomes, President, SBA Loan Division State Bank and Trust Company For many new start-ups and small businesses, an SBA loan may be an appropriate form of financing. In this interview
More informationSales Training Programme. Module 8. Closing the sale workbook
Sales Training Programme. Module 8. Closing the sale workbook Workbook 8. Closing the sale Introduction This workbook is designed to be used along with the podcast on closing the sale. It is a self learning
More informationFive Steps Towards Effective Fraud Management
Five Steps Towards Effective Fraud Management Merchants doing business in a card-not-present environment are exposed to significantly higher fraud risk, costly chargebacks and the challenge of securing
More informationCredit Score Basics, Part 1: What s Behind Credit Scores? October 2011
Credit Score Basics, Part 1: What s Behind Credit Scores? October 2011 OVERVIEW Today, credit scores are often used synonymously as an absolute statement of consumer credit risk. Or, credit scores are
More informationDon t Kill the Analyst Just Yet
Don t Kill the Analyst Just Yet How Ipsos Loyalty applies text analytics GAME CHANGERS Don t Kill the Analyst Just Yet Text Analytics clocks in as the #4 emerging methods priority for market researchers
More informationnot possible or was possible at a high cost for collecting the data.
Data Mining and Knowledge Discovery Generating knowledge from data Knowledge Discovery Data Mining White Paper Organizations collect a vast amount of data in the process of carrying out their day-to-day
More informationElaboration of Scrum Burndown Charts.
. Combining Control and Burndown Charts and Related Elements Discussion Document By Mark Crowther, Empirical Pragmatic Tester Introduction When following the Scrum approach a tool frequently used is the
More informationHow To Turn Big Data Into An Insight
mwd a d v i s o r s Turning Big Data into Big Insights Helena Schwenk A special report prepared for Actuate May 2013 This report is the fourth in a series and focuses principally on explaining what s needed
More informationData Discovery, Analytics, and the Enterprise Data Hub
Data Discovery, Analytics, and the Enterprise Data Hub Version: 101 Table of Contents Summary 3 Used Data and Limitations of Legacy Analytic Architecture 3 The Meaning of Data Discovery & Analytics 4 Machine
More informationPractical Jealousy Management
Florida Poly Retreat 2006 Practical Jealousy Management Part 1: On the Nature of Jealousy Jealousy is an unusual emotion in that it is an emotion rooted in other emotions. Often, the root of jealousy lies
More informationIMPORTANCE OF QUANTITATIVE TECHNIQUES IN MANAGERIAL DECISIONS
IMPORTANCE OF QUANTITATIVE TECHNIQUES IN MANAGERIAL DECISIONS Abstract The term Quantitative techniques refers to the methods used to quantify the variables in any discipline. It means the application
More informationA simple tscheme guide to securing electronic transactions
A simple tscheme guide to securing electronic transactions 1 A simple tscheme guide to securing electronic transactions Electronic Transactions An electronic transaction is best thought of as a type of
More informationMaking a positive difference for energy consumers. Competency Framework Band C
Making a positive difference for energy consumers Competency Framework 2 Competency framework Indicators of behaviours Strategic Cluster Setting Direction 1. Seeing the Big Picture Seeing the big picture
More informationHandouts for teachers
ASKING QUESTIONS THAT ENCOURAGE INQUIRY- BASED LEARNING How do we ask questions to develop scientific thinking and reasoning? Handouts for teachers Contents 1. Thinking about why we ask questions... 1
More informationHow to Choose a Social Media Monitoring and Review Analytics Tool. Make sure the greatest possible range of data is indexed
How to Choose a Social Media Monitoring and Review Analytics Tool by Josiah Mackenzie, ReviewPro Over the past year, a lot has changed in the hotel reputation management industry. And these changes require
More informationOrganizing an essay the basics 2. Cause and effect essay (shorter version) 3. Compare/contrast essay (shorter version) 4
Organizing an essay the basics 2 Cause and effect essay (shorter version) 3 Compare/contrast essay (shorter version) 4 Exemplification (one version) 5 Argumentation (shorter version) 6-7 Support Go from
More informationThe Big Data Deluge: Creating Serious Business Problems. Analytics: Harnessing Big Data Deluge to Acquire Business Power
The Big Data Deluge: Creating Serious Business Problems Analytics: Harnessing Big Data Deluge to Acquire Business Power Predictive Analytics: The Holy Grail of Big Data Analytics The Predictive Analytics
More informationBIG DATA S BIG FAILURE: The struggles businesses face in accessing the information they need
BIG DATA S BIG FAILURE: The struggles businesses face in accessing the information they need Great ideas, inspired people and outstanding companies can rise or fall for all kinds of reasons. Navigating
More informationBUYER S GUIDE. The Unified Communications Buyer s Guide to Picking the Right Cloud Telephony Solution
BUYER S GUIDE The Unified Communications Buyer s Guide to Picking the Right Cloud Telephony Solution There s no better time for a business to embrace unified communications solutions. Actually, that s
More informationClass 10. Data Mining and Artificial Intelligence. Data Mining. We are in the 21 st century So where are the robots?
Class 1 Data Mining Data Mining and Artificial Intelligence We are in the 21 st century So where are the robots? Data mining is the one really successful application of artificial intelligence technology.
More informationMeasuring the Impact of Volunteering
Measuring the Impact of Volunteering Why is measuring the impact of volunteering important? It is increasingly important for organisations or groups to describe the difference that volunteering makes to,
More information