Ethical Issues in Data Mining



Similar documents
Statistics 215b 11/20/03 D.R. Brillinger. A field in search of a definition a vague concept

The Bibliomining Process: Data Warehousing and Data Mining for Library Decision-Making

Data Mining and Exploration. Data Mining and Exploration: Introduction. Relationships between courses. Overview. Course Introduction

Data Mining for Manufacturing: Preventive Maintenance, Failure Prediction, Quality Control

What is Customer Relationship Management? Customer Relationship Management Analytics. Customer Life Cycle. Objectives of CRM. Three Types of CRM

A STUDY OF DATA MINING ACTIVITIES FOR MARKET RESEARCH

The KDD Process for Extracting Useful Knowledge from Volumes of Data

Healthcare Measurement Analysis Using Data mining Techniques

Data Mining and KDD: A Shifting Mosaic. Joseph M. Firestone, Ph.D. White Paper No. Two. March 12, 1997

Data Mining Analytics for Business Intelligence and Decision Support

Dynamic Data in terms of Data Mining Streams

Introduction to Data Mining

Jagir Singh, Greeshma, P Singh University of Northern Virginia. Abstract

A Spatial Decision Support System for Property Valuation

Three Perspectives of Data Mining

Customer Analysis - Customer analysis is done by analyzing the customer's buying preferences, buying time, budget cycles, etc.

Data Mining Solutions for the Business Environment

SPATIAL DATA CLASSIFICATION AND DATA MINING

International Journal of Computer Science Trends and Technology (IJCST) Volume 2 Issue 3, May-Jun 2014

Data Mining to Predict Mobility Outcomes for Older Adults Receiving Home Health Care

FREQUENT PATTERN MINING FOR EFFICIENT LIBRARY MANAGEMENT

A Case Study in Knowledge Acquisition for Insurance Risk Assessment using a KDD Methodology

TOWARDS SIMPLE, EASY TO UNDERSTAND, AN INTERACTIVE DECISION TREE ALGORITHM

Data Mining. Knowledge Discovery, Data Warehousing and Machine Learning Final remarks. Lecturer: JERZY STEFANOWSKI

How To Use Data Mining For Knowledge Management In Technology Enhanced Learning

Adding New Level in KDD to Make the Web Usage Mining More Efficient. Abstract. 1. Introduction [1]. 1/10

Single Level Drill Down Interactive Visualization Technique for Descriptive Data Mining Results

DATA QUALITY IN BUSINESS INTELLIGENCE APPLICATIONS

Chapter ML:XI. XI. Cluster Analysis

Knowledge Discovery and Data Mining: Towards a Unifying Framework

Data Mining System, Functionalities and Applications: A Radical Review

Database Marketing, Business Intelligence and Knowledge Discovery

Vishnu Swaroop Computer Science and Engineering College Madan Mohan Malaviya Engineering College Gorakhpur, India

TOTAL DATA WAREHOUSING:

NEW TECHNIQUE TO DEAL WITH DYNAMIC DATA MINING IN THE DATABASE

Interactive Exploration of Decision Tree Results

Data Mining Framework for Direct Marketing: A Case Study of Bank Marketing

Knowledge Discovery Process and Data Mining - Final remarks

DATA MINING - A DOMAIN SPECIFIC ANALYTICAL TOOL FOR DECISION MAKING

DATA MINING AND WAREHOUSING CONCEPTS

Text Mining: The state of the art and the challenges

Data Mining: Concepts and Techniques

The Management and Mining of Multiple Predictive Models Using the Predictive Modeling Markup Language (PMML)

Explanation-Oriented Association Mining Using a Combination of Unsupervised and Supervised Learning Algorithms

Machine Learning, Data Mining, and Knowledge Discovery: An Introduction

relevant to the management dilemma or management question.

Mining an Online Auctions Data Warehouse

Example application (1) Telecommunication. Lecture 1: Data Mining Overview and Process. Example application (2) Health

Knowledge-Driven Decision Support System Based on Knowledge Warehouse and Data Mining for Market Management

Cost Drivers of a Parametric Cost Estimation Model for Data Mining Projects (DMCOMO)

Introduction to Data Mining and Business Intelligence Lecture 1/DMBI/IKI83403T/MTI/UI

Lluis Belanche + Alfredo Vellido. Intelligent Data Analysis and Data Mining

INTERNATIONAL JOURNAL FOR ENGINEERING APPLICATIONS AND TECHNOLOGY DATA MINING IN HEALTHCARE SECTOR.

Standardization of Components, Products and Processes with Data Mining

A Review on Data Mining: Its Challenges, Issues and Applications

How To Create A Healthcare Data Management For Providers Solution From An Informatica Data Management Solution

Data Mining is sometimes referred to as KDD and DM and KDD tend to be used as synonyms

ETPL Extract, Transform, Predict and Load

Data Warehousing: A Technology Review and Update Vernon Hoffner, Ph.D., CCP EntreSoft Resouces, Inc.

20 A Visualization Framework For Discovering Prepaid Mobile Subscriber Usage Patterns

2.1. Data Mining for Biomedical and DNA data analysis

Rule based Classification of BSE Stock Data with Data Mining

A Business Intelligence Training Document Using the Walton College Enterprise Systems Platform and Teradata University Network Tools Abstract

Editors Prof. Amos DAVID & Prof. Charles UWADIA

DATA MINING AND KNOWLEDGE DISCOVERY FROM RESEARCH PROBLEMS

Data Mart/Warehouse: Progress and Vision

Why do statisticians "hate" us?

In-Database Analytics

Data Catalogs for Hadoop Achieving Shared Knowledge and Re-usable Data Prep. Neil Raden Hired Brains Research, LLC

Information Security in Big Data using Encryption and Decryption

ECLT 5810 E-Commerce Data Mining Techniques - Introduction. Prof. Wai Lam

A STUDY ON DATA MINING INVESTIGATING ITS METHODS, APPROACHES AND APPLICATIONS

A Framework for Data Warehouse Using Data Mining and Knowledge Discovery for a Network of Hospitals in Pakistan

Data Security and Privacy in Data Mining: Research Issues & Preparation

Overview Applications of Data Mining In Health Care: The Case Study of Arusha Region

Information Visualization WS 2013/14 11 Visual Analytics

An Overview of Temporal Data Mining

Decision Support System for predicting Football Game result

The Informatica Solution for Improper Payments

Transcription:

Ethical Issues in Data Mining Mandana Mir Moftakhari PhD Student at Hacettepe University, Department of Information Management. Email: mir_moftakhari@yahoo.com Güleda Doğan PhD Student & Research Assistant of Hacettepe University Department of Information Management. Email: gduzyol@hacettepe.edu.tr

We will discuss about: Big Data Knowledge Discovery Data mining Ethical issues in Data mining

Big Data! Data overload is a serious problem that has been grown by technical advances. Human beings have to cope with such overwhelming amounts of data and manage it in order to obtain relevant information and knowledge to solve their problems.

Big Data! Organizations have to overcome with massive data volume to achieve opportunities for: better decision-making gaining competitive advantages

knowledge Discovery in Databases as a Solution knowledge discovery in databases (KDD) is the nontrivial process of identifying valid novel potentially useful and ultimately understandable patterns in data (Fayyad, Piatetsky-Shapiro, Smyth and Uthurusamy,1996)

KDD Involves Different Steps Selection Preprocessing Transformation Data mining Interpretation or evaluation (Fayyad et al., 1996)

What Is Data Mining? Data mining as the center process of KDD is the analysis of (often large) observational data sets to find unsuspected relationships and to summarize the data in novel ways that are both understandable and useful to the data owner. (Hand, Mannila and Smyth, 2001)

Data mining Using Areas Customer service support Prediction Estimation Forecasting Decision support Manage budget

Data Mining Process Identifying the aim areas Determining sources of data Gathering and cleaning the data into a data warehouse Choosing proper analyzing tools Finding new patterns Prepare reports and implementing the results.

Ethical issues in Data mining Individuals not only expect qualified services, but also they require high level privacy and security of their personal details. These issues cannot be overlooked because of their consequences and effects on consumers, individuals and society.

Ethical issues Privacy Data accuracy Database security Consent

Privacy Threats Many consumers feel that their privacy is violated by information-gathering practices. Secondary use of the personal information Handling misinformation Granulated access to personal information

Secondary Use of the Personal Information Recent surveys on privacy show a great concern about the use of personal data for purposes other than the one for which data has been collected.

Handling Misinformation Misinformation can cause serious and long-term damage, so individuals should be able challenge the correctness of data about themselves.

Granulated Access to Personal Information The access to personal data should be on a need-to-know basis, and limited to relevant information only.

Type of data Some types of personal information are seen as being more sensitive than others. What complicates this issue is that sensitivity level varies according to the individual.

Database security Database security inhibits the unauthorized dissemination of personal data.

Data accuracy Collected data have originated from many diverse, possibly external, sources. Might be noisy, obsolete, inaccurate, or incomplete Not enough new Different from the present situation of individuals

Consent The purpose of data mining is to discover new insights and new uses for the information that companies already have. This makes it nearly impossible to allow the consumer to have the right of giving informed consent for each use of his data.

Conclusion and Recommendations Data mining is the process of searching in order to discover relationships between data sets and find useful information. Ethical issues should be observed in all steps of the process.

Conclusion and Recommendations Consider the expectations of the customers Develop a customer-oriented privacy policy Research and understand all laws that may have jurisdiction over sensitive data Control access to data warehouses Give customers more control over their data Evaluate the quality of source data

References American Library Association. (1995). Code of ethics of the American Library Association. Bhambri, V. &Gagandeep, (2012).Coexistence of data mining and privacy of data.international Journal of Research in IT & Management,2(2). Brankovic, L., &Estivill-Castro, V. (1999, July). Privacy issues in knowledge discovery and data mining. In Australian institute of computer ethics conference. 89-99. Cary, C., Wen, H.J. &Mahatanankoon, P. (2003).Data mining: consumer privacy, ethical policy, and systems development practices. Human Systems Management, 22(4), 157 168. Cavoukian, A. (1998). Data mining: staking a claim on your privacy. Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P. &Uthurusamy, R.. (1996). Advances in knowledge discovery and data mining. Cambridge, Menlo Park, Calif.: AAAI Press.Retrieved on May,10, 2014, from http://www.amazon.ca/gp/product/0262560976 Fayyad, U. M., Piatetsky-Shapiro, G., & Smyth, P. (1996). From data mining to knowledge discovery in databases.ai Magazine, 17, 37 54. Nicholson, S. (2003). The bibliomining process: Data warehousing and data mining for library decision-making. Information Technology and Libraries, 22(4), 146-151. Nicholson, S. & Stanton, J. (2003). Gaining strategic advantage through bibliomining: Data mining for management decisions in corporate, special, digital, and traditional libraries. In Nemati, H. &Barko, C. (Eds.).Organizational data mining: Leveraging enterprise data resources for optimal performance. Hershey, PA: Idea Group Publishing. 247-262. Payne, D. &. Trumbach, C. C. (2009). Data mining: proprietary rights, people and proposals, Business Ethics Quarterly, vol. 18(3).