Course Description This course will change the way you think about data and its role in business.



Similar documents
DATA MINING FOR BUSINESS ANALYTICS

Lecture: Mon 13:30 14:50 Fri 9:00-10:20 ( LTH, Lift 27-28) Lab: Fri 12:00-12:50 (Rm. 4116)

Office: LSK 5045 Begin subject: [ISOM3360]...

DATA MINING FOR BUSINESS INTELLIGENCE. Data Mining For Business Intelligence: MIS 382N.9/MKT 382 Professor Maytal Saar-Tsechansky

BUSINESS INTELLIGENCE WITH DATA MINING FALL 2012 PROFESSOR MAYTAL SAAR-TSECHANSKY

IST565 M001 Yu Spring 2015 Syllabus Data Mining

New York University Stern School of Business Undergraduate College

Preliminary Syllabus for the course of Data Science for Business Analytics

MGMT 280 Impact Investing Ed Quevedo

BUAD 310 Applied Business Statistics. Syllabus Fall 2013

CRN: STAT / CRN / INFO 4300 CRN


Accounting : Accounting Information Systems and Controls. Fall 2015 COLLEGE OF BUSINESS AND INNOVATION

ECON 351: Microeconomics for Business

269 Business Intelligence Technologies Data Mining Winter (See pages 8-9 for information about 469)

COURSE DESCRIPTION AND PREREQUISITES

University of Southern California MARSHALL SCHOOL OF BUSINESS Spring, 2004 Course Guidelines & Syllabus

CS 1361-D10: Computer Science I

Statistics W4240: Data Mining Columbia University Spring, 2014

CS Data Science and Visualization Spring 2016

NEW YORK UNIVERSITY STERN SCHOOL OF BUSINESS Department of Accounting Principles of Financial Accounting (ACCT-UB.

UNIVERSITY OF DAYTON MANAGEMENT AND MARKETING DEPARTMENT MKT 315: RETAIL MARKETING Course Syllabus Winter 2008, Section 01

Investment Management Course

STATISTICS AND DATA ANALYSIS COR1-GB BLOCK 5 - FALL 2015

FIN 357 BUSINESS FINANCE

King Saud University

Course Syllabus. Purposes of Course:

IT 342 Operating Systems Fundamentals Fall 2014 Syllabus

Florida Gulf Coast University Lutgert College of Business Marketing Department MAR3503 Consumer Behavior Spring 2015

DSBA6100-U01 And U90 - Big Data Analytics for Competitive Advantage (Cross listed as MBAD7090, ITCS 6100, HCIP 6103) Fall 2015

Systems and Internet Marketing Syllabus Spring 2011 Department of Management, Marketing and International Business

Machine Learning. CUNY Graduate Center, Spring Professor Liang Huang.

Introduction to Data Science: CptS Syllabus First Offering: Fall 2015

CSCI-599 DATA MINING AND STATISTICAL INFERENCE

PA 750: Financial Management in Public Service Tuesday, 6:00-8:45 pm DTC Lab 617

University of Washington Foster School of Business FIN 502: Corporate Finance, Winter 2015 Professor Mark Westerfield

Project Management Tools and Leadership (MIS3886) Spring 2016 Course Syllabus

Data Mining Carnegie Mellon University Mini 2, Fall Syllabus

Precalculus Algebra Online Course Syllabus

Data Mining and Business Intelligence CIT-6-DMB. Faculty of Business 2011/2012. Level 6

QUANTITATIVE ANALYSIS FOR BUSINESS DECISIONS

Belk College of Business Administration, University of North Carolina at Charlotte. INFO : MANAGEMENT INFORMATION SYSTEMS Spring 2012

CHEM PRINCIPLES OF CHEMISTRY Lecture

SYST 371 SYSTEMS ENGINEERING MANAGEMENT

CSC 314: Operating Systems Spring 2005

Canisius College Richard J. Wehle School of Business Department of Marketing & Information Systems Spring 2015

MGSC 590 Information Systems Development Course Syllabus for Spring 2008

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

MKTG 330 FLORENCE: MARKET RESEARCH Syllabus Spring 2011 (Tentative)

Canisius College Computer Science Department Computer Programming for Science CSC107 & CSC107L Fall 2014

UNIVERSITY OF SOUTHERN CALIFORNIA Marshall School of Business BUAD 425 Data Analysis for Decision Making (Fall 2013) Syllabus

Syllabus. HMI 7437: Data Warehousing and Data/Text Mining for Healthcare

Infrastructure for ecommerce

ABNORMAL PSYCHOLOGY (PSYCH 238) Psychology Building, Rm.31 Spring, 2010: Section K. Tues, Thurs 1:45-2:45pm and by appointment (schedule via )

Practical Data Science with Azure Machine Learning, SQL Data Mining, and R

OM 335: OPERATIONS MANAGEMENT (Summer 2012)

MIS 140 Management Information Systems Course Syllabus for Fall Quarter 2013

BIO Evolution. KSCommons. Keene State College. Sciences and Social Sciences, School of. Syllabi. Spring 2010

RYERSON UNIVERSITY Ted Rogers School of Information Technology Management And G. Raymond Chang School of Continuing Education

BA 561: Database Design and Applications Acct 565: Advanced Accounting Information Systems Syllabus Spring 2015

BCM :00-12:15 p.m. 1:30-3:35 p.m. Wednesday 10:00-12:00 noon

PSY 6361 Teaching of Psychology Online Course Spring nd Eight Weeks

Psychology 420 (Sections 101 and 102) Experimental Psychology: Social Psychology Laboratory

CS 425 Software Engineering. Course Syllabus

CS 394 Introduction to Computer Architecture Spring 2012

Department of Accounting ACC Fundamentals of Financial Accounting Syllabus

OPERATIONS MANAGEMENT (OM335: 04285, 04290)

ISM 4403 Section 001 Advanced Business Intelligence 3 credit hours. Term: Spring 2012 Class Location: FL 411 Time: Monday 4:00 6:50

Finance 471: DERIVATIVE SECURITIES Fall 2015 Prof. Liang Ma University of South Carolina, Moore School of Business

NEW YORK UNIVERSITY Stern School of Business MBA Program. COR1-GB-2310: Marketing Spring 2016

SENG 520, Experience with a high-level programming language. (304) , Jeff.Edgell@comcast.net

Financial Statement Analysis University of Texas at Austin ACC 327 Spring 2009 J. William Kamas

ISM 4210: DATABASE MANAGEMENT

CS 341: Foundations of Computer Science II elearning Section Syllabus, Spring 2015

CSci 538 Articial Intelligence (Machine Learning and Data Analysis)

ENGR 102: Engineering Problem Solving II

Financial Calculator (any version is fine but access to a support manual is critical)

ISM 4113: SYSTEMS ANALYSIS & DESIGN

Grading. The grading components are as follows: Midterm Exam 25% Final Exam 35% Problem Set 10% Project Assignment 20% Class Participation 10%

MSIS 635 Session 1 Health Information Analytics Spring 2014

IN THE CITY OF NEW YORK Decision Risk and Operations. Advanced Business Analytics Fall 2015

University of Massachusetts Dartmouth Charlton College of Business Information Technology for Small Business MIS 375 (Online Course)

AMIS 7640 Data Mining for Business Intelligence

CS 649 Database Management Systems. Fall 2011

AMIS 7640 Data Mining for Business Intelligence

Acct 206 INTRODUCTION TO MANAGERIAL ACCOUNTING Spring 2015 Section 002 SYLLABUS

Southwestern Michigan College School of Business Dowagiac, Michigan. Course Syllabus FALL SEMESTER 2012

IS Management Information Systems

DePaul University Kellstadt Graduate School of Business ACC 555 Management Accounting for Decision Making

CS 425 Software Engineering

Transcription:

INFO-GB.3336 Data Mining for Business Analytics Section 32 (Tentative version) Spring 2014 Faculty Class Time Class Location Yilu Zhou, Ph.D. Associate Professor, School of Business, Fordham University Adjunct Professor, Stern School of Business, New York University Email: yzhou@stern.nyu.edu 6:00PM 9:00PM Tuesdays TBD Office Hour 3:00PM 5:00PM Tuesdays or by appointments Teaching Assistant TBD Course Description This course will change the way you think about data and its role in business. Businesses, governments, and individuals create massive collections of data as a by-product of their activity. Increasingly, decision-makers rely on intelligent technology to analyze data systematically to improve decision-making. In many cases, automating analytical and decisionmaking processes is necessary because of the volume of data and the speed with which new data are generated. We will examine how data mining techniques can be used to improve decision-making. We will study the fundamental principles and techniques of data mining, and we will examine real-world examples and cases to place data-mining techniques in context, to develop data-analytic thinking, and to illustrate that proper application is as much an art as it is a science. In addition, we will work hands-on with data mining software. The course is a combination of lecture, case studies, hands-on exercises and a real world project. Prerequisites None Course Objectives After taking this course, you should: 1. Approach business problems data-anaytically. Think carefully & systematically about whether & how data can improve business performance, to make better-informed decisions for management, marketing, investment, etc. 1

2. Be able to interact competently on the topic of data mining for business intelligence. Know the basics of data mining processes, algorithms, & systems well enough to interact with CTOs, expert data miners, consultants, etc. Envision opportunities. 3. Have had hands-on experience mining data. Be prepared to follow up ideas or opportunities that present themselves, e.g., by performing pilot studies. Focus and Interaction The course will explain through lectures and real-world examples the fundamental principles, uses, and some technical details of data mining techniques. The emphasis primarily is on understanding the business application of data mining techniques, and secondarily on the variety of techniques. We will discuss the mechanics of how the methods work as is necessary to understand the fundamental concepts and business application. I will expect you to be prepared for class discussions by having satisfied yourself that you understand what we have done in the prior classes. The assigned readings will cover the fundamental material. The class meetings will be a combination of lectures/discussions on the fundamental material, discussions of business applications of the idea and techniques, case discussions, and student exercises. You are expected to attend every class session, to arrive prior to the starting time, to remain for the entire class, and to follow basic classroom etiquette, including having all electronic devices turned off and put away for the during of the class (this is Stern policy, see below) and refraining from chatting or doing other work or reading during class. In general we will follow Stern default policies unless I state otherwise. I will assume that you have read them and agree to abide by them: http://www.stern.nyu.edu/academicaffairs/policies/generalpolicies/defaultpoliciesforsterncou rses/index.htm Classroom Equipment Students will be required to bring their laptops during some classes with wireless Internet access to complete the class demo and exercises. Office Hours and Email Instructor s office hours: Tuesdays 3-5PM or by appointment If you have questions about class material that you do not want to ask in class, or that would take us well off topic, please detain me after class, come to office hours to see me or the TA, or ask on the discussion board. You may also send e-mails to ask questions or set up appointments outside of office hours. Please type DM3336 in the subject line of every email that you send. I will check my email at least once a day during the week (M-F). Your email will get my priority when you put 2

DM3336 in the subject. If no reply is received within 48 hours, your email may have been overlooked and please feel free to send another email. Course Homepage The NYU Classes site is the main site for this Please check the page frequently for updates. You will find the following materials: syllabus, lecture schedule, lecture notes, group project, frequently asked questions, and course resources. Please print out the necessary materials for yourself. I will assume that you have read all announcements and class discussion. Course Materials and Text Book Lecture slides and handouts distributed by the instructor. All lecture slides will be posted on NYU Classes website. You will be expected to flesh these out with your own note taking, and to ask questions about any material in the notes that is unclear after our class discussion. Depending on the direction our class discussion takes, we may not cover all material in the notes. Required book: Data Science for Business: Fundamental Principles of Data Mining and Data Analytic Thinking, 1 st Edition, 2013 By Foster Provost, Tom Fawcett ISBN: 1449361323 Supplemental book (optional): Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management By Gordon S. Linoff, Michael J. A. Berry, 3 rd Edition, 2011 ISBN: 0470650931 Grading Weka Book (optional): Data Mining: Practical Machine Learning Tools and Techniques, 3 rd Edition, 2011 By Ian Witten, Eibe Frank, Mark Hall ISBN-10: 0123748569 o This book provides much more technical details of the data mining techniques and is a very nice supplement for the student who wants to dig more deeply into the technical details. It also provides a comprehensive introduction to the Weka toolkit. Student grades will approximately consist of the following elements as related to learning objectives: 3

Homework ---------------------------------------------20% Term Project ----------------------------------------- 30% Participation and Class Contribution----------------20% Final Quiz ----------------------------------------------40% At NYU Stern we seek to teach challenging courses that allow students to demonstrate differential mastery of the subject matter. Assigning grades that reward excellent and reflect differences in performance is important to ensuring the integrity of our curriculum. Students generally become engaged with this course and do excellent or very good work, receiving As and Bs, and only one or two perform only adequately or below and receive Cs or lower. Note that the actual distribution for this course and your own grade will depend upon how well each of you actually performs this particular semester. Homework Assignments Homework Assignments are listed (by due date) in the class schedule below. Each homework comprises questions to be answered and/or hands-on tasks. Except as explicitly noted otherwise, you are expected to complete your assignments on your own without interacting with others. Completed assignments must be handed on NYU Classes at least one hour prior to the start of classes on the due date (that is, by 5PM), unless otherwise indicated. Assignments will be graded and returned promptly. Answers to homework questions should be well thought out and communicated precisely, avoiding sloppy language, poor diagrams, and irrelevant discussion. The hands-on tasks will be based on data that we will provide. You will mine the data to get hands-on experience in formulating problems and using the various techniques discussed in class. You will use these data to build and evaluate predictive models. For the hands-on assignments you will use the (award-winning) toolkit Weka, part of the Pentaho open source business intelligence suite: http://www.cs.waikato.ac.nz/ml/weka www.pentaho.com Important: In order to use Weka you must have access to a computer on which you can install software. If you do not have such a computer, please see me immediately so we can make alternative arrangements. You should bring your computer to the second class. During the class we will have a lab session during which we will install and configure the software, get it running and dealing with the inevitable glitches that a few of you might experience. If you need additional help with using the data mining software, please see the Teaching Assistant. 4

Generally the Teaching Assistant should be the first point of contact for questions about and issues with the homeworks. If they cannot help you to your satisfaction, please do not hesitate to come see me. Late Assignments As stated above, assignments are to be submitted on NYU Classes at least one hour prior to the start of the class on the due date. Assignments up to 24 hours late will have their grade reduced by 25%; assignments up to one week late will have their grade reduced by 50%. After one week, late assignments will receive no credit. Please turn in your assignment early if there is any uncertainty about your ability to turn it in on time. Term Project A term project report will be prepared by student teams. We will give you the instructions on how to form your teams. Teams are encouraged to interact with the instructor and TA electronically or face-to-face in developing their project reports. You will submit a proposal for your project about half way through the course. Each team will present its project at the end of the semester. We will discuss the project requirements and presentations in class. Final Quiz The final quiz will be a take-home to be completed during the week following the last class. The subject matter covered and the exact dates will be discussed in class. Regrading If you feel that a calculation, factual, or judgment error has been made in the grading of an assignment or exam, please write a formal memo to me describing the error, within one week after the class date on which that assignment was returned. Include documentation (e.g., pages in the book, a copy of class notes, etc.). I will make a decision and get back to you as soon as I can. Please remember that grading any assignments requires the grader to make many judgments as to how well you have answered the question. Inevitably, some of these go in your favor and possibly some go against. In fairness to all students, the entire assignment or exam will be regraded. Disability Policy If you have a qualified disability and will require academic accommodation during this course, please contact the Moses Center for Students with Disabilities (CSD, 998-4980) and provide me with a letter from them verifying your registration and outlining the accommodations they recommend. If you will need to take an exam at the CSD, you must submit a completed Exam 5

Accommodation Form to them at least one week prior to the scheduled exam time to be guaranteed accommodation. Honor Code We assume that you have complete integrity in all your class efforts. Violations of the University's Honor Code will be taken extremely seriously, and they will be addressed promptly according to the established procedures. Students are to adhere to the Code of Student Conduct, and other policies and regulations as adopted and promulgated by appropriate University authorities. Copies of these documents may be obtained from the Office of the Dean of Students or from the offices of the academic deans. No infractions will be tolerated. Students violating the Code of Student Conduct will be dismissed from class and will receive an F for the course. Stern Honor Code http://www.stern.nyu.edu/mba/studact/mjc/hc.html 6

Class Schedule (Tentative) Class Number Date 1 2/11 Topics (subject to change as class progresses) Introduction to Data Mining Overview of techniques, DM process, application domains Syllabus Readings Ch.1 & 2 Deliverables Survey Sheet 2 2/18 Predictive Modeling I Basic terminology, classification, regression, decision tree Weka Hands-on Ch. 3 & 4 HW1 Due 3 2/25 Predictive Modeling II Attribute selection, machine learning, Support Vector Machines, Neural Network, Naive Bayes Case: customer segmentation Ch. 3 & 4 4 3/4 Fitting and Overfitting Data Hold-out, cross-validation, learning curves Case: disease prediction Ch. 5 HW2 Due 5 3/11 6 3/25 Similarity, Distance, Nearest Neighbors Ch. 6 Spring Break (No class on 3/18) Clustering Unsupervised Learning Ch. 6 HW3 Due 7 4/1 What is a Good Model? Various mini cases Classification, clustering, ranking, etc. Project mid-term check Ch. 7 Project Update Due 8 4/8 Recommender Systems Customer similarity, product similarity, networks, association rules Case: Netflix Cha. 8 9 4/15 Online Advertisement User clickstream data, data validation, fraud detection, multi-channel integration HW4 Due 7

10 4/22 Representing and Mining Text Bag-of-words, TFIDF, similarity Case: News article and stock price Movie box office and sentiment analysis Ch. 10 11 4/29 Data Visualization Basic visualization techniques, charts, ROC and AUC Popular visualization toolkits 12 5/6 Presentation Final Quiz on NYU Classes Ch. 8 HW5 Due Project Report Due 8