IN THE CITY OF NEW YORK Decision Risk and Operations. Advanced Business Analytics Fall 2015



Similar documents
COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK DEPARTMENT OF INDUSTRIAL ENGINEERING AND OPERATIONS RESEARCH

Business Analytics Syllabus

Syllabus for MATH 191 MATH 191 Topics in Data Science: Algorithms and Mathematical Foundations Department of Mathematics, UCLA Fall Quarter 2015

Faculty of Science School of Mathematics and Statistics

Government of Russian Federation. Faculty of Computer Science School of Data Analysis and Artificial Intelligence

CSCI-599 DATA MINING AND STATISTICAL INFERENCE

Office: LSK 5045 Begin subject: [ISOM3360]...

Introduction to data mining

Introduction to predictive modeling and data mining

Statistics W4240: Data Mining Columbia University Spring, 2014

Introduction to Data Science: CptS Syllabus First Offering: Fall 2015

Brown University Department of Economics Spring 2015 ECON 1620-S01 Introduction to Econometrics Course Syllabus

Financial Optimization ISE 347/447. Preliminaries. Dr. Ted Ralphs

MGMT 280 Impact Investing Ed Quevedo

INFS5873 Business Analytics. Course Outline Semester 2, 2014

Course Description This course will change the way you think about data and its role in business.

DATA MINING FOR BUSINESS ANALYTICS

UNIVERSITY OF SOUTHERN CALIFORNIA Marshall School of Business BUAD 425 Data Analysis for Decision Making (Fall 2013) Syllabus

Syllabus. Finance 367: Investment Management

Lecture/Recitation Topic SMA 5303 L1 Sampling and statistical distributions

DSCI 3710 Syllabus: Spring 2015

Lecture: Mon 13:30 14:50 Fri 9:00-10:20 ( LTH, Lift 27-28) Lab: Fri 12:00-12:50 (Rm. 4116)

Data Mining and Business Intelligence CIT-6-DMB. Faculty of Business 2011/2012. Level 6

Section Format Day Begin End Building Rm# Instructor. 001 Lecture Tue 6:45 PM 8:40 PM Silver 401 Ballerini

AMIS 7640 Data Mining for Business Intelligence

CS Data Science and Visualization Spring 2016

QMB Business Analytics CRN Fall 2015 T & R 9.30 to AM -- Lutgert Hall 2209

Regression and Multivariate Data Analysis

BUSA 501: Introduction to Business Analytics

BUAD 310 Applied Business Statistics. Syllabus Fall 2013

Data Mining Carnegie Mellon University Mini 2, Fall Syllabus

OPMG 5811 LOGISTICS MANAGEMENT. Course Outline* Semester 2, 2012

Statistics 3202 Introduction to Statistical Inference for Data Analytics 4-semester-hour course

Course Syllabus. Purposes of Course:

QMB Business Analytics CRN Fall 2015 W 6:30-9:15 PM -- Lutgert Hall 2209

BA 561: Database Design and Applications Acct 565: Advanced Accounting Information Systems Syllabus Spring 2015

BA530 Financial Management Spring 2015

INSTRUCTOR INFORMATION Instructor: Adrienne Petersen Office: DMS 233 Office Hours: TuTh 11am-1pm by appointment

Learning outcomes. Knowledge and understanding. Competence and skills

Predictive Analytics Certificate Program

MD - Data Mining

AMIS 7640 Data Mining for Business Intelligence

Online Basic Statistics

IST565 M001 Yu Spring 2015 Syllabus Data Mining

INFO 2130 Introduction to Business Computing Spring 2013 Self-Paced Section 006

UNIVERSITY OF BRIDGEPORT School of Engineering COURSE SYLLABUS. TCMG-555-6T1 Project Management

Introduction to Machine Learning Lecture 1. Mehryar Mohri Courant Institute and Google Research

Australian School of Business School of Information Systems, Management and Technology

Revenue Management and Pricing

Statistics in Applications III. Distribution Theory and Inference

A Learning Path to Understanding Biology

Information and Decision Sciences (IDS)

Econometrics and Data Analysis I

Introduction to Physics I (PHYS ) Fall Semester 2012

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Requirements Fulfilled This course is required for all students majoring in Information Technology in the College of Information Technology.

IBM SPSS Modeler Professional

Intro. to Data Visualization Spring 2016

MATH 140 HYBRID INTRODUCTORY STATISTICS COURSE SYLLABUS

Audit Analytics. --An innovative course at Rutgers. Qi Liu. Roman Chinchila

KATE GLEASON COLLEGE OF ENGINEERING. John D. Hromi Center for Quality and Applied Statistics

BUSINESS INTELLIGENCE WITH DATA MINING FALL 2012 PROFESSOR MAYTAL SAAR-TSECHANSKY

Physics 21-Bio: University Physics I with Biological Applications Syllabus for Spring 2012

Big Data Analytics For Competitive Advantage Cross listed as MBAD7090, ITCS 6100, and HCIP 6103

SE 333/433 Software Testing and Quality Assurance

BUS 1200 Personal Financial Planning 1

FIN 357 BUSINESS FINANCE

College of Health and Human Services. Fall Syllabus

IDEAL PROGRAM PRST 224 Critical Thinking & Writing SYLLABUS ONLINE

ANALYTICS CENTER LEARNING PROGRAM

COLLEGE OF NATURAL AND HEALTH SCIENCE MATH 1213 College Algebra Classical Conversations Summer 2016 Course Summary

Regularized Logistic Regression for Mind Reading with Parallel Validation

ADV 3001: Advertising Strategy Summer 2014 M F, 2 nd (9:30 10:45)

Finance 471: DERIVATIVE SECURITIES Fall 2015 Prof. Liang Ma University of South Carolina, Moore School of Business

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate

POL 204b: Research and Methodology

CS 2750 Machine Learning. Lecture 1. Machine Learning. CS 2750 Machine Learning.

King Saud University

THE UNIVERSITY OF HONG KONG FACULTY OF BUSINESS AND ECONOMICS

SNL BA Syllabus FA Successful Sales and Marketing Management for Business Today

02-201: Programming for Scientists

Strategic Use of Information Technology (CIS ) Summer /

ISM and 05D, Online Class Business Processes and Information Technology SYLLABUS Fall 2015

School of Business and Nonprofit Management Course Syllabus

Busn 135 Syllabus. Business Math using Excel. (Syllabus subject to change)

Executive Master of Public Administration. QUANTITATIVE TECHNIQUES I For Policy Making and Administration U6310, Sec. 03

Model selection in R featuring the lasso. Chris Franck LISA Short Course March 26, 2013

MTH 110: Elementary Statistics (Online Course) Course Syllabus Fall 2012 Chatham University

CRN: STAT / CRN / INFO 4300 CRN

Online Course Syllabus Page 1

QMB 3302 Business Analytics CRN Spring 2015 T R -- 11:00am - 12:15pm -- Lutgert Hall 2209

Transcription:

Advanced Business Analytics Fall 2015 Course Description Business Analytics is about information turning data into action. Its value derives fundamentally from information gaps in the economic choices of consumers and firms. Analytics unlocks this hidden value. Consider consumers. In economic theory, they make choices to maximize their total utility. That s all fine, but to truly optimize one s choices - to discover what s best among the vast array of alternatives available is hard. Doing it well requires a lot of information and the ability to sort through that information to decide effectively. The result is that consumer decision-making is highly imperfect in practice and a great deal of potential value goes unrealized as a result. Business analytics helps capture this latent value by improving our choices as consumers: Google s search engine provides fast, highly relevant web content, enhancing the value of your time online. Google Maps and Waze help you uncover a faster route home. Pandora s recommendation system makes it simple to find and enjoy the music you love; Netflix does the same for movies and television. Amazon makes it easy to discover great products to buy and streamlines the purchase transaction. Tripadvisor helps you create great vacations, while Okcupid finds you the perfect mate (well, at least a suitable one!). Each of these technologies enables us to capture value that would go unrealized without the help of data and analytics. Firms benefit from analytics too. While economic theory suggests firms organize themselves to maximize the value they deliver, real companies frequently waste resources, fail to understand customer needs and hence engage in unproductive activities. Data and analytics help them overcome these inefficiencies. Capital One uses analytics to match credit card offers to customers more accurately than their competition. WalMart uses analytics to manage its inventory in a way that allows it to serve its customers reliably at exceptionally low cost. Cleveland Clinic uses analytics to provide more targeted treatment of patients and to fine-tune therapies that produce better health outcomes at lower costs. Axioma uses analytics to construct portfolios that provide better risk-reward trade-offs for their clients. In each case, data and analytics helps these firm uncover new opportunities to focus their efforts on value-adding activities and hence increase the gap between the value they deliver and their cost to serve. This course is intended for GSB students that have taken the introductory Business Analytics course offered by the division of. The basic course presented predictive and prescriptive analytics tools in the context of business cases, with an emphasis on the challenges that can arise in implementing analytical approaches within an organization. This course goes beyond the basic analytic course in several ways. First, the course will cover the tools and theory at a deeper level. Second, the course will be based on R. R is a programming language and software environment for statistical computing and graphics. The R language is widely used among statisticians and data miners. In particular, we will use R-studio, a free and open source integrated development environment for R. Students should install R and R-studio prior to the beginning of the course.

The course emphasizes that business analytics is a practical discipline which requires mastery of both methodology and business applications. The concepts learned in this class should help you identify opportunities in which business analytics can be used to improve performance and support important decisions. It will teach you important tools that can be used to transform data into high-impact business decisions. Lastly, it should make you alert to the ways that analytics can be used and misused within an organization. Course topics include a review of basis statistical ideas, numerical and graphical methods for summarizing data, linear regression, logistic regression, subset selection, cross-validation, classification, decision trees, factor models, clustering, support vector machines, and other emerging data analytics methods. The course presents real-world examples where a significant competitive advantage has been obtained through large-scale data analysis. Examples include advertising, ecommerce, finance, health care, marketing, and revenue management. The ultimate goal is, of course, help to make better business decisions using advanced analytics. Instructor Prof. Guillermo Gallego Office Hours Tuesdays: 3-4pm Office: CEPSR 822 (8 th floor) Office Phone: 212 854 2935 Email: gmg2@columbia.edu Textbooks and Learning Materials There is no required textbook: all class materials will be available on the Canvas website. However, some books are very useful if you want to learn more and deeper about data analytics. The best way to learn is by doing (especially with programming). All of the books below have free on-line versions. Optional Textbook 1 (highly recommend, easy following with many examples and data sets): Data Mining and Business Analytics with R, by Johannes Ledolter; Publisher: Wiley (2013), ISBN- 13: 978-1118447147. A pdf file of the book can be downloaded by following the link: http://www.nataraz.in/data/ebook/hadoop/data_mining_and_business_analytics_with_r Johannes_Led olter.pdf Optional Textbook 2 (solid primer, with theory and explanation): An Introduction to Statistical Learning with Application in R, by Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani; Publisher: Springer (2013); ISBN-13: 978-1461471370. The book can be downloaded from http://wwwbcf.usc.edu/~gareth/isl/islr%20first%20printing.pdf Optional Textbook 3 (a great advanced text): Elements of Statistical Learning: Data Mining, Inference, and Prediction, by Trevor Hastie, Robert Tibshirani and Jerome Friedman (2009); ISBN 978-0-387-84858-7. This book requires some mathematical sophistication and goes beyond the material we will be covering. The book is free at http://statweb.stanford.edu/~tibs/elemstatlearn/index.html

Software: We require the R Statistical Software, which is powerful and free. R can be downloaded at the link below: http://www.cran.r-project.org/ Rstudio is a free platform for both writing and running R, available at www.rstudio.org. Some students find it friendlier than basic R. We do not assume that you have used R in a previous class. I will provide limited software instruction, in-class demonstration, and code to accompany lectures and assignments. However, this is not a class on R. Like any language, R is only learned by doing. You should install R as soon as possible and familiarize yourself with basic operations. Students can become proficient in a few weeks. Some manuals are very helpful to learn R, e.g., http://cran.r-project.org/manuals.html Additional resources: (a) Tutorials at data.princeton.edu/r are fantastic (and there are many others out there). (b) Youtube intros to R, e.g. the series from Google Developers. Canvas Site A Canvas site is set up for this course. Each student is expected to check the site throughout the semester as Canvas will be the primary venue for outside classroom communications between the instructors and the students. Attendance Policy Attendance and class participation are part of each student s course grade. Students are expected to attend all scheduled class sessions. Failure to attend class will result in an inability to achieve the objectives of the course. Regular attendance and active participation are required for students to successfully complete the course. You are expected to come to class prepared, and ready to discuss the pre-class reading, case or assignment questions. More details on the assignments will be provided on Canvas. Class participation is an important part of learning. If you have a question, it s likely that others do as well. I encourage active participation, and course grades will take into account students who make particularly strong contributions. Assignments There will be three group assignments (1-4 students). All homework assignment should be submitted through the Canvas links. Assignment 1. Due session A5 Assignment 2. Due session A8 Assignment 3. Due session A12 Group Project: 3-5 students form a group and work on the projects as a team. Students can identify a company or a scenario, collect data, use techniques taught in class to study the data patterns or to

predict future outcomes. Students are required to write a 4-6 page project report, and present in class using Power Point slides. Project proposal due session A6 Project presentations due session A-makeup Final Exam: the final exam is a take home individual written exam. It will be given on session A12. The final exam is due by midnight December 20. Late submission including assignments, projects and exams will not be accepted. Study Group (not required, but highly recommend) Many students learn better and faster when working in a group, so I encourage collaborative learning. The study groups can be different from your project groups. Evaluation and Grading Assignment Weight Attendance and participation in class 10% discussion Homework 30% Project 20% Final Exam 40% Total 100% Syllabus at a Glance Session A1 (September 11): Motivation, course description, Introduction to R Session A2 (September 12): Statistical learning, bias-variance tradeoff; logistic regression Session A3 (September 25): Linear and quadratic discrimination analysis; knn Session A4 (September 26): Multiple linear regression Session A5 (October 9): Cross-validation and the bootstrap Session A6 (October 23): Subset selection, ridge and lasso regressions Session A7 (October 24): Principal component analysis and partial least squares. Session A8 (November 6): Class canceled. Will make up class on December 18. Session A9 (November 21): Difference-in-difference. Propensity score matching Session A10 (December 4): Tree-based methods Session A11 (December 5): Support vector machines Session A12 (December 12): Simulation and Optimization in R Session A-makeup (December 18): Project presentations