Computing, technology & Data Analysis in the Graduate Curriculum. Duncan Temple Lang UC Davis Dept. of Statistics

Similar documents
Programming Languages & Tools

UNDERGRADUATE COMPUTER SCIENCE EDUCATION: A NEW CURRICULUM PHILOSOPHY & OVERVIEW

Proposal for Undergraduate Certificate in Large Data Analysis

RFI Summary: Executive Summary

A LOOK BACK: UNDERGRADUATE COMPUTER SCIENCE EDUCATION: A NEW CURRICULUM PHILOSOPHY & OVERVIEW

ANALYTICS CENTER LEARNING PROGRAM

Numerical Analysis. Professor Donna Calhoun. Fall 2013 Math 465/565. Office : MG241A Office Hours : Wednesday 10:00-12:00 and 1:00-3:00

HPC Wales Skills Academy Course Catalogue 2015

Eastern Washington University Department of Computer Science. Questionnaire for Prospective Masters in Computer Science Students

Assessment and Instruction: Two Sides of the Same Coin.

DEGREE PLAN INSTRUCTIONS FOR COMPUTER ENGINEERING

Computational Science and Informatics (Data Science) Programs at GMU

Introducing PgOpenCL A New PostgreSQL Procedural Language Unlocking the Power of the GPU! By Tim Child

SAS JOINT DATA MINING CERTIFICATION AT BRYANT UNIVERSITY

Introduction to MATLAB Gergely Somlay Application Engineer

Eastern Washington University Department of Computer Science. Questionnaire for Prospective Masters in Computer Science Students

Department of Computer Science

CS Matters in Maryland CS Principles Course

POL 204b: Research and Methodology

CURRICULUM VITAE Herbert L. Dershem

Program: Organizational Leadership. Department: Psychology. College: Arts & Sciences. Year:

Integrating a Big Data Platform into Government:

DIABLO VALLEY COLLEGE CATALOG

The following are the measurable objectives for graduated computer science students (ABET Standards):

Integration of Mathematical Concepts in the Computer Science, Information Technology and Management Information Science Curriculum

EC2000 CRITERION 2: A PROCEDURE FOR CREATING, ASSESSING, AND DOCUMENTING PROGRAM EDUCATIONAL OBJECTIVES

FORENSIC SCIENCE EDUCATION PROGRAMS ACCREDITATION COMMISSION. FEPAC Computing and Information Science Technology. Call for Comments September 2015

Computer Science Course Descriptions Page 1

Assessment for Master s Degree Program Fall Spring 2011 Computer Science Dept. Texas A&M University - Commerce

Gerald Roth. Department of Electrical Engineering and Computer Science School of Engineering Vanderbilt University Nashville, TN

OUTLINE FOR AN INTERDISCIPLINARY CERTIFICATE PROGRAM

INNOVATION IN UNDERGRADUATE COMPUTER SCIENCE EDUCATION

How To Get A Computer Science Degree At Coastal Carolina University

Self-Reflection Teaching. Susan M. Blunck, Ph.D. Assistant Clinical Professor Department of Education UMBC

02-201: Programming for Scientists

MEng, BSc Computer Science with Artificial Intelligence

FACULTY STUDY PROGRAMME FOR POSTGRADUATE STUDIES

Guide to the MSCS Program Sheet

SUMMER SCHOOL ON ADVANCES IN GIS

Data Science Certificate Program

Note: The modules offered and their timing are conditional upon the availability of faculty and may be subject to change.

Classroom Demonstrations of Big Data

AC : A COURSE SEQUENCE FOR INTEGRATING PROBLEM SOLVING AND CRITICAL THINKING IN A HYBRID OUTCOME-BASED IS/IT CURRICULUM

DOVER-SHERBORN HIGH SCHOOL PROGRAM OF STUDIES

Information and Decision Sciences (IDS)

Core Curriculum to the Course:

Biomedical Engineering (MS)

Page Overview... 2 Admission Requirements... 2 Additional Requirements... 3 Sample Timeline... 4 Sample Research Proposal... 5

MEng, BSc Applied Computer Science

Research and Digital Game- based Play: A Review of Martha Madison

DIGITAL FORENSICS SPECIALIZATION IN BACHELOR OF SCIENCE IN COMPUTING SCIENCE PROGRAM

Merit Review Broader Impacts Criterion: Representative Activities

Computer Science. 232 Computer Science. Degrees and Certificates Awarded. A.S. Degree Requirements. Program Student Outcomes. Department Offices

COMPUTER SCIENCE, BACHELOR OF SCIENCE (B.S.)

Curriculum Map. Discipline: Computer Science Course: C++

Master of Arts in Teaching/Science Education Master of Arts in Teaching/Mathematics Education

The Applied and Computational Mathematics (ACM) Program at The Johns Hopkins University (JHU) is

The CS Principles Project 1

Computer Science. Computer Science 213. Faculty and Offices. Degrees and Certificates Awarded. AS Computer Science Degree Requirements

Data Analysis with MATLAB The MathWorks, Inc. 1

Master Degree Program in Computer Science (CS)

Game Design as a Writing Course in the Liberal Arts

Master of Science (M.S.), Major in Software Engineering

Industrial Engineering Definition of Tuning

Curricular Vision. I. Introduction:

Course Descriptions: Undergraduate/Graduate Certificate Program in Data Visualization and Analysis

Interactive Computer Software in College Algebra

WHITE PAPER. Peter Drucker. intentsoft.com 2014, Intentional Software Corporation

Electrical and Computer Engineering Undergraduate Advising Manual

Computer Science. Master of Science

Big Data Executive Survey

Learning outcomes. Knowledge and understanding. Competence and skills

CURRICULUM VITAE EDUCATION:

COURSE SYLLABUS EDG 6931: Designing Integrated Media Environments 2 Educational Technology Program University of Florida

Data Science in the Statistics Curricula: Preparing Students to "Think with Data"

Department of Computer Science. BSc COMPUTER SCIENCE. At the forefront of today s digital world UNDERGRADUATE

Graduate Program Handbook M.S. and Ph.D. Degrees

The School of Communication & Journalism Diversity Plan Auburn University

MSCA Introduction to Statistical Concepts

LONDON SCHOOL OF COMMERCE. Programme Specification for the. Cardiff Metropolitan University. BSc (Hons) in Computing

Eastern Washington University Department of Computer Science. Questionnaire for Prospective Masters in Computer Science Students

Bachelor of Information Technology

Computer Science. B.S. in Computer & Information Science. B.S. in Computer Information Systems

Course Descriptions. preparation.

Transcription:

Computing, technology & Data Analysis in the Graduate Curriculum Duncan Temple Lang UC Davis Dept. of Statistics

JSM 08 Statistics is much broader than we represent in our educational programs Context of Scientific Discovery, not statistical methods! Data analysis and problems Technology has significantly altered science And hence statistics. We must respond! Computing & technology are essential elements of our practice, research and education

JSM 08 Extend stat. curricula with computing & technology Mix with introduction to modern statistical methods and real applications of data analysis This combination makes our students into more valuable contributors to scientific inquiry.

Calls for Change in view of statistics NRC 08 1962 Annals 1977 Analysis of Large Complex Data http://www.stat.fi/isi99/ proceedings/arkisto/varasto/ frie0060.pdf 1997

Resistance NRC 08 Nay-sayers often prioritize stat. topics over computing defend the mathematical foundations of our discipline based on conservatism, Frequently people who don t understand technology and computation and its role in practice and in reshaping opportunities for statistical thinking and research. But not competition between computing or mathematics both are tools for statistical concepts & practice.

NRC 08 After 46, 31, 11 years, it is time for action & change, not just talk. We must do the best we can and strive to get computing, technology and data analysis adequately into our curricula. Need to attract/retain researchers with modern, different perspective on data analysis & its impact.

Outline JSM 08 Why? What? For whom? By whom? How to achieve this?

JSM 08 Need our students to be computationally literate to be able to Do computations for their own research (simulations, implement methodology) to help with our research interact with scientists using complex data from diverse sources disseminate new statistical methods as software understand, critically appreciate and exploit new technologies as they emerge to allow us to do new types of data analysis.

JSM 08 Opportunity to teach statistical methodology that the students wouldn t necessarily see in a heuristic manner Improve their (exploratory) data analysis skills and intuition. Expose them to research by implementing computations within a paper.

JSM 08 Omitting computing & technology from our curricula means we are playing with one hand tied behind our back We cannot provide complete solutions to scientific problems, but merely prescriptions (not functioning/tangible tools) for how others can approach these problems. Software for doing statistics in the analytic pipeline

What - Broad Topics JSM 08 Fundamentals of scientific programming -... Computing for Research - profiling, C, basic parallel computing, object-oriented computing, R packages Computational Statistics - Lin. Alg, Numeric optimization, RNG, MCMC, EM, resampling, numerical integration, Data Technologies - Databases (SQL), Regular Expressions, XML, Web services. Visualization technologies - graphical techniques & software; dynamic, interactive & Web displays

Intro to Stat. Computing JSM 08 Operating system concepts - commonalities & differences file system (files, folders, binary/text), editors,... Types of languages - compiled/interpreted, vectorized/scalar, task-specific languages, Perl, Python, R, MATLAB, SAS Language elements - data types, subsetting, function calls, vectorized looping (apply()), control flow Input and Output (I/O) Writing functions - mechanics, design, *Debugging - tools, technique and philosophy/art, Efficiency - alg. complexity, idioms, profiling, interface to C/ FORTRAN/... Batch computing & remote shells

JSM 08 Vital to avoid teaching just the syntax of a particular language, or how to cut-andpaste & modify templates Need to teach concepts of computing, how to understand other languages, approach a computational task & abstract the ideas.

For whom? JSM 08 Different types of students - different courses Each student should take 2 computing classes Required class - Scientific Programming teach how to think & reason about computing and express stat. tasks as computations. ideally also cover R/MATLAB fundamentals, interface to C, efficiency, parallel computing (in context of data analysis). And one class in either Data Technologies, Computational Statistics, Advanced Computing.

Types of students & second course JSM 08 Student studying methodology research (theory) simulation, software development (e.g. R packages), efficiency, algorithmic complexity, numerical algorithms, parallel computing, streaming data, visualization Probability - simulation, RNG, efficiency, visualization. Applications - data technologies for accessing data, additional languages, efficiency, parallel programming, visualization.

JSM 08 For me, programming and the basics of data technologies I/O for complex data text manipulation & regular expressions databases XML are vital for all students working with data.

Masters Students JSM 08 What do they end up doing? Data manipulation and processing - data technologies Exploratory Data Analysis & Reports - visualization Simulations - programming Modeling - R, MATLAB, SAS. First class in stat. computing & then mix of visualization, data technologies, SAS Data, data, data...

Can we weave topics into existing classes? JSM 08 Not the programming class! Starting from nothing We need programming to be a fundamental class to emphasize its importance & establish culture of computing. provide solid, rich foundation for other topics, allow the students to absorb the concepts/reasoning over a quarter/semester, put in the context of data analysis/math. stat.

By whom? JSM 08 Rarely in our graduate programs More senior faculty haven t been exposed to this, so can t teach it, so students aren t exposed to it, so... Students left to learn it on their own with little encouragement or priority empirically results are poor with major misconceptions So very few instructors who can teach computing and technology

Computer Science Classes JSM 08 Can we send our students to Computer Science classes on programming? databases? text manipulation? to Applied Math for numerical analysis? optimization?...

JSM 08 No we do a different type of programming (vectorized, interpreted languages versus compiled scalar languages) a class in databases teaches internal details of database not how to use it. importantly, don t put these methods in the context of statistical data analysis.

How? JSM 08 Have to train instructors or train themselves? NSF grant (Nolan, Hansen, Temple Lang) to develop potential syllabi & topics for computing create resources for teaching - lecture notes, exercises/homeworks/projects/case-studies, text book teach instructors how to teach computing evangelize computing, technologies & data analysis within the community via papers, talks, etc.

JSM 08 May 2007 - Workshop for syllabi July 2008 - Workshop for teaching instructors 2009 - final workshop. What form? teach additional instructors (same as 2008)? summer school on technology, computing & data analysis for recent graduates, like New Researchers Conf.? summer school for PhD students starting research? small working group to complete materials for others to pick up?

JSM 08 Materials at Workshop 1 http://www.stat.berkeley.edu/twiki/workshop/compcurric Workshop 2 http://www.stat.berkeley.edu/~statcur/

Summary JSM 08 It is time to step up and do something about computing & technology & data analysis in our curricula. Must have dedicated statistical/scientific programming course Data technologies, advanced/research computing, numerical algorithms, visualization classes or individual topics

Summary ctd. JSM 08 Grow pool of potential instructors by teaching these classes now and teaching existing instructors via workshops & developing class materials What form for next workshop? Seek funding for additional workshops? Strategic Initiative from ASA, ISI,...

Actions JSM 08 Time for action on technolog & computing. Departments should introduce computing into graduate & undergraduate classes. Explicit computing classes Introductory programming, data technologies for scientific computing with data Second class or integrate topics into classes from: Data technologies, advanced/research computing, numerical algorithms, visualization classes or individual topics