Introducing open source statistical and data science tools to business analytics students and professionals
|
|
|
- Horatio Tucker
- 10 years ago
- Views:
Transcription
1 Detroit ASA January 2015 Introducing open source statistical and data science tools to business analytics students and professionals Mark Isken Assoc. Prof. of MIS School of Business Administration Oakland University
2 Abstract Tools such as Excel, SQL databases, SPSS and SAS have long been staples of the quantitative side of business education and professional practice. Recently, this community has seen a surge in popularity of open source tools of a more computational nature. In response, in the spring of 2014, I developed and delivered a course entitled "Practical Computing for Business Analytics" within the School of Business at Oakland University. This course relied entirely on open source software for course development, delivery and student work. Specifically we used the Linux OS along with R and Python. R Markdown documents and IPython notebooks were the primary teaching tools. I will describe my teaching methods, discuss how the course went, and share plans for the continued dissemination of this material in the business analytics community.
3 Mark Isken BSE, MSE, Ph.D. in Industrial and Operations Engineering from University of Michigan Operations analyst for William Beaumont Hospital and Henry Ford Health System and some small consulting companies (~10 years) Joined OU Fall 1999 as full-time faculty member of Dept. of Decision and Information Sciences I m a techie love working with computers and mathematical models to help solve business problems Teach/taught business analytics, statistics, computer simulation, intro MIS courses and healthcare operations mgt I remember when INFORMS was ORSA and TIMS and the controversy that ensued when merger proposed. 3
4 X X X Management science Business analytics Machine learning Decision science Data science Operations research Analytics Data visualization OLAP Statistics Business intelligence Data mining Data warehousing Knowledge discovery in databases Big data
5 Healthcare Operations Analysis Internal business analysis / decision support consultant Simulation modeling Critical care tower, emergency departments Pneumatic tube systems, outpatient clinics, pharmacy robots Staffing and scheduling models people, cases, tests, etc. Queueing, simulation, optimization Database and analytical tool development using Access, Excel, VBA and other software Various statistical and operations analysis studies Short term census forecasting 45 Postpartum Staffing Needs Nurses Sun 12 am Sun 06 am Sun 12 pm Sun 06 pm Mon 12 am Mon 06 am Mon 12 pm Mon 06 pm Tue 12 am Tue 06 am Tue 12 pm Tue 06 pm W ed 12 am W ed 06 am W ed 12 pm W ed 06 pm Thu 12 am Thu 06 am Thu 12 pm Thu 06 pm Fri 12 am Fri 06 am Fri 12 pm Fri 06 pm Sat 12 am Sat 06 am Sat 12 pm Sat 06 pm Introduction to BAM 5
6 My view of business analytics Programming & databases Math & stats Domain knowledge Art and Craft of modeling Communication, visualization, story-telling
7 Spreadsheet Based Modeling & Decision Support (taught since 2001) Management science modeling Simulation Optimization Data analysis Database EDA/data viz Statistics OLAP/DW Data Mining Application Development System design User Forms Automation Environment customization, error prevention & handling Basic foundation Modeling Spreadsheet Modeling/engineering VBA Introduction to BAM 7
8 Getting started with Free and Open Source Software (FOSS) PhD days using FORTRAN based network flow algorithms for scheduling problems in addition to commercial tools like IBM's OSL and CPLEX Clearly saw FOSS allowed me to learn by code exploration It allowed me to create decision support apps with sophisticated code built in that didn't force end user organizations (hospitals with no extra $$$) to buy expensive commercial software that I could extend this software to solve my specific problem better Wrote dissertation in LaTex My research experience along with several years as a practicing industrial engineer with two large healthcare systems and a few years of university teaching launched my real plunge into the world of FOSS
9 FOSS for analytics in practice A smallish healthcare analytics firm run by a good friend of mine from grad school days Very Microsoft-centric place and client base (SQL Server,.Net apps, Excel, Access, PPT) I've been introducing and helping people get up to speed with things like R and Python to overcome common limitations of their current Excel centric analytical workflow practices ad hoc and non-reproducible data cleaning, transforming lots of pointing and clicking for repetitive tasks sketchy documentation of analytical workflow ease of doing things with R (via apply family or plyr and ggplot2) and Python (via pandas and matplotlib) such as percentile calculations within pivot style or group by analysis small multiples both of the above are hideous to do in Excel and even the first is tough in specialized tools like Tableau. I've got beginner level tutorials on these things on hselab.org both in R and Python
10 No fun to do in Excel Small multiples Percentiles by group
11 hselab.org This is my primary outlet for sharing tutorials, teaching materials, FOSS and other analytics related things Tutorials and guides Blog posts Links to my FOSS projects Working on Shiny apps Open courses Science, engineering, research are all evolving in response to calls for reproducibility, open access to data and results, changes to publishing models and the possibilities offered by FOSS along with internet infrastructure that facilitates organic evolution of social and technical ecosystems Got me thinking we really needed a course on this stuff within the School of Business...
12 MIS 480/680 - Practical Computing for Business Analytics Hey MBAs! Microsoft isn't the only game in town. If you really want to do analytics in the business world, you better learn to do some programming!
13 Structure 14 3hr sessions 202EH Computer Teaching Lab First half of the semester Session 1: Intro to analytics Second half of the semester 9/10: Intro to Python 2: Intro to R and R Studio 11: Data analysis and plotting in Python 3: Exploratory data analysis with R 12: Data acquisition, prep and more analysis 4: Group by analysis and more stats - R 13: Time series, datetime analysis in Python 5: Linear models in R 6: Data mining in R (knn, cluster, Rattle) 7/8 Text files, regex, Linux tools (e.g. shell, grep, basic scripting) 14: Overview of Hadoop & MapReduce
14 Open Source Tools
15 Our computing appliance - pcba Computer running Windows, Mac OS, or Linux Programs MS Office Notepad Browser VirtualBox Documents Spreadsheets, Word documents, text files,pdf Virtual machines VM running Lubuntu Linux Programs R, R Studio, R packages Python (Anaconda) Geany OpenOffice Browser File Manager Shell Documents R scripts, Python programs OpenOffice documents Text files, pdf pcba
16 No one book really fit
17 Why R and Python? Both R and Python are widely used in the data science and business analytics worlds A quote from Enterprise Data Analysis and Visualization: An Interview Study on the growing need for technically adept analysts: When discussing recruitment, one Chief Scientist said analysts that can t program are disenfranchised here Both support a combination of interactive use via tools like R Studio and IPython along with programmatic use via text scripting Huge communities and ecosystems supporting R and Python for analytics work Both facilitate reproducible analysis Some things that are simply hideously difficult to do in tools like Excel or a database, are simple in R and/or Python Group By or Pivoting type analysis for operations such as percentiles Small multiples and other complex graphing/charting/plotting Documenting and reproducing complex series of data cleaning and transformations
18 Flow of a typical class Guided exploration of topics via interactive use of R Markdown documents or IPython Notebooks In class assignment where I act as roving consultant Open lab time for homework and project work Collaborate with classmates Get questions answered by me
19 R Markdown documents Mixture of markdown (simple plain text formatting) and executable R code chunks Facilitates authoring informative and reproducible analysis documents Can generate output in numerous forms including PDF, HTML, MS Word Can publish resultant HTML directly to RPubs Used in PCBA as an interactive session delivery, exploration and note taking method, homework submissions, and project deliverables IPython notebooks facilitate interactive Python computing in a browser based environment Mixture of markdown and Python Inline plotting Magic commands for interacting with the OS
20 IPython Notebooks nbviewer Notebooks are just json text files Gallery of interesting notebooks Fernando Perez
21 Homework assignments HW0 - Intro to PCBA - guided exploration of pcba virtual appliance - overview reading from DDS and exploration of links on course website HW1 - Intro to R - use R Studio, create an RMarkdown document - data importing and exporting - view and modify dataframes (change data types, add cols) - answer questions about some R lists, vectors, arrays, matrices - generate html from Rmd file HW2 - EDA with R - EDA: summary stats, group by, plots - data reshaping HW3 - Predictive modeling with R - regression models to predict MLB winning percentage - try out a few predictive modeling techniques for the Kaggle Titanic Challenge - feature engineering HW4 - Simulating the Monty Hall 3-Door Problem with Python - skeleton code and comments provided in IPython notebook
22 Final Projects Options 1. Analyze dataset of interest 2. Research into techniques and/or tools 3. Compete in active Kaggle competition A few of the resulting projects - Neural nets for the Kaggle bike share competition - financial portfolio analysis with Python and tkinter (for GUI) - exploration of R packages for financial analysis - blackjack simulator in Python for exploring different playing strategies - a tutorial for creating a basic R Shiny app - maps of website use based on Apache logs using Python, pandas, matplotlib - using EDA, knn, decision trees to explore factors affecting vehicle fuel economy
23 Student mix R/Python Analytics Summer I 2014 MBA 6 BS-MIS 3 MS-STA 1 MSITM 16 Post-Bac 1 Spreadsheet based Business Analytics BS-FIN 1 BS-MIS 2 BS-POM 1 MACC 7 MBA 12 MSITM 13 Fall 2014
24 Our analytical profiles I'll give everyone an index card and on it you'll profile yourself (on a relative scale) with respect to the following dimensions Computer programming Math Statistics Data visualization Machine learning / data mining Modeling Domain expertise Communication and presentation skills Click the picture
25 Why learn to use Linux for analytics? Linux widely used in the data science and analytics world Linux shell FAR superior to Windows command line application Powerful shell scripting language Tab completion Command line is often way more efficient than GUI Linux is free (as in freedom and beer) and open source Sets you apart from other business analysts who only know Windows and Microsoft applications
26 The Geek Factor Using and creating FOSS earns you geek cred It's fun to use tools like R and Python and Linux Do an import this and then an import antigravity an IPython notebook or any Python shell Do you think MS inspires this kind of thing? :) FOSS facilitates users becoming more tech savvy Real geeks use Linux; but seriously, command line use github installing software getting your hands dirty leveraging the Unix philosophy of small focused tools that you can put together to do amazing things $wc -l *.pdb sort -g head -1 here's the overview presentation I use as part of the hands-on session to introduce B-school students to Linux
27 About Practical Computing for Business Analytics introduced B-school students to non-microsoft world that exists Linux shell scripting, Linux OS, and world of FOSS for Linux R and Python created a Lubuntu based computing appliance distributed as.ova exported from VirtualBox free! I could get it totally configured by installing and setting up the software as I wanted students didn't waste time trying to get myriad of tools working on their systems minimized hassle on our OU IT staff entire course was created and delivered with FOSS "I wouldn't use Windows at all if not for Excel An open version of the course website is available from my hselab site in the courses section In Summer I 2015, course will be offered again (and every Summer I) MIS 447 Practical computing for data analytics
ANALYTICS CENTER LEARNING PROGRAM
Overview of Curriculum ANALYTICS CENTER LEARNING PROGRAM The following courses are offered by Analytics Center as part of its learning program: Course Duration Prerequisites 1- Math and Theory 101 - Fundamentals
Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate
Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate Description The Helzberg School of Management has launched two graduate-level certificates: one in Data
Microsoft Research Windows Azure for Research Training
Copyright 2013 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
Microsoft Research Microsoft Azure for Research Training
Copyright 2014 Microsoft Corporation. All rights reserved. Except where otherwise noted, these materials are licensed under the terms of the Apache License, Version 2.0. You may use it according to the
Sunnie Chung. Cleveland State University
Sunnie Chung Cleveland State University Data Scientist Big Data Processing Data Mining 2 INTERSECT of Computer Scientists and Statisticians with Knowledge of Data Mining AND Big data Processing Skills:
Session 85 IF, Predictive Analytics for Actuaries: Free Tools for Life and Health Care Analytics--R and Python: A New Paradigm!
Session 85 IF, Predictive Analytics for Actuaries: Free Tools for Life and Health Care Analytics--R and Python: A New Paradigm! Moderator: David L. Snell, ASA, MAAA Presenters: Brian D. Holland, FSA, MAAA
Machine Learning. Hands-On for Developers and Technical Professionals
Brochure More information from http://www.researchandmarkets.com/reports/2785739/ Machine Learning. Hands-On for Developers and Technical Professionals Description: Dig deep into the data with a hands-on
Harnessing the power of advanced analytics with IBM Netezza
IBM Software Information Management White Paper Harnessing the power of advanced analytics with IBM Netezza How an appliance approach simplifies the use of advanced analytics Harnessing the power of advanced
Introduction to Big Data! with Apache Spark" UC#BERKELEY#
Introduction to Big Data! with Apache Spark" UC#BERKELEY# So What is Data Science?" Doing Data Science" Data Preparation" Roles" This Lecture" What is Data Science?" Data Science aims to derive knowledge!
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks vision is to empower anyone to easily build and deploy advanced analytics solutions. The company was founded by the team who created Apache Spark, a powerful
Databricks. A Primer
Databricks A Primer Who is Databricks? Databricks was founded by the team behind Apache Spark, the most active open source project in the big data ecosystem today. Our mission at Databricks is to dramatically
Operationalise Predictive Analytics
Operationalise Predictive Analytics Publish SPSS, Excel and R reports online Predict online using SPSS and R models Access models and reports via Android app Organise people and content into projects Monitor
The Definitive Guide to Data Blending. White Paper
The Definitive Guide to Data Blending White Paper Leveraging Alteryx Analytics for data blending you can: Gather and blend data from virtually any data source including local, third-party, and cloud/ social
Investor Presentation. Second Quarter 2015
Investor Presentation Second Quarter 2015 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
CSE 6040 Computing for Data Analytics: Methods and Tools. Lecture 1 Course Overview
CSE 6040 Computing for Data Analytics: Methods and Tools Lecture 1 Course Overview DA KUANG, POLO CHAU GEORGIA TECH FALL 2014 Fall 2014 CSE 6040 COMPUTING FOR DATA ANALYSIS 1 Course Staff Instructor Da
DATA SCIENCE CURRICULUM WEEK 1 ONLINE PRE-WORK INSTALLING PACKAGES COMMAND LINE CODE EDITOR PYTHON STATISTICS PROJECT O5 PROJECT O3 PROJECT O2
DATA SCIENCE CURRICULUM Before class even begins, students start an at-home pre-work phase. When they convene in class, students spend the first eight weeks doing iterative, project-centered skill acquisition.
DATA SCIENTIST TRAINING FOR LIBRARIANS #DST4L. C. Erdmann DST4L @ Designing Libraries IV @libcce
DATA SCIENTIST TRAINING FOR LIBRARIANS #DST4L C. Erdmann DST4L @ Designing Libraries IV @libcce On the Same Page We started speaking the same language. A side conversation with a Harvard faculty member
Concept Paper: MS in Business Analytics
Concept Paper: MS in Business Analytics MS in Business Analytics is proposed by: Saunders College of Business MIS, Marketing and Digital Business and Accounting and Finance Departments with designed collaboration
WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley
WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley Disclaimer: This material is protected under copyright act AnalytixLabs, 2011. Unauthorized use and/ or duplication of this material or
Analytics For Everyone - Even You
White Paper Analytics For Everyone - Even You Abstract Analytics have matured considerably in recent years, to the point that business intelligence tools are now widely accessible outside the boardroom
Is a Data Scientist the New Quant? Stuart Kozola MathWorks
Is a Data Scientist the New Quant? Stuart Kozola MathWorks 2015 The MathWorks, Inc. 1 Facts or information used usually to calculate, analyze, or plan something Information that is produced or stored by
An interdisciplinary model for analytics education
An interdisciplinary model for analytics education Raffaella Settimi, PhD School of Computing, DePaul University Drew Conway s Data Science Venn Diagram http://drewconway.com/zia/2013/3/26/the-data-science-venn-diagram
Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points
Analytic 360 Our Raison d'être Identify major choice decision points Leverage Analytical Tools and Techniques to solve problems hindering these decision points Empowerment through Intelligence Our Suite
Certificate Program in Applied Big Data Analytics in Dubai. A Collaborative Program offered by INSOFE and Synergy-BI
Certificate Program in Applied Big Data Analytics in Dubai A Collaborative Program offered by INSOFE and Synergy-BI Program Overview Today s manager needs to be extremely data savvy. They need to work
Downloading, Configuring, and Using the Free SAS University Edition Software
PharmaSUG 2015 Paper CP08 Downloading, Configuring, and Using the Free SAS University Edition Software Kirk Paul Lafler, Software Intelligence Corporation, Spring Valley, California Charles Edwin Shipp,
Integrating a Big Data Platform into Government:
Integrating a Big Data Platform into Government: Drive Better Decisions for Policy and Program Outcomes John Haddad, Senior Director Product Marketing, Informatica Digital Government Institute s Government
Customer Case Study. Automatic Labs
Customer Case Study Automatic Labs Customer Case Study Automatic Labs Benefits Validated product in days Completed complex queries in minutes Freed up 1 full-time data scientist Infrastructure savings
IST565 M001 Yu Spring 2015 Syllabus Data Mining
IST565 M001 Yu Spring 2015 Syllabus Data Mining Draft updated 10/28/2014 Instructor: Professor Bei Yu Classroom: Hinds 117 Email: [email protected] Class time: 3:45-5:05 Wednesdays Office: Hinds 320
SAS and OSU Data Mining Certificate and Marketing Analytics Certificate Program
SAS Analytics Day SAS and OSU Data Mining Certificate and Marketing Analytics Certificate Program Goutam Chakraborty Professor (Marketing) Agenda Conference details 190+ registered attendees (60+ companies)
Mike Maxey. Senior Director Product Marketing Greenplum A Division of EMC. Copyright 2011 EMC Corporation. All rights reserved.
Mike Maxey Senior Director Product Marketing Greenplum A Division of EMC 1 Greenplum Becomes the Foundation of EMC s Big Data Analytics (July 2010) E M C A C Q U I R E S G R E E N P L U M For three years,
MIS 5208 Data Analytics for IT Auditors Introduction & Course Overview
MIS 5208 Data Analytics for IT Auditors Introduction & Course Overview Week 1: Introductions, Course Outline, Reading, Tools and Other Administration Issues Ed Ferrara, MSIA, CISSP [email protected]
CORE CLASSES: IS 6410 Information Systems Analysis and Design IS 6420 Database Theory and Design IS 6440 Networking & Servers (3)
COURSE DESCRIPTIONS CORE CLASSES: Required IS 6410 Information Systems Analysis and Design (3) Modern organizations operate on computer-based information systems, from day-to-day operations to corporate
Building Your Big Data Team
Building Your Big Data Team With all the buzz around Big Data, many companies have decided they need some sort of Big Data initiative in place to stay current with modern data management requirements.
Unlocking the True Value of Hadoop with Open Data Science
Unlocking the True Value of Hadoop with Open Data Science Kristopher Overholt Solution Architect Big Data Tech 2016 MinneAnalytics June 7, 2016 Overview Overview of Open Data Science Python and the Big
Some vendors have a big presence in a particular industry; some are geared toward data scientists, others toward business users.
Bonus Chapter Ten Major Predictive Analytics Vendors In This Chapter Angoss FICO IBM RapidMiner Revolution Analytics Salford Systems SAP SAS StatSoft, Inc. TIBCO This chapter highlights ten of the major
AMIS 7640 Data Mining for Business Intelligence
The Ohio State University The Max M. Fisher College of Business Department of Accounting and Management Information Systems AMIS 7640 Data Mining for Business Intelligence Autumn Semester 2013, Session
Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p.
Introduction p. xvii Introduction to Big Data Analytics p. 1 Big Data Overview p. 2 Data Structures p. 5 Analyst Perspective on Data Repositories p. 9 State of the Practice in Analytics p. 11 BI Versus
Introduction to Python
Introduction to Python Sophia Bethany Coban Problem Solving By Computer March 26, 2014 Introduction to Python Python is a general-purpose, high-level programming language. It offers readable codes, and
KnowledgeSTUDIO HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES
HIGH-PERFORMANCE PREDICTIVE ANALYTICS USING ADVANCED MODELING TECHNIQUES Translating data into business value requires the right data mining and modeling techniques which uncover important patterns within
Data Analysis with Various Oracle Business Intelligence and Analytic Tools
Data Analysis with Various Oracle Business Intelligence and Analytic Tools Session ID: 108680 Prepared by: Tim and Dan Vlamis Vlamis Software Solutions www.vlamis.com @TimVlamis Agenda What we will talk
CSCI6900 Assignment 2: Naïve Bayes on Hadoop
DEPARTMENT OF COMPUTER SCIENCE, UNIVERSITY OF GEORGIA CSCI6900 Assignment 2: Naïve Bayes on Hadoop DUE: Friday, September 18 by 11:59:59pm Out September 4, 2015 1 IMPORTANT NOTES You are expected to use
Introduction to Data Mining and Machine Learning Techniques. Iza Moise, Evangelos Pournaras, Dirk Helbing
Introduction to Data Mining and Machine Learning Techniques Iza Moise, Evangelos Pournaras, Dirk Helbing Iza Moise, Evangelos Pournaras, Dirk Helbing 1 Overview Main principles of data mining Definition
INVESTOR PRESENTATION. First Quarter 2014
INVESTOR PRESENTATION First Quarter 2014 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
One Statistician s Perspectives on Statistics and "Big Data" Analytics
One Statistician s Perspectives on Statistics and "Big Data" Analytics Some (Ultimately Unsurprising) Lessons Learned Prof. Stephen Vardeman IMSE & Statistics Departments Iowa State University July 2014
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY
Classroom Demonstrations of Big Data
Classroom Demonstrations of Big Data Eric A. Suess Abstract We present examples of accessing and analyzing large data sets for use in a classroom at the first year graduate level or senior undergraduate
ANALYTICS A FUTURE IN ANALYTICS
ANALYTICS A FUTURE IN ANALYTICS WHAT IS ANALYTICS? In the information age in which we live, almost all of us consume and produce digital data, either for business, community or private uses. We access
Predictive Analytics. Noam Zeigerson, CTO
Predictive Analytics Noam Zeigerson, CTO Agenda The Predictive Analytics Need Innovative Technologies Business Solutions The problem: Inconsistent stream of revenue Available Data Sources ERP data Web
DIABLO VALLEY COLLEGE CATALOG 2014-2015
COMPUTER SCIENCE COMSC The computer science department offers courses in three general areas, each targeted to serve students with specific needs: 1. General education students seeking a computer literacy
Dr. Rob Donald - Curriculum Vitae. Email: [email protected], Web: http://www.statsresearch.co.uk Mob: 07780 650 910
Dr. Rob Donald - Curriculum Vitae Email: [email protected], Web: http://www.statsresearch.co.uk Mob: 07780 650 910 Profile Data Scientist, Systems and Data Analyst In my current role I am a senior
How to Optimize Your Data Mining Environment
WHITEPAPER How to Optimize Your Data Mining Environment For Better Business Intelligence Data mining is the process of applying business intelligence software tools to business data in order to create
Five Reasons Spotfire Is Better than Excel for Business Data Analytics
Five Reasons Spotfire Is Better than Excel for Business Data Analytics A hugely versatile application, Microsoft Excel is the Swiss Army Knife of IT, able to cope with all kinds of jobs from managing personal
Prerequisites. Course Outline
MS-55040: Data Mining, Predictive Analytics with Microsoft Analysis Services and Excel PowerPivot Description This three-day instructor-led course will introduce the students to the concepts of data mining,
2015 Ironside Group, Inc. 2
2015 Ironside Group, Inc. 2 Introduction to Ironside What is Cloud, Really? Why Cloud for Data Warehousing? Intro to IBM PureData for Analytics (IPDA) IBM PureData for Analytics on Cloud Intro to IBM dashdb
THE MCKINSEY GLOBAL INSTITUTE has predicted that by 2018, the US alone could face a shortage of between 140,000 to 190,000 people with deep
THE MCKINSEY GLOBAL INSTITUTE has predicted that by 2018, the US alone could face a shortage of between 140,000 to 190,000 people with deep analytical skills, and a shortage of 1.5 million managers and
BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business
BUDT 758B-0501: Big Data Analytics (Fall 2015) Decisions, Operations & Information Technologies Robert H. Smith School of Business Instructor: Kunpeng Zhang ([email protected]) Lecture-Discussions:
Gamification Meets Analytics With Kaggle
G00228640 Gamification Meets Analytics With Kaggle Published: 1 June 2012 Analyst(s): Rita L. Sallam This note describes how Kaggle is bringing "the collective" to "the predictive" to help companies overcome
Big Data Explained. An introduction to Big Data Science.
Big Data Explained An introduction to Big Data Science. 1 Presentation Agenda What is Big Data Why learn Big Data Who is it for How to start learning Big Data When to learn it Objective and Benefits of
Management Information Systems
University of Illinois at Chicago 1 Management Information Systems Mailing Address: UIC Liautaud Graduate School of Business 1108 University Hall (MC 077) 601 South Morgan Street Chicago, IL 60607 Contact
Orientation Program for Students of Our MSc. Programs Business Administration, Economics and MEMS. Information Systems. Prof. Dr.
Orientation Program for Students of Our MSc. Programs Business Administration, Economics and MEMS Information Systems Prof. Dr. Stefan Lessmann Agenda What it is about Information Systems Who we are What
ANACONDA. Open Source Modern Analytics Platform Powered by Python ANACONDA DELIVERS OPEN ENTERPRISE PYTHON KEY FEATURES WHY YOU LL LOVE ANACONDA
1 Open Source Modern Analytics Platform Powered by Python KEY FEATURES 100% Open Source Modern Analytics Platform Powered by Python Single click installation Package management Works with Windows, OS X,
Advanced analytics at your hands
2.3 Advanced analytics at your hands Neural Designer is the most powerful predictive analytics software. It uses innovative neural networks techniques to provide data scientists with results in a way previously
R Tools Evaluation. A review by Analytics @ Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015
R Tools Evaluation A review by Analytics @ Global BI / Local & Regional Capabilities Telefónica CCDO May 2015 R Features What is? Most widely used data analysis software Used by 2M+ data scientists, statisticians
ST ALOYSIUS COLLEGE (AUTONOMOUS)
ST ALOYSIUS COLLEGE (AUTONOMOUS) ALOYSIUS INSTITUTE OF MANAGEMENT & INFORMATION TECHNOLOGY MANGALORE, KARNATAKA Phone : 0824-2286890 6490299 e-mail : [email protected] SCHOOL OF INFORMATION TECHNOLOGY
Business Intelligence Tutorial
IBM DB2 Universal Database Business Intelligence Tutorial Version 7 IBM DB2 Universal Database Business Intelligence Tutorial Version 7 Before using this information and the product it supports, be sure
Jay Buckingham Dynamic Signal [email protected]
Jay Buckingham Dynamic Signal [email protected] Financial Times PeHub.com Wall Street Journal Harvard Business Review Making use of vast amounts of data to: Discover what we don t know Obtain
BIG Data Analytics Move to Competitive Advantage
BIG Data Analytics Move to Competitive Advantage where is technology heading today Standardization Open Source Automation Scalability Cloud Computing Mobility Smartphones/ tablets Internet of Things Wireless
Big Data and Data Science: Behind the Buzz Words
Big Data and Data Science: Behind the Buzz Words Peggy Brinkmann, FCAS, MAAA Actuary Milliman, Inc. April 1, 2014 Contents Big data: from hype to value Deconstructing data science Managing big data Analyzing
Better planning and forecasting with IBM Predictive Analytics
IBM Software Business Analytics SPSS Predictive Analytics Better planning and forecasting with IBM Predictive Analytics Using IBM Cognos TM1 with IBM SPSS Predictive Analytics to build better plans and
Predictive Analytics Certificate Program
Information Technologies Programs Predictive Analytics Certificate Program Accelerate Your Career Offered in partnership with: University of California, Irvine Extension s professional certificate and
Automating Big Data Benchmarking for Different Architectures with ALOJA
www.bsc.es Jan 2016 Automating Big Data Benchmarking for Different Architectures with ALOJA Nicolas Poggi, Postdoc Researcher Agenda 1. Intro on Hadoop performance 1. Current scenario and problematic 2.
2015 Workshops for Professors
SAS Education Grow with us Offered by the SAS Global Academic Program Supporting teaching, learning and research in higher education 2015 Workshops for Professors 1 Workshops for Professors As the market
Data processing goes big
Test report: Integration Big Data Edition Data processing goes big Dr. Götz Güttich Integration is a powerful set of tools to access, transform, move and synchronize data. With more than 450 connectors,
Business plus Intelligence plus Technology equals Business Intelligence
Business plus Intelligence plus Technology equals Business Intelligence Ron Klimberg Ira Yermish Virginia Miori John Yi Rashmi Malhotra Decision and System Sciences Haub School of Business Saint Joseph
Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee
Teaching Computational Thinking using Cloud Computing: By A/P Tan Tin Wee Technology in Pedagogy, No. 8, April 2012 Written by Kiruthika Ragupathi ([email protected]) Computational thinking is an emerging
Consulting and Systems Integration (1) Networks & Cloud Integration Engineer
Ericsson is a world-leading provider of telecommunications equipment & services to mobile & fixed network operators. Over 1,000 networks in more than 180 countries use Ericsson equipment, & more than 40
Questionnaire about the skills necessary for people. working with Big Data in the Statistical Organisations
Questionnaire about the skills necessary for people working with Big Data in the Statistical Organisations Preliminary results of the survey (19.08 2014) More detailed analysis will be prepared by October
SAP Predictive Analytics
SAP Predictive Analytics What s the best that COULD happen? Bringing predictive analytics to the end user SAP Forum Belgium September 9, 2015 Waldemar Adams @adamsw SVP & GM Analytics SAP Europe, Middle-East
Cisco Data Preparation
Data Sheet Cisco Data Preparation Unleash your business analysts to develop the insights that drive better business outcomes, sooner, from all your data. As self-service business intelligence (BI) and
Information and Decision Sciences (IDS)
University of Illinois at Chicago 1 Information and Decision Sciences (IDS) Courses IDS 400. Advanced Business Programming Using Java. 0-4 Visual extended business language capabilities, including creating
Introduction to predictive modeling and data mining
Introduction to predictive modeling and data mining Rebecca C. Steorts Predictive Modeling and Data Mining: STA 521 August 25 2015 1 Today s Menu 1. Brief history of data science (from slides of Bin Yu)
COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK DEPARTMENT OF INDUSTRIAL ENGINEERING AND OPERATIONS RESEARCH
Course: IEOR 4575 Business Analytics for Operations Research Lectures MW 2:40-3:55PM Instructor Prof. Guillermo Gallego Office Hours Tuesdays: 3-4pm Office: CEPSR 822 (8 th floor) Textbooks and Learning
Analytics Essentials. A foundational certification program in business analytics. 13 th June 2015 19 th September 2015
A foundational certification program in business analytics 13 th June 2015 19 th September 2015 A foundational certification program in Business Analytics With the maturity of the information age, there
An Introduction to Using Python with Microsoft Azure
An Introduction to Using Python with Microsoft Azure If you build technical and scientific applications, you're probably familiar with Python. What you might not know is that there are now tools available
Proposal for New Program: BS in Data Science: Computational Analytics
Proposal for New Program: BS in Data Science: Computational Analytics 1. Rationale... The proposed Data Science: Computational Analytics major is designed for students interested in developing expertise
INVESTOR PRESENTATION. Third Quarter 2014
INVESTOR PRESENTATION Third Quarter 2014 Note to Investors Certain non-gaap financial information regarding operating results may be discussed during this presentation. Reconciliations of the differences
