GETTING STARTED WITH R AND DATA ANALYSIS

Similar documents
U N D E R S TA N D I N G T H E D N A O F DATA SCIENCE Persontyle Ltd. All rights reserved.

INTRODUCTION TO DATA SCIENCE USING R

CLUSTER ANALYSIS WITH R

MACHINE LEARNING BASICS WITH R

ESS event: Big Data in Official Statistics. Antonino Virgillito, Istat

Importing Data into R

From The Little SAS Book, Fifth Edition. Full book available for purchase here.

How NewZapp Track can help your Marketing

Diagrams and Graphs of Statistical Data

WROX Certified Big Data Analyst Program by AnalytixLabs and Wiley

ECON 424/CFRM 462 Introduction to Computational Finance and Financial Econometrics

A Correlation of. to the. South Carolina Data Analysis and Probability Standards

A GENERAL TAXONOMY FOR VISUALIZATION OF PREDICTIVE SOCIAL MEDIA ANALYTICS

Today's Topics. COMP 388/441: Human-Computer Interaction. simple 2D plotting. 1D techniques. Ancient plotting techniques. Data Visualization:

Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate

Research Note What is Big Data?

Data Visualization Techniques

DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7

SAS REPORTS ON YOUR FINGERTIPS? SAS BI IS THE ANSWER FOR CREATING IMMERSIVE MOBILE REPORTS

Our Raison d'être. Identify major choice decision points. Leverage Analytical Tools and Techniques to solve problems hindering these decision points

Table of Contents. Table of Contents 3

Better planning and forecasting with IBM Predictive Analytics

Team Members: Christopher Copper Philip Eittreim Jeremiah Jekich Andrew Reisdorph. Client: Brian Krzys

9.2 User s Guide SAS/STAT. Introduction. (Book Excerpt) SAS Documentation

WebFOCUS RStat. RStat. Predict the Future and Make Effective Decisions Today. WebFOCUS RStat

Microsoft Excel. Qi Wei

Downloading, Configuring, and Using the Free SAS University Edition Software

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing

Summarizing and Displaying Categorical Data

Using SPSS, Chapter 2: Descriptive Statistics

Data Mining. SPSS Clementine Clementine Overview. Spring 2010 Instructor: Dr. Masoud Yaghini. Clementine

2015 Workshops for Professors

imc FAMOS 6.3 visualization signal analysis data processing test reporting Comprehensive data analysis and documentation imc productive testing

MicroStrategy Analytics Express User Guide

R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol

PRINCE2 Project Management Certification

Data Visualization Techniques

Course Title: Advanced Topics in Quantitative Methods: Educational Data Science Practicum

IST687 Applied Data Science

PRINCE2 Project Management Certification

Market Pricing Override

NEXT Analytics Business Intelligence User Guide

Time Series Analysis AMS 316

BIG DATA Driven Innovations in the Life Insurance Industry

CONTENTS PREFACE 1 INTRODUCTION 1 2 DATA VISUALIZATION 19

APPROACHABLE ANALYTICS MAKING SENSE OF DATA

Programming Languages & Tools

White Paper. Redefine Your Analytics Journey With Self-Service Data Discovery and Interactive Predictive Analytics

Tutorial 2: Reading and Manipulating Files Jason Pienaar and Tom Miller

Systat: Statistical Visualization Software

THE COMPARISON OF DATA MINING TOOLS

An introduction to using Microsoft Excel for quantitative data analysis

R Tools Evaluation. A review by Global BI / Local & Regional Capabilities. Telefónica CCDO May 2015

Figure 1. An embedded chart on a worksheet.

RIFIS Ad Hoc Reports

Interactive Data Mining and Visualization

A Review of "Free" Massive Open Online Content (MOOC) for SAS Learners

Big Data Executive Survey

Exploratory Data Analysis with #codemash

SAS/GRAPH Network Visualization Workshop 2.1

Microsoft Office 2010: Access 2010, Excel 2010, Lync 2010 learning assets

TIBCO Spotfire Business Author Essentials Quick Reference Guide. Table of contents:

Steven Wood Software, 2 Harksome Hill, West Hunsbury Northampton. NN4 9YF. United Kingdom

Computer Programming for the Social Sciences

Predictive Analytics Certificate Program

Excel 2010: Create your first spreadsheet

Oracle Advanced Analytics 12c & SQLDEV/Oracle Data Miner 4.0 New Features

Introduction to SPSS 16.0

Egress Switch Secure Workspace 2.x. Administration Guide

GeoGebra Statistics and Probability

Visualizing the Top 400 Universities

The document may be freely distributed in its entirety, either digitally or in printed format, to all EPiServer Mail users.

Exploratory Spatial Data Analysis

Ignite Your Creative Ideas with Fast and Engaging Data Discovery

Executive Dashboard. User Guide

CSU, Fresno - Institutional Research, Assessment and Planning - Dmitri Rogulkin

Why Taking This Course? Course Introduction, Descriptive Statistics and Data Visualization. Learning Goals. GENOME 560, Spring 2012

A Visualization is Worth a Thousand Tables: How IBM Business Analytics Lets Users See Big Data

SPSS Introduction. Yi Li

Easily Identify Your Best Customers

SAP Predictive Analysis Installation

Best Practices in Data Visualizations. Vihao Pham January 29, 2014

Best Practices in Data Visualizations. Vihao Pham 2014

Big Data Analytics in R

CE 504 Computational Hydrology Computational Environments and Tools Fritz R. Fiedler

Introduction Course in SPSS - Evening 1

What is Data Analysis. Kerala School of MathematicsCourse in Statistics for Scientis. Introduction to Data Analysis. Steps in a Statistical Study

Transcription:

GETTING STARTED WITH R AND DATA ANALYSIS [Learn R for effective data analysis]

LEARN PRACTICAL SKILLS REQUIRED FOR VISUALIZING, TRANSFORMING, AND ANALYZING DATA IN R One day course for people who are just starting with R as well as for data analysts who are switching to R from other statistical software, such as SAS, SPSS or Excel. R IS THE LEADING TOOL FOR STATISTICS, DATA ANALYSIS, AND MACHINE LEARNING R is an open source environment for statistical computing. R was created in 1993 by Ross Ihaka and Robert Gentleman of the University of Aukland, New Zealand. It is a GNU project which is similar to the S language and environment which was developed at Bell Laboratories by John Chambers and colleagues. R provides an integrated environment for data manipulation, calculation and graphical display. Worldwide, millions of statisticians and data scientists use R for data manipulation, statistical modelling, and graphical analysis. R provides a wide variety of statistical (linear and nonlinear modelling, classical statistical tests, time-series analysis, classification, clustering, and many machine learning methods) and graphical techniques, and is highly extensible. You can easily find, download and use 4000 plus methods in statistics, predictive modeling, and machine learning free of charge.

WHAT IS THIS COURSE ABOUT? Are you tasked with analysing data, or are you about to be? Have you been using proprietary data analysis tools and would like to explore open source? If so, you are in for a treat as we introduce you to the most popular and rapidly growing open source statistical package R. In this course you will learn how to use R for effective data analysis, how to install and configure software necessary for a statistical programming environment, learn and practice R programming language concepts, common and useful R commands. You will learn to use R for reading data, writing functions, making informative graphs, and applying basic statistical methods. As more and more business activity is digitized, new sources of information and ever-cheaper equipment combine to bring us into a new era: one in which large amounts of digital information exist on virtually any topic of interest to a business Andrew McAfee and Erik Brynjolfsson

WHAT WILL YOU LEARN? + Overview and introduction to R + Getting started and working with data + Basic data types and operations + Data Frames + Importing, saving, exporting, and re-using data + Common R functions for numbers, factors, text, and dates + Vector-oriented computation + Sorting, ranking, and printing + Reading and writing data with R + Reading tables and CSV files, row and column headers, delimiters. Built-in data. + Cleaning and transforming data + Programming efficiently in R + Writing R scripts and functions + Loops and conditions + Flow control, functions and classes in R, executing R scripts from GUI and command line + Visualizing data and Exploratory Data Analysis + Basic data summary functions/ summary statistics + Exploring and plotting relationships between variables + Visualizations for categorical and continuous data, scatter plots, box plots, pie charts, histograms, bar plots, dot charts, and the char object. WHO SHOULD TAKE THIS COURSE? The course is aimed at business professionals, data analysts, technologists, journalists, software developers, academics and students who already have some basic competence in using statistics but wish to begin using R for the first time.

PREREQUISITES Basic knowledge of statistics and programming languages. WHAT SHOULD I BRING? Along with bringing your laptop and charger, don t forget to bring loads of curiosity, scepticism, eagerness to participate and the desire to learn. COURSE INSTRUCTORS Persontyle trainers are passionate about meeting each participants learning needs. They have been chosen both for their extensive practical Data Science and Machine Learning experience and for their ability to educate and interact with natural empathy. All of our trainers have worked on a variety of data science and Machine Learning projects. They share their academic knowledge and real-world experience and each individual adds their own unique perspective to the course. Our trainers present in a style that is informal, entertaining and highly interactive. Guest Speakers Business leaders, Data Science practitioners, and academic researchers covering use cases, case studies and sharing practical experience of applying Data Science and Machine Learning in their organizations. I keep saying the sexy job in the next ten years will be statisticians. People think I'm joking, but who would've guessed that computer engineers would've been the sexy job of the 1990s? Professor Hal Varian, Chief Economist at Google

RETURN ON INVESTMENT (ROI) CONVINCE YOUR BOSS The advent of the data driven connected era means that analyzing massive scale, messy, noisy, and unstructured data is going to increasingly form part of everyone's work. The School of Data Science learning programs provide a unique investment opportunity that pay s for itself many times over. World-class Instructors Develop Practical Data Science Skills Real World Industry Use Cases Short Courses For Time Convenience Value For Money "For the best return on your money, pour your purse into your head." Benjamin Franklin Limited seats. We encourage you to register as soon as you can. Register Now For corporate bookings or to organize on-site training email hello@persontyle.com or call now +44 (0)20 3239 3141 THE SCHOOL OF DATA SCIENCE The School of Data Science, a project of Persontyle, specializes in designing and delivering structured, relevant and practical learning experiences for all of us to understand data science in simple human terms. /school Follow us on Twitter @schooltds Like us on Facebook Get in touch! hello@personyyle.com