Accessing bigger datasets in R using SQLite and dplyr
|
|
|
- Trevor Palmer
- 10 years ago
- Views:
Transcription
1 Accessing bigger datasets in R using SQLite and dplyr Amherst College, Amherst, MA, USA March 24, 2015 [email protected]
2 Thanks to Revolution Analytics for their financial support to the Five College Statistics Program and the Biostatistics Program at UMass/Amherst to Nick Reich, Emily Ramos, Kostis Gourgoulias, and Andrew Bray (co-organizers of the Meetup) suggestions of meeting topics, speakers and locations always welcomed
3 Upcoming meetups working with geographic data in R 3/26 at 7:00pm DataFest 3/27-3/29 learn about plotly 4/2 at 4:00pm updates from RStudio: J.J. Allaire 7/15 (save the date!) statistical analysis of big genetics and genomics data Xihong Lin, 11/16 (save the date!) using docker for fun and profit anyone willing?
4 Acknowledgements joint work with Ben Baumer (Smith College) and Hadley Wickham (Rice/RStudio) supported by NSF grant (building a community around modeling, statistics, computation and calculus) more information at
5 Prelude New Frontier in statistical thinking (Chamandy, JSM 2013) teaching precursors to big data/foundations of data science as an intellectually coherent theme (Pruim, Calvin College) growing importance of computing and ability to think with data (Lambert, Google) key capacities in statistical computing (Nolan and Temple Lang, TAS 2010) Statistics and the modern student (Gould, ISR 2010) Building precursors for data science in the first and second courses in statistics (Horton, Baumer, and Wickham, 2015)
6 Prelude (cont.) How to accomplish this? start in the first course build on capacities in the second course develop more opportunities for students to apply their knowledge in practice (internships, collaborative research, teaching assistants) new courses focused on Data Science today s goal: helping DataFesters grapple with larger datasets
7 Data Expo 2009 Ask students: have you ever been stuck in an airport because your flight was delayed or cancelled and wondered if you could have predicted it if you d had more data? (Wickham, JCGS, 2011)
8 Data Expo 2009 dataset of flight arrival and departure details for all commercial flights within the USA, from October 1987 to April 2008 (but we now have through the begininning of 2015) large dataset: more than 180 million records aim: provide a graphical summary of important features of the data set winners presented at the JSM in 2009; details at
9 Airline Delays Codebook (abridged) Year Month DayOfMonth DayOfWeek 1=Monday DepTime departure time UniqueCarrier OH = Comair, DL = Delta, etc. TailNum plane tail number ArrDelay arrival delay, in minutes Origin BDL, BOS, SFO, etc. Dest Full details at
10 Data Expo 2009 winners
11 Data Expo 2009 winners
12 Data Expo 2009 winners
13 Data Expo 2009 winners
14 Data Expo 2009 winners
15 Goal facilitate use of database technology for instructors and students without training in this area demonstrate how to use SQLite in conjunction with dplyr in R to achieve this goal help enable the next generation of data scientists
16 Background on databases and SQL relational databases (invented in 1970) like electronic filing cabinets to organize masses of data (terabytes) fast and efficient useful reference: Learning MySQL, O Reilly 2007 Horton, Baumer, and Wickham (2015, in press),
17 Client and server model server: manages data client: ask server to do things use R as both the client and the server
18 SQL Structured Query Language special purpose programming language for managing data developed in early 1970 s standardized (multiple times) most common operation is query (using SELECT)
19 Why SQLite? advantage: free, quick, dirty, simple (runs locally), database can be copied from one system (Mac OS X) to Windows or Linux disadvantage: not as robust, fast, or flexible than other free alternatives such as MySQL (which run remotely) For personal use, SQLite is ideal (and is what I will demonstrate today). For a class, I d recommend MySQL.
20 Creating the airline delays database 1 download the data (30gb uncompressed) 2 load the data 3 add indices (to speed up access to the data, takes some time) 4 establish a connection (using src sqlite()) 5 start to make selections (which will be returned as R objects) using dplyr package 6 features lazy evaluation (data only accessed when needed)
21 Key idioms for dealing with big(ger) data select: subset variables filter: subset rows mutate: add new columns summarise: reduce to a single row arrange: re-order the rows Hadley Wickham, bit.ly/bigrdata4
22 Accessing the database (see test-sqlite.rmd) my_db <- src_sqlite(path="ontime.sqlite3") # link to some useful tables flights <- tbl(my_db, "flights") airports <- tbl(my_db, "airports") airlines <- tbl(my_db, "airlines") airplanes <- tbl(my_db, "airplanes")
23 Accessing the database (see test-sqlite.rmd) > airports Source: sqlite [ontime.sqlite3] From: airports [3,376 x 7] IATA Airport City State 1 00M Thigpen Bay Springs MS 2 00R Livingston Municipal Livingston TX 3 00V Meadow Lake Colorado Springs CO 4 01G Perry-Warsaw Perry NY 5 01J Hilliard Airpark Hilliard FL 6 01M Tishomingo County Belmont MS Variables not shown: Country (chr), Latitude (dbl), Longitude (dbl)
24 Accessing the database (see test-sqlite.rmd) airportcounts <- flights %>% filter(dest %in% c( ALB, BDL, BTV )) %>% group_by(year, DayOfMonth, DayOfWeek, Month, Dest) %>% summarise(count = n()) %>% collect() # this forces the evaluation airportcounts <- airportcounts %>% mutate(date = ymd(paste(year, "-", Month, "-", DayOfMonth, sep=""))) head(airportcounts)
25 GROUP BY ## Source: local data frame [6 x 7] ## Groups: Year, DayOfMonth, DayOfWeek, Month ## ## Year DayOfMonth Day Month Dest count Date ## ALB ## BDL ## BTV ## ALB ## BDL ## BTV
26 SQL commands ## Year Month DayOfMonth Day DepTime CRSDepTime ArrTime CRS ## Variables not shown: UniqueCarrier (chr), FlightNum (int ## ActualElapsedTime (int), CRSElapsedTime (int), AirTime ( ## (int), DepDelay (int), Origin (chr), Dest (chr), Distanc tbl(my_db, sql("select * FROM flights LIMIT 4")) # or tbl(my_db, "flights") %>% head() ## Source: sqlite [ontime.sqlite3] ## From: <derived table> [?? x 29] ## ## ## ## ## ## (int), TaxiOut (int), Cancelled (int), CancellationCode ## (chr), CarrierDelay (chr), WeatherDelay (chr), NASDelay ## SecurityDelay (chr), LateAircraftDelay (chr)
27 Which month is it best to travel (airline averages/bdl)? 30 AvgArrivalDelay
28 Which day is it best to travel (airline averages from BDL)? 30 Avg Delay by Airline (in minutes)
29 Setup on your own machine (data from 2014) Update existing packages: update.packages() Install necessary packages: RSQLite, dplyr, tidyr, mosaic, knitr, nycflights13, lubridate, igraph, markdown Create a new project in RStudio and directory to store the data and datab Download files: see list at Set up the database: run load-sqlite.r Take it for a spin: run test-sqlite.rmd
30 Next steps Want more data? See the additional instructions to access more than just the data from 2014 See also: the dplyr package vignette on databases See also: the precursors paper (in press) at
31 Accessing bigger datasets in R using SQLite and dplyr Amherst College, Amherst, MA, USA March 24, 2015 [email protected]
Teaching Precursors to Data Science in Introductory and Second Courses in Statistics
Teaching Precursors to Data Science in Introductory and Second Courses in Statistics Nicholas Horton, [email protected] April 28, 2015 Resources available at http://www.amherst.edu/~nhorton/precursors
VOL. 5, NO. 2, August 2015 ISSN 2225-7217 ARPN Journal of Systems and Software 2009-2015 AJSS Journal. All rights reserved
Big Data Analysis of Airline Data Set using Hive Nillohit Bhattacharya, 2 Jongwook Woo Grad Student, 2 Prof., Department of Computer Information Systems, California State University Los Angeles nbhatta2
Bigger data analysis. Hadley Wickham. @hadleywickham Chief Scientist, RStudio. Thursday, July 18, 13
http://bit.ly/bigrdata3 Bigger data analysis Hadley Wickham @hadleywickham Chief Scientist, RStudio July 2013 http://bit.ly/bigrdata3 1. What is data analysis? 2. Transforming data 3. Visualising data
Big Data Analysis with Revolution R Enterprise
Big Data Analysis with Revolution R Enterprise August 2010 Joseph B. Rickert Copyright 2010 Revolution Analytics, Inc. All Rights Reserved. 1 Background The R language is well established as the language
Introduc)on to RHadoop Master s Degree in Informa1cs Engineering Master s Programme in ICT Innova1on: Data Science (EIT ICT Labs Master School)
Introduc)on to RHadoop Master s Degree in Informa1cs Engineering Master s Programme in ICT Innova1on: Data Science (EIT ICT Labs Master School) Academic Year 2015-2106 Contents Introduc1on to MapReduce
Big Data Analysis with Revolution R Enterprise
REVOLUTION WHITE PAPER Big Data Analysis with Revolution R Enterprise By Joseph Rickert January 2011 Background The R language is well established as the language for doing statistics, data analysis, data-mining
Hands-On Data Science with R Dealing with Big Data. [email protected]. 27th November 2014 DRAFT
Hands-On Data Science with R Dealing with Big Data [email protected] 27th November 2014 Visit http://handsondatascience.com/ for more Chapters. In this module we explore how to load larger datasets
How To Calculate Estimatepi On An Ec2 Cluster
deathstar seamless cloud computing for R Whit Armstrong [email protected] KLS Diversified Asset Management May 11, 2012 deathstar seamless cloud computing for R 1 / 27 What is deathstar? deathstar
Scalable Data Analysis in R. Lee E. Edlefsen Chief Scientist UserR! 2011
Scalable Data Analysis in R Lee E. Edlefsen Chief Scientist UserR! 2011 1 Introduction Our ability to collect and store data has rapidly been outpacing our ability to analyze it We need scalable data analysis
Sawmill Log Analyzer Best Practices!! Page 1 of 6. Sawmill Log Analyzer Best Practices
Sawmill Log Analyzer Best Practices!! Page 1 of 6 Sawmill Log Analyzer Best Practices! Sawmill Log Analyzer Best Practices!! Page 2 of 6 This document describes best practices for the Sawmill universal
LabStats 5 System Requirements
LabStats Tel: 877-299-6241 255 B St, Suite 201 Fax: 208-473-2989 Idaho Falls, ID 83402 LabStats 5 System Requirements Server Component Virtual Servers: There is a limit to the resources available to virtual
Classroom Demonstrations of Big Data
Classroom Demonstrations of Big Data Eric A. Suess Abstract We present examples of accessing and analyzing large data sets for use in a classroom at the first year graduate level or senior undergraduate
RevoScaleR 7.3 HPC Server Getting Started Guide
RevoScaleR 7.3 HPC Server Getting Started Guide The correct bibliographic citation for this manual is as follows: Revolution Analytics, Inc. 2014. RevoScaleR 7.3 HPC Server Getting Started Guide. Revolution
Project 2 Database Design and ETL
Project 2 Database Design and ETL Out: October 7th, 2015 1 Introduction: What is this project all about? We ve now studied many techniques that help in modeling data (E-R diagrams), which can then be migrated
Cloud Computing Capstone Task 2 Report
Cloud Computing Capstone Task 2 Report Paul Lo [email protected] 2016-02-23 I. System Architecture I deployed 6 EC2 instances in Hadoop cluster for Spark tasks One master and secondary NameNode, plus
Big Data for Science (and other) Students
Big Data for Science (and other) Students Randall Pruim Calvin College Questions to My Colleagues 1. What is the largest data set that students encounter in your respective majors? 2. In the context of
Computational Statistics: A Crash Course using R for Biologists (and Their Friends)
Computational Statistics: A Crash Course using R for Biologists (and Their Friends) Randall Pruim Calvin College Michigan NExT 2011 Computational Statistics Using R Why Use R? Statistics + Computation
Product Guide. Sawmill Analytics, Swindon SN4 9LZ UK [email protected] tel: +44 845 250 4470
Product Guide What is Sawmill Sawmill is a highly sophisticated and flexible analysis and reporting tool. It can read text log files from over 800 different sources and analyse their content. Once analyzed
Dashboard Engine for Hadoop
Matt McDevitt Sr. Project Manager Pavan Challa Sr. Data Engineer June 2015 Dashboard Engine for Hadoop Think Big Start Smart Scale Fast Agenda Think Big Overview Engagement Model Solution Offerings Dashboard
Bringing Big Data Modelling into the Hands of Domain Experts
Bringing Big Data Modelling into the Hands of Domain Experts David Willingham Senior Application Engineer MathWorks [email protected] 2015 The MathWorks, Inc. 1 Data is the sword of the
BiDAl: Big Data Analyzer for Cluster Traces
BiDAl: Big Data Analyzer for Cluster Traces Alkida Balliu, Dennis Olivetti, Ozalp Babaoglu, Moreno Marzolla, Alina Sirbu Department of Computer Science and Engineering University of Bologna, Italy BigSys
HALOGEN. Technical Design Specification. Version 2.0
HALOGEN Technical Design Specification Version 2.0 10th August 2010 1 Document Revision History Date Author Revision Description 27/7/09 D Carter, Mark Widdowson, Stuart Poulton, Lex Comber 1.1 First draft
Big R: Large-scale Analytics on Hadoop using R
2014 IEEE International Congress on Big Data Big R: Large-scale Analytics on Hadoop using R Oscar D. Lara, Weiqiang Zhuang, and Adarsh Pannu IBM Silicon Valley Laboratory San Jose, CA, USA Email: {odlaraye,
CSE 6040 Computing for Data Analytics: Methods and Tools. Lecture 1 Course Overview
CSE 6040 Computing for Data Analytics: Methods and Tools Lecture 1 Course Overview DA KUANG, POLO CHAU GEORGIA TECH FALL 2014 Fall 2014 CSE 6040 COMPUTING FOR DATA ANALYSIS 1 Course Staff Instructor Da
How To Perform Predictive Analysis On Your Web Analytics Data In R 2.5
How to perform predictive analysis on your web analytics tool data June 19 th, 2013 FREE Webinar by Before we start... www Q & A? Our speakers Carolina Araripe Inbound Marketing Strategist @Tatvic http://linkd.in/yazvvn
Turnout Response System (TRS) Version 1.8
Turnout Response System (TRS) Version 1.8 Copyright 2013-2014 by Mark Hessling . This material may be distributed only subject to the terms and conditions set forth in the Open Publication
Hardware and Software Requirements for Installing California.pro
Hardware and Requirements for Installing California.pro This document lists the hardware and software requirements to install and run California.pro. Workstation with SQL Server Recommended: 64-Bit Windows
Your Best Next Business Solution Big Data In R 24/3/2010
Your Best Next Business Solution Big Data In R 24/3/2010 Big Data In R R Works on RAM Causing Scalability issues Maximum length of an object is 2^31-1 Some packages developed to help overcome this problem
MySQL Manager. User Guide. July 2012
July 2012 MySQL Manager User Guide Welcome to AT&T Website Solutions SM We are focused on providing you the very best web hosting service including all the tools necessary to establish and maintain a successful
Tableau's data visualization software is provided through the Tableau for Teaching program.
A BEGINNER S GUIDE TO VISUALIZATION Featuring REU Site Collaborative Data Visualization Applications June 10, 2014 Vetria L. Byrd, PhD Advanced Visualization, Director REU Coordinator Visualization Scientist
User Guide. Analytics Desktop Document Number: 09619414
User Guide Analytics Desktop Document Number: 09619414 CONTENTS Guide Overview Description of this guide... ix What s new in this guide...x 1. Getting Started with Analytics Desktop Introduction... 1
NaviCell Data Visualization Python API
NaviCell Data Visualization Python API Tutorial - Version 1.0 The NaviCell Data Visualization Python API is a Python module that let computational biologists write programs to interact with the molecular
The Five College Guide to Statistics
Nicholas J. Horton Daniel T. Kaplan Randall Pruim The Five College Guide to Statistics Project MOSAIC With R Contents 1 Introduction 13 2 Getting Started with RStudio 15 3 One Quantitative Variable 25
inforouter V8.0 Server & Client Requirements
inforouter V8.0 Server & Client Requirements Please review this document thoroughly before proceeding with the installation of inforouter Version 8. This document describes the minimum and recommended
Computing, technology & Data Analysis in the Graduate Curriculum. Duncan Temple Lang UC Davis Dept. of Statistics
Computing, technology & Data Analysis in the Graduate Curriculum Duncan Temple Lang UC Davis Dept. of Statistics JSM 08 Statistics is much broader than we represent in our educational programs Context
Chapter 4 IT Infrastructure and Platforms
Chapter 4 IT Infrastructure and Platforms Essay Questions: 1. Identify and describe the stages of IT infrastructure evolution. 2. Identify and describe the technology drivers of IT infrastructure evolution.
Introduction to SQL for Data Scientists
Introduction to SQL for Data Scientists Ben O. Smith College of Business Administration University of Nebraska at Omaha Learning Objectives By the end of this document you will learn: 1. How to perform
Excel Power Tools. David Onder and Alison Joseph. NCAIR 2015 Conference
Excel Power Tools David Onder and Alison Joseph NCAIR 2015 Conference 10,382 students Master s Comprehensive Mountain location Residential and Distance 2 Why Pivot Tables Summarize large datasets Quickly
Cookbook 23 September 2013 GIS Analysis Part 1 - A GIS is NOT a Map!
Cookbook 23 September 2013 GIS Analysis Part 1 - A GIS is NOT a Map! Overview 1. A GIS is NOT a Map! 2. How does a GIS handle its data? Data Formats! GARP 0344 (Fall 2013) Page 1 Dr. Carsten Braun 1) A
MultiAlign Software. Windows GUI. Console Application. MultiAlign Software Website. Test Data
MultiAlign Software This documentation describes MultiAlign and its features. This serves as a quick guide for starting to use MultiAlign. MultiAlign comes in two forms: as a graphical user interface (GUI)
A Dynamic Programming Approach for 4D Flight Route Optimization
A Dynamic Programming Approach for 4D Flight Route Optimization Christian Kiss-Tóth, Gábor Takács Széchenyi István University, Győr, Hungary IEEE International Conference on Big Data Oct 27-30, 2014 Washington
Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate
Data Science and Business Analytics Certificate Data Science and Business Intelligence Certificate Description The Helzberg School of Management has launched two graduate-level certificates: one in Data
The SkySQL Administration Console
www.skysql.com The SkySQL Administration Console Version 1.1 Draft Documentation Overview The SkySQL Administration Console is a web based application for the administration and monitoring of MySQL 1 databases.
Editing Locally and Using SFTP: the FileZilla-Sublime-Terminal Flow
Editing Locally and Using SFTP: the FileZilla-Sublime-Terminal Flow Matthew Salim, 20 May 2016 This guide focuses on effective and efficient offline editing on Sublime Text. The key is to use SFTP for
LCMON Network Traffic Analysis
LCMON Network Traffic Analysis Adam Black Centre for Advanced Internet Architectures, Technical Report 79A Swinburne University of Technology Melbourne, Australia [email protected] Abstract The Swinburne
SUSE Cloud Installation: Best Practices Using an Existing SMT and KVM Environment
Best Practices Guide www.suse.com SUSE Cloud Installation: Best Practices Using an Existing SMT and KVM Environment Written by B1 Systems GmbH Table of Contents Introduction...3 Use Case Overview...3 Hardware
Visualizing More Than Twenty Years of Flight Data for the Raleigh-Durham International Airport
CS-BIGS 5(1): 51-57 2012 CS-BIGS http://www.bentley.edu/centers/csbigs/crotty.pdf Visualizing More Than Twenty Years of Flight Data for the Raleigh-Durham International Airport Michael T. Crotty SAS Institute,
Department of Geography Program in Planning, Faculty of Arts and Science University of Toronto GGR 273 H1S: GIS II Course Outline Winter 2015
Department of Geography Program in Planning, Faculty of Arts and Science University of Toronto GGR 273 H1S: GIS II Course Outline Winter 2015 Course General Information Course title GIS II Course number
SkySQL Data Suite. A New Open Source Approach to MySQL Distributed Systems. Serge Frezefond V1212.01
SkySQL Data Suite A New Open Source Approach to MySQL Distributed Systems Serge Frezefond V1212.01 Agenda SkySQL Cloud Data Suite Architecture SkySQL Cloud Data Suite on Amazon EC2 Components for automated
BIOS 6660: Analysis of Biomedical Big Data Using R and Bioconductor, Fall 2015 Computer Lab: Education 2 North Room 2201DE (TTh 10:30 to 11:50 am)
BIOS 6660: Analysis of Biomedical Big Data Using R and Bioconductor, Fall 2015 Computer Lab: Education 2 North Room 2201DE (TTh 10:30 to 11:50 am) Course Instructor: Dr. Tzu L. Phang, Assistant Professor
Outline. What is Big data and where they come from? How we deal with Big data?
What is Big Data Outline What is Big data and where they come from? How we deal with Big data? Big Data Everywhere! As a human, we generate a lot of data during our everyday activity. When you buy something,
Tackling Big Data with MATLAB Adam Filion Application Engineer MathWorks, Inc.
Tackling Big Data with MATLAB Adam Filion Application Engineer MathWorks, Inc. 2015 The MathWorks, Inc. 1 Challenges of Big Data Any collection of data sets so large and complex that it becomes difficult
Background on Elastic Compute Cloud (EC2) AMI s to choose from including servers hosted on different Linux distros
David Moses January 2014 Paper on Cloud Computing I Background on Tools and Technologies in Amazon Web Services (AWS) In this paper I will highlight the technologies from the AWS cloud which enable you
Installation Manual for Grid Monitoring Tool
Installation Manual for Grid Monitoring Tool Project No: CDAC/B/SSDG/GMT/2004/025 Document No: SSDG/GMT/2004/025/INST-MAN/2.1 Control Status: Controlled (Internal Circulation Only) Author : Karuna Distribution
UQC103S1 UFCE47-20-1. Systems Development. uqc103s/ufce47-20-1 PHP-mySQL 1
UQC103S1 UFCE47-20-1 Systems Development uqc103s/ufce47-20-1 PHP-mySQL 1 Who? Email: [email protected] Web Site www.cems.uwe.ac.uk/~jedawson www.cems.uwe.ac.uk/~jtwebb/uqc103s1/ uqc103s/ufce47-20-1 PHP-mySQL
Prediction and Fuzzy Logic at ThomasCook to automate price. automate price settings of last minute offers
Prediction and Fuzzy Logic at ThomasCook to automate price settings of last minute offers Jan Wijffels: [email protected] BNOSAC - Belgium Network of Open Source Analytical Consultants www.bnosac.be
SEIZE THE DATA. 2015 SEIZE THE DATA. 2015
1 Copyright 2015 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. Deep dive into Haven Predictive Analytics Powered by HP Distributed R and
Case Study 2 SPR500 Fall 2009
Case Study 2 SPR500 Fall 2009 6 th November 2009 Due Date: 9 th December 2009 Securing Sotnec's web site using Linux Firewall technology Sotnec corporation, an Open Source Company, consists of a small
Spam Marshall SpamWall Step-by-Step Installation Guide for Exchange 5.5
Spam Marshall SpamWall Step-by-Step Installation Guide for Exchange 5.5 What is this document for? This document is a Step-by-Step Guide that can be used to quickly install Spam Marshall SpamWall on Exchange
Android Development Tutorial. Nikhil Yadav CSE40816/60816 - Pervasive Health Fall 2011
Android Development Tutorial Nikhil Yadav CSE40816/60816 - Pervasive Health Fall 2011 Database connections Local SQLite and remote access Outline Setting up the Android Development Environment (Windows)
The data between TC Monitor and remote devices is exchanged using HTTP protocol. Monitored devices operate either as server or client mode.
1. Introduction TC Monitor is easy to use Windows application for monitoring and control of some Teracom Ethernet (TCW) and GSM/GPRS (TCG) controllers. The supported devices are TCW122B-CM, TCW181B- CM,
LoadRunner and Performance Center v11.52 Technical Awareness Webinar Training
LoadRunner and Performance Center v11.52 Technical Awareness Webinar Training Tony Wong 1 Copyright Copyright 2012 2012 Hewlett-Packard Development Development Company, Company, L.P. The L.P. information
How, What, and Where of Data Warehouses for MySQL
How, What, and Where of Data Warehouses for MySQL Robert Hodges CEO, Continuent. Introducing Continuent The leading provider of clustering and replication for open source DBMS Our Product: Continuent Tungsten
1. Product Information
ORIXCLOUD BACKUP CLIENT USER MANUAL LINUX 1. Product Information Product: Orixcloud Backup Client for Linux Version: 4.1.7 1.1 System Requirements Linux (RedHat, SuSE, Debian and Debian based systems such
Online Backup Client User Manual
For Mac OS X Software version 4.1.7 Version 2.2 Disclaimer This document is compiled with the greatest possible care. However, errors might have been introduced caused by human mistakes or by other means.
Session 85 IF, Predictive Analytics for Actuaries: Free Tools for Life and Health Care Analytics--R and Python: A New Paradigm!
Session 85 IF, Predictive Analytics for Actuaries: Free Tools for Life and Health Care Analytics--R and Python: A New Paradigm! Moderator: David L. Snell, ASA, MAAA Presenters: Brian D. Holland, FSA, MAAA
MicroStrategy Analytics Express User Guide
MicroStrategy Analytics Express User Guide Analyzing Data with MicroStrategy Analytics Express Version: 4.0 Document Number: 09770040 CONTENTS 1. Getting Started with MicroStrategy Analytics Express Introduction...
Sisense. Product Highlights. www.sisense.com
Sisense Product Highlights Introduction Sisense is a business intelligence solution that simplifies analytics for complex data by offering an end-to-end platform that lets users easily prepare and analyze
The Whole OS X Web Development System
The Whole OS X Web Development Title slide Building PHP/MySQL Web Databases for OS X Scot Hacker Webmaster, UC Berkeley s Graduate School of Journalism The Macworld Conference on Dreamweaver January 6-7,
Frequently Asked Questions. Frequently Asked Questions. 2013 SSLPost Page 1 of 31 [email protected]
Frequently Asked Questions 2013 SSLPost Page 1 of 31 [email protected] Table of Contents 1 What is SSLPost Cloud? 3 2 Why do I need SSLPost Cloud? 4 3 What do I need to use SSLPost Cloud? 5 4 Which Internet
MIGRATING TO AVALANCHE 5.0 WITH MS SQL SERVER
MIGRATING TO AVALANCHE 5.0 WITH MS SQL SERVER This document provides instructions for migrating to Avalanche 5.0 from an installation of Avalanche MC 4.6 or newer using MS SQL Server 2005. You can continue
Machine Problem 3 (Option 1): Mobile Video Chat
Machine Problem 3 (Option 1): Mobile Video Chat CS414 Spring 2011: Multimedia Systems Instructor: Klara Nahrstedt Posted: Apr 4, 2011 Due: 11:59pm Apr 29, 2011 Introduction You may have heard that Apple
1 Log visualization at CNES (Part II)
1 Log visualization at CNES (Part II) 1.1 Background For almost 2 years now, CNES has set up a team dedicated to "log analysis". Its role is multiple: This team is responsible for analyzing the logs after
Using. DataTrust Secure Online Backup. To Protect Your. Hyper-V Virtual Environment. 1 P a g e
Using DataTrust Secure Online Backup To Protect Your Hyper-V Virtual Environment. 1 P a g e Table of Contents: 1. Backing Up the Guest OS with DataTrustOBM 3 2. Backing up the Hyper-V virtual machine files
Understanding ArcGIS Deployments in Public and Private Cloud. Marwa Mabrouk
Understanding ArcGIS Deployments in Public and Private Cloud Marwa Mabrouk Agenda Back to Basics What are people doing? New Features Using ArcGIS in the Cloud - Private Cloud - Public Cloud Technical Demos
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7
DATA MINING TOOL FOR INTEGRATED COMPLAINT MANAGEMENT SYSTEM WEKA 3.6.7 UNDER THE GUIDANCE Dr. N.P. DHAVALE, DGM, INFINET Department SUBMITTED TO INSTITUTE FOR DEVELOPMENT AND RESEARCH IN BANKING TECHNOLOGY
Data journalism: what it can do for you
Data journalism: what it can do for you NCSWA workshop, January 12, 2013 Peter Aldhous, San Francisco Bureau Chief [email protected] Twitter: @paldhous From the ashes of the news industry, a phoenix?
1. The orange button 2. Audio Type 3. Close apps 4. Enlarge my screen 5. Headphones 6. Questions Pane. SparkR 2
SparkR 1. The orange button 2. Audio Type 3. Close apps 4. Enlarge my screen 5. Headphones 6. Questions Pane SparkR 2 Lecture slides and/or video will be made available within one week Live Demonstration
WhatsUp Gold v16.2 MSP Edition Deployment Guide This guide provides information about installing and configuring WhatsUp Gold MSP Edition to central
WhatsUp Gold v16.2 MSP Edition Deployment Guide This guide provides information about installing and configuring WhatsUp Gold MSP Edition to central and remote sites. Contents Table of Contents Using WhatsUp
Lesson 7 - Website Administration
Lesson 7 - Website Administration If you are hired as a web designer, your client will most likely expect you do more than just create their website. They will expect you to also know how to get their
Getting Started with R and RStudio 1
Getting Started with R and RStudio 1 1 What is R? R is a system for statistical computation and graphics. It is the statistical system that is used in Mathematics 241, Engineering Statistics, for the following
POPI Cloud Backups. Overview. The Challenges
POPI Cloud Backups Overview POPI Online Backup offers a simple, Secure, rapid deployment solution that is customizable and cost effective to implement a Backup strategy throughout your organisation. POPI
MANIPULATION OF LARGE DATABASES WITH "R"
MANIPULATION OF LARGE DATABASES WITH "R" Ana Maria DOBRE, Andreea GAGIU National Institute of Statistics, Bucharest Abstract Nowadays knowledge is power. In the informational era, the ability to manipulate
Inge Os Sales Consulting Manager Oracle Norway
Inge Os Sales Consulting Manager Oracle Norway Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database Machine Oracle & Sun Agenda Oracle Fusion Middelware Oracle Database 11GR2 Oracle Database
Bomgar KACE Integration 101. System Requirements. Configure Dell KACE
Bomgar KACE Integration 101 This is a comprehensive guide on how to easily integrate your Bomgar appliance with your K1000. This guide is for Admins who have purchased the Bomgar appliance and NOT a setup
Automated Penetration Testing with the Metasploit Framework. NEO Information Security Forum March 19, 2008
Automated Penetration Testing with the Metasploit Framework NEO Information Security Forum March 19, 2008 Topics What makes a good penetration testing framework? Frameworks available What is the Metasploit
