PROGRAMMING FOR BIOLOGISTS. BIOL 6297 Monday, Wednesday 10 am -12 pm

Similar documents
sftp - secure file transfer program - how to transfer files to and from nrs-labs

CLC Server Command Line Tools USER MANUAL

CONNECTING TO DEPARTMENT OF COMPUTER SCIENCE SERVERS BOTH FROM ON AND OFF CAMPUS USING TUNNELING, PuTTY, AND VNC Client Utilities

Remote Access to Unix Machines

SSH and Basic Commands

Laboration 3 - Administration

How to Tunnel Remote Desktop Through SSH on a Windows Computer

Miami University RedHawk Cluster Connecting to the Cluster Using Windows

Access Instructions for United Stationers ECDB (ecommerce Database) 2.0

Linux Overview. Local facilities. Linux commands. The vi (gvim) editor

NaviCell Data Visualization Python API

INSTALLING KAAZING WEBSOCKET GATEWAY - HTML5 EDITION ON AN AMAZON EC2 CLOUD SERVER

What Does Tequila Have to Do with Managing Macs? Using Open Source Tools to Manage Mac OS in the Enterprise!

HPCC - Hrothgar Getting Started User Guide

Connecting to the School of Computing Servers and Transferring Files

HP-UX Essentials and Shell Programming Course Summary

Cisco Networking Academy Program Curriculum Scope & Sequence. Fundamentals of UNIX version 2.0 (July, 2002)

Author A.Kishore/Sachin VNC Background

Adafruit's Raspberry Pi Lesson 6. Using SSH

CASHNet Secure File Transfer Instructions

Cloud Server powered by Mac OS X. Getting Started Guide. Cloud Server. powered by Mac OS X. AKJZNAzsqknsxxkjnsjx Getting Started Guide Page 1

Instructions for Accessing the Advanced Computing Facility Supercomputing Cluster at the University of Kansas

FEEG Applied Programming 6 - Working Remotely on Linux Server

PuTTY/Cygwin Tutorial. By Ben Meister Written for CS 23, Winter 2007

An Introduction to High Performance Computing in the Department

TSM for Windows Installation Instructions: Download the latest TSM Client Using the following link:

Getting Started with HPC

Using a login script for deployment of Kaspersky Network Agent to Mac OS X clients

Tutorial. Reference for more thorough Mininet walkthrough if desired

IDS and Penetration Testing Lab II

IBM WebSphere Application Server Version 7.0

Command Line Interface User Guide for Intel Server Management Software

Navigating the Rescue Mode for Linux

Introduction to Running Hadoop on the High Performance Clusters at the Center for Computational Research

WinSCP PuTTY as an alternative to F-Secure July 11, 2006

There are many different ways in which we can connect to a remote machine over the Internet. These include (but are not limited to):

Installing a Symantec Backup Exec Agent on a SnapScale Cluster X2 Node or SnapServer DX1 or DX2. Summary

TREK HOSC PAYLOAD ETHERNET GATEWAY (HPEG) USER GUIDE

University of Toronto

Volume SYSLOG JUNCTION. User s Guide. User s Guide

Installing and Using No Machine to connect to the Redhawk Cluster. Mac version

Course plan. MSc on Bioinformatics for Health Sciences Academic Year Qualification Master's Degree

Version 5.0 Release Notes

Overview. Remote access and file transfer. SSH clients by platform. Logging in remotely

Download and Install the Citrix Receiver for Mac/Linux

How to Backup XenServer VM with VirtualIQ

Code Estimation Tools Directions for a Services Engagement

SSH Connections MACs the MAC XTerm application can be used to create an ssh connection, no utility is needed.

Viking VPN Guide Linux/UNIX

File Transfer Examples. Running commands on other computers and transferring files between computers

Introduction to UNIX and SFTP

Project 2: Firewall Design (Phase I)

VHA Innovations Program Future Technology Laboratory. Linux Workstation Remote Desktop Connection Manual

Using Symantec NetBackup with Symantec Security Information Manager 4.5

imhosted Web Hosting Knowledge Base

SendMIME Pro Installation & Users Guide

Make a folder named Lab3. We will be using Unix redirection commands to create several output files in that folder.

CONFIGURING ECLIPSE FOR AWS EMR DEVELOPMENT

SAP HANA Client Installation and Update Guide

Command Line - Part 1

Hosted Connecting Steps Client Installation Instructions

Quick Start Guide. User Manual. 1 March 2012

Installing the SSH Client v3.2.2 For Microsoft Windows

Oracle EXAM - 1Z Oracle Weblogic Server 11g: System Administration I. Buy Full Product.

Basic processing of next-generation sequencing (NGS) data

CycleServer Grid Engine Support Install Guide. version 1.25

SparkLab May 2015 An Introduction to

TIBCO ActiveMatrix BusinessWorks Plug-in for TIBCO Managed File Transfer Software Installation

Centers for Medicare and Medicaid Services. Connect: Enterprise Secure Client (SFTP) Gentran. Internet Option Manual

Wakanda Studio Features

There s a variety of software that can be used, but the approach described here uses freely available Cygwin software: (1) Cygwin/X (2) Cygwin/openssh

Using and Contributing Virtual Machines to VM Depot

Installation of PHP, MariaDB, and Apache

Click Studios. Passwordstate. Password Discovery, Reset and Validation. Requirements

SIMIAN systems. Setting up a Sitellite development environment on Windows. Sitellite Content Management System

Using the Yale HPC Clusters

Installing Bacula Client on Mac OS X Server

WinSCP for Windows: Using SFTP to upload files to a server

Cumulus 6 HELIOS Companion 2.0. Administrator Guide

Scheduling in SAS 9.4 Second Edition

MySQL Quick Start Guide

Microsoft Windows PowerShell v2 For Administrators

Tips for getting started! with! Virtual Data Center!

WS_FTP Professional 12

RSA SecurID Token User Guide February 12, 2015

Clearswift Information Governance

Upgrading a Single Node Cisco UCS Director Express, page 2. Supported Upgrade Paths to Cisco UCS Director Express for Big Data, Release 2.

Linux command line. An introduction to the Linux command line for genomics. Susan Fairley

1. Installation Overview

Introduction to MSI* for PubH 8403

Editing Files on Remote File Systems

An Introduction to Using Python with Microsoft Azure

How to Use Remote Desktop over a Secure Connection (SSH)

Scyld Cloud Manager User Guide

2 Advanced Session... Properties 3 Session profile... wizard. 5 Application... preferences. 3 ASCII / Binary... Transfer

Download/Install IDENTD

Contents. TSM Remote Server Installation guide

Mobile Labs Plugin for IBM Urban Code Deploy

Outpost Office Firewall

SETTING UP RASPBERRY PI FOR TOPPY FTP ACCESS. (Draft 5)

Transcription:

PROGRAMMING FOR BIOLOGISTS BIOL 6297 Monday, Wednesday 10 am -12 pm

Tomorrow is Ada Lovelace Day Ada Lovelace was the first person to write a computer program

Today s Lecture Overview of the course Philosophy & Goals Getting Started Logging onto xanadu (Jerry Ebalunode) Course Organization Grading UNIX operating system History Getting Started in UNIX Connecting remotely Working from the command-line

Contact Info Elizabeth Ostrowski SR2, Room 221E eaostrowski@uh.edu Office Hours: After class, Monday and Wednesday (or by appointment)

Course Goals Teach you a programming language Teach you how to carry out particular bioinformatics analyses You can teach yourself Everyone needs to learn something different (customize) Most intro programming is written for computer scientists or software engineers, not biologists. Data Scientist How to generate, analyze, and synthesize large data sets Introduce tools and techniques that are necessary for bioinformatics Use a computing cluster How to write scripts Universal features of languages e.g., loops How to build computational pipelines

Example Bioinformatics Task Text file of sequencing reads Sequence Alignments

An Example Pipeline Raw sequencing reads Mapping Perl script UNIX tools Ruby scripts MAQ Shell script List of Mismatches Filtering UNIX tools AWK List of high confidence SNPs Sequences UNIX tools Ruby BioRuby Genome Sequences Convert Gene Sequences Align Sequence Alignments Molecular Evolution Statistics R UNIX tools (Bio)Ruby RESULTS UNIX tools Muscle Macse Revtrans.py Ruby scripts UNIX tools Ruby Analysis package

What we will learn How to use the UNIX command line to efficiently submit, pipeline, and analyze large data sets Command-line to organize and sort data Shell programming Text editors (emacs) Awk and Sed: Extract and manipulate information from data sets Introduction to two programming languages An interpreted language (Python) A language for statistical analysis & data visualization (R) Ethics - Data Management and Reproducibility Good programming practices

Course Grading Attendance (10%) Class Exercises (30%) Submit answers by the end of class Quizzes (60%) Will drop the two worst grades No make-ups

The Hacker Mentality http://en.wikipedia.org/wiki/cheating Code is not like writing or works of art you are encouraged to be resourceful and to re-purpose code from anywhere you can. Use Google or online user help forums: Debugging Establish proper syntax However, for any graded assignments in this class: You may discuss strategies in general terms ( pseudocode ) Do not show each other actual code (written or typed)

Course Organization Part I => UNIX Operating System Part II => Programming Languages Python Part III => R Data visualization and statistics

Connecting to the UH cluster Connect via ssh ( secure shell ) Installed by default on any UNIX-based machine (Mac/Linux) Mac (Applications->Utilities) Linux Machine (Terminal) Windows PuTTY, a free ssh software Need: A user account IP address Format: $ ssh username@ipaddress For example: ssh eaostrow@xanadu.tlc2.uh.edu ssh elizabeth@171.28.41.6 For our class: ssh biol6297eo1@xanadu.tlc2.uh.edu Check out a node to use in interactive mode: $ qsub I $ exit Do not run jobs (i.e., work) on the login nodes!! Substitute your user name here

Let s Practice Practice Exercises: Learn Code the Hard Way (LCTHW) Command-line http://cli.learncodethehardway.org/book/ Start at: Paths, Folders and Directories

Shell is the user interface Locally, access the shell by opening a Terminal Remotely, use ssh (secure shell) This will open a secure (encrypted) session From a UNIX machine open a Terminal: $ ssh hpc13f52@xanadu.tlc2.uh.edu! Command prompt User name Host name From a Windows machine: Use PuTTY, or some other SSH software

Notes for Mac Users MacOSX is a UNIX-based operating system, but most UNIX utilities are not installed by default Two methods to get these tools from Apple: Install Xcode Preferences -> Command-line tools Download from Apple Developer Website Must register with Apple ID Or install a virtual machine Consider a ports software (e.g., Macports, Fink, or Homebrew)

Getting Started Windows Users: Install PuTTY and/or WinScp http://www.putty.org http://ged.msu.edu/angus/tutorials/using-putty-on-windows.html http://rcc.its.psu.edu/user_guides/remote_connectivity/putty/ Linux Users: Open a Terminal Mac Users: Open a Terminal Developer Tools: http://www.cnet.com/how-to/install-command-line-developer-toolsin-os-x/