Introduction to bioknoppix: Linux for the life sciences



Similar documents
Cloud Computing Solutions for Genomics Across Geographic, Institutional and Economic Barriers

PARALLEL & CLUSTER COMPUTING CS 6260 PROFESSOR: ELISE DE DONCKER BY: LINA HUSSEIN

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, Abstract. Haruna Cofer*, PhD

Week Overview. Installing Linux Linux on your Desktop Virtualization Basic Linux system administration

WinBioinfTools: Bioinformatics Tools for Windows Cluster. Done By: Hisham Adel Mohamed

Datzilla. Error Reporting and Tracking for NOAA Data

icer Bioinformatics Support Fall 2011

Cloud BioLinux: Pre-configured and On-demand Bioinformatics Computing for the Genomics Community

out of this world guide to: POWERFUL DEDICATED SERVERS

Bioinformatics Grid - Enabled Tools For Biologists.

112 Linton House Union Street London SE1 0LH T: F:

Building a Top500-class Supercomputing Cluster at LNS-BUAP

Powerful Dedicated Servers

EMBL Identity & Access Management

Debian Med. Integrated software environment for all medical purposes based on Debian GNU/Linux. Andreas Tille. OSWC, Malaga Debian.

A Comparison of VMware and {Virtual Server}

Getting Started with HPC

HPC Cloud. Focus on your research. Floris Sluiter Project leader SARA

CentOS Linux 5.2 and Apache 2.2 vs. Microsoft Windows Web Server 2008 and IIS 7.0 when Serving Static and PHP Content

Mississippi State University High Performance Computing Collaboratory Brief Overview. Trey Breckenridge Director, HPC

Additional Software and Hardware Requirements

Linux clustering. Morris Law, IT Coordinator, Science Faculty, Hong Kong Baptist University

With VTE On-Site you can have VTE CRM installed directly at your company.

Building Clusters for Gromacs and other HPC applications

PTK Forensics. Dario Forte, Founder and Ceo DFLabs. The Sleuth Kit and Open Source Digital Forensics Conference

Applied Micro development platform. ZT Systems (ST based) HP Redstone platform. Mitac Dell Copper platform. ARM in Servers

Scaling from 1 PC to a super computer using Mascot

VMWare Workstation 11 Installation MICROSOFT WINDOWS SERVER 2008 R2 STANDARD ENTERPRISE ED.

Leveraging Open Source / Freeware Solutions

Programming for GCSE Topic H: Operating Systems

Deliverable D 6.1 Website

Visualization and Data Analysis with VIDA. Joe Corkery OpenEye Scientific Software

INDIAN INSTITUTE OF TECHNOLOGY KANPUR Department of Mechanical Engineering

Lecture 6: Operating Systems and Utility Programs

Pipeline Pilot Enterprise Server. Flexible Integration of Disparate Data and Applications. Capture and Deployment of Best Practices

HPC Wales Skills Academy Course Catalogue 2015

AklaBox. The Ultimate Document Platform for your Cloud Infrastructure. Installation Guideline

Installing Ubuntu inside Windows using VirtualBox

Computational infrastructure for NGS data analysis. José Carbonell Caballero Pablo Escobar

Visual UpTime Select Server Specifications

An Introduction to High Performance Computing in the Department

Web Hosting. Hosting. Cloud File Hosting. The Genio Group (214)

Chapter 8 Operating Systems and Utility Programs

Click to view Web Link, click Chapter 8, Click Web Link from left navigation, then click BIOS below Chapter 8 p. 395 Fig. 8-4.

Hypervisor Software and Virtual Machines. Professor Howard Burpee SMCC Computer Technology Dept.

Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing

The Ultimate Business & Enterprise Hosting Solutions.

2: Computer Performance

High Performance. CAEA elearning Series. Jonathan G. Dudley, Ph.D. 06/09/ CAE Associates

Cluster Computing at HRI

Overview. Open source toolchains. Buildroot features. Development process

Lab - Dual Boot - Vista & Windows XP

Early Cloud Experiences with the Kepler Scientific Workflow System

vnas Series All-in-one NAS with virtualization platform

Database Management System Choices. Introduction To Database Systems CSE 373 Spring 2013

Capacity Planning for Microsoft SharePoint Technologies

Open Computers & Softwares Inventory New Generation

Installing and Upgrading to Windows 7

SERVER CLUSTERING TECHNOLOGY & CONCEPT

Migrating from Linux to Mac OS X. David Wheeler Kineticode, Inc.

Creating Library Website Using Open Source Content Management System

Tekla Structures 18 Hardware Recommendation

Chapter 5: System Software: Operating Systems and Utility Programs

OPERATING SYSTEMS Software in the Background. Chapter 2

112 Linton House Union Street London SE1 0LH T: F:

Efficiency of Web Based SAX XML Distributed Processing

Bio-Linux as a Tool for Bioinformatics Training

This guide specifies the required and supported system elements for the application.

138 To satisfy a prerequisite, the student must have earned a letter grade of A, B, C or CR in the prerequisite course, unless otherwise stated.

Installation Manual for Grid Monitoring Tool

Efficiency Considerations of PERL and Python in Distributed Processing

ST ALOYSIUS COLLEGE (AUTONOMOUS)

Built for Business. Ready for the Future.

Enterprise Edition Technology Overview

Hands-On Microsoft Windows Server Chapter 12 Managing System Reliability and Availability

Introduction to Running Computations on the High Performance Clusters at the Center for Computational Research

CMS Query Suite. CS4440 Project Proposal. Chris Baker Michael Cook Soumo Gorai

Transcription:

Introduction to bioknoppix: Linux for the life sciences Carlos M Rodríguez Rivera Humberto Ortiz Zuazaga

Who are we? Short: Bunch of computer geeks. Long: The High Performance Computing facility of the University of Puerto Rico, is presently developing a technology, service and computing infrastructure for the research and education community of the University. http://www.hpcf.upr.edu

HPCf Services Internet2 connectivity to participating institutions. Software development (c, python, perl, php, etc.). Scientific Computing (Blast, Emboss, Gaussian, etc). Databases (MySQL, Postgres, Oracle). Web Services for research (web hosting, email, mirrors). Video Conferencing (Access Grid, H323). Training and support.

Bioinformatics Resource Center BiRC Cafeina SGI Origin 300 shared memory supercomputer 24 GB of RAM 32 Processors Gigabit ethernet Espresso Linux cluster 172 Xeon 2.4 GHz 85 GB of RAM Gigabit ethernet

Bioinformatics Resource Center BiRC Areas supported genomic and proteomic databases sequence analysis software phylogeny software protein structure prediction and visualization bioinformatics programming microarray data visualization and analysis biostatistics research support services and training

What is bioknoppix? Bioknoppix is a customized distribution of knoppix linux live cd, loaded with bioinformatics applications. A linux live cd is a fully functional operating system that boots from the cd without the need of being installed. The nice feature of bioknoppix is that besides using some RAM it doesn't touch the host computer. Being ideal for demos, life sciences students, workshops, etc.

Linux? Why not Windows? Historically bioinformatics applications were developed for High Performance Computing environments, which are generally Unix based. Linux is open source GPL (free as in speech and could be free as in beer). No add on costs. Linux distribution are generally released with compilers, scripting languages, web servers, mail servers, databases, many more. This fact make more feasible for low budget projects to develop on linux vs windows. Many more...

Bioknoppix a short story Knoppix was used at the HPCf for diagnostics, PC repairs and demos. Humberto was giving a bioinformatics class. Wouldn't be nice to have a bioinformatics version of knoppix for the class. Bioknoppix was born.

Applications inside bioknoppix Open source, Open source! Emboss sequence analysis suite Jemboss Emboss interface Artemis genome viewer ClustalX ClustalW graphical interface Cn3D NCBI's 3D viewer ImageJ image processing BioPython python tools BioPerl perl tools Bioconductor microarray and biostatistics analysis tools Rasmol Molecular viewer

Full bioinformatics development environment included With bioknoppix you get a full development environment. Libraries for development in C, C++ are available on bioknoppix like on many linux distributions. On top of that bioknoppix contains libraries for biology applications development under python, perl, and R. Also you can develop applications to be integrated with Emboss.

Bioknoppix Mission To have a working environment attractive to the life science community. Break the ice, such that people lose the fear of linux. Give a sample of the freely available tools and alternatives to licensed software.

Bioknoppix howto Get the CD Download from http://bioknoppix.hpcf.upr.edu/downloads Buy it for a nominal fee: http://cart.cheapbytes.com/cgi-bin/cart/0070011034.html http://linuxcd.org/view_item.php?id_version=500 Boot the cd Make sure your computer boots from the cd (BIOS) Insert the CD, turn on the PC and wait...

DEMO

Similar and related projects VigyaanCD http://www.vigyaancd.org/ DNA-Linux http://www.dnalinux.com/ Bio-linux http://www.biolinux.org/ BioBrew http://bioinformatics.org/biobrew/

Recommended links http://bioinformatics.org/ http://knopper.net/knoppix/ http://www.knoppix.net/ http://opensource.org/ http://www.linux.org/ http://bioinformatics.ubc.ca/resources/links_directory/

Take home message Use open source tools, good for your health and your pocket. There is a whole universe outside licensed software. Introduce to the resources and services offered by the High Performance Computing facility and the Bioinformatics Resource Center. Of course, where to get bioknoppix http://bioknoppix.hpcf.upr.edu