A Web Based Software for Synonymous Codon Usage Indices
|
|
|
- Jonah Horn
- 9 years ago
- Views:
Transcription
1 International Journal of Information and Computation Technology. ISSN Volume 3, Number 3 (2013), pp International Research Publications House irphouse.com /ijict.htm A Web Based Software for Synonymous Codon Usage Indices Anu Sharma 1, S.B. Lal 2, DC Mishra 3, Sudhir Srivastava 4 and Anil Rai 5 Centre for Agricultural Bioinformatics, Indian Agricultural Statistics Research Institute, Library Avenue, Pusa, New Delhi , INDIA. Abstract The genetic codes have degeneracy. Most of the amino acids are encoded by more than one codon. Codons encoding the same amino acid are called synonymous codons. The patterns of codon usage vary considerably among organisms, and also among genes from the same genome. This phenomenon is known as codon biasness. Various factors contribute to codon usage biasness like gene expression level, %G+C composition, GC skewness, transcriptional selection. These factors form a pattern called synonymous codon usage pattern which explains the causes of variations present in the genes. Several indices have been used to measure the degree of non-random usage of synonymous codons in a gene. With the exponential increase in the volume of the sequence information, detailed statistical analysis of codon usage is highly desired. Complete analysis of codon usage for gene expression studies requires using many softwares and standard statistical packages for visual representation of data. Some of packages for codon usage analysis are stand alone and are not easily accessible. So, comprehensive web based software for codon usage analysis is highly required by researchers. This paper describes a web based software for analyzing the non-random usage of synonymous codons using various indices developed by researchers. This software is developed using Java, JSP, Apache Tomcat Server and MS-Access. The software will be highly useful for biologists, statisticians and computer scientist involved in biological research. Keywords: Gene Expression Identification, Indices for Codon Usage, Synonymous Codon Usage, Software, Web.
2 148 Anu Sharma et al 1. Introduction The spread of Internet and the growing demand of services from the web users have changed and are still changing the way to organize the work or the study. Statistical software packages have been used for decades to perform statistical analyses. Rapid advancements on the internet technology front have expanded the potential for these packages. An online software development environment allows data sets and analyses to be shared and researchers to communicate with each other quickly and conveniently. Statistical analysis of codon usage remains low may be due to lack of programs for codon usage. Many Commercial software packages available for multivariate analysis are not specifically designed to deal with the biological problems. With the advancement in web technology, it is desired to make available these analyses on the web for quick reference. This paper describes a comprehensive web solution named, WebSynCod, for synonymous codon usage analysis for gene expression identification using client-server architecture. This system can be accessed any time from arbitrary platforms through internet. It includes online analysis using of indices of codon usage. The software has been developed using three-tier client and server architecture. This software will help researchers in carrying out analysis on web. 2. Background Codon usage indices have been extensively studied in the literature for the tabulation and investigation of codon usage. Two types of codon usage indices have been constructed one for calculating codon usage deviation and second that measure codon bias towards a subset of preferred codons. Some of the indices used for codon usage deviation are P2 (Gouy and Gautier, 1982), P (Gribskov et al., 1984), GC3 (Nichols et al. 1980), GC Skew, effective number of Codon usage (ENC, Wright, 1990), relative synonymous codon usage (Sharp et al., 1986), frequency of optimal codon (Ikemura 1981), codon bias index (Benetzen and Hall, 1982) and codon adaptation index (Sharp and Li, 1987). Vetrivel, U. et al. (2007) has developed a software named ACUA (Automated Codon Usage Tool) to perform high throughput sequence analysis aiding statistical profiling of codon usage. Gupta and Ghosh (2000) have developed a non-redundant codon usage database from the complete genomes of 17 organisms. GC percentage at the coding region as well as the three different codon positions was tabulated for each organism. Nakamura et al. (1996) had developed programs that tabulate codon usage of species directly from publically available databases. John (1999) had developed software named codonw to simplify the Multivariate analysis (correspondence analysis) of codon and amino acid usage. It also calculates standard indices of codon usage. But it does not have any in-built graphics for visual representation of results. Countcodon program is web based program to count the number of codons only ( was written based on the C programming language to calculate synonymous codon usage order (SCUO) for each open reading frame (ORF). It is freely available from
3 A Web Based Software for Synonymous Codon Usage Indices Complete analysis of codon usage for gene expression studies requires the calculation of many indices. Some of the packages available for codon usage analysis are stand alone and are not easily accessible. So, comprehensive web based software for codon usage analysis is highly required by researchers. 3. three-tier Architecture of Websyncod WebSYNCod is implemented as a layered structure with each layer corresponding to a different functionality. The three-tier architecture of WebSYNCod is given in Fig. 1. Fig. 1: Three Tier Architecture of WebSYNCod. The User Interface Layer has been implemented using Hyper Text Markup Language (HTML) and JavaScript. Server Side Application Layer has been implemented using Java Server Pages (JSP). Database Layer has been implemented using MySQL. 4. Software Description The Software for Synonymous Codon Usage Analysis (WebSYNCod) has been developed for web platform and programming has been done with the JSP and Java programming language. It has been developed on Intel based computer with 166 MHzclock speed, Microsoft Windows 7 Operating System and 2.0 GB RAM. NetBeans development environment has been used as a platform for development of the software.
4 150 Anu Sharma et al The Home page (Fig. 2) of the software presents the user with a brief welcome note on the software. User may create new account or may log on using the existing account. Fig. 2: Home page of WebSYNCod. 4.1 Input data handling Sequences to be analyzed should be in a single file. A header line is defined as anyline whose first character is a right angled bracket >. There may beany number of header lines but they must precede each sequence, and the second or subsequentheader lines are ignored. Those lines whosefirst character is not > are considered to be sequence data. Sequences must be in thecorrect reading frame, and should not contain untranslated 5 or 3 sequence. WebSYNCod assumesthat the first character of the sequence is the first base of the first codon. The format of each line of sequence data is relaxed; sequences can be either upper or lower case characters. Input lines may be any width and contain spaces and/or numbers. Input data handlingmodule has been designed and developed for reading data for computation of WebSYNCod. Client is required to upload the input data in fasta file format or can also paste or enter the data as shown in Fig. 3.
5 A Web Based Software for Synonymous Codon Usage Indices 151 Fig. 3: Uploading Fasta File. 4.2 Codon Usage Indices The software provides the facility for calculation of base nucleotide composition, bases nucleotide composition at third position, GC and GC3 contents. The software also calculates Codon Adaption Index (CAI), Codon Bias Index (CBI), RSCU and frequency of optimal codons as shown in Fig. 5. Selection of some options of indices will cause the software to prompt for additional files. The indices CAI, CBI and Fop, quantify the adaptation of codon usage towards a set of optimal codons. While optimal codons are known for E.Coli and Saccharomyces cerevisiaeare in-built into software, for most species they are not known. Therefore, selecting one of these indices will cause the software to prompt for a personal choice of optimal codons. Fig. 4: Screen Showing Codon Usage Indices. 5. Conclusion WenSYNCod provides online facility for gene expression identification using synonymous codon usage analysis after it is hosted through a web server. It can save time by doing complex calculations automatically on its own and generating results in understandable format. The software is user friendly and does not demand expertise of computer programming. User can register, login, see results and save result in Excel
6 152 Anu Sharma et al file for further processing using client interface online. Administrator interface of the software helps in development and maintenance of user database. References [1] Bennetzen, J. L., and Hall, B. D. (1982).Codon selection in yeast.journal of BiologicalChemistry, 257, pp [2] Gouy, M., and Gautier, C. (1982).Codon usage in bacteria correlation with gene expressivity.nucleic Acids Research, 10, pp [3] Grantham, R., Gautier, C., Gouy, M., Mercier, R. and Pave, A. (1980b).Codon catalogue usage and the genome hypothesis.nucleic Acids Research, 8, pp [4] Grantham, R., C. Gautier, M. Gouy, M. Jacobzone and R. Mercier, (1981). Codon catalogue usage is a genome strategy for genome expressivity. Nucleic Acids Research, 9, pp r43-r75. [5] Gribskov, M., Devereux, J. and Burgess, R. (1984). The codon preference plot: graphic analysis of protein coding sequences and prediction of gene expression. Nucleic Acids Research, 12, pp [6] Gupta, S. K. and Ghosh, T. C. (2000), CUCG: A non-redundant codon usage database from complete genomes, Current Science, vol. 78, no. 1. [7] Ikemura, T. (1981a). Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli system. Journal of Molecular Biology, 151, [8] John, F.P. (1999), Analysis of Codon usage, Ph.D. Thesis. [9] Nakamura, Y., Wada, K., Wada, Y., Doi, H. and Kanaya, S. (1996). Codon usage tabulated from the international DNA sequence databases. Nucleic Acids Research, 24, pp [10] Nichols, B., Miozzari, G., Cleemput, M. V., Bennett, G. and Yanofsky, C. (1980).Nucleotide sequence of the trpg regions of Escherichia coli, Shingelladysenteriae, salmonella typimurium and Serratiaamrcescens.Journal of Molecular Biology, 142, pp [11] Sharp, P. M., Tuohy, T. M. F. and Mosurski, K. R. (1986). Codon usage in yeast cluster-analysis clearly differentiates highly and lowly expressed genes. Nucleic Acids Research, 14, pp [12] Sharp, P. M., and Li, W. H. (1987a).The codon adaptation index - a measure of directional synonymous codon usage bias, and its potential applications.nucleic Acids Research, 15, pp [13] Vetrivel, U., Arunkumar, V. and Dorairaj, S. (2007). ACUA: A software tool for automated codon usage analysis. Bioinformation, 2(2), pp [14] Wright, F. (1990). The effective number of codon used in a gene. Gene, 87, pp23-29.
(http://genomes.urv.es/caical) TUTORIAL. (July 2006)
(http://genomes.urv.es/caical) TUTORIAL (July 2006) CAIcal manual 2 Table of contents Introduction... 3 Required inputs... 5 SECTION A Calculation of parameters... 8 SECTION B CAI calculation for FASTA
A Multiple DNA Sequence Translation Tool Incorporating Web Robot and Intelligent Recommendation Techniques
Proceedings of the 2007 WSEAS International Conference on Computer Engineering and Applications, Gold Coast, Australia, January 17-19, 2007 402 A Multiple DNA Sequence Translation Tool Incorporating Web
Bioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
Biological Sequence Data Formats
Biological Sequence Data Formats Here we present three standard formats in which biological sequence data (DNA, RNA and protein) can be stored and presented. Raw Sequence: Data without description. FASTA
Bioinformatics Grid - Enabled Tools For Biologists.
Bioinformatics Grid - Enabled Tools For Biologists. What is Grid-Enabled Tools (GET)? As number of data from the genomics and proteomics experiment increases. Problems arise for the current sequence analysis
Investigations on Hierarchical Web service based on Java Technique
Investigations on Hierarchical Web service based on Java Technique A. Bora, M. K. Bhuyan and T. Bezboruah, Member, IAENG Abstract We have designed, developed and implemented a hierarchical web service
Distributed Bioinformatics Computing System for DNA Sequence Analysis
Global Journal of Computer Science and Technology: A Hardware & Computation Volume 14 Issue 1 Version 1.0 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.
Basic Concepts of DNA, Proteins, Genes and Genomes
Basic Concepts of DNA, Proteins, Genes and Genomes Kun-Mao Chao 1,2,3 1 Graduate Institute of Biomedical Electronics and Bioinformatics 2 Department of Computer Science and Information Engineering 3 Graduate
GenBank, Entrez, & FASTA
GenBank, Entrez, & FASTA Nucleotide Sequence Databases First generation GenBank is a representative example started as sort of a museum to preserve knowledge of a sequence from first discovery great repositories,
DnaSP, DNA polymorphism analyses by the coalescent and other methods.
DnaSP, DNA polymorphism analyses by the coalescent and other methods. Author affiliation: Julio Rozas 1, *, Juan C. Sánchez-DelBarrio 2,3, Xavier Messeguer 2 and Ricardo Rozas 1 1 Departament de Genètica,
REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf])
820 REGULATIONS FOR THE DEGREE OF BACHELOR OF SCIENCE IN BIOINFORMATICS (BSc[BioInf]) (See also General Regulations) BMS1 Admission to the Degree To be eligible for admission to the degree of Bachelor
Biological Databases and Protein Sequence Analysis
Biological Databases and Protein Sequence Analysis Introduction M. Madan Babu, Center for Biotechnology, Anna University, Chennai 25, India Bioinformatics is the application of Information technology to
Online Pest Management Information System
Jour. Ind. Soc. Ag. StatistiCs 55(2),2002: 184-188 Online Pest Management Information System Soubhratra Das, Basant Kumar and P.K. Malhotra Indian Agricultural Statistics Research Institute, New Delhi-I/O
SSDA.Analysis - A Class Library for Analysis of Sample Survey Data
Vol.2, Issue.1, Jan-Feb 2012 pp-242-246 ISSN: 2249-6645 SSDA.Analysis - A Class Library for Analysis of Sample Survey Data Anu Sharma 1 and S. B. Lal 1 1 (Indian Agricultural Statistics Research Institute)
PROC. CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE 2006 1. E-mail: [email protected]
BIOINFTool: Bioinformatics and sequence data analysis in molecular biology using Matlab Mai S. Mabrouk 1, Marwa Hamdy 2, Marwa Mamdouh 2, Marwa Aboelfotoh 2,Yasser M. Kadah 2 1 Biomedical Engineering Department,
Algorithms in Computational Biology (236522) spring 2007 Lecture #1
Algorithms in Computational Biology (236522) spring 2007 Lecture #1 Lecturer: Shlomo Moran, Taub 639, tel 4363 Office hours: Tuesday 11:00-12:00/by appointment TA: Ilan Gronau, Taub 700, tel 4894 Office
Gene Models & Bed format: What they represent.
GeneModels&Bedformat:Whattheyrepresent. Gene models are hypotheses about the structure of transcripts produced by a gene. Like all models, they may be correct, partly correct, or entirely wrong. Typically,
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS
BASIC STATISTICAL METHODS FOR GENOMIC DATA ANALYSIS SEEMA JAGGI Indian Agricultural Statistics Research Institute Library Avenue, New Delhi-110 012 [email protected] Genomics A genome is an organism s
SGI. High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems. January, 2012. Abstract. Haruna Cofer*, PhD
White Paper SGI High Throughput Computing (HTC) Wrapper Program for Bioinformatics on SGI ICE and SGI UV Systems Haruna Cofer*, PhD January, 2012 Abstract The SGI High Throughput Computing (HTC) Wrapper
RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison
RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the
Graph theoretic approach to analyze amino acid network
Int. J. Adv. Appl. Math. and Mech. 2(3) (2015) 31-37 (ISSN: 2347-2529) Journal homepage: www.ijaamm.com International Journal of Advances in Applied Mathematics and Mechanics Graph theoretic approach to
Introduction to Bioinformatics 3. DNA editing and contig assembly
Introduction to Bioinformatics 3. DNA editing and contig assembly Benjamin F. Matthews United States Department of Agriculture Soybean Genomics and Improvement Laboratory Beltsville, MD 20708 [email protected]
BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16
Course Director: Dr. Barry Grant (DCM&B, [email protected]) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems
Name Class Date. Figure 13 1. 2. Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.
13 Multiple Choice RNA and Protein Synthesis Chapter Test A Write the letter that best answers the question or completes the statement on the line provided. 1. Which of the following are found in both
Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM [email protected]
Lecture 11 Data storage and LIMS solutions Stéphane LE CROM [email protected] Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog
Module 10: Bioinformatics
Module 10: Bioinformatics 1.) Goal: To understand the general approaches for basic in silico (computer) analysis of DNA- and protein sequences. We are going to discuss sequence formatting required prior
CCR Biology - Chapter 9 Practice Test - Summer 2012
Name: Class: Date: CCR Biology - Chapter 9 Practice Test - Summer 2012 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Genetic engineering is possible
Introduction to Genome Annotation
Introduction to Genome Annotation AGCGTGGTAGCGCGAGTTTGCGAGCTAGCTAGGCTCCGGATGCGA CCAGCTTTGATAGATGAATATAGTGTGCGCGACTAGCTGTGTGTT GAATATATAGTGTGTCTCTCGATATGTAGTCTGGATCTAGTGTTG GTGTAGATGGAGATCGCGTAGCGTGGTAGCGCGAGTTTGCGAGCT
Vector NTI Advance 11 Quick Start Guide
Vector NTI Advance 11 Quick Start Guide Catalog no. 12605050, 12605099, 12605103 Version 11.0 December 15, 2008 12605022 Published by: Invitrogen Corporation 5791 Van Allen Way Carlsbad, CA 92008 U.S.A.
Bioruptor NGS: Unbiased DNA shearing for Next-Generation Sequencing
STGAAC STGAACT GTGCACT GTGAACT STGAAC STGAACT GTGCACT GTGAACT STGAAC STGAAC GTGCAC GTGAAC Wouter Coppieters Head of the genomics core facility GIGA center, University of Liège Bioruptor NGS: Unbiased DNA
Student Attendance Through Mobile Devices
Student Attendance Through Mobile Devices Anurag Rastogi Kirti Gupta Department of Computer Science and Engineering National Institute of Technology Rourkela Rourkela-769 008, Odisha, India Student Attendance
ASSOCIATION RULE MINING ON WEB LOGS FOR EXTRACTING INTERESTING PATTERNS THROUGH WEKA TOOL
International Journal Of Advanced Technology In Engineering And Science Www.Ijates.Com Volume No 03, Special Issue No. 01, February 2015 ISSN (Online): 2348 7550 ASSOCIATION RULE MINING ON WEB LOGS FOR
NASSI-SCHNEIDERMAN DIAGRAM IN HTML BASED ON AML
Volume 6, Number 3, 2013 NASSI-SCHNEIDERMAN DIAGRAM IN HTML BASED ON AML László Menyhárt Abstract: In an earlier work I defined an extension of XML called Algorithm Markup Language (AML) for easy and understandable
Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources
1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools
MORPHEUS. http://biodev.cea.fr/morpheus/ Prediction of Transcription Factors Binding Sites based on Position Weight Matrix.
MORPHEUS http://biodev.cea.fr/morpheus/ Prediction of Transcription Factors Binding Sites based on Position Weight Matrix. Reference: MORPHEUS, a Webtool for Transcripton Factor Binding Analysis Using
XTendTraders.com Trading room simulator
2011 2012 XTendTraders.com Trading room simulator BELGHITI ALAOUI Mohammed IMAFA BEN HAMOUDA Ahmed IMAFA EL FERACHI Anas AL EL HAJJI Khalil AL Polytech Nice Sophia Antipolis SI4 AL/IMAFA 2011 2012 1 CONTENTS
LabGenius. Technical design notes. The world s most advanced synthetic DNA libraries. [email protected] V1.5 NOV 15
LabGenius The world s most advanced synthetic DNA libraries Technical design notes [email protected] V1.5 NOV 15 Introduction OUR APPROACH LabGenius is a gene synthesis company focussed on the design and manufacture
FINDING RELATION BETWEEN AGING AND
FINDING RELATION BETWEEN AGING AND TELOMERE BY APRIORI AND DECISION TREE Jieun Sung 1, Youngshin Joo, and Taeseon Yoon 1 Department of National Science, Hankuk Academy of Foreign Studies, Yong-In, Republic
When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
International Journal of Engineering Technology, Management and Applied Sciences. www.ijetmas.com November 2014, Volume 2 Issue 6, ISSN 2349-4476
ERP SYSYTEM Nitika Jain 1 Niriksha 2 1 Student, RKGITW 2 Student, RKGITW Uttar Pradesh Tech. University Uttar Pradesh Tech. University Ghaziabad, U.P., India Ghaziabad, U.P., India ABSTRACT Student ERP
Annex 6: Nucleotide Sequence Information System BEETLE. Biological and Ecological Evaluation towards Long-Term Effects
Annex 6: Nucleotide Sequence Information System BEETLE Biological and Ecological Evaluation towards Long-Term Effects Long-term effects of genetically modified (GM) crops on health, biodiversity and the
Human Genome Organization: An Update. Genome Organization: An Update
Human Genome Organization: An Update Genome Organization: An Update Highlights of Human Genome Project Timetable Proposed in 1990 as 3 billion dollar joint venture between DOE and NIH with 15 year completion
SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications
Product Bulletin Sequencing Software SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications Comprehensive reference sequence handling Helps interpret the role of each
Guide for Bioinformatics Project Module 3
Structure- Based Evidence and Multiple Sequence Alignment In this module we will revisit some topics we started to look at while performing our BLAST search and looking at the CDD database in the first
DNA and the Cell. Version 2.3. English version. ELLS European Learning Laboratory for the Life Sciences
DNA and the Cell Anastasios Koutsos Alexandra Manaia Julia Willingale-Theune Version 2.3 English version ELLS European Learning Laboratory for the Life Sciences Anastasios Koutsos, Alexandra Manaia and
Web Mining using Artificial Ant Colonies : A Survey
Web Mining using Artificial Ant Colonies : A Survey Richa Gupta Department of Computer Science University of Delhi ABSTRACT : Web mining has been very crucial to any organization as it provides useful
Exon: a web-based software toolkit for DNA sequence analysis
Exon: a web-based software toolkit for DNA sequence analysis Diogo Pratas, Armando J. Pinho, and Sara P. Garcia Abstract Recent advances in DNA sequencing methodologies have caused an exponential growth
An Introduction to Genomics and SAS Scientific Discovery Solutions
An Introduction to Genomics and SAS Scientific Discovery Solutions Dr Karen M Miller Product Manager Bioinformatics SAS EMEA 16.06.03 Copyright 2003, SAS Institute Inc. All rights reserved. 1 Overview!
Japan Communication India Skill Development Center
Japan Communication India Skill Development Center Java Application System Developer Course Detail Track 2a Java Application Software Developer: Phase1 SQL Overview 70 Introduction Database, DB Server
Focusing on results not data comprehensive data analysis for targeted next generation sequencing
Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes
Teaching and Training of Agricultural Statistics
Teaching and Training of Agricultural Statistics V.K. BHATIA Indian Agricultural Statistics Research Institute, Library Avenue, Pusa, New Delhi-110 012, India www.iasri.res.in [email protected]; [email protected]
Intro to Bioinformatics
Intro to Bioinformatics Marylyn D Ritchie, PhD Professor, Biochemistry and Molecular Biology Director, Center for Systems Genomics The Pennsylvania State University Sarah A Pendergrass, PhD Research Associate
Overview of Eukaryotic Gene Prediction
Overview of Eukaryotic Gene Prediction CBB 231 / COMPSCI 261 W.H. Majoros What is DNA? Nucleus Chromosome Telomere Centromere Cell Telomere base pairs histones DNA (double helix) DNA is a Double Helix
Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals
Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Xiaohui Xie 1, Jun Lu 1, E. J. Kulbokas 1, Todd R. Golub 1, Vamsi Mootha 1, Kerstin Lindblad-Toh
Japan Communication India Skill Development Center
Japan Communication India Skill Development Center Java Application System Developer Course Detail Track 1B Java Application Software Developer: Phase1 DBMS Concept 20 Entities Relationships Attributes
Genome Explorer For Comparative Genome Analysis
Genome Explorer For Comparative Genome Analysis Jenn Conn 1, Jo L. Dicks 1 and Ian N. Roberts 2 Abstract Genome Explorer brings together the tools required to build and compare phylogenies from both sequence
Translation Study Guide
Translation Study Guide This study guide is a written version of the material you have seen presented in the replication unit. In translation, the cell uses the genetic information contained in mrna to
How Sequencing Experiments Fail
How Sequencing Experiments Fail v1.0 Simon Andrews [email protected] Classes of Failure Technical Tracking Library Contamination Biological Interpretation Something went wrong with a machine
ANALYSIS OF ENTITY-ATTRIBUTE-VALUE MODEL APPLICATIONS IN FREELY AVAILABLE DATABASE MANAGEMENT SYSTEMS FOR DNA MICROARRAY DATA PROCESSING 1.
JOURNAL OF MEDICAL INFORMATICS & TECHNOLOGIES Vol. 20/2012, ISSN 1642-6037 entity-attribute-value model, relational database management system, DNA microarray Tomasz WALLER 1, Damian ZAPART 1, Magdalena
FREE computing using Amazon EC2
FREE computing using Amazon EC2 Seong-Hwan Jun 1 1 Department of Statistics Univ of British Columbia Nov 1st, 2012 / Student seminar Outline Basics of servers Amazon EC2 Setup R on an EC2 instance Stat
Gene Expression Macro Version 1.1
Gene Expression Macro Version 1.1 Instructions Rev B 1 Bio-Rad Gene Expression Macro Users Guide 2004 Bio-Rad Laboratories Table of Contents: Introduction..................................... 3 Opening
Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing
Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing James D. Jackson Philip J. Hatcher Department of Computer Science Kingsbury Hall University of New Hampshire Durham,
Decision Support System for Trait Specific Germplasm Identified Through Multi-location Evaluation
International Journal of Genetic Engineering and Biotechnology. ISSN 0974 3073 Volume 5, Number 2 (2014), pp. 127-132 International Research Publication House http://www.irphouse.com Decision Support System
SMRT Analysis v2.2.0 Overview. 1. SMRT Analysis v2.2.0. 1.1 SMRT Analysis v2.2.0 Overview. Notes:
SMRT Analysis v2.2.0 Overview 100 338 400 01 1. SMRT Analysis v2.2.0 1.1 SMRT Analysis v2.2.0 Overview Welcome to Pacific Biosciences' SMRT Analysis v2.2.0 Overview 1.2 Contents This module will introduce
Evaluation of Open Source Data Cleaning Tools: Open Refine and Data Wrangler
Evaluation of Open Source Data Cleaning Tools: Open Refine and Data Wrangler Per Larsson [email protected] June 7, 2013 Abstract This project aims to compare several tools for cleaning and importing
BUDAPEST: Bioinformatics Utility for Data Analysis of Proteomics using ESTs
BUDAPEST: Bioinformatics Utility for Data Analysis of Proteomics using ESTs Richard J. Edwards 2008. Contents 1. Introduction... 2 1.1. Version...2 1.2. Using this Manual...2 1.3. Why use BUDAPEST?...2
Japan Communication India Skill Development Center
Japan Communication India Skill Development Center Java Application System Developer Course Detail Track 2b Java Application Software Developer: Phase1 SQL Overview 70 Introduction Database, DB Server
Client-server 3-tier N-tier
Web Application Design Notes Jeff Offutt http://www.cs.gmu.edu/~offutt/ SWE 642 Software Engineering for the World Wide Web N-Tier Architecture network middleware middleware Client Web Server Application
Molecular Genetics. RNA, Transcription, & Protein Synthesis
Molecular Genetics RNA, Transcription, & Protein Synthesis Section 1 RNA AND TRANSCRIPTION Objectives Describe the primary functions of RNA Identify how RNA differs from DNA Describe the structure and
The Steps. 1. Transcription. 2. Transferal. 3. Translation
Protein Synthesis Protein synthesis is simply the "making of proteins." Although the term itself is easy to understand, the multiple steps that a cell in a plant or animal must go through are not. In order
MBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world.
MBARI Deep Sea Guide: Designing a web interface that represents information about the Monterey Bay deep-sea world. Pierre Venuat, University of Poitiers Mentors: Brian Schlining and Nancy Jacobsen Stout
Hidden Markov Models in Bioinformatics. By Máthé Zoltán Kőrösi Zoltán 2006
Hidden Markov Models in Bioinformatics By Máthé Zoltán Kőrösi Zoltán 2006 Outline Markov Chain HMM (Hidden Markov Model) Hidden Markov Models in Bioinformatics Gene Finding Gene Finding Model Viterbi algorithm
Sequence Formats and Sequence Database Searches. Gloria Rendon SC11 Education June, 2011
Sequence Formats and Sequence Database Searches Gloria Rendon SC11 Education June, 2011 Sequence A is the primary structure of a biological molecule. It is a chain of residues that form a precise linear
MAKING AN EVOLUTIONARY TREE
Student manual MAKING AN EVOLUTIONARY TREE THEORY The relationship between different species can be derived from different information sources. The connection between species may turn out by similarities
Library and information science research trends in India
Annals of Library and Studies Vol. 58, December 011, pp. 319-35 Library and information science research trends in India Rekha Mittal Senior Principal Scientist, CSIR-National Institute of Science Communication
Teaching Bioinformatics to Undergraduates
Teaching Bioinformatics to Undergraduates http://www.med.nyu.edu/rcr/asm Stuart M. Brown Research Computing, NYU School of Medicine I. What is Bioinformatics? II. Challenges of teaching bioinformatics
Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6
Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 In the last lab, you learned how to perform basic multiple sequence alignments. While useful in themselves for determining conserved residues
Genetomic Promototypes
Genetomic Promototypes Mirkó Palla and Dana Pe er Department of Mechanical Engineering Clarkson University Potsdam, New York and Department of Genetics Harvard Medical School 77 Avenue Louis Pasteur Boston,
MATCH Commun. Math. Comput. Chem. 61 (2009) 781-788
MATCH Communications in Mathematical and in Computer Chemistry MATCH Commun. Math. Comput. Chem. 61 (2009) 781-788 ISSN 0340-6253 Three distances for rapid similarity analysis of DNA sequences Wei Chen,
Information and Knowledge Management Tools, Techniques
Information and Knowledge Management Tools, Techniques and Practices Editor Ajit K. Roy Formerly Principal Scientist and Head Social Science Section Central Institute of Freshwater Aquaculture (CIFA-ICAR),
SYSTEM OF MONITORING AND CONTROL FOR THE AUTOMATION OF INDUSTRIAL WASH MACHINES
SYSTEM OF MONITORING AND CONTROL FOR THE AUTOMATION OF INDUSTRIAL WASH MACHINES Catalin BUJDEI Liviu PERNIU Ion TRUICAN Mihai CARAMAN Automatics Department, Transilvania University of Brasov, M.Viteazu
Review of the Techniques for Smart Learning Systems
, pp.1-5 http://dx.doi.org/10.14257/astl.2016. Review of the Techniques for Smart Learning Systems Jaegeol Yim, Sangheon Kim 1 Departmet of Computer Engineering, Dongguk University at Gyeongju, 38066 Korea
