Package phylotools. R topics documented: February 20, Type Package Title Phylogenetic tools for Eco-phylogenetics Version 0.1.
|
|
|
- Mervin Booth
- 10 years ago
- Views:
Transcription
1 Type Package Title Phylogenetic tools for Eco-phylogenetics Version Date Package phylotools February 20, 2015 Author Jinlong Zhang, Nancai Pei, Xiangcheng Mi Maintainer Jinlong Zhang Building supermatrix for DNA barcodes using different genes, calculating the inequality among lineages and phylogenetic similarity for very large dataset using slicing methods by invoking Phylocom. Depends seqrflp, ape, picante, spaa, fields Suggests vegan License GPL-2 LazyLoad yes Repository CRAN Date/Publication :59:27 NeedsCompilation no R topics documented: phylotools-package add.mat aln2dat appendchar complement dat2phy del.tree.tip dimension edgesub fasta.split fmatch
2 2 phylotools-package formatxy framsub gini hcreorder imagevect inequality meangini phy2dat phylocom.comdist phylocom.comstruct phylocom.pd phylocom.phylosor phyloshuffle plotcut plotscale read.phylip rename.fasta resid.tree reverse rh rm.col RMPD seq2fasta sub.tip.label supermat uniquefasta write.mat Index 44 phylotools-package Phylogenetic tools for ecologists This package currently consists of a few functions for handling DNA-barcoding sequences to build supermatrix for further analysis with RAxML etc. Much more functions for conducting phylogenetic analysis would be added in the future, especially for community phylogentic analysis. Package: phylotools Type: Package Version: Date: License: GLP-2 LazyLoad: yes
3 phylotools-package 3 Jinlong Zhang, Nancai Pei, Xiangcheng Mi Maintainer: Jinlong Zhang <[email protected]> Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America ############################################### ## Example Part I###Building Supermatrix######## ############################################### ### Build super matrix dir <- system.file("extdata", package = "phylotools") setwd(dir) ## Supermatrix with "rbcla","matk","trnh-psba" supermat <- supermat(rbcl = "rbcla.phy", matk = "matk.phy", trn = c("trn1.phy", "trn2.phy","trn3.phy","trn4.phy")) ## Save to file write.mat(supermat, "result.phy") ## Delete file unlink("result.phy") ############################################### ## Example Part II###Create a image plot####### ############################################### ## Create an Image with legend from named numerical vectors x <- rnorm(600) labmat <- expand.grid(paste("x",1:30, sep = ""), paste("y", 1:20, sep = "")) lab <- paste(labmat[,1], labmat[,2], sep = "") imagevect(x, labels = lab, col = cm.colors(10)) ############################################### ## Example Part III###Handling fasta project###
4 4 phylotools-package ############################################### ## Handling Fasta files library(seqrflp) ## loading data data(fil.fas) ## Get the names of the sequences col1 <- gnames.fas(fil.fas) ## Generating new names col2 <- paste("seq", 1:length(col1), sep = "") reftable.rename <- data.frame(col1, col2) renamed <- rename.fasta(fil.fas, reftable.rename) ##Generate split factor levels. index <- rep(na, length(col2)) level1 <- seq(1, length(col2), by = 2) index[level1] <- 1 index[-level1] <- 2 ## Reference table reftable.split <- data.frame(col2, index) ## split the fasta object fasta.split(renamed, reftable.split) ################################################# ## Example Part IV###Phylosor for very large data ################################################# ## Make sure the PhyloCom can be invoked by command line ## res <- phylocom.phylosor(sample.file = "sample", ## phylo = "phylo") ################################################# ## Example Part V###Rescale the FDP data ####### ## to different scales ########################## ################################################# ## 20m ## plotscale(inputdata = BCI, len = 1000, wid = 500, ## scale = 20) ## 50m ## plotscale(inputdata = BCI, len = 1000, wid = 500, ## scale = 50) ## 100m ## plotscale(inputdata = BCI, len = 1000, wid = 500, ## scale = 100) ##################################################
5 add.mat 5 ## Example Part VI### ## Lineages inequality and ## Mean Gini Coefficient amonge ## lineages caused by imperfect sampling ################################################## data(bird.orders) rtr1 <- del.tree.tip(bird.orders,3) inequality(bird.orders, rtr1[[1]], h = ) data(bird.orders) to.drop <- c("craciformes", "Galliformes", "Gruiformes") droped <- drop.tip(bird.orders, to.drop) meangini(tree = bird.orders, subtree = droped, times = 10, plot = TRUE) meangini(tree = bird.orders, subtree = droped, times = 50, plot = TRUE) meangini(tree = bird.orders, subtree = droped, times = 100, plot = TRUE) meangini(tree = bird.orders, subtree = droped, times = 200, plot = TRUE) add.mat Add matrix to current matrix Concatenates second columns of matrix. add.mat(mat1, mat2) mat1 mat2 A dataframe with 2 columns. A dataframe with 2 columns. This function could be used to concatenate the second column of the second matrix with the first matrix s second column according to each matrix s first column (as the names of the sequences ). The "?" will be added automatically. A matrix with first column the names of the sequences, the second column the concatenated sequences.
6 6 aln2dat Jinlong Zhang Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America supermat ## add.mat example ######## dir <- system.file("extdata", package = "phylotools") setwd(dir) rbcla <- read.phylip("rbcla.phy") matk <- read.phylip("matk.phy" ) rbdat <- phy2dat(rbcla) matdat <- phy2dat(matk) add.mat(rbdat, matdat) aln2dat Convert ClustalX alignment data to dataframe Convert ClustalX alignment data to dataframe aln2dat(aln) aln A character string indicating the the aln format file generated by Clustal None. Dataframe
7 appendchar 7 Note Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America Jinlong Zhang <[email protected]> None. dir <- system.file("extdata", package = "phylotools") setwd(dir) ##aln2dat examle ######## test <- readlines("aln") aln2dat(test) appendchar Paste character according to the longest character. Paste character for each elements at the second column which is shorter than the longest one. appendchar(mat, pattern = "?") mat pattern data frame with two columns Character to be added to the end of elements None. A data frame with specified character appended to the end of each shorter elements.
8 8 complement Note None Jinlong Zhang Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America framsub ##appendchar example ######## aa <- c("1","2","3") bb <- c("abc","abcd","abcdefg") rrr <- data.frame(aa, bb) rrr appendchar(rrr) complement Get the complement sequences This function could be used to convert the sequences to its complement sequence. The input data must be fasta format. complement(fas) fas Fasta format object This function could be used to obtain the complement sequence given a fasta file. For example, the complement sequence for "5-TTGAACC-3" is "5-GGTTCAA-3".This may be used for converting the consensus sequence to its complement, when the user find that the sequence can not be aligned with other sequences.
9 dat2phy 9 The complement sequences in fasta format. Jinlong Zhang <[email protected]> None revcomp for more information. cat( ">No305", "NTTCGAAAAACACACCCACTACTAAAANTTATCAGTCACT", file = "dna1.fas", sep = "\n") sequence <- read.fasta("dna1.fas") complement(sequence) unlink("dna1.fas") dat2phy Convert dataframe to phylip format Convert dataframe to phylip format dat2phy(input, write = TRUE) input write dataframe with the first column the names of the sequence s and the second column the sequences Write the file to disk. Convert dataframe to phylip format, whose first column are the names of the sequence s and the second column the sequences
10 10 del.tree.tip Note A vector of phylip format. None Jinlong Zhang <[email protected]> Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America phy2dat ##dat2phy example ######## ##Convert dataframe to phylip object dir <- system.file("extdata", package = "phylotools") setwd(dir) test <- read.phylip("matk.phy") test2 <- phy2dat(test) dat2phy(test2) del.tree.tip Randomly Delete tree tip labels This function will randomly deleting tree tip labels, this is an internal function in phylotools del.tree.tip(tree, n) tree n The phylogenetic tree Number of tips to be deleted
11 dimension 11 The phylogenetic tree in class phylo Jinlong Zhang < > To be added help data(bird.orders) del.tree.tip(bird.orders, 4) dimension Get the dimension of the matrix defined by "XnYm" labels This is an internal function for handling "XnYm" labels, users do not need to call this function directly. dimension(x, unique = FALSE, sort = FALSE) x unique sort A vector contain the "XnYm" labels Logical, remove the duplicated value Logical, whether to sort the value in the result A list with the rows or columns Jinlong Zhang < [email protected] >
12 12 edgesub imagevect dimension(c("x1y4","x33y56")) edgesub Substitute character for a sequence from both sides. Substitute character for a sequence from both sides without changing the internal matched characters. edgesub(x, pattern = "-", replacement = "?") x pattern replacement The input DNA sequence. The character that to be substitute. The character that will replace the pattern. Note None. This function will return to the substituted pattern of the input DNA sequence. None Jinlong Zhang <[email protected]> Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America
13 fasta.split 13 framsub, for substitution of a dataframe, which is a general application of this function. ## edgesub example ######## #### Substitute each terminal of the sequence dna = "---ATTGCCTTAS--TTTAAAAAACCGTTC-----" edgesub(dna) fasta.split Split the fasta object to fasta objects Split the fasta object to fasta objects according to the groups given fasta.split(fasta, ref, save2disk = FALSE) fasta ref save2disk The fasta object Data frame with first column of the names of the sequences, the second column which group it belongs to. Whether to save the groups of fasta files to separate files This function may be used to split the trnh-psba sequences, given the order ( usually APGIII ) it belongs to. Returns a list, indicate the groups for the sequences Jinlong Zhang <[email protected]> Non.
14 14 fmatch rename.fasta library(seqrflp) ## loading data data(fil.fas) ## Get the names of the sequences col1 <- gnames.fas(fil.fas) ## Generating new names col2 <- paste("seq", 1:length(col1), sep = "") reftable.rename <- data.frame(col1, col2) renamed <- rename.fasta(fil.fas, reftable.rename) ##Generate split factor levels. index <- rep(na, length(col2)) level1 <- seq(1, length(col2), by = 2) index[level1] <- 1 index[-level1] <- 2 ## Reference table reftable.split <- data.frame(col2, index) ## split the fasta object fasta.split(renamed, reftable.split) fmatch Find the matching index Find the matching index fmatch(dna, pattern = "") dna pattern A character string including the DNA sequence. A character to be used in matching the DNA sequence. None.
15 formatxy 15 Note The max number the input character could be matched from left to right. None Jinlong Zhang Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America edgesub for a general application of this function. ### fmatch example ######## dna = "---ATTGCCTTAS--TTTAAAAAACCGTTC-----" fmatch(dna, "-") formatxy Formating XY label Add "0" in characters where should be added, inorder to make the character labels sortable. formatxy(x) x A character vector containing the labels in "XnYm" format This function will add "0" before the number n if it has fewer characters than the number that has more characters in the same position. Only then the elements "XnYm" in the label vector could be sorted.
16 16 framsub Jinlong Zhang < > imagevect formatxy(c("x1y1","x33y567")) framsub Pattern of substitutions of a dataframe. This function will replace the specified character in the egde of each element to the replacement character of the second column of the input dataframe. framsub(x, pattern = "-", replacement = "?") x pattern replacement The input dataframe. A character to be matched. A character to be replaced. This is a general application of the function edgesub.the only the matched characters at the egde of the input DNA string will be replaced. A dataframe with the matched characters at the two edges replaced. Jinlong Zhang <[email protected]> Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America
17 gini 17 edgesub ##framsub example ######## dir <- system.file("extdata", package = "phylotools") setwd(dir) testphy <- read.phylip("rbcla.phy") testdat <- phy2dat(testphy) framsub(testdat) gini Gini coefficient of inequality The Gini coefficient of inequality gini(x) x A numeric vector Gini coefficient is a measure of inequality. Gini coefficient is based on Lorenz s curve, and the range of this coefficient is between 0 and 1. The more lower the value, the more equality the distribution is. The algorithm of this function is directly adopted from package "ineq" by Achim Zeileis. Jinlong Zhang < [email protected] > F A Cowell: Measurement of Inequality, 2000, in A B Atkinson / F Bourguignon (Eds): Handbook of Income Distribution, Amsterdam meangini
18 18 hcreorder ### gini(c(1,3,2,1,1,1,4)) hcreorder Reorder the height for hclust object This function could be used to reorder the height values in hclust objects, especially when the hclust object was generated by as.hclust.phylo() in package ape. Since the method rect.hclust() or cutree() could not be applied to the hclust objected generated by ape before the height of the nodes be sorted in a increase order. hcreorder(x) x object in class hclust. It should be note that if the height of the nodes are not sorted in increasing order, the rect.hclust() or cutree() will give wrong results. If the hclust object is generated by hclust() on distance matrix, there is no need to call this function. However, as hclust object was specified manually, or if generated using ape from a phylogenetic tree using as.hclust.phylo, you may have to call hcreorder first to reorder the heights of nodes. It should be noted that the merge matrix in hclust has also been regenerated according the definition of hclust object. hclust object with heights been reordered Jinlong Zhang <[email protected]> None. hclust,cutree,rect.hclust
19 imagevect 19 ### help file for hreorder() ### load ape library library(ape) data(bird.orders) ### convert the bird.orders to class hclust bird.hclust <- as.hclust(multi2di(bird.orders)) ### ready to draw tree/dendrograms par(mfrow = c(1,3)) ### plot phylogenetic tress plot(bird.orders, direction = "downwards", cex = 1.5) ### the wrong results, because of the unordered height in hclust ### plot the bird.hclust plot(bird.hclust, hang = -1 ) ### add rectangles to the plot rect.hclust(bird.hclust, k = 4) ### the proper results. ### reorder the hclust object bird.hclust.ordered <- hcreorder(bird.hclust) ### plot the reordered hclust object plot(bird.hclust.ordered, hang = -1 ) ### add rectangles to the plot rect.hclust(bird.hclust.ordered, k = 4) ### call cutree to divide the taxon to groups/clades cutree(bird.hclust.ordered, k = 4) ## Not run imagevect Image plot of a numeric vector An image plot of a vector given the labels in "XnYn" in format. imagevect(x, labels, contour = FALSE, gridsize = 20, axes = TRUE, nlabx = 5, nlaby = 5,...)
20 20 imagevect x labels contour gridsize axes nlabx nlaby A numeric vector representing the value for each quadrat. A character vector representing the row and column for each value in x. Should be in format "X10Y5", which is account for column 10 (X = 10), and row 5(Y = 5). Whether to add contour lines to the existing image plot. Size for each quadrat or grid Whether to draw axes for the plot Number of labels to be added to the x axis Number of labels to be added to the y axis... Further arguments to passed on for plotting This function will automatically sort the values in vector according to the information of rows and columns that extracted from labels, and subsequently build a matrix for invoking the image.plot function in "fields" package. The labels must be in "XnYn" format, and must be in the same length with vec. Image plot for the vector. Jinlong Zhang < [email protected] > None x <- rnorm(600) labmat <- expand.grid(paste("x",1:30, sep = ""), paste("y", 1:20, sep = "")) lab <- paste(labmat[,1], labmat[,2], sep = "") imagevect(x, labels = lab, col = cm.colors(10))
21 inequality 21 inequality The inequality of a cutting time This function could be applied to calculate the inequality of a given time. inequality(tree, subtree, h, detail = FALSE) tree subtree h detail The phylogenetic tree, in class phylo as defined in ape The phylogenetic tree, usually the original tree with some of the tips removed. Numeric: The given time Logical,the time The inequality of the insufficient sampling is evaluated by Gini coefficient with a range of c(0, 1). If the value equals to zero, a ideally equality observed; if the values equals to 1, a extremely inequality observed. Jinlong Zhang < [email protected] > To be added data(bird.orders) rtr1 <- del.tree.tip(bird.orders,3) inequality(bird.orders, rtr1[[1]], h = )
22 22 meangini meangini Mean Gini coefficient Calculating the inequality of clade insufficient sampling Gini coefficient using Monte Carlo simulations meangini(tree, subtree, times = 1000, plot = FALSE) tree subtree times plot A phylogenetic tree in class phylo as in package ape A phylogenetic tree in class phylo, but have fewer tips than the input tree. Times for calculating the Gini coefficient Whether to plot the cutting pattern and the histogram Note This function could be used to calculating the inequality of clade insufficient sampling. The inequality can be measured by Gini coefficient, which is frequently used in economy. The Gini coefficient is based on Lorenz s curve, and the range of this coefficient is between 0 and 1. The more lower the value, the more equality the distribution is. A random time between the oldest node and the latest node in the phylogenetic tree was generated, and the phylogenetic tree was divided into two to n clades according to the time. For each clade in each "cutting" pattern, number of tips that can not be found in the perfect phylogenetic tree was recorded as number of "missing" tips. The number of missing tips was further divided by the number of expectation in the clade in perfect sampling tree. This ratio was calculated for each clade.the Gini coefficient was calculated for these values to represent the inequality of species that not included among clades. This procedure will be replicated for 1000 times in default and require intensively computationally power, this may last for minutes, even in handling small phylogenetic trees. Therefore, please be patient on the time consuming task. returns a list with the first the mean for all value, and the second a vector representing the all Gini coefficient for each randomly cutting pattern. This function require intensive computationally power, and can be very slow during handling large phylogenies. Jinlong Zhang < [email protected] >
23 phy2dat 23 Jinlong Zhang, Nathan Swenson, Xiangcheng Mi, Jihong Huang, Keping Ma, The imperfect sampling and the accuracy of phylogenetic patterns in communities. gini data(bird.orders) to.drop <- c("craciformes", "Galliformes", "Gruiformes") dropped <- drop.tip(bird.orders, to.drop) meangini(tree = bird.orders, subtree = dropped, times = 10, plot = TRUE) meangini(tree = bird.orders, subtree = dropped, times = 50, plot = TRUE) meangini(tree = bird.orders, subtree = dropped, times = 100, plot = TRUE) meangini(tree = bird.orders, subtree = dropped, times = 200, plot = TRUE) phy2dat Convert phylip file to dataframe Convert phylip file to dataframe phy2dat(x) x Vector of phylip format generated by Clustal software. Note Convert phylip file to dataframe. The phylip format vector can be read from local files via read.phylip, or even readlines. data frame with the first column the names of the input sequences, and the sequences in the second column. None
24 24 phylocom.comdist Jinlong Zhang Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America read.phylip ##phy2dat example ######## dir <- system.file("extdata", package = "phylotools") setwd(dir) test <- read.phylip("rbcla.phy") phy2dat(test) phylocom.comdist Calculating phylogenetic comdist using Phylocom This is a wrapper function for phylocom. phylocom.comdist(sample = "sample", phylo = "phylo", method = "comdist", aw = TRUE, file = "comdist.tx sample phylo method aw Character string, the name of the sample file used in Phylocom Character string, the name of the "phylo" file used in Phylocom character, only "comdist" or "comdistn" Logical, if TRUE, the comdist will be computed in accounting of abundance. Otherwise, for presence - absence. file Result File that to be saved to. Jinlong Zhang < [email protected] >
25 phylocom.comstruct 25 Campbell Webb, David Ackerly, Steve Kembel (2010) Phylocom software for the analysis of phylogenetic community structure and character evolution (with phylomatic and ecovolve) User S Manual Version 4.1 phylocom.pd,phylocom.comstruct ### Assume the file sample1 and phylo1 have been saved ### to the R working directory. The phylocom executable ### file has been copied to the same working directory ### Or could be invoked in console by command lines. ### in order to run phylocom in R, simply copy the ### following command: # i = 1 # phylocom.comdist(sample = paste("sample",i, sep = ""), # phylo = paste("phylo",i, sep = ""), # method = "comdist", aw = TRUE, # file = paste("comdist.aw", i, # ".txt", sep = "")) # phylocom.comstruct Calculating community phylogenetic structure using Phylocom This is a wrapper function for phylocom. phylocom.comstruct(sample = "sample", phylo = "phylo", swapmethod = 2, runs = 999, swaps = 1000, aw = TRUE, file = "comstruct.txt") sample phylo swapmethod runs swaps aw Character string, the name of the sample file used in Phylocom Character string, the name of the "phylo" file used in Phylocom Swap method used in phylocom Times to generating null models Times to swap Logical, if TRUE, the comdist will be computed in accounting of abundance. Otherwise, for presence - absence. file Result File that to be saved to.
26 26 phylocom.pd Jinlong Zhang < > Campbell Webb, David Ackerly, Steve Kembel (2010) Phylocom software for the analysis of phylogenetic community structure and character evolution (with phylomatic and ecovolve) User S Manual Version 4.1 phylocom.pd,phylocom.comdist ### Assume the file sample1 and phylo1 have been ### saved to the R working directory. ### The phylocom executable file has been copied ### to the same working directory ### Or could be invoked in console by command lines. ### in order to run phylocom in R, simply copy the ### following command: # i = 1 # phylocom.comstruct(sample = paste("sample",i, sep = ""), # phylo = paste("phylo",i, sep = ""), # swapmethod = 2, runs = 999, swaps = 1000, aw = TRUE, # file = paste("comstruct.aw", i,".txt", sep = "")) phylocom.pd Calculate phylogenetic diversity using Phylocom This is a wrapper function of phylocom for Calculate phylogenetic diversity using Phylocom phylocom.pd(sample = "sample", phylo = "phylo") sample phylo Character string, the name of the sample file used in Phylocom Character string, the name of the "phylo" file used in Phylocom PD data frame
27 phylocom.phylosor 27 Jinlong Zhang < > Campbell Webb, David Ackerly, Steve Kembel (2010) Phylocom software for the analysis of phylogenetic community structure and character evolution (with phylomatic and ecovolve) User S Manual Version 4.1 phylocom.comstruct,phylocom.comdist ### Assume the file sample1 and phylo1 have been saved to the R working directory. ### The phylocom executable file has been copied to the same working directory ### Or could be invoked in console by command lines. ### inorder to run phylocom in R, simply copy the following command: # i = 1 # PD.res <- phylocom.pd(sample = paste("sample",i, sep = ""), # phylo = paste("phylo",i, sep = ""), # ) # PD.res # phylocom.phylosor Calculating Phylosor similarity This function can be used to calculating phylosor (Phylogenetic Sorenson ) similarity between each pair of places phylocom.phylosor(sample.file = "sample", phylo = "phylo") sample.file Phylocom "Sample" file in the working directory. phylo The phylo file in the working directory, must be in newick format. This function has been tested for calculating phylosor for very large phylogenies (more than tips that generated by Phylomatic)
28 28 phyloshuffle a distance matrix Jinlong Zhang < [email protected] > Bryant, J.A., Lamanna, C., Morlon, H., Kerkhoff, A.J., Enquist, B.J. & Green, J.L. (2008) Microbes on mountainsides: Contrasting elevational patterns of bacterial and plant diversity. Proceedings of the National Academy of Sciences of the United States of America, 105, Morlon, H., Schwilk, D.W., Bryant, J.A., Marquet, P.A., Rebelo, A.G., Tauss, C., Bohannan, B.J.M. & Green, J.L. (2011) Spatial patterns of phylogenetic diversity. Ecology Letters, 14, phylocom.pd # Please make sure the Phylocom executable file can # be invoked by console by adding the system path for it. # res <- phylocom.phylosor(sample.file = "sample", phylo = "phylo") # phyloshuffle Tip label shuffling of a phylogenetic tree Internal function used in phylotools. phyloshuffle(tree) tree Phylogenetic tree Shuffle the phylogenetic tips
29 plotcut 29 Randomized phylogenetic tree Jinlong Zhang < > data(bird.orders) par(mfrow = c(1,2)) plot(bird.orders) rand <- phyloshuffle(bird.orders) plot(rand) plotcut Cutting the phylogenetic cutting pattern by a given time or times Add a line to the plot phylogenetic tree plotcut(tree, n) tree n The phylogenetic tree in class phylo defined in ape Number of randomly placed lines to add to. This function add lines to a phylogenetic tree randomly. There for it is only a demonstration for cutting patters of the phylogenetic tree. This cutting pattern could be used to evaluate the inequality of clade insufficient sampling as implemented in meangini Jinlong Zhang < [email protected] > meangini
30 30 plotscale data(bird.orders) plotcut(bird.orders, 10) plotscale Rescaling the individuals occurred in a Forestry Dynamic Plot Rescaling the individuals occurred in a Forestry Dynamic Plot. It will add a column called "gridnames". This function could make the analysis based on different scales easier. plotscale(inputdata, len = NULL, wid = NULL, scale = NULL) inputdata len wid scale A data matrix inputdata that has "tag", "x" and "y". Length of the FDP Width of the FDP Size of the quadrat The input data must he following columns, "Tag" : the tag number for each individual in the FDP. "x", "y" are the coordinats of the individuals in the FDP. A column will be added to the input matrix, to represent the position for each individual. For example, "X5Y20" means the corresponding individual lies at cell X = 5,y = 20. The labels could be further used to represent the species matrix. Jinlong Zhang < [email protected] > Nathan J. B. Kraft, Renato Valencia, David D. Ackerly (2008) Functional Traits and Niche-Based Tree Community Assembly in an Amazonian Forest Science 322, 580 ;
31 read.phylip 31 ## 20m ## plotscale(inputdata = BCI, len = 1000, wid = 500, ## scale = 20) ## 50m ## plotscale(inputdata = BCI, len = 1000, wid = 500, ## scale = 50) ## 100m ## plotscale(inputdata = BCI, len = 1000, wid = 500, ## scale = 100) read.phylip Read phylip file to memory. Read phylip file to memory. read.phylip(fil = NULL) fil file path for the input phylip file. Note The phylip format should be generated by the Clustal X software. Phylip objected to be used in further analysis. None. Jinlong Zhang <[email protected]> Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America
32 32 rename.fasta as readlines ## read.phylip() examle ######## dir <- system.file("extdata", package = "phylotools") setwd(dir) read.phylip("rbcla.phy") rename.fasta Renaming a fasta object given a reference table this function could be used to rename the sequences in the fasta object, given a reference dataframe. rename.fasta(fas, ref, fil = NULL, prefix = NULL) fas ref fil prefix the fasta object. Dataframe with first column of the names of the sequences, the second column which name to change into. A string indicating the file name, the result written to. prefix of which the names of the sequences. Prefix providing the flexibility of modifying the input names of sequences, for example, if the sequences names begin with "P_seq1", and unfortunately the names in reference table are lack of "P_". You may set the prefix as "P_", and without changing the reference table. fasta object Jinlong Zhang <[email protected]>
33 resid.tree 33 None gnames.fas library(seqrflp) ## loading data data(fil.fas) ## Get the names of the sequences col1 <- gnames.fas(fil.fas) ## Generating new names col2 <- paste("seq", 1:length(col1), sep = "") reftable.rename <- data.frame(col1, col2) ## rename.fasta(fil.fas, reftable.rename) resid.tree The undeleted terminals of a phylogenetic tree This is a internal function, and only handling the object that generated by del.tree.tip. resid.tree(tree, deltree) tree deltree A phylogenetic tree An object that generated by del.tree.tip The phylogenetic tree of the deleted tipes Jinlong Zhang < [email protected] > del.tree.tip
34 34 reverse data(bird.orders) del <- del.tree.tip(bird.orders,3) del.tree <- resid.tree(bird.orders, del) plot(del.tree) reverse Get the reverse sequence. This function could be used to obtain the reverse sequence of the input sequence. reverse(dna) dna A character string of input DNA sequence This function could be used to obtain the reverse sequence of the input sequence. A string contains the reverse sequence. Note None Jinlong Zhang <[email protected]> Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America fmatch
35 rh 35 ## reverse example ######## dna = "---ATTGCCTTAS--TTTAAAAAACCGTTC-----" reverse(dna) rh Generating random time between the oldest node and the latest node Random time between the oldest node and the latest node. This function only handle the ultrametric tree. rh(tree, times = 1) tree times The phylogenetic tree Number of random time values to be generated A vector representing the time, that randomly generated Note This function only handle the ultrametric tree. Jinlong Zhang < [email protected] > To be added data(bird.orders) rh(bird.orders, 10)
36 36 rm.col rm.col Delete specified columns for dataframe or matrix Delete specified columns for dataframe or matrix. rm.col(mat, to.rm = NULL) mat to.rm a dataframe or matrix, must have column names. a character vector representing the column names that to be removed. A dataframe or matrix (according to the input data)with certain column deleted. Jinlong Zhang < [email protected] > None x <- rnorm(600) dim(x) <- c(20,30) colnames(x) <- paste("col",1:30,sep = "") rm.col(x, to.rm = c("col1", "col3", "col8", "col22", "col30"))
37 RMPD 37 RMPD Ratio of mean phylogenetic distance of two phylogenetic trees Ratio of the mean phylogenetic distance between the tips in subtree to the mean phylogenetic distance between the tips in whole tree. RMPD(subtree, tree) subtree tree A subset of phylogenetic tree A phylogenetic tree RMPD > 1 indicate the tips in the "subtree" are less phylogenetic related than in "tree". This ratio could be used as a representative of phylogenetic relationships of species that occurred in subtree in comparison with in "tree". RMPD < 1 indicate the tips in "subtree" are more close related Jinlong Zhang < [email protected] > To be added resid.tree data(bird.orders) del <- del.tree.tip(bird.orders,3) del.tree <- resid.tree(bird.orders, del) RMPD(subtree = del.tree, tree = bird.orders)
38 38 seq2fasta seq2fasta Convert Seq file to fasta format This function convert.seq file generated by dnaman to fasta format. seq2fasta(file) file A character string indicate the ".seq" file name. Seq file is a DNA format generated by dnaman, often the exported sequences assembly results. Sometimes these files will have to be converted to fasta format in order to carry on further analysis. fasta file. Jinlong Zhang <[email protected]> To be added dir <- system.file("extdata", package = "phylotools") seq2fasta(paste(dir,"/dnaman.seq", sep = ""))
39 sub.tip.label 39 sub.tip.label Substitute the tip labels of a phylogenetic tree This function may be used in changing the tip labels of a phylogenetic tree according to a reference data table. sub.tip.label(tree, dat) tree dat Phylogenetic tree A dataframe with the first the tip labels and the second the substitutions. To be added A Phylogenetic tree with the tip labels substituted Jinlong Zhang <[email protected]> None library(ape) data(bird.families) tips <- bird.families$tip.label abr <- paste("fam",1:length(tips), sep = "") dat <- data.frame(tips, abr) ntree <- sub.tip.label(bird.families, dat)
40 40 supermat supermat Build super matrix using PHYLIP files This function could be used to build the super dataframe described by Kress et al.(2009). The out put dataframe could be converted to Phylip format file for phylogenetic analysis. supermat(rbcl = NULL, matk = NULL, trn = NULL) rbcl matk trn A character string indicting the name of rbcl phylip file. A character string indicting the name of matk phylip file. A vector of character strings indicting the names of trnh-psba phylip files. Note The phylip files should be first put into a directory. Users are encouraged to copy the phylip files which generated by software Clustal to a directory, and then set it as the working directory. The function will search the input phylip file names from the working directory and build the "super" dataframe. The dataframe then could be converted to phylip format by dat2phy and be written to the working directory by writelines so that to generate a super matrix. The super data frame to be converted to supermatrix. None. Jinlong Zhang <[email protected]> Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America dat2phy, writelines
41 uniquefasta 41 dir <- system.file("extdata", package = "phylotools") setwd(dir) ## Supermatrix with "rbcla","matk","trnh-psba" supermat1 <- supermat(rbcl = "rbcla.phy", matk = "matk.phy", trn = c("trn1.phy", "trn2.phy","trn3.phy","trn4.phy")) write.mat(supermat1, "result1.phy") ## Supermatrix with"matk","trnh-psba" supermat2 <- supermat(matk = "matk.phy", trn = c("trn1.phy", "trn2.phy","trn3.phy","trn4.phy")) write.mat(supermat2, "result2.phy") ## Supermatrix with "rbcla","matk" supermat3 <- supermat(matk = "matk.phy", rbcl = "rbcla.phy") write.mat(supermat3, "result3.phy") ## Supermatrix with "rbcla","trnh-psba" supermat4 <- supermat(rbcl = "rbcla.phy", trn = c("trn1.phy", "trn2.phy","trn3.phy","trn4.phy")) write.mat(supermat4, "result4.phy") ## Delete the results unlink(c("result1.phy", "result2.phy", "result3.phy", "result4.phy")) uniquefasta Deleting duplicated sequences in fasta object This function may be used to deleting the duplicated sequences in the fasta object. The sequences with its name same to any sequences will be ignored. Only the first sequence will be retained. uniquefasta(fasta) fasta The fasta object This function returns to the fasta object with unique sequence.
42 42 write.mat Note This function will remove the duplicated sequences and retain the first sequence only, all according to the names of the sequences. Jinlong Zhang None unique library(seqrflp) ## loading data data(fil.fas) ## Get the names of the sequences fasta.names <- gnames.fas(fil.fas) new.names0 <- substring(fasta.names,1,2 ) new.names <- paste(new.names0, sep = "") reftable.rename <- data.frame(fasta.names, new.names) renamed <- rename.fasta(fil.fas, reftable.rename) uniquefasta(renamed) write.mat Save supermatrix Save supermatrix to a phylip file write.mat(supermat, file = NULL) supermat file Supermatrix as build by function supermat File name to be specified
43 write.mat 43 None Save supermatrix to phylip file. Jinlong Zhang Kress W., Erickson D., Jones F., Swenson N., Perez R., Sanjur O., Bermingham E., Plant DNA barcodes and community phylogeny of a tropical forest dynamics plot in Panama. Proceedings of the National Academy of Sciences of the United States of America supermat dir <- system.file("extdata", package = "phylotools") file.copy(from = dir, to = getwd()) ## Supermatrix with "rbcla","matk","trnh-psba" supermat1 <- supermat(rbcl = "rbcla.phy", matk = "matk.phy", trn = c("trn1.phy", "trn2.phy","trn3.phy","trn4.phy")) write.mat(supermat1, "result1.phy")
44 Index Topic Algnment aln2dat, 6 Topic Phylosor phylocom.phylosor, 27 Topic \textasciitildekwd1 inequality, 21 plotcut, 29 Topic character appendchar, 7 Topic dataframe dat2phy, 9 framsub, 16 phy2dat, 23 Topic equality meangini, 22 Topic fasta complement, 8 fasta.split, 13 rename.fasta, 32 Topic hclust hcreorder, 18 Topic inequality gini, 17 Topic match fmatch, 14 Topic matrix add.mat, 5 Topic package phylotools-package, 2 Topic phylip dat2phy, 9 phy2dat, 23 read.phylip, 31 Topic phylocom phylocom.comdist, 24 phylocom.comstruct, 25 phylocom.pd, 26 Topic plot imagevect, 19 Topic print supermat, 40 Topic reverse reverse, 34 Topic substitution edgesub, 12 framsub, 16 Topic supermat supermat, 40 write.mat, 42 add.mat, 5 aln2dat, 6 appendchar, 7 complement, 8 cutree, 18 dat2phy, 9, 40 del.tree.tip, 10, 33 dimension, 11 edgesub, 12, fasta.split, 13 fmatch, 14, 34 formatxy, 15 framsub, 8, 13, 16 gini, 17, 23 gnames.fas, 33 hclust, 18 hcreorder, 18 help, 11 imagevect, 12, 16, 19 inequality, 21 meangini, 17, 22, 29 phy2dat, 10, 23 44
45 INDEX 45 phylocom.comdist, 24, 26, 27 phylocom.comstruct, 25, 25, 27 phylocom.pd, 25, 26, 26, 28 phylocom.phylosor, 27 phyloshuffle, 28 phylotools (phylotools-package), 2 phylotools-package, 2 plotcut, 29 plotscale, 30 read.phylip, 24, 31 readlines, 32 rect.hclust, 18 rename.fasta, 14, 32 resid.tree, 33, 37 revcomp, 9 reverse, 34 rh, 35 rm.col, 36 RMPD, 37 seq2fasta, 38 sub.tip.label, 39 supermat, 6, 40, 43 unique, 42 uniquefasta, 41 write.mat, 42 writelines, 40
DNA Barcoding in Plants: Biodiversity Identification and Discovery
DNA Barcoding in Plants: Biodiversity Identification and Discovery University of Sao Paulo December 2009 W. John Kress Department of Botany National Museum of Natural History Smithsonian Institution New
Cluster Analysis using R
Cluster analysis or clustering is the task of assigning a set of objects into groups (called clusters) so that the objects in the same cluster are more similar (in some sense or another) to each other
Package retrosheet. April 13, 2015
Type Package Package retrosheet April 13, 2015 Title Import Professional Baseball Data from 'Retrosheet' Version 1.0.2 Date 2015-03-17 Maintainer Richard Scriven A collection of tools
Package tagcloud. R topics documented: July 3, 2015
Package tagcloud July 3, 2015 Type Package Title Tag Clouds Version 0.6 Date 2015-07-02 Author January Weiner Maintainer January Weiner Description Generating Tag and Word Clouds.
Package bigrf. February 19, 2015
Version 0.1-11 Date 2014-05-16 Package bigrf February 19, 2015 Title Big Random Forests: Classification and Regression Forests for Large Data Sets Maintainer Aloysius Lim OS_type
Package MDM. February 19, 2015
Type Package Title Multinomial Diversity Model Version 1.3 Date 2013-06-28 Package MDM February 19, 2015 Author Glenn De'ath ; Code for mdm was adapted from multinom in the nnet package
Package hoarder. June 30, 2015
Type Package Title Information Retrieval for Genetic Datasets Version 0.1 Date 2015-06-29 Author [aut, cre], Anu Sironen [aut] Package hoarder June 30, 2015 Maintainer Depends
Package HadoopStreaming
Package HadoopStreaming February 19, 2015 Type Package Title Utilities for using R scripts in Hadoop streaming Version 0.2 Date 2009-09-28 Author David S. Rosenberg Maintainer
Package missforest. February 20, 2015
Type Package Package missforest February 20, 2015 Title Nonparametric Missing Value Imputation using Random Forest Version 1.4 Date 2013-12-31 Author Daniel J. Stekhoven Maintainer
Statistical Data Mining. Practical Assignment 3 Discriminant Analysis and Decision Trees
Statistical Data Mining Practical Assignment 3 Discriminant Analysis and Decision Trees In this practical we discuss linear and quadratic discriminant analysis and tree-based classification techniques.
R Graphics Cookbook. Chang O'REILLY. Winston. Tokyo. Beijing Cambridge. Farnham Koln Sebastopol
R Graphics Cookbook Winston Chang Beijing Cambridge Farnham Koln Sebastopol O'REILLY Tokyo Table of Contents Preface ix 1. R Basics 1 1.1. Installing a Package 1 1.2. Loading a Package 2 1.3. Loading a
Package empiricalfdr.deseq2
Type Package Package empiricalfdr.deseq2 May 27, 2015 Title Simulation-Based False Discovery Rate in RNA-Seq Version 1.0.3 Date 2015-05-26 Author Mikhail V. Matz Maintainer Mikhail V. Matz
Package MBA. February 19, 2015. Index 7. Canopy LIDAR data
Version 0.0-8 Date 2014-4-28 Title Multilevel B-spline Approximation Package MBA February 19, 2015 Author Andrew O. Finley , Sudipto Banerjee Maintainer Andrew
Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6
Introduction to Bioinformatics AS 250.265 Laboratory Assignment 6 In the last lab, you learned how to perform basic multiple sequence alignments. While useful in themselves for determining conserved residues
Dataframes. Lecture 8. Nicholas Christian BIOST 2094 Spring 2011
Dataframes Lecture 8 Nicholas Christian BIOST 2094 Spring 2011 Outline 1. Importing and exporting data 2. Tools for preparing and cleaning datasets Sorting Duplicates First entry Merging Reshaping Missing
Package hive. January 10, 2011
Package hive January 10, 2011 Version 0.1-9 Date 2011-01-09 Title Hadoop InteractiVE Description Hadoop InteractiVE, is an R extension facilitating distributed computing via the MapReduce paradigm. It
Dynamic Programming. Lecture 11. 11.1 Overview. 11.2 Introduction
Lecture 11 Dynamic Programming 11.1 Overview Dynamic Programming is a powerful technique that allows one to solve many different types of problems in time O(n 2 ) or O(n 3 ) for which a naive approach
Package TRADER. February 10, 2016
Type Package Package TRADER February 10, 2016 Title Tree Ring Analysis of Disturbance Events in R Version 1.2-1 Date 2016-02-10 Author Pavel Fibich , Jan Altman ,
Vector analysis - introduction Spatial data management operations - Assembling datasets for analysis. Data management operations
Vector analysis - introduction Spatial data management operations - Assembling datasets for analysis Transform (reproject) Merge Append Clip Dissolve The role of topology in GIS analysis Data management
Package uptimerobot. October 22, 2015
Type Package Version 1.0.0 Title Access the UptimeRobot Ping API Package uptimerobot October 22, 2015 Provide a set of wrappers to call all the endpoints of UptimeRobot API which includes various kind
Package cgdsr. August 27, 2015
Type Package Package cgdsr August 27, 2015 Title R-Based API for Accessing the MSKCC Cancer Genomics Data Server (CGDS) Version 1.2.5 Date 2015-08-25 Author Anders Jacobsen Maintainer Augustin Luna
Package plan. R topics documented: February 20, 2015
Package plan February 20, 2015 Version 0.4-2 Date 2013-09-29 Title Tools for project planning Author Maintainer Depends R (>= 0.99) Supports the creation of burndown
Visualization of Phylogenetic Trees and Metadata
Visualization of Phylogenetic Trees and Metadata November 27, 2015 Sample to Insight CLC bio, a QIAGEN Company Silkeborgvej 2 Prismet 8000 Aarhus C Denmark Telephone: +45 70 22 32 44 www.clcbio.com [email protected]
Environmental Remote Sensing GEOG 2021
Environmental Remote Sensing GEOG 2021 Lecture 4 Image classification 2 Purpose categorising data data abstraction / simplification data interpretation mapping for land cover mapping use land cover class
Package fimport. February 19, 2015
Version 3000.82 Revision 5455 Date 2013-03-15 Package fimport February 19, 2015 Title Rmetrics - Economic and Financial Data Import Author Diethelm Wuertz and many others Depends R (>= 2.13.0), methods,
Package treemap. February 15, 2013
Type Package Title Treemap visualization Version 1.1-1 Date 2012-07-10 Author Martijn Tennekes Package treemap February 15, 2013 Maintainer Martijn Tennekes A treemap is a space-filling
Package sjdbc. R topics documented: February 20, 2015
Package sjdbc February 20, 2015 Version 1.5.0-71 Title JDBC Driver Interface Author TIBCO Software Inc. Maintainer Stephen Kaluzny Provides a database-independent JDBC interface. License
Package wordcloud. R topics documented: February 20, 2015
Package wordcloud February 20, 2015 Type Package Title Word Clouds Version 2.5 Date 2013-04-11 Author Ian Fellows Maintainer Ian Fellows Pretty word clouds. License LGPL-2.1 LazyLoad
Package bigdata. R topics documented: February 19, 2015
Type Package Title Big Data Analytics Version 0.1 Date 2011-02-12 Author Han Liu, Tuo Zhao Maintainer Han Liu Depends glmnet, Matrix, lattice, Package bigdata February 19, 2015 The
Package dsstatsclient
Maintainer Author Version 4.1.0 License GPL-3 Package dsstatsclient Title DataSHIELD client site stattistical functions August 20, 2015 DataSHIELD client site
Package TimeWarp. R topics documented: April 1, 2015
Type Package Title Date Calculations and Manipulation Version 1.0.12 Date 2015-03-30 Author Tony Plate, Jeffrey Horner, Lars Hansen Maintainer Tony Plate
Figure 1. An embedded chart on a worksheet.
8. Excel Charts and Analysis ToolPak Charts, also known as graphs, have been an integral part of spreadsheets since the early days of Lotus 1-2-3. Charting features have improved significantly over the
Instructions for Use. CyAn ADP. High-speed Analyzer. Summit 4.3. 0000050G June 2008. Beckman Coulter, Inc. 4300 N. Harbor Blvd. Fullerton, CA 92835
Instructions for Use CyAn ADP High-speed Analyzer Summit 4.3 0000050G June 2008 Beckman Coulter, Inc. 4300 N. Harbor Blvd. Fullerton, CA 92835 Overview Summit software is a Windows based application that
How To Create A Tree In R
Definition of Formats for Coding Phylogenetic Trees in R Emmanuel Paradis October 24, 2012 Contents 1 Introduction 1 2 Terminology and Notations 2 3 Definition of the Class "phylo" 2 3.1 Memory Requirements........................
Query 4. Lesson Objectives 4. Review 5. Smart Query 5. Create a Smart Query 6. Create a Smart Query Definition from an Ad-hoc Query 9
TABLE OF CONTENTS Query 4 Lesson Objectives 4 Review 5 Smart Query 5 Create a Smart Query 6 Create a Smart Query Definition from an Ad-hoc Query 9 Query Functions and Features 13 Summarize Output Fields
NATIONAL STUDENT CLEARINGHOUSE RESEARCH CENTER
StudentTracker SM Detail Report NATIONAL STUDENT CLEARINGHOUSE RESEARCH CENTER 2300 Dulles Station Blvd., Suite 300, Herndon, VA 20171 Contents How the National Student Clearinghouse populates its database...
Polynomial Neural Network Discovery Client User Guide
Polynomial Neural Network Discovery Client User Guide Version 1.3 Table of contents Table of contents...2 1. Introduction...3 1.1 Overview...3 1.2 PNN algorithm principles...3 1.3 Additional criteria...3
Tutorial for the WGCNA package for R: I. Network analysis of liver expression data in female mice
Tutorial for the WGCNA package for R: I. Network analysis of liver expression data in female mice 2.a Automatic network construction and module detection Peter Langfelder and Steve Horvath November 25,
Detail Report Excel Guide for High Schools
StudentTracker SM Detail Report NATIONAL STUDENT CLEARINGHOUSE RESEARCH CENTER 2300 Dulles Station Blvd., Suite 300, Herndon, VA 20171 Contents How the National Student Clearinghouse populates its database...
Activity IT S ALL RELATIVES The Role of DNA Evidence in Forensic Investigations
Activity IT S ALL RELATIVES The Role of DNA Evidence in Forensic Investigations SCENARIO You have responded, as a result of a call from the police to the Coroner s Office, to the scene of the death of
Package LexisPlotR. R topics documented: April 4, 2016. Type Package
Type Package Package LexisPlotR April 4, 2016 Title Plot Lexis Diagrams for Demographic Purposes Version 0.3 Date 2016-04-04 Author [aut, cre], Marieke Smilde-Becker [ctb] Maintainer
Detailed information about Gerber, NC Drill and NC Route.
Title: Product: Summary: Manufacturing Output OrCAD / Allegro PCB Editor Detailed information about Gerber, NC Drill and NC Route. Author/Date: Beate Wilke / 27.05.2009 Table of Contents 1 Gerber... 2
Package DSsim. September 25, 2015
Package DSsim September 25, 2015 Depends graphics, splancs, mrds, mgcv, shapefiles, methods Suggests testthat, parallel Type Package Title Distance Sampling Simulations Version 1.0.4 Date 2015-09-25 LazyLoad
Decision Trees from large Databases: SLIQ
Decision Trees from large Databases: SLIQ C4.5 often iterates over the training set How often? If the training set does not fit into main memory, swapping makes C4.5 unpractical! SLIQ: Sort the values
Bayesian Phylogeny and Measures of Branch Support
Bayesian Phylogeny and Measures of Branch Support Bayesian Statistics Imagine we have a bag containing 100 dice of which we know that 90 are fair and 10 are biased. The
Package DCG. R topics documented: June 8, 2016. Type Package
Type Package Package DCG June 8, 2016 Title Data Cloud Geometry (DCG): Using Random Walks to Find Community Structure in Social Network Analysis Version 0.9.2 Date 2016-05-09 Depends R (>= 2.14.0) Data
TOOLS FOR T-RFLP DATA ANALYSIS USING EXCEL
TOOLS FOR T-RFLP DATA ANALYSIS USING EXCEL A collection of Visual Basic macros for the analysis of terminal restriction fragment length polymorphism data Nils Johan Fredriksson TOOLS FOR T-RFLP DATA ANALYSIS
Optimization of sampling strata with the SamplingStrata package
Optimization of sampling strata with the SamplingStrata package Package version 1.1 Giulio Barcaroli January 12, 2016 Abstract In stratified random sampling the problem of determining the optimal size
Package EstCRM. July 13, 2015
Version 1.4 Date 2015-7-11 Package EstCRM July 13, 2015 Title Calibrating Parameters for the Samejima's Continuous IRT Model Author Cengiz Zopluoglu Maintainer Cengiz Zopluoglu
II-13Category Plots. Chapter II-13
Chapter II-13 II-13Category Plots Overview... 316 Creating a Category Plot... 316 Combining Category Plots and XY Plots... 317 Modifying a Category Plot... 317 Bar and Category Gaps... 317 Tick Mark Positioning...
SAP BusinessObjects Business Intelligence (BI) platform Document Version: 4.1, Support Package 3-2014-04-03. Report Conversion Tool Guide
SAP BusinessObjects Business Intelligence (BI) platform Document Version: 4.1, Support Package 3-2014-04-03 Table of Contents 1 Report Conversion Tool Overview.... 4 1.1 What is the Report Conversion Tool?...4
DataPA OpenAnalytics End User Training
DataPA OpenAnalytics End User Training DataPA End User Training Lesson 1 Course Overview DataPA Chapter 1 Course Overview Introduction This course covers the skills required to use DataPA OpenAnalytics
Package erp.easy. September 26, 2015
Type Package Package erp.easy September 26, 2015 Title Event-Related Potential (ERP) Data Exploration Made Easy Version 0.6.3 A set of user-friendly functions to aid in organizing, plotting and analyzing
ACCESS 2007. Importing and Exporting Data Files. Information Technology. MS Access 2007 Users Guide. IT Training & Development (818) 677-1700
Information Technology MS Access 2007 Users Guide ACCESS 2007 Importing and Exporting Data Files IT Training & Development (818) 677-1700 [email protected] TABLE OF CONTENTS Introduction... 1 Import Excel
Package optirum. December 31, 2015
Title Financial Functions & More Package optirum December 31, 2015 This fills the gaps credit analysts and loan modellers at Optimum Credit identify in the existing R code body. It allows for the production
STATGRAPHICS Online. Statistical Analysis and Data Visualization System. Revised 6/21/2012. Copyright 2012 by StatPoint Technologies, Inc.
STATGRAPHICS Online Statistical Analysis and Data Visualization System Revised 6/21/2012 Copyright 2012 by StatPoint Technologies, Inc. All rights reserved. Table of Contents Introduction... 1 Chapter
Parallelization: Binary Tree Traversal
By Aaron Weeden and Patrick Royal Shodor Education Foundation, Inc. August 2012 Introduction: According to Moore s law, the number of transistors on a computer chip doubles roughly every two years. First
Package survpresmooth
Package survpresmooth February 20, 2015 Type Package Title Presmoothed Estimation in Survival Analysis Version 1.1-8 Date 2013-08-30 Author Ignacio Lopez de Ullibarri and Maria Amalia Jacome Maintainer
Microsoft Access 2010 Overview of Basics
Opening Screen Access 2010 launches with a window allowing you to: create a new database from a template; create a new template from scratch; or open an existing database. Open existing Templates Create
IBM SPSS Direct Marketing 23
IBM SPSS Direct Marketing 23 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 23, release
Topography of an Origin Project and Workspace
Origin Basics Topography of an Origin Project and Workspace When you start Origin, a new project opens displaying a worksheet window in the workspace. The worksheet is one type of window available in Origin.
Package benford.analysis
Type Package Package benford.analysis November 17, 2015 Title Benford Analysis for Data Validation and Forensic Analytics Version 0.1.3 Author Carlos Cinelli Maintainer Carlos Cinelli
IBM SPSS Direct Marketing 22
IBM SPSS Direct Marketing 22 Note Before using this information and the product it supports, read the information in Notices on page 25. Product Information This edition applies to version 22, release
Package neuralnet. February 20, 2015
Type Package Title Training of neural networks Version 1.32 Date 2012-09-19 Package neuralnet February 20, 2015 Author Stefan Fritsch, Frauke Guenther , following earlier work
Package brewdata. R topics documented: February 19, 2015. Type Package
Type Package Package brewdata February 19, 2015 Title Extracting Usable Data from the Grad Cafe Results Search Version 0.4 Date 2015-01-29 Author Nathan Welch Maintainer Nathan Welch
Package TSfame. February 15, 2013
Package TSfame February 15, 2013 Version 2012.8-1 Title TSdbi extensions for fame Description TSfame provides a fame interface for TSdbi. Comprehensive examples of all the TS* packages is provided in the
Molecular Clocks and Tree Dating with r8s and BEAST
Integrative Biology 200B University of California, Berkeley Principals of Phylogenetics: Ecology and Evolution Spring 2011 Updated by Nick Matzke Molecular Clocks and Tree Dating with r8s and BEAST Today
org.rn.eg.db December 16, 2015 org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank accession numbers.
org.rn.eg.db December 16, 2015 org.rn.egaccnum Map Entrez Gene identifiers to GenBank Accession Numbers org.rn.egaccnum is an R object that contains mappings between Entrez Gene identifiers and GenBank
Introduction to the data.table package in R
Introduction to the data.table package in R Revised: September 18, 2015 (A later revision may be available on the homepage) Introduction This vignette is aimed at those who are already familiar with creating
2.3 Identify rrna sequences in DNA
2.3 Identify rrna sequences in DNA For identifying rrna sequences in DNA we will use rnammer, a program that implements an algorithm designed to find rrna sequences in DNA [5]. The program was made by
Training Needs Analysis
Training Needs Analysis Microsoft Office 2007 Access 2007 Course Code: Name: Chapter 1: Access 2007 Orientation I understand how Access works and what it can be used for I know how to start Microsoft Access
Package hive. July 3, 2015
Version 0.2-0 Date 2015-07-02 Title Hadoop InteractiVE Package hive July 3, 2015 Description Hadoop InteractiVE facilitates distributed computing via the MapReduce paradigm through R and Hadoop. An easy
Package MixGHD. June 26, 2015
Type Package Package MixGHD June 26, 2015 Title Model Based Clustering, Classification and Discriminant Analysis Using the Mixture of Generalized Hyperbolic Distributions Version 1.7 Date 2015-6-15 Author
ICP Data Validation and Aggregation Module Training document. HHC Data Validation and Aggregation Module Training Document
HHC Data Validation and Aggregation Module Training Document Contents 1. Introduction... 4 1.1 About this Guide... 4 1.2 Scope... 4 2. Steps for Testing HHC Data Validation and Aggregation Module.. Error!
Package sendmailr. February 20, 2015
Version 1.2-1 Title send email using R Package sendmailr February 20, 2015 Package contains a simple SMTP client which provides a portable solution for sending email, including attachment, from within
Data Mining: Exploring Data. Lecture Notes for Chapter 3. Introduction to Data Mining
Data Mining: Exploring Data Lecture Notes for Chapter 3 Introduction to Data Mining by Tan, Steinbach, Kumar Tan,Steinbach, Kumar Introduction to Data Mining 8/05/2005 1 What is data exploration? A preliminary
An Introduction to Point Pattern Analysis using CrimeStat
Introduction An Introduction to Point Pattern Analysis using CrimeStat Luc Anselin Spatial Analysis Laboratory Department of Agricultural and Consumer Economics University of Illinois, Urbana-Champaign
Chemotaxis and Migration Tool 2.0
Chemotaxis and Migration Tool 2.0 Visualization and Data Analysis of Chemotaxis and Migration Processes Chemotaxis and Migration Tool 2.0 is a program for analyzing chemotaxis and migration data. Quick
Topographic Change Detection Using CloudCompare Version 1.0
Topographic Change Detection Using CloudCompare Version 1.0 Emily Kleber, Arizona State University Edwin Nissen, Colorado School of Mines J Ramón Arrowsmith, Arizona State University Introduction CloudCompare
Package GEOquery. August 18, 2015
Type Package Package GEOquery August 18, 2015 Title Get data from NCBI Gene Expression Omnibus (GEO) Version 2.34.0 Date 2014-09-28 Author Maintainer BugReports
A Step-by-Step Tutorial: Divergence Time Estimation with Approximate Likelihood Calculation Using MCMCTREE in PAML
9 June 2011 A Step-by-Step Tutorial: Divergence Time Estimation with Approximate Likelihood Calculation Using MCMCTREE in PAML by Jun Inoue, Mario dos Reis, and Ziheng Yang In this tutorial we will analyze
NCSS Statistical Software Principal Components Regression. In ordinary least squares, the regression coefficients are estimated using the formula ( )
Chapter 340 Principal Components Regression Introduction is a technique for analyzing multiple regression data that suffer from multicollinearity. When multicollinearity occurs, least squares estimates
SOAL-SOAL MICROSOFT EXCEL 1. The box on the chart that contains the name of each individual record is called the. A. cell B. title C. axis D.
SOAL-SOAL MICROSOFT EXCEL 1. The box on the chart that contains the name of each individual record is called the. A. cell B. title C. axis D. legend 2. If you want all of the white cats grouped together
Step by Step Guide to Importing Genetic Data into JMP Genomics
Step by Step Guide to Importing Genetic Data into JMP Genomics Page 1 Introduction Data for genetic analyses can exist in a variety of formats. Before this data can be analyzed it must imported into one
There are six different windows that can be opened when using SPSS. The following will give a description of each of them.
SPSS Basics Tutorial 1: SPSS Windows There are six different windows that can be opened when using SPSS. The following will give a description of each of them. The Data Editor The Data Editor is a spreadsheet
CD-HIT User s Guide. Last updated: April 5, 2010. http://cd-hit.org http://bioinformatics.org/cd-hit/
CD-HIT User s Guide Last updated: April 5, 2010 http://cd-hit.org http://bioinformatics.org/cd-hit/ Program developed by Weizhong Li s lab at UCSD http://weizhong-lab.ucsd.edu [email protected] 1. Introduction
Paper 109-25 Merges and Joins Timothy J Harrington, Trilogy Consulting Corporation
Paper 109-25 Merges and Joins Timothy J Harrington, Trilogy Consulting Corporation Abstract This paper discusses methods of joining SAS data sets. The different methods and the reasons for choosing a particular
Plotting: Customizing the Graph
Plotting: Customizing the Graph Data Plots: General Tips Making a Data Plot Active Within a graph layer, only one data plot can be active. A data plot must be set active before you can use the Data Selector
Ohio University Computer Services Center August, 2002 Crystal Reports Introduction Quick Reference Guide
Open Crystal Reports From the Windows Start menu choose Programs and then Crystal Reports. Creating a Blank Report Ohio University Computer Services Center August, 2002 Crystal Reports Introduction Quick
Clone Manager. Getting Started
Clone Manager for Windows Professional Edition Volume 2 Alignment, Primer Operations Version 9.5 Getting Started Copyright 1994-2015 Scientific & Educational Software. All rights reserved. The software
Aras Corporation. 2005 Aras Corporation. All rights reserved. Notice of Rights. Notice of Liability
Aras Corporation 2005 Aras Corporation. All rights reserved Notice of Rights All rights reserved. Aras Corporation (Aras) owns this document. No part of this document may be reproduced or transmitted in
KaleidaGraph Quick Start Guide
KaleidaGraph Quick Start Guide This document is a hands-on guide that walks you through the use of KaleidaGraph. You will probably want to print this guide and then start your exploration of the product.
Package HHG. July 14, 2015
Type Package Package HHG July 14, 2015 Title Heller-Heller-Gorfine Tests of Independence and Equality of Distributions Version 1.5.1 Date 2015-07-13 Author Barak Brill & Shachar Kaufman, based in part
1. Classification problems
Neural and Evolutionary Computing. Lab 1: Classification problems Machine Learning test data repository Weka data mining platform Introduction Scilab 1. Classification problems The main aim of a classification
Computational Assignment 4: Discriminant Analysis
Computational Assignment 4: Discriminant Analysis -Written by James Wilson -Edited by Andrew Nobel In this assignment, we will investigate running Fisher s Discriminant analysis in R. This is a powerful
When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want
1 When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want to search other databases as well. There are very
Bioinformatics Resources at a Glance
Bioinformatics Resources at a Glance A Note about FASTA Format There are MANY free bioinformatics tools available online. Bioinformaticists have developed a standard format for nucleotide and protein sequences
