Global microrna level regulation of EGFR-driven cell cycle protein network in breast cancer

Size: px
Start display at page:

Download "Global microrna level regulation of EGFR-driven cell cycle protein network in breast cancer"

Transcription

1 Molecular Systems Biology Peer Review Process File Global microrna level regulation of EGFR-driven cell cycle protein network in breast cancer Stefan Uhlmann, Heiko Mannsperger, Jitao David Zhang, Emőke-Ágnes Horvát, Christian Schmidt, Moritz Küblbeck, Frauke Henjes, Aoife Ward, Ulrich Tschulena, Katharina Zweig, Ulrike Korf, Stefan Wiemann, Özgür Sahin Corresponding author: Özgür Sahin, German Cancer Research Center DKFZ Review timeline: Submission date: 07 July 2011 Editorial Decision: 07 September 2011 Revision received: 28 October 2011 Editorial Decision: 22 November 2011 Revision received: 09 December 2011 Accepted: 12 December 2011 Transaction Report: (Note: With the exception of the correction of typographical or spelling errors that could be a source of ambiguity, letters and reports are not edited. The original formatting of letters and referee reports may not be reflected in this compilation.) 1st Editorial Decision 07 September 2011 Thank you again for submitting your work to Molecular Systems Biology. We have now heard back from the three referees who agreed to evaluate your manuscript. As you will see from the reports below, the referees find the topic of your study of potential interest. They raise, however, substantial concerns on your work, which, I am afraid to say, preclude its publication in its present form. The first two reviewers recognized the novelty of the approach applied in this work, and were broadly supportive. Nonetheless, two reviewers had clear concerns regarding the generality of these findings, and the first reviewer clearly indicated that some of these results must be tested in another cell line to demonstrate that they are not specific only to MDA-MB-231 cells. On a somewhat related topic the last reviewer wonders how many of these micrornas are actually expressed in MDA-MB-231 cells, and to what degree expression anti-correlation between microrna and targets across cell lines would have suggested these same regulatory relationships. In addition, the reviewers had clear concerns regarding the clarity of presentation, particularly in the figures, and made a series of suggestions for improvements. In addition, the editor found it somewhat hard to determine whether the bootstrap-based p-values for the inferred microrna-protein relationships are multiple-test corrected, which seems important when trying to assess the statistical significance of the bipartite graph as a whole. This should be clarified in any revised manuscript, and, if possible, an estimate of the FDR for the graph as a whole should be reported. If you feel you can satisfactorily deal with these points and those listed by the referees, you may wish to submit a revised version of your manuscript. Please attach a covering letter giving details of European Molecular Biology Organization 1

2 Molecular Systems Biology Peer Review Process File the way in which you have handled each of the points raised by the referees. A revised manuscript will be once again subject to review and you probably understand that we can give you no guarantee at this stage that the eventual outcome will be favorable. *PLEASE NOTE* As part of the EMBO Publications transparent editorial process initiative (see Molecular Systems Biology now publishes online a Review Process File with each accepted manuscript. Please be aware that in the event of acceptance, your cover letter/point-by-point document will be included as part of this file, which will be available to the scientific community. Authors may opt out of the transparent process at any stage prior to publication (contact us at msb@embo.org). More information about this initiative is available in our Instructions to Authors. Sincerely, Editor Molecular Systems Biology msb@embo.org Referee reports: Reviewer #1 (Remarks to the Author): Uhlmann et al., used a systems biology approach to identify mirnas controlling the EGFR pathway. The results show interesting networking aspects of this regulation. Moreover, it led to the identification of novel tumor suppressor mirnas. To the best of my knowledge, this is the first study of this kind and therefore, this manuscript can be considered of high relevance. The text is organized and well written. It certainly has a lot of potential to be accepted. However, it is important that the authors address the points listed below: 1)The entire study was done with only one cell line, MDA-MB-231. The authors do not present a rationale to use this particular line. To increase the relevance of their results, it is important that the most crucial results be recapitulated in at least another cell line. The use of a cell line from another tumor type is particularly encouraged. EGFR is important in glioblastoma, for instance. 2)It would be good if the authors include in the main body of the paper a table or a figure showing all the genes that were screened. 3)Figure 1 does not match the legend. This needs to be fixed. 4)Figure 2 has major problems. Parts A, B and C are too small and show poor resolution. I suggest authors to increase size and show these three parts as an independent figure. 5)Figure 4. To avoid confusion, use 1, 2,3... instead of (a), (b), (c). 6)The authors failed to discuss Boutz et al., 2011 (JBC 286(20): ). This is another example of mirna analysis using proteomics and demonstrates the networking aspects of mir-122 regulation. Therefore, this manuscript is related very closely to the content of this article. 7)To polish the low stringency network, the authors chose to filter the data by selecting only genes derived from predictions or already validated. These parameters might be too stringent. I wonder if the authors tried to perform a less stringent analysis, by including genes with mapped seed sequences. 8)The results shown in Figure 6 (luciferase and RNA analysis) do not entirely recapitulate the ones shown in figure 5. Do the authors have any comments on that? There are commercially available 3'UTR reporter collections (Switchgear Genomics, for instance). The authors might get better results using an optimized reporter. Reviewer #2 (Remarks to the Author): The paper by Uhlman et al present a comprehensive paper of a microrna screen based approach in order to identify the effect of micrornas on a set of human proteins with focus on teh EGFR protein network. This is the first time that a systematic approach is presented using all micrornas on a set of protein targets. The authors should be commended for undertaking a technically difficult proteomic approach utilising reverse phase arrays. The data analysis involved the development of network analysis tools that will be of interest for microrna based and other systems biology approaches. European Molecular Biology Organization 2

3 Molecular Systems Biology Peer Review Process File The main conclusion of the paper is that three micrornas (mir-124, mir147 and mir-193-3p) have been identified that target the EGFR driven cell cycle network. Extensive validations using a number of techniques (expression constructs, Western blots) are supporting the findings. This represents a novel finding and a model study for future investigations. Some points on the paper: 1.The main figures 2&3 are not very informative in their present form. Figure 2 parts A-B are too small and complex to be of any meaningful value, at least in an A4 format. These should be revised in order to make part A bigger and part be to be retained as a graph (cum. edge number vs z-score) to explain the thresholds used. The networks can be presented as separate suppl. Figures in an enlarged format. In Figure 2. C the extra orange pinheads" representing other mirnas are not very informative. Either include which mirnas are important or leave out. Alternatively the networks in part B can be presented as a new figure 3 and figure 3 can be moved into supplementary. 2. In the introduction the authors mention that transcriptomic approaches to mirna target identification have a weakness in not being able to identify targets regulated at the protein level, while proteomic approaches are either not amenable to high through put or not sensitive enough to detect large numbers of proteins. In the transcriptomic approaches the authors should include RISC- IP followed by RNA-seq (or array hybridization) as another tool that can identify directly irrespective of whether the steady state levels of an mrna or protein change). 3. The authors should comment on the fact that their results are obtained using a model cell line system and the microrna-target relationships must be further validated in large breast cancer sample collections through expression profiling or immunohistochemistry based approaches Reviewer #3 (Remarks to the Author): The paper of Uhlmann et. al. aims to describe the mirna regulation level of EGFR-driven cell cycle. The authors farther aim the development of a novel network analysis methodology. I have the following concerns regarding the paper: 1) The authors choose to investigate a number of proteins based on a previous publication and after analysis of deep sequencing data of the specific cells. Then they investigate the expression of 810 mirnas in relation of these proteins, but is not mentioned if all these mirnas ( and in which levels - from deep sequencing data for example ) are normally expressed in the same cells. 2) The first computational analysis involves a mirnome-protein interaction network where mirna are linked to proteins ( up- & down regulation). This data is then farther analyzed including computational mirna target prediction. The authors are not explaining if the results of Fig. 2c could not directly be computed by using computational target prediction and the expression levels of mirna and proteins (mirna present, protein down regulated, computational mirna:protein target present). 3) In fig. 3 the authors analyze the relation of the seed match between mirna and protein sequence to the repression of the protein expression. Such analysis has been done several times and is not producing some new knowledge. 4) The authors select three mirnas and investigate their targets and their role. Could it be that by just investigating which mirnas have the most significant targets in the first set of collected proteins the investigation will lead in a much simpler way to the same outcome? 1st Revision - authors' response 28 October 2011 European Molecular Biology Organization 3

4 Manuscript Number: MSB Title: Global microrna level regulation of EGFR-driven cell cycle protein network in breast cancer Reviewer #1 (Remarks to the Author): Uhlmann et al., used a systems biology approach to identify mirnas controlling the EGFR pathway. The results show interesting networking aspects of this regulation. Moreover, it led to the identification of novel tumor suppressor mirnas. To the best of my knowledge, this is the first study of this kind and therefore, this manuscript can be considered of high relevance. The text is organized and well written. It certainly has a lot of potential to be accepted. However, it is important that the authors address the points listed below: 1)The entire study was done with only one cell line, MDA-MB-231. The authors do not present a rationale to use this particular line. To increase the relevance of their results, it is important that the most crucial results be recapitulated in at least another cell line. The use of a cell line from another tumor type is particularly encouraged. EGFR is important in glioblastoma, for instance. We fully accepted this criticism. In line with the reviewer s suggestion, we have now extended our analyses to another breast cancer cell line (MCF-7) as well as to a glioblastoma cell line (U87). Three mirnas (mir-124, mir-147 and mir-193a-3p), which we had identified as tumor suppressors in the MDA-MB-231 breast cancer cell line with the networkbased proteomics approach (current Figure 6 and 7), have been investigated in these two additional cell lines, with same assaying techniques including cell viability and cell cycle, that we had initially applied in the MDA-MB-231 cell line. In addition, we performed luciferase reporter assays to validate the direct targeting and qrt-pcr to verify mrna level reduction to be recapitulated also in the other cell lines investigated. Indeed, all the major results we had obtained with the MDA-MB-231 cell line about these three mirnas were supported by results from the other two cell lines: Similar to the results we had obtained in MDA-MB-231 cell line, all three mirnas (mir-124, mir-147 and mir- 193a-3p) led to the reduction of cell population at S-phase and reduced cell viability of the other cell lines (Figure 7E and F). Furthermore, all direct targeting regulations were recapitulated in these two cell lines (Supplementary Figure S9A). The only differences observed were different targeting mechanisms for AKT2 by mir-124 or mir-147. While targeting of AKT2 by mir-124 or mir-147 was via translation repression in MDA-MB-231 cell line, it was via mrna cleavage in MCF-7 (in case of mir-147) and in U87 (in case of both mir-124 and mir-147) (Supplementary Figure S9B). Considering the different origin and heterogeneity of the additional cell lines tested (ER+ adenocarcinoma MCF-7 and glioblastoma U87), we conclude that our observations are indeed not cell-line specific, but can be generalized also to other cell systems. Along these lines we have added new text and figures in the revised version of the manuscript in the results (presented as Figure 7E, F and Supplementary Figure S9) and discussion within the main text (Page 14, paragraph 1 and Page 18, paragraph 2), and also provided further below. 1

5 Figure 7 (E) The effects of mirnas on cell cycle progression of MCF-7 and U87 cell lines. After overexpressing the mirnas in MCF-7 or U87 cells, total DNA content (7-AAD) and S-phase population (BrdU) were determined by flow cytometry after 48 hours. The red rectangles show the gating of proliferating cells in S- phase (percentages given in the rectangles). Results are shown as representative of three biological and three technical replicates. (F) The effects of mirnas on the viability of MCF-7 and U87 cell lines. After overexpressing the mirnas in MCF-7 or U87 cells, viable cells were counted by a luciferase-based viability assay after 72 hours. The results are shown as average of 2 biological and 5 technical replicates. Two asterisks (**) denote the p-value less than 0.01, three asterisks (***) denote the p-value less than Supplementary Figure S9. Validation of direct targeting and quantification of mrnas upon transfection with mir-124, mir-147 and mir-193a-3p in MCF-7 and U87 cell lines. (A) Validation of direct targeting of genes by mirnas. A luciferase reporter assay was performed after co-transfecting MCF-7 or U87 cells with the luciferase vector harbouring the 3 -UTR of the gene of interest and mirna mimics. Luciferase activities were measured after 48 hours of transfection, and results are presented as average of 4 biological and 3 technical replicates. P1, P2 and P3 denote the parts of 3 -UTRs analysed for the genes with very long or difficult to clone 3 -UTRs. (B) Quantification of effects of mir-124, mir-147 and mir-193a-3p on the mrna level by qrt-pcr in MCF-7 and U87 cell lines. The results are presented as average of 2 biological and 3 technical replicates. Two asterisks (**) denote the p-value less than 0.01, three asterisks (***) denote the p-value less than

6 2)It would be good if the authors include in the main body of the paper a table or a figure showing all the genes that were screened. In the revised version of the manuscript, the proteins which were screened are now provided as Table 1 in the main body of the paper. This table includes Official HGNC Gene Symbol, Protein Symbol, UniprotKB accession numbers and Protein Names. For simplicity we used merely protein names throughout the revised version of the manuscript. 3)Figure 1 does not match the legend. This needs to be fixed. This has been fixed in the revised version of the manuscript. 4)Figure 2 has major problems. Parts A, B and C are too small and show poor resolution. I suggest authors to increase size and show these three parts as an independent figure. We agree with this criticism which was also raised by the other reviewer (Reviewer 2). Accordingly, we have modified Figure 2 where the networks in Figure 2B are now new Figure 2C and 2D with higher resolution. Furthermore, the mirna-protein network in Figure 2C is now the current Figure 3 where all the names of mirnas regulating the proteins were given in Supplementary Table S7. 5)Figure 4. To avoid confusion, use 1, 2,3... instead of (a), (b), (c). In the revised version of the manuscript, the old Figure 4 is now Figure 5. Subfigures are presented as (1), (2), (3), (4) and (5) instead of (a), (b), (c), (d) and (e). 6)The authors failed to discuss Boutz et al., 2011 (JBC 286(20): ). This is another example of mirna analysis using proteomics and demonstrates the networking aspects of mir-122 regulation. Therefore, this manuscript is related very closely to the content of this article. We thank the reviewer for drawing our attention to this very relevant paper. In the revised version of the manuscript we cited and discussed this study in the Introduction section (Page 4, paragraph 1). 7)To polish the low stringency network, the authors chose to filter the data by selecting only genes derived from predictions or already validated. These parameters might be too stringent. I wonder if the authors tried to perform a less stringent analysis, by including genes with mapped seed sequences. In order to provide the reviewer as well as readers with a network based on less-stringent statistical significance (p<0.05) and less-stringent sequence-matching requirement (seed sequence mapping), we used pre-compiled microrna-mrna sequence mappings provided by the microrna.org database ( Release August 2010). In this dataset, all target site predictions which have a 6-mer or better were listed. 3

7 With this dataset, we built a network of less stringency than current Figure 3 (Figure 2C in the first submission), namely keeping all edges where p<0.05 and a target site (6-mer or better) is predicted. The network is visualized as below (which is also provided in the Supplementary Figure S5). Supplementary Figure S5: Intermediate network with relaxed seed-matching filter. Each yellow dot indicates one microrna, and proteins are shown in blue boxes. There are 241 micrornas and 25 genes/proteins (PIK3CB did not have microrna mapped to), and 355 edges connecting them. Note that edges in this network have been filtered by (1) screening z-score z<1.96 (corresponding to two-sided p<0.05) and (2) micrornas have been mapped to gene 3 -UTR by at least one six-mer match or better. As one could suspect, many more edges are visible in this network compared to the one shown in Figure 3. The complexity of this intermediate network (total edge number E=355) lies between the one in the current Figure 2C (E=1783), where no sequence-matching filter is set, and the network in the current Figure 3 (E=120), where a more stringent sequencematching filter based on evolutionary conservations is active. We prefer the network in the current Figure 3 to the intermediate network in the main text for the sake of visualization clarity, while providing the latter one as supplementary information (Supplementary Figure S5 and Supplementary Table S6). This part is now implemented into the main text (Page 7, paragraph 3 and page 8, paragraph 1). 8)The results shown in Figure 6 (luciferase and RNA analysis) do not entirely recapitulate the ones shown in figure 5. Do the authors have any comments on that? There are commercially 4

8 available 3'UTR reporter collections (Switchgear Genomics, for instance). The authors might get better results using an optimized reporter. For the initial submission, in Figure 6C and 6D we used gene symbols while we used protein symbols in Figure 5B and 5C. This obviously led to confusion. In order to improve consistency between these figures, we have changed all gene symbols in Figure 6 (new Figure 7) to protein symbols. Furthermore, we have prepared a table (Table 1) in the main body of the manuscript with both gene and protein symbols. We used protein symbols throughout the manuscript in the revised version of the manuscript. From the experimental point of view and as we explain in the revised version of the manuscript, we had tested the effects of each mirna (mir-124, mir-147 and mir-193a-3p) on the down-regulated proteins (old Figure 5B and 5C, new Figure 6B and 6C) in luciferase assays (old Figure 6C, new Figure 7C) to test direct targeting. Furthermore, to differentiate between targeting at mrna and protein levels, we had tested the effects of all three mirnas also at the mrna level (old Figure 6D, new Figure 7D). We had been aware of the commercially available 3 -UTR luciferase reporter construct collection that is suggested by the reviewer. We had tested constructs from this collection and compared the results with those obtained with the vectors that are described in the manuscript and that we have cloned using psicheck2. The results obtained with the psicheck2 system were more robust, likely because the Renilla luciferase and the Firefly normaliser are present in the same vector backbone. Introduction of Firefly luciferase in the psicheck2 (Promega) vector allows normalization of Renilla luciferase expression, achieving robust and reproducible results whereas a second vector is required for normalization in case of the alternative system which, however, could lead to different expression levels as well as to more stress in the cells. We compared the two reporter systems (one from Switchgear and one from psicheck2) for the 3 -UTR of the EGFR gene which is a known direct target of mir-7 (Webster et al, J Biol Chem, 2009) as an example. As the results below indicate the luciferase reporter assay results seem to be similar for the two systems, where the psicheck2 appears to be slightly more robust. These results are not part of manuscript, but are provided in the Figure below. 5

9 Figure I: Luciferase reporter assay results comparing the targeting of 3 -UTR of EGFR by mir-7 using both psicheck2 (vectors we used for validation of direct targeting) and plightswitch (vector commercially available from SwitchGear). Reviewer #2 (Remarks to the Author): The paper by Uhlman et al present a comprehensive paper of a microrna screen based approach in order to identify the effect of micrornas on a set of human proteins with focus on teh EGFR protein network. This is the first time that a systematic approach is presented using all micrornas on a set of protein targets. The authors should be commended for undertaking a technically difficult proteomic approach utilising reverse phase arrays. The data analysis involved the development of network analysis tools that will be of interest for microrna based and other systems biology approaches. The main conclusion of the paper is that three micrornas (mir-124, mir147 and mir-193-3p) have been identified that target the EGFR driven cell cycle network. Extensive validations using a number of techniques (expression constructs, Western blots) are supporting the findings. This represents a novel finding and a model study for future investigations. Some points on the paper: 1.The main figures 2&3 are not very informative in their present form. Figure 2 parts A-B are too small and complex to be of any meaningful value, at least in an A4 format. These should be revised in order to make part A bigger and part be to be retained as a graph (cum. edge number vs z-score) to explain the thresholds used. The networks can be presented as separate suppl. Figures in an enlarged format. In Figure 2. C the extra orange pinheads" representing other mirnas are not very informative. Either include which mirnas are important or leave out. Alternatively the networks in part B can be presented as a new figure 3 and figure 3 can be moved into supplementary. We thank the reviewer for his/her suggestions which were also raised by reviewer 1. In the revised version of the manuscript, as suggested by this reviewer, we kept old Figure 2A with better resolution and only the middle panel of Figure 2B (cumulative edge number vs z-score) as new Figure 2B. The networks (dense and sparse, left and right panels in old Figure2B) are now represented as new Figures 2C and 2D with increased size. This has indeed led to a better resolution of both figures. Furthermore, old Figure 2C is now presented as a separate figure (Figure 3). In order to improve the representation of its content, we initially removed the orange pinheads from the figure and wrote the names of the mirnas instead. However, this compromised the clarity and resulted in very complicated figure. Therefore, we prefer the network as shown in Figure 3 for the sake of visualization clarity. Instead of complicating the figure, we now provide the names of all mirnas that are shown as orange pinheads in Figure 3 in the supplementary information (Supplementary Table S7). The old Figure 3 is presented as Figure 4 in the revised version of the manuscript. 2. In the introduction the authors mention that transcriptomic approaches to mirna target identification have a weakness in not being able to identify targets regulated at the protein level, while proteomic approaches are either not amenable to high through put or not sensitive enough to detect large numbers of proteins. In the transcriptomic approaches the authors should include RISC-IP followed by RNA-seq (or array hybridization) as another tool 6

10 that can identify directly irrespective of whether the steady state levels of an mrna or protein change). We thank the reviewer for drawing our attention to this very relevant method used to identify targets of mirnas at transcriptomic level. This method relies on immunoprecipitation of the RISC complex and the associated transcripts which can then be identified and quantified using either array-based or sequence-based platforms. We now mention this methodology and implemented the references (Karginov FV, 2007, PNAS; Hanina et al, 2010, PLoS Genetics; Grey et al, 2010, PLoS Pathogens; Thomson et al, 2011, NAR) and discussed its potential advantages (e.g. identification of 5 -UTR targeting) and disadvantages (e.g. the necessity of stable interactions between mirna, mrnas and Ago proteins to survive immunoprecipitation process) in the Introduction section (Pages 3, paragraph 2). 3. The authors should comment on the fact that their results are obtained using a model cell line system and the microrna-target relationships must be further validated in large breast cancer sample collections through expression profiling or immunohistochemistry based approaches We are well aware of this fact which was also raised by another reviewer (Reviewer 1). In this line, first we have tested our results also in another breast cancer cell line as well as in a glioblastoma cell line (Figure 7E, F and Supplementary Figure S9). The findings we had made in the MDA-MB-231 cell line were recapitulated also in these other cell lines validating our conclusions and proving that the induced effects were not cell-line specific. Furthermore, in the revised version of the manuscript, we discussed that mirna-target relationships should further be validated in large cancer sample sets at mrna and/or protein level using expression profiling and/or IHC-based or protein array-based methods. This is now provided in the Discussion section (Pages 18, paragraph 2). Reviewer #3 (Remarks to the Author): The paper of Uhlmann et. al. aims to describe the mirna regulation level of EGFR-driven cell cycle. The authors farther aim the development of a novel network analysis methodology. I have the following concerns regarding the paper: 1) The authors choose to investigate a number of proteins based on a previous publication and after analysis of deep sequencing data of the specific cells. Then they investigate the expression of 810 mirnas in relation of these proteins, but is not mentioned if all these mirnas ( and in which levels - from deep sequencing data for example ) are normally expressed in the same cells. We examined the expression of mirbase registered mirnas in the cell system MDA-MB- 231 by using both in-house microarray profiling data and published next-generation sequencing (NGS) data (Sun et al, PLoS ONE, 2011; 16: e17490) (see Methods below). Both methods identified a large body of known human micrornas that were expressed in the MDA-MB-231 cell system (N=429 detected with microarray with p<0.01, and N=598 by 7

11 next-generation sequencing with maximum 2 mismatches to mature microrna sequences). Out of them 362 micrornas were identified to be expressed by both methods (the count of overlapping micrornas was highly significant, p<2.7e-18, hypergeometric test). Based on these data, we report approximately 52% (429) of the 810 micrornas tested to be above the detection limit of microrna microarray, and 74% (598) with next-generation sequencing. Thus, expression of a large proportion (45%, 362) of mirnome in the MDA-MB-231 cell system was supported by two independent technologies (please see figure below). Supplementary Figure S1: Expression levels of human micrornas in the MDA-MB-231 cell system using both array-based and sequence-based platforms. (A) Lines are density estimations of expression levels. In the microarray dataset, blue dots indicate the threshold of detection score (p<0.01, or equivalently, expression value x>6.22) N indicates the number of expressed micrornas reported by each technology. (B) The Venn diagram shows the overlap of the two lists. We reasoned that this information should not necessarily be part of the manuscript as in a gain-of-function screen (such as the mirna mimic library screen in our study) endogenous expression of the respective mirnas is not of equal importance as in a loss-of-function screens (e.g. using antagomirs - or sirnas). In the latter context lack of expression of target mirnas could have led to false-positive phenotypes. Still, in the revised manuscript, we provide the information on how many of the mirnas were indeed found to be expressed in the MDA-MB-231 cell line (Supplementary Figure S1 and Page 5, paragraph 2) and details on the expression analysis as Supplementary Information. 8

12 Methods of data analysis: 1. Microarray profiling of micrornas was performed with Illumina Human MicroRNA Expression Profiling Array (version 2). Background signal was calculated and subtracted with the default method of the commercial software shipped with the microarray. Raw data was normalized by the variance-stabilization normalization (VSN) algorithm with the vsn package in Bioconductor. We calculated the detection score (DS), a score ranging from 0 to 1, for each microrna present on the microarray: DS equals the probability that the microrna is not expressed: therefore DS=0 suggests microrna might be expressed, while DS=1 suggests the microrna s expression is under the detection limit of the microarray. The calculation of DS use expression signals of scramble control probes: if a probe matching the mature sequence of a potentially expressing microrna has similar signal intensities like background controls, its DS will be towards 1 and hence one may assume that its expression is under the detection limit. By setting the threshold of DS as DS<0.01, equivalent to the (vsn-normalized) expression values larger than 6.22, the microarray experiment reported 429 mirbase micrornas whose expression was in the dynamic range of detection of the microrna microarray (p<0.01). 2. The small RNA next-generation sequencing (NGS) data was re-analyzed from a publicly available study (Sun et al, PLoS ONE, 2011; 16: e17490). The reads were first trimmed off adaptor sequences with Flicker, an add-on to the Illumina Genome Analyzer Pipeline software. Then they were aligned with the ELAND software against mirbase (maximum mis-match per read: 2 bases). Altogether 16.7 million reads were used for trimming and aligning. We only retain 5.1 million (30%) reads, which were aligned to both human genomic DNA and mirbase mature microrna sequences. These reads consist of distinct sequence tags, which were mapped to 598 mature microrna major and minor product sequences. 2) The first computational analysis involves a mirnome-protein interaction network where mirna are linked to proteins ( up- & down regulation). This data is then farther analyzed including computational mirna target prediction. The authors are not explaining if the results of Fig. 2c could not directly be computed by using computational target prediction and the expression levels of mirna and proteins (mirna present, protein down regulated, computational mirna:protein target present). We thank the reviewer for this interesting suggestion. To test this, we performed computational target prediction based on expression correlation analyses with two publicly available datasets (NCI60 cancer cell line panel, covering 60 cancer cell lines of 9 different tumor types). In-silico prediction was able to imply the potential regulatory relationships in a small proportion of microrna-mrna and microrna/protein pairs that we had identified with the proteomics screening (Supplementary Table S9). However, the method misses a large part of the microrna-protein network identified in the screening (Figure 2C, New Figure 3). In addition, we note that the prediction method based on correlation provides a very long list of candidate microrna-gene pairs, even after multiple testing corrections, which impede one-by-one experimental validations. In the revised manuscript, we implemented the details of the expression correlation analysis in the Supplementary Information and discussed its limitation and relationships to our screening 9

13 approach in the main text (Page 16, paragraph 2). We reason, however, that the correlation analysis, parallel to predictions based on sequence matching between micrornas and mrnas, could indeed be used as filters to prioritize microrna-gene pairs for experimental validation. However, due to the high false-positive and false-negative rates, these computational methods cannot replace wet-lab experiments like the proteomics screening approach we have followed. This non-exchangeability has been further supported, from another aspect, by a recent study where Shabab et al determined the levels of mirnas and their targeted mrnas by laser captured micro-dissected (LCM) in ovarian cancer epithelial cells (CEPI) and compared with levels present in ovarian surface epithelial cells (OSE). They found that inverse correlations between changes in levels of mirnas and those of their mrna targets were identified in only 11% of the computational predictions. Furthermore, positively correlated targets were detected at a similar rate as negatively correlated ones (Shabab et al, 2011, PLoS ONE ; 6(7):e22508). Methods of in silico target prediction with expression correlation: We took advantages of two large-scale public expression datasets produced by different technologies, and performed computational target predictions. The two datasets are: 1. Paired mrna and microrna expression profiling with Agilent microarray platforms (Liu et al, Molecular Cancer Therapeutics, 2010; 9: ) (NCI-60-mm hereafter, mm stands for microrna-mrna) 2. MicroRNA expression profiling used in the NCI-60-mm study, paired with the protein profile data set of NCI-60 cell lines based on reverse-phase protein lysate arrays (Shankavaram et al, Molecular Cancer Therapeutics, 2007; 6: ) (NCI-60-mp hereafter, mp stands for microrna-protein) Both datasets take use of the well-studied NCI-60 tumor cell line panel representing nine different tumor types, and altogether they construct a three-tier network of microrna, mrna and proteins. Expression correlation analyses here are based on the simple assumption that their expression should reflect in the form of high (anti-)correlation provided there were true regulatory relationships between micrornas and genes (mrna or protein). Since in most known cases micrornas negatively regulate gene expression, we anticipate negative correlations to exist between the expression of micrornas and their targets. This assumption, however, is very naïve considering that (1) gene expression can be modulated by many factors, therefore, a linear correlation might not be expected between regulating pairs and (2) correlation does not imply causality, i.e., even a significant negative correlation coefficient was determined, no conclusion about targeting could be made. Correlation analysis with the NCI-60-mm dataset The mrna and microrna expression profiling data of NCI-60 cell lines (Liu et al, Molecular Cancer Therapeutics, 2010; 9: ) was downloaded from the CellMiner database ( After normalizations as described in the publication, both mrna and microrna probes were filtered by the "gisgenedetected" flag in the raw data file. Only those mrna/microrna probes that were expressed in at least one cell line were used: mrna probes and 422 microrna probes passed this filter. 10

14 Subsequently, probes with very low expression variations among cell lines were removed. Inter-quantile ranges (IQR) were used as filter, and only probes with IQR no less than 0.75 were further used for the correlation analysis (see Figure below). After applying this filter, mrna probes and 345 mirna probes remained for correlation analysis. Supplementary Figure S10: Distributions of interquantile ranges (IQR) in both mrna and microrna datasets of NCI-60 panel of cancer cell lines. Red dot line indicates the threshold, under which the probe will be removed. The top-right legend (A/B) shows the number of remaining probes (A) and that of all probes after the expression filter was applied (B). Pearson correlation coefficients were computed pairwise between each gene and microrna, giving a 24697x345 matrix. P-values were calculated based on Pearson correlation coefficients subsequently (with F-statistic, degree of freedom (1,58)). To alleviate the multiple testing problem, all p-values were corrected by the Benjamini-Hochberg (BH) method (less stringent, estimating FDR), as well as the Bonferroni method (very stringent) (102300) microrna-mrna pairs had BH-adjusted p<0.05 (0.01), and 4700 (3093) pairs had Bonferroni-adjusted p<0.05 (0.01). If we only focus on microrna-mrna pairs with significant negative correlations, pairs have BH-adjusted p<0.05 and 1590 pairs had Bonferroni-adjusted p<0.05. Correlation analysis with the NCI-60-mp dataset The dataset was also downloaded from the CellMiner database. Data pre-processing was done similarly as in the microrna/mrna correlation analysis above. Relative abundances of 89 proteins were available for the correlation analysis. The only difference was that no variancefiltering was performed on the protein data, since almost all proteins had moderate to large variances among cell lines. Pairwise Pearson correlation coefficients were also converted into p-values. After multiple testing corrections, 1311 (475) microrna-protein pairs had BH-adjusted p<0.05 (0.01); 77 (39) pairs had Bonferroni-adjusted p<0.05 (0.01). If we only focus on those micrornaprotein pairs with significant negative correlations, 594 pairs had BH-adjusted p<0.05 and 25 pairs had Bonferroni-adjusted p<

15 Co-mapping the results of correlation analyses and results of this study Out of 120 mirna-protein interactions shown in figure 2C (figure 3 in the revised manuscript), only 5 pairs could have been detected from significant negative correlations with the Benjamini-Hochberg multiple testing correction (BH p<0.05). Note that, none of them had Bonferroni adjusted p<0.05 (see below, Supplementary Table S9). mirna GeneSymbol z-score cor p-value p.bhadj p.bonadj hsa-mir-148a ERRFI e hsa-mir-194 MAPK e hsa-mir-200b PLCG e hsa-mir-429 PLCG e hsa-mir-93 CCND e Supplementary Table S9: Common mirna-target pairs obtained by both our screen approach and mirna/target correlation analyses. Among 89 proteins tested in the NCI-60-mp dataset, 4 proteins were also investigated in the current study (CDK4, GRB2, MAPK1 and PTPN11). In our work, we identified 29 TargetScan predicted micrornas to negatively regulate these 4 proteins (z<-1.96), forming 34 binary microrna-protein pairs (several micrornas regulated more than one of these four proteins). However, none of these pairs had significant negative correlations observed in the NCI-60-mp dataset (BH adjusted p-value p<0.05). One may criticize our method of multiple testing might be too strict, since we used all mirna-mrna (mirna-protein) data for the correction, instead of using only mrnas or proteins that were used in our study. In fact, even if we use uncorrected p value p<0.05 of the correlation coefficient as filtering threshold, only 12 microrna-mrna pairs could have been discovered from the NCI-60-mm study or merely one microrna-protein pair (mir- 153:GRB2) from the NCI-60-mp study. At such a significance level, however, there are microrna-mrna pairs and 5817 microrna-protein pairs to be tested. In summary, out of 120 mirna-protein interactions we have identified in this study, only very few (5~13 depending on the significance level) could have been computed from correlation analyses based on mrna-microrna-protein expression datasets of the NCI-60 panel. In conclusion, this approach misses most interactions. Some reasons for this discrepancy have been discussed above in the limitation of this method. Besides, inconsistencies between mrna and protein expression levels could further add to the low amount of information obtained only with the bioinformatic prediction. 3) In fig. 3 the authors analyze the relation of the seed match between mirna and protein sequence to the repression of the protein expression. Such analysis has been done several times and is not producing some new knowledge. We thank the reviewer for the suggestion. Having considered the volume of novel knowledge this figure delivers as well as the integrity of this work, however, we decided to keep this 12

16 figure in the revised manuscript. Our reasoning started with reviewing relevant approaches in literature: 1. Selbach et al., has done a similar analysis where they examined the effect of a single mirna (mir-155) on 3,299 proteins (Selbach et al, 2008, Nature). 2. Baek et al., did a similar analysis upon mir-124 expression and upon deletion of mir-223 in mouse neutrophils at both mrna and protein levels (Baek et al, 2008, Nature). 3. Boutz et al., also did a similar analysis by comparing downregulated proteins with predicted ones upon mir-122 expression (Boutz et al, 2011, JBC). However, none of these studies examined or compared the seed-3 -UTR match at genomewide mirna level on more than two dozens of proteins. Therefore, our analysis with high dimensional mirna number (810) vs moderate number of proteins (26), especially at the protein level, provides new knowledge to both mirna and protein research field. 4) The authors select three mirnas and investigate their targets and their role. Could it be that by just investigating which mirnas have the most significant targets in the first set of collected proteins the investigation will lead in a much simpler way to the same outcome? We have tested the approach suggested by the reviewer. Taking three different significance values of screening z-score (with two-sided p-values p<0.10, p<0.05 and p<0.01), the top 10 micrornas of having most significant protein targets are listed in the following table (Table I): p<0.10 p<0.05 p<0.01 hsa-mir-124 (17) hsa-mir-124 (16) hsa-mir-124 (7) hsa-mir-491 (13) hsa-mir-147 (10) hsa-mir-661 (7) hsa-mir-661 (13) hsa-mir-661 (10) hsa-mir-892b (7) hsa-mir-147 (12) hsa-mir-124* (9) hsa-mir-124* (5) hsa-mir-21 (12) hsa-mir-342 (9) hsa-mir-193a-3p (5) hsa-mir-892b (12) hsa-mir-17-3p (8) hsa-mir-26a (5) hsa-mir-124* (11) hsa-mir-193a-3p (8) hsa-mir-105 (4) hsa-mir-17-3p (11) hsa-mir-892b (8) hsa-mir-147 (4) hsa-mir-193a-3p (11) hsa-mir-21 (7) hsa-mir-21 (4) hsa-mir-26a (11) hsa-mir-491 (7) hsa-mir-491 (4) Table I: Top 10 micrornas having most significant protein targets. Each column shows the top 10 micrornas, ranked by the number of significantly regulated proteins at that given significance level. The numbers in parentheses indicate the respective number of protein targets. Our first observation was that despite slight variations in gene lists at different significance levels, many of these micrornas always had more targets than other micrornas: 7 micrornas (mir-124*, mir-124, mir-147, mir-193a-3p, mir-21, mir-491, mir-661 and mir-892b) were among the top 10 at all three levels. We take this as evidence supporting the robustness of our screening data: the relative importance of these micrornas remained stable, when measured by number of significantly regulated proteins, irrespective of the significance level that was set. 13

17 Indeed, all three micrornas selected by the network approach were among the abovementioned 7 micrornas. While mir-124 ranked the first at all three significance levels, mir- 147 and mir-193a-3p had more variations in the relative ranking: mir-147 ranked 3 rd /2 nd /3 rd (tied) at three levels, and mir-193a ranked 4 th /4 th /2 nd (tied). Although a simpler p-value approach might also bring us to the same conclusion (maybe at the cost of more experimental validations because of ties in the ranks), we believe that the network approach has the following advantages that the simple method of counting significant targets does not possess. By using the network approach, we do not only discern which micrornas regulate what proteins. Importantly, we could identify which proteins were co-regulated by micrornas, and whether these co-regulations are statistically significant. The ability of answering these questions is exclusively rendered by the bipartite network analysis. As visualized in new Figure 6C (old Figure 5C), by establishing the more-than-random regulation model, we could show that proteins controlling cell cycle in the MDA-MB-231 are significantly co-regulated by multiple micrornas, to which mir-124, mir-147 and mir-193a-3p belong to. We believe that the simplified method illustrated above might indeed have led us to conclude that cell cycle proteins would be co-regulated by these top micrornas, however, under these circumstances it would remain unclear whether or not this co-regulation was a random effect. We think it is important to address this question, because the aim of this study had not simply been to detect and analyze pair-wise regulatory relationships between micrornas and proteins: on top of that, we were out to discover patterns behind the scenes of microrna regulations. The network approach we have devised has given us the chance of having a glimpse of one such pattern: the more-than-random co-regulation of cell cycle proteins. Therefore, we devised and applied the network approach to build the consensus network of proteins that were non-randomly co-regulated by micrornas; and within this consensus network, the cell cycle module was used as an example to demonstrate the co-regulation induced by three micrornas. 14

18 Molecular Systems Biology Peer Review Process File 2nd Editorial Decision 22 November 2011 Thank you again for submitting your work to Molecular Systems Biology. We have now heard back from the two referees who agreed to evaluate this revised study. As you will see, the referees feel that the revisions made to this work have satisfied their concerns, and they are now supportive of publication. Before this work will be acceptable for publication at Molecular Systems Biology, we have some issues of an editorial nature that we would ask to address in a final revision of this work. 1. The editor feels that this work could use a careful and thorough writing revision to improve its readability. When revising this manuscript please keep the following general writing tips in mind: --Avoid overly-long paragraphs. Paragraph breaks should be clearly marked by an empty line or indentation of the first line. --Be concise. --Use active voice whenever possible. To help illustrate these points, I have attached an edited version of your manuscript with suggestions for changes in the abstract and introduction. Before resubmitting this work, we strongly encourage you to have the full document proof-read by a native English speaker. 2. The resolution of the supplied figure images remains somewhat low; the text in many cases is noticeably blocky/blurry when zooming in. You will get the best results if the figures are made directly in a professional quality vector graphics program like Illustrator or the free, opensource alternative Inkscape, and saved directly in EPS format (PDF may also be acceptable). Please feel free to contact us if you have any questions about these points. Thank you for submitting this paper to Molecular Systems Biology. Sincerely, Editor Molecular Systems Biology msb@embo.org Referee reports Reviewer #1 (Remarks to the Author): The authors managed to correct all the deficiencies and address the concerns raised by the reviewers. This a superb article that certainly will cause a great impact. It is definitely ready to be accepted. Reviewer #3 (Remarks to the Author): The authors have successfully answered my comments 2nd Revision - authors' response 09 December 2011 European Molecular Biology Organization 4

Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6

Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Analyzing microrna Data and Integrating mirna with Gene Expression Data in Partek Genomics Suite 6.6 Overview This tutorial outlines how microrna data can be analyzed within Partek Genomics Suite. Additionally,

More information

岑 祥 股 份 有 限 公 司 技 術 專 員 費 軫 尹 20100803

岑 祥 股 份 有 限 公 司 技 術 專 員 費 軫 尹 20100803 技 術 專 員 費 軫 尹 20100803 Overview of presentation Basic Biology of RNA interference Application of sirna for gene function? How to study mirna? How to deliver sirna and mirna? New prospects on RNAi research

More information

Tutorial for proteome data analysis using the Perseus software platform

Tutorial for proteome data analysis using the Perseus software platform Tutorial for proteome data analysis using the Perseus software platform Laboratory of Mass Spectrometry, LNBio, CNPEM Tutorial version 1.0, January 2014. Note: This tutorial was written based on the information

More information

OriGene Technologies, Inc. MicroRNA analysis: Detection, Perturbation, and Target Validation

OriGene Technologies, Inc. MicroRNA analysis: Detection, Perturbation, and Target Validation OriGene Technologies, Inc. MicroRNA analysis: Detection, Perturbation, and Target Validation -Optimal strategies to a successful mirna research project Optimal strategies to a successful mirna research

More information

Exiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays

Exiqon Array Software Manual. Quick guide to data extraction from mircury LNA microrna Arrays Exiqon Array Software Manual Quick guide to data extraction from mircury LNA microrna Arrays March 2010 Table of contents Introduction Overview...................................................... 3 ImaGene

More information

Regulation of telomeres by mirnas in human cancer

Regulation of telomeres by mirnas in human cancer Regulation of telomeres by mirnas in human cancer Stefan Schoeftner, PhD LNCIB, Trieste stefan.schoeftner@lncib.it SIES Discutiamo Insieme Florence, 2.11.214 Shelterin components and telomerase are key

More information

Outline. MicroRNA Bioinformatics. microrna biogenesis. short non-coding RNAs not considered in this lecture. ! Introduction

Outline. MicroRNA Bioinformatics. microrna biogenesis. short non-coding RNAs not considered in this lecture. ! Introduction Outline MicroRNA Bioinformatics Rickard Sandberg Dept. of Cell and Molecular Biology (CMB) Karolinska Institutet! Introduction! microrna target site prediction! Useful resources 2 short non-coding RNAs

More information

Guide for Data Visualization and Analysis using ACSN

Guide for Data Visualization and Analysis using ACSN Guide for Data Visualization and Analysis using ACSN ACSN contains the NaviCell tool box, the intuitive and user- friendly environment for data visualization and analysis. The tool is accessible from the

More information

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals

Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Systematic discovery of regulatory motifs in human promoters and 30 UTRs by comparison of several mammals Xiaohui Xie 1, Jun Lu 1, E. J. Kulbokas 1, Todd R. Golub 1, Vamsi Mootha 1, Kerstin Lindblad-Toh

More information

A role of microrna in the regulation of telomerase? Yuan Ming Yeh, Pei Rong Huang, and Tzu Chien V. Wang

A role of microrna in the regulation of telomerase? Yuan Ming Yeh, Pei Rong Huang, and Tzu Chien V. Wang A role of microrna in the regulation of telomerase? Yuan Ming Yeh, Pei Rong Huang, and Tzu Chien V. Wang Department of Molecular and Cellular Biology, Chang Gung University, Kwei San, Tao Yuan 333, Taiwan

More information

Core Facility Genomics

Core Facility Genomics Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray

More information

1-7 2-8 3-9 4-10 5-11 6-12 7-13 8-14 9-15. Corresponding position of sirna GS

1-7 2-8 3-9 4-10 5-11 6-12 7-13 8-14 9-15. Corresponding position of sirna GS A C GS Target transcript 5ʼ- - 3ʼ 3ʼ- 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1-5ʼ 10-16 11-17 12-18 13-19 14-20 15-21 sivim-270 3ʼUTR 1-7 2-8 3-9 4-10 5-11 6-12 7-13 8-14 9-15 Corresponding

More information

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE

AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE ACCELERATING PROGRESS IS IN OUR GENES AGILENT S BIOINFORMATICS ANALYSIS SOFTWARE GENESPRING GENE EXPRESSION (GX) MASS PROFILER PROFESSIONAL (MPP) PATHWAY ARCHITECT (PA) See Deeper. Reach Further. BIOINFORMATICS

More information

MicroRNA Mike needs help to degrade all the mrna transcripts! Aaron Arvey ISMB 2010

MicroRNA Mike needs help to degrade all the mrna transcripts! Aaron Arvey ISMB 2010 Target mrna abundance dilutes microrna and sirna activity MicroRNA Mike needs help to degrade all the mrna transcripts! Aaron Arvey ISMB 2010 Target mrna abundance dilutes microrna and sirna activity Erik

More information

LightSwitch Luciferase Assay System

LightSwitch Luciferase Assay System Constructs, Assays & Services Assay System Promoter GoClones Synthetic Response Elements Pathway Cell Lines 3 UTR GoClones mirna Mimics & Inhibitors Synthetic mirna Targets Transfection Reagents Assay

More information

Gene Expression Analysis

Gene Expression Analysis Gene Expression Analysis Jie Peng Department of Statistics University of California, Davis May 2012 RNA expression technologies High-throughput technologies to measure the expression levels of thousands

More information

Relative Quantification of mirna Target mrnas by Real-Time qpcr. 1 Introduction. Gene Expression Application Note No. 4

Relative Quantification of mirna Target mrnas by Real-Time qpcr. 1 Introduction. Gene Expression Application Note No. 4 Gene Expression Application Note No. 4 Relative Quantification of mirna Target mrnas by Real-Time qpcr Ute Ernst, Jitao David Zhang, Anja Irsigler, Stefan Wiemann, Ulrich Tschulena Division: Molecular

More information

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS)

Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) Introduction to transcriptome analysis using High Throughput Sequencing technologies (HTS) A typical RNA Seq experiment Library construction Protocol variations Fragmentation methods RNA: nebulization,

More information

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34

Network Analysis. BCH 5101: Analysis of -Omics Data 1/34 Network Analysis BCH 5101: Analysis of -Omics Data 1/34 Network Analysis Graphs as a representation of networks Examples of genome-scale graphs Statistical properties of genome-scale graphs The search

More information

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Analysis of gene expression data. Ulf Leser and Philippe Thomas Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:

More information

Next Generation Sequencing: Technology, Mapping, and Analysis

Next Generation Sequencing: Technology, Mapping, and Analysis Next Generation Sequencing: Technology, Mapping, and Analysis Gary Benson Computer Science, Biology, Bioinformatics Boston University gbenson@bu.edu http://tandem.bu.edu/ The Human Genome Project took

More information

PreciseTM Whitepaper

PreciseTM Whitepaper Precise TM Whitepaper Introduction LIMITATIONS OF EXISTING RNA-SEQ METHODS Correctly designed gene expression studies require large numbers of samples, accurate results and low analysis costs. Analysis

More information

FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem

FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem FlipFlop: Fast Lasso-based Isoform Prediction as a Flow Problem Elsa Bernard Laurent Jacob Julien Mairal Jean-Philippe Vert September 24, 2013 Abstract FlipFlop implements a fast method for de novo transcript

More information

PART 3.3: MicroRNA and Cancer

PART 3.3: MicroRNA and Cancer BIBM 2010 Tutorial: Epigenomics and Cancer PART 3.3: MicroRNA and Cancer Dec 18, 2010 Sun Kim at Indiana University Outline of Part 3.3 Background on microrna Role of microrna in cancer MicroRNA pathway

More information

Basic Analysis of Microarray Data

Basic Analysis of Microarray Data Basic Analysis of Microarray Data A User Guide and Tutorial Scott A. Ness, Ph.D. Co-Director, Keck-UNM Genomics Resource and Dept. of Molecular Genetics and Microbiology University of New Mexico HSC Tel.

More information

Frequently Asked Questions Next Generation Sequencing

Frequently Asked Questions Next Generation Sequencing Frequently Asked Questions Next Generation Sequencing Import These Frequently Asked Questions for Next Generation Sequencing are some of the more common questions our customers ask. Questions are divided

More information

Profiling of non-coding RNA classes Gunter Meister

Profiling of non-coding RNA classes Gunter Meister Profiling of non-coding RNA classes Gunter Meister RNA Biology Regensburg University Universitätsstrasse 31 93053 Regensburg Overview Classes of non-coding RNAs Profiling strategies Validation Protein-RNA

More information

Measuring gene expression (Microarrays) Ulf Leser

Measuring gene expression (Microarrays) Ulf Leser Measuring gene expression (Microarrays) Ulf Leser This Lecture Gene expression Microarrays Idea Technologies Problems Quality control Normalization Analysis next week! 2 http://learn.genetics.utah.edu/content/molecules/transcribe/

More information

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey

Molecular Genetics: Challenges for Statistical Practice. J.K. Lindsey Molecular Genetics: Challenges for Statistical Practice J.K. Lindsey 1. What is a Microarray? 2. Design Questions 3. Modelling Questions 4. Longitudinal Data 5. Conclusions 1. What is a microarray? A microarray

More information

PREDA S4-classes. Francesco Ferrari October 13, 2015

PREDA S4-classes. Francesco Ferrari October 13, 2015 PREDA S4-classes Francesco Ferrari October 13, 2015 Abstract This document provides a description of custom S4 classes used to manage data structures for PREDA: an R package for Position RElated Data Analysis.

More information

Statistical issues in the analysis of microarray data

Statistical issues in the analysis of microarray data Statistical issues in the analysis of microarray data Daniel Gerhard Institute of Biostatistics Leibniz University of Hannover ESNATS Summerschool, Zermatt D. Gerhard (LUH) Analysis of microarray data

More information

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16 Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems

More information

Data Acquisition. DNA microarrays. The functional genomics pipeline. Experimental design affects outcome data analysis

Data Acquisition. DNA microarrays. The functional genomics pipeline. Experimental design affects outcome data analysis Data Acquisition DNA microarrays The functional genomics pipeline Experimental design affects outcome data analysis Data acquisition microarray processing Data preprocessing scaling/normalization/filtering

More information

Microarray Data Analysis. A step by step analysis using BRB-Array Tools

Microarray Data Analysis. A step by step analysis using BRB-Array Tools Microarray Data Analysis A step by step analysis using BRB-Array Tools 1 EXAMINATION OF DIFFERENTIAL GENE EXPRESSION (1) Objective: to find genes whose expression is changed before and after chemotherapy.

More information

G E N OM I C S S E RV I C ES

G E N OM I C S S E RV I C ES GENOMICS SERVICES THE NEW YORK GENOME CENTER NYGC is an independent non-profit implementing advanced genomic research to improve diagnosis and treatment of serious diseases. capabilities. N E X T- G E

More information

Supplementary Information

Supplementary Information Supplementary Information S1: Degree Distribution of TFs in the E.coli TRN and CRN based on Operons 1000 TRN Number of TFs 100 10 y = 619.55x -1.4163 R 2 = 0.8346 1 1 10 100 1000 Degree of TFs CRN 100

More information

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison RETRIEVING SEQUENCE INFORMATION Nucleotide sequence databases Database search Sequence alignment and comparison Biological sequence databases Originally just a storage place for sequences. Currently the

More information

Quantitative proteomics background

Quantitative proteomics background Proteomics data analysis seminar Quantitative proteomics and transcriptomics of anaerobic and aerobic yeast cultures reveals post transcriptional regulation of key cellular processes de Groot, M., Daran

More information

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM lecrom@biologie.ens.fr

Lecture 11 Data storage and LIMS solutions. Stéphane LE CROM lecrom@biologie.ens.fr Lecture 11 Data storage and LIMS solutions Stéphane LE CROM lecrom@biologie.ens.fr Various steps of a DNA microarray experiment Experimental steps Data analysis Experimental design set up Chips on catalog

More information

0 value2. 3. Assign labels to the proteins in each interval of the ranked list

0 value2. 3. Assign labels to the proteins in each interval of the ranked list Ranked list of protein degrees in decreasing order 0 100 Proteins with few connections Proteins with many connections 1. Sort a random number between 80 and 98 (value1) 0 value1 2. Sort a random number

More information

Supplementary Figure 1.

Supplementary Figure 1. Supplementary Figure 1. (A) MicroRNA 212 enhances IS from pancreatic β-cells. INS-1 832/3 β-cells were transfected with precursors for mirnas 212, 375, or negative control oligonucleotides. 48 hrs after

More information

ALLEN Mouse Brain Atlas

ALLEN Mouse Brain Atlas TECHNICAL WHITE PAPER: QUALITY CONTROL STANDARDS FOR HIGH-THROUGHPUT RNA IN SITU HYBRIDIZATION DATA GENERATION Consistent data quality and internal reproducibility are critical concerns for high-throughput

More information

Superior TrueMAB TM monoclonal antibodies for the recognition of proteins native epitopes

Superior TrueMAB TM monoclonal antibodies for the recognition of proteins native epitopes Superior TrueMAB TM monoclonal antibodies for the recognition of proteins native epitopes Outlines Brief introduction of OriGene s mission on gene-centric product solution. TrueMAB monoclonal antibody

More information

Row Quantile Normalisation of Microarrays

Row Quantile Normalisation of Microarrays Row Quantile Normalisation of Microarrays W. B. Langdon Departments of Mathematical Sciences and Biological Sciences University of Essex, CO4 3SQ Technical Report CES-484 ISSN: 1744-8050 23 June 2008 Abstract

More information

Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF

Tutorial for Proteomics Data Submission. Katalin F. Medzihradszky Robert J. Chalkley UCSF Tutorial for Proteomics Data Submission Katalin F. Medzihradszky Robert J. Chalkley UCSF Why Have Guidelines? Large-scale proteomics studies create huge amounts of data. It is impossible/impractical to

More information

Consistent Assay Performance Across Universal Arrays and Scanners

Consistent Assay Performance Across Universal Arrays and Scanners Technical Note: Illumina Systems and Software Consistent Assay Performance Across Universal Arrays and Scanners There are multiple Universal Array and scanner options for running Illumina DASL and GoldenGate

More information

Course on Functional Analysis. ::: Gene Set Enrichment Analysis - GSEA -

Course on Functional Analysis. ::: Gene Set Enrichment Analysis - GSEA - Course on Functional Analysis ::: Madrid, June 31st, 2007. Gonzalo Gómez, PhD. ggomez@cnio.es Bioinformatics Unit CNIO ::: Contents. 1. Introduction. 2. GSEA Software 3. Data Formats 4. Using GSEA 5. GSEA

More information

School of Nursing. Presented by Yvette Conley, PhD

School of Nursing. Presented by Yvette Conley, PhD Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression

More information

A Primer of Genome Science THIRD

A Primer of Genome Science THIRD A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:

More information

Delivering the power of the world s most successful genomics platform

Delivering the power of the world s most successful genomics platform Delivering the power of the world s most successful genomics platform NextCODE Health is bringing the full power of the world s largest and most successful genomics platform to everyday clinical care NextCODE

More information

S1 Text. Modeling deterministic single-cell microrna-p53-mdm2 network Figure 2 Figure 2

S1 Text. Modeling deterministic single-cell microrna-p53-mdm2 network Figure 2 Figure 2 S1 Text. Modeling deterministic single-cell microrna-p53-mdm2 network The schematic diagram of the microrna-p53-mdm2 oscillator is illustrated in Figure 2. The interaction scheme among the mrnas and the

More information

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

Focusing on results not data comprehensive data analysis for targeted next generation sequencing Focusing on results not data comprehensive data analysis for targeted next generation sequencing Daniel Swan, Jolyon Holdstock, Angela Matchan, Richard Stark, John Shovelton, Duarte Mohla and Simon Hughes

More information

Module 1. Sequence Formats and Retrieval. Charles Steward

Module 1. Sequence Formats and Retrieval. Charles Steward The Open Door Workshop Module 1 Sequence Formats and Retrieval Charles Steward 1 Aims Acquaint you with different file formats and associated annotations. Introduce different nucleotide and protein databases.

More information

Comparing Methods for Identifying Transcription Factor Target Genes

Comparing Methods for Identifying Transcription Factor Target Genes Comparing Methods for Identifying Transcription Factor Target Genes Alena van Bömmel (R 3.3.73) Matthew Huska (R 3.3.18) Max Planck Institute for Molecular Genetics Folie 1 Transcriptional Regulation TF

More information

Graphical Modeling for Genomic Data

Graphical Modeling for Genomic Data Graphical Modeling for Genomic Data Carel F.W. Peeters cf.peeters@vumc.nl Joint work with: Wessel N. van Wieringen Mark A. van de Wiel Molecular Biostatistics Unit Dept. of Epidemiology & Biostatistics

More information

Head of College Scholars List Scheme. Summer Studentship. Report Form

Head of College Scholars List Scheme. Summer Studentship. Report Form Head of College Scholars List Scheme Summer Studentship Report Form This report should be completed by the student with his/her project supervisor. It should summarise the work undertaken during the project

More information

Web-Based Genomic Information Integration with Gene Ontology

Web-Based Genomic Information Integration with Gene Ontology Web-Based Genomic Information Integration with Gene Ontology Kai Xu 1 IMAGEN group, National ICT Australia, Sydney, Australia, kai.xu@nicta.com.au Abstract. Despite the dramatic growth of online genomic

More information

Exercise with Gene Ontology - Cytoscape - BiNGO

Exercise with Gene Ontology - Cytoscape - BiNGO Exercise with Gene Ontology - Cytoscape - BiNGO This practical has material extracted from http://www.cbs.dtu.dk/chipcourse/exercises/ex_go/goexercise11.php In this exercise we will analyze microarray

More information

TOOLS sirna and mirna. User guide

TOOLS sirna and mirna. User guide TOOLS sirna and mirna User guide Introduction RNA interference (RNAi) is a powerful tool for suppression gene expression by causing the destruction of specific mrna molecules. Small Interfering RNAs (sirnas)

More information

MASCOT Search Results Interpretation

MASCOT Search Results Interpretation The Mascot protein identification program (Matrix Science, Ltd.) uses statistical methods to assess the validity of a match. MS/MS data is not ideal. That is, there are unassignable peaks (noise) and usually

More information

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources 1 of 8 11/7/2004 11:00 AM National Center for Biotechnology Information About NCBI NCBI at a Glance A Science Primer Human Genome Resources Model Organisms Guide Outreach and Education Databases and Tools

More information

DeCyder Extended Data Analysis module Version 1.0

DeCyder Extended Data Analysis module Version 1.0 GE Healthcare DeCyder Extended Data Analysis module Version 1.0 Module for DeCyder 2D version 6.5 User Manual Contents 1 Introduction 1.1 Introduction... 7 1.2 The DeCyder EDA User Manual... 9 1.3 Getting

More information

Chapter 6. The stacking ensemble approach

Chapter 6. The stacking ensemble approach 82 This chapter proposes the stacking ensemble approach for combining different data mining classifiers to get better performance. Other combination techniques like voting, bagging etc are also described

More information

Protein Protein Interaction Networks

Protein Protein Interaction Networks Functional Pattern Mining from Genome Scale Protein Protein Interaction Networks Young-Rae Cho, Ph.D. Assistant Professor Department of Computer Science Baylor University it My Definition of Bioinformatics

More information

SUPPLEMENTAL DATA. Construct Registry title Registry number. P y = Ubc, Gal4 = 0, P x = none (Fig. 2) Ubc-CFP BBa_J176093

SUPPLEMENTAL DATA. Construct Registry title Registry number. P y = Ubc, Gal4 = 0, P x = none (Fig. 2) Ubc-CFP BBa_J176093 SUPPLEMENTAL DATA Construct Registry title Registry number P y = Ubc, Gal4 = 0, P x = none (Fig. 2) Ubc-CFP BBa_J176093 P y = Ubc, Gal4 = 0, P x = Ubc (Fig. 2) Test mir-luc sensor 1 BBa_J176142 P y = Ubc,

More information

Data, Measurements, Features

Data, Measurements, Features Data, Measurements, Features Middle East Technical University Dep. of Computer Engineering 2009 compiled by V. Atalay What do you think of when someone says Data? We might abstract the idea that data are

More information

Computational localization of promoters and transcription start sites in mammalian genomes

Computational localization of promoters and transcription start sites in mammalian genomes Computational localization of promoters and transcription start sites in mammalian genomes Thomas Down This dissertation is submitted for the degree of Doctor of Philosophy Wellcome Trust Sanger Institute

More information

micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved

micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved microrna 2 micrornas Non protein coding, endogenous RNAs of 21-22nt length Evolutionarily conserved Regulate gene expression by binding complementary regions at 3 regions of target mrnas Act as negative

More information

Summary of Discussion on Non-clinical Pharmacology Studies on Anticancer Drugs

Summary of Discussion on Non-clinical Pharmacology Studies on Anticancer Drugs Provisional Translation (as of January 27, 2014)* November 15, 2013 Pharmaceuticals and Bio-products Subcommittees, Science Board Summary of Discussion on Non-clinical Pharmacology Studies on Anticancer

More information

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening

Micro RNAs: potentielle Biomarker für das. Blutspenderscreening Micro RNAs: potentielle Biomarker für das Blutspenderscreening micrornas - Background Types of RNA -Coding: messenger RNA (mrna) -Non-coding (examples): Ribosomal RNA (rrna) Transfer RNA (trna) Small nuclear

More information

Multivariate Analysis of Ecological Data

Multivariate Analysis of Ecological Data Multivariate Analysis of Ecological Data MICHAEL GREENACRE Professor of Statistics at the Pompeu Fabra University in Barcelona, Spain RAUL PRIMICERIO Associate Professor of Ecology, Evolutionary Biology

More information

Ingenuity Pathway Analysis (IPA )

Ingenuity Pathway Analysis (IPA ) ProductProfile Ingenuity Pathway Analysis (IPA ) For the analysis and interpretation of omics data IPA is a web-based software application for the analysis, integration, and interpretation of data derived

More information

SUPPLEMENTARY DATA 1

SUPPLEMENTARY DATA 1 SUPPLEMENTARY DATA 1 Supplementary Figure S1. Overexpression of untagged Ago2 inhibits the nuclear transport of the dotted foci of GFP-signal of myc-gfp-tnrc6a-nes-mut. (A C) HeLa cells expressing myc-gfp-tnrc6a-nes-mut

More information

Work Package 13.5: Authors: Paul Flicek and Ilkka Lappalainen. 1. Introduction

Work Package 13.5: Authors: Paul Flicek and Ilkka Lappalainen. 1. Introduction Work Package 13.5: Report summarising the technical feasibility of the European Genotype Archive to collect, store, and use genotype data stored in European biobanks in a manner that complies with all

More information

Supplementary Figure 1: Quality Assessment of Mouse Arrays. Supplementary Figure 2: Quality Assessment of Rat Arrays

Supplementary Figure 1: Quality Assessment of Mouse Arrays. Supplementary Figure 2: Quality Assessment of Rat Arrays Supplementary Figure 1: Quality Assessment of Mouse Arrays The mouse microarray data were subjected to an extensive quality-control procedure prior to conducting downstream analyses. We assessed the spread

More information

False Discovery Rates

False Discovery Rates False Discovery Rates John D. Storey Princeton University, Princeton, USA January 2010 Multiple Hypothesis Testing In hypothesis testing, statistical significance is typically based on calculations involving

More information

MeDIP-chip service report

MeDIP-chip service report MeDIP-chip service report Wednesday, 20 August, 2008 Sample source: Cells from University of *** Customer: ****** Organization: University of *** Contents of this service report General information and

More information

Introduction To Real Time Quantitative PCR (qpcr)

Introduction To Real Time Quantitative PCR (qpcr) Introduction To Real Time Quantitative PCR (qpcr) SABiosciences, A QIAGEN Company www.sabiosciences.com The Seminar Topics The advantages of qpcr versus conventional PCR Work flow & applications Factors

More information

GenomeStudio Data Analysis Software

GenomeStudio Data Analysis Software GenomeStudio Analysis Software Illumina has created a comprehensive suite of data analysis tools to support a wide range of genetic analysis assays. This single software package provides data visualization

More information

NATIONAL GENETICS REFERENCE LABORATORY (Manchester)

NATIONAL GENETICS REFERENCE LABORATORY (Manchester) NATIONAL GENETICS REFERENCE LABORATORY (Manchester) MLPA analysis spreadsheets User Guide (updated October 2006) INTRODUCTION These spreadsheets are designed to assist with MLPA analysis using the kits

More information

Quality Assessment of Exon and Gene Arrays

Quality Assessment of Exon and Gene Arrays Quality Assessment of Exon and Gene Arrays I. Introduction In this white paper we describe some quality assessment procedures that are computed from CEL files from Whole Transcript (WT) based arrays such

More information

From Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data

From Reads to Differentially Expressed Genes. The statistics of differential gene expression analysis using RNA-seq data From Reads to Differentially Expressed Genes The statistics of differential gene expression analysis using RNA-seq data experimental design data collection modeling statistical testing biological heterogeneity

More information

mirnaselect pep-mir Cloning and Expression Vector

mirnaselect pep-mir Cloning and Expression Vector Product Data Sheet mirnaselect pep-mir Cloning and Expression Vector CATALOG NUMBER: MIR-EXP-C STORAGE: -80ºC QUANTITY: 2 vectors; each contains 100 µl of bacterial glycerol stock Components 1. mirnaselect

More information

Each figure of a manuscript should be submitted as a single file.

Each figure of a manuscript should be submitted as a single file. Introduction This page provides general information for authors creating figures to maximize the quality of those illustrations and to prepare artwork for submission to the Brazilian Journal of Medical

More information

Data Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov

Data Integration. Lectures 16 & 17. ECS289A, WQ03, Filkov Data Integration Lectures 16 & 17 Lectures Outline Goals for Data Integration Homogeneous data integration time series data (Filkov et al. 2002) Heterogeneous data integration microarray + sequence microarray

More information

RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial

RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial RT 2 Profiler PCR Array: Web-Based Data Analysis Tutorial Samuel J. Rulli, Jr., Ph.D. qpcr-applications Scientist Samuel.Rulli@QIAGEN.com Pathway Focused Research from Sample Prep to Data Analysis! -2-

More information

Validation and Replication

Validation and Replication Validation and Replication Overview Definitions of validation and replication Difficulties and limitations Working examples from our group and others Why? False positive results still occur. even after

More information

In developmental genomic regulatory interactions among genes, encoding transcription factors

In developmental genomic regulatory interactions among genes, encoding transcription factors JOURNAL OF COMPUTATIONAL BIOLOGY Volume 20, Number 6, 2013 # Mary Ann Liebert, Inc. Pp. 419 423 DOI: 10.1089/cmb.2012.0297 Research Articles A New Software Package for Predictive Gene Regulatory Network

More information

NOVEL GENOME-SCALE CORRELATION BETWEEN DNA REPLICATION AND RNA TRANSCRIPTION DURING THE CELL CYCLE IN YEAST IS PREDICTED BY DATA-DRIVEN MODELS

NOVEL GENOME-SCALE CORRELATION BETWEEN DNA REPLICATION AND RNA TRANSCRIPTION DURING THE CELL CYCLE IN YEAST IS PREDICTED BY DATA-DRIVEN MODELS NOVEL GENOME-SCALE CORRELATION BETWEEN DNA REPLICATION AND RNA TRANSCRIPTION DURING THE CELL CYCLE IN YEAST IS PREDICTED BY DATA-DRIVEN MODELS Orly Alter (a) *, Gene H. Golub (b), Patrick O. Brown (c)

More information

Two-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?...

Two-Way ANOVA tests. I. Definition and Applications...2. II. Two-Way ANOVA prerequisites...2. III. How to use the Two-Way ANOVA tool?... Two-Way ANOVA tests Contents at a glance I. Definition and Applications...2 II. Two-Way ANOVA prerequisites...2 III. How to use the Two-Way ANOVA tool?...3 A. Parametric test, assume variances equal....4

More information

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis

InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis InSyBio BioNets: Utmost efficiency in gene expression data and biological networks analysis WHITE PAPER By InSyBio Ltd Konstantinos Theofilatos Bioinformatician, PhD InSyBio Technical Sales Manager August

More information

How To Cluster Of Complex Systems

How To Cluster Of Complex Systems Entropy based Graph Clustering: Application to Biological and Social Networks Edward C Kenley Young-Rae Cho Department of Computer Science Baylor University Complex Systems Definition Dynamically evolving

More information

Simplifying Data Interpretation with Nexus Copy Number

Simplifying Data Interpretation with Nexus Copy Number Simplifying Data Interpretation with Nexus Copy Number A WHITE PAPER FROM BIODISCOVERY, INC. Rapid technological advancements, such as high-density acgh and SNP arrays as well as next-generation sequencing

More information

Changes to UK NEQAS Leucocyte Immunophenotyping Chimerism Performance Monitoring Systems From April 2014. Uncontrolled Copy

Changes to UK NEQAS Leucocyte Immunophenotyping Chimerism Performance Monitoring Systems From April 2014. Uncontrolled Copy Changes to UK NEQAS Leucocyte Immunophenotyping Chimerism Performance Monitoring Systems From April 2014 Contents 1. The need for change 2. Current systems 3. Proposed z-score system 4. Comparison of z-score

More information

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis

Comparative genomic hybridization Because arrays are more than just a tool for expression analysis Microarray Data Analysis Workshop MedVetNet Workshop, DTU 2008 Comparative genomic hybridization Because arrays are more than just a tool for expression analysis Carsten Friis ( with several slides from

More information

DIPARTIMENTO di SCIENZE BIOMEDICHE e BIOTECNOLOGICHE

DIPARTIMENTO di SCIENZE BIOMEDICHE e BIOTECNOLOGICHE July 15, 2015 Science editor of World Journal of Gastroenterology Jing Yu Email: j.yu@wjgnet.com Dear Prof. Jing Yu, please find enclosed a revised version of our paper Non-coding landscapes of Colorectal

More information

Genetomic Promototypes

Genetomic Promototypes Genetomic Promototypes Mirkó Palla and Dana Pe er Department of Mechanical Engineering Clarkson University Potsdam, New York and Department of Genetics Harvard Medical School 77 Avenue Louis Pasteur Boston,

More information

Next Generation Sequencing: Adjusting to Big Data. Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013

Next Generation Sequencing: Adjusting to Big Data. Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013 Next Generation Sequencing: Adjusting to Big Data Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa 29.10.2013 Outline Human Genome Project Next-Generation Sequencing Personalized Medicine

More information

An Introduction to Genomics and SAS Scientific Discovery Solutions

An Introduction to Genomics and SAS Scientific Discovery Solutions An Introduction to Genomics and SAS Scientific Discovery Solutions Dr Karen M Miller Product Manager Bioinformatics SAS EMEA 16.06.03 Copyright 2003, SAS Institute Inc. All rights reserved. 1 Overview!

More information

A Brief Study of the Nurse Scheduling Problem (NSP)

A Brief Study of the Nurse Scheduling Problem (NSP) A Brief Study of the Nurse Scheduling Problem (NSP) Lizzy Augustine, Morgan Faer, Andreas Kavountzis, Reema Patel Submitted Tuesday December 15, 2009 0. Introduction and Background Our interest in the

More information