The experiment will be conducted in two phases; find the optimal statistical detection method and apply to collected data.

Size: px
Start display at page:

Download "The experiment will be conducted in two phases; find the optimal statistical detection method and apply to collected data."

Transcription

1 SPECIFIC AIMS Plasmodium falciparum is the most lethal causative agent of malaria, affecting over 500 million annually with 1 to 3 million of those cases resulting in death 1. Widespread drug use has been the favored control mechanism but has lead to the evolution of drug and multi-drug resistance. Candidate targets for new drugs and vaccines are the proteins interacting between the host and pathogen. Examples include var genes that encode a family of antigens called pfemp1 2 and host Band3 binding merozoite surface protein 1 3 which have been identified on a case by case basis. This narrow approach to detecting such interactions is time consuming. An alternative, genome wide approach is to use the natural variation in the host and pathogen genes encoding the interacting proteins and investigate correlations between the observed genetic variation and parasitemia using methods similar to the epistatic association methods. The hypothesis behind the proposal is polymorphisms in genes encoding the interacting proteins can affect severity of infection, be identified using a statistical model, and suggest targets for vaccinations and new treatment. The hypothesis is based on the success of association studies to detect associations with HIV-1 viral load 4, type 2 diabetes 5-8, Crohn s disease, rheumatoid arthritis, type 1 diabetes, bipolar disorder, and coronary artery disease 8 when sampling is sufficient, and the emergence of computationally tractable epistatic detection methods. The experiment will be conducted in two phases; find the optimal statistical detection method and apply to collected data. 1. Construct and test methods to detect interacting genes. Human, pathogen, and resulting infection phenotypes will be simulated to provide data to test the interacting gene detection methods. Methods for detecting epistatic interactions will be modified to utilize the host and pathogen genotypes, and tested on the simulated data. 2. Detect interacting genes in P. falciparum and H. sapiens. Africans infected with P. falciparum will be enrolled, host and pathogen genotyped, and parasitemia measured. Additional information will be collected about the subjects for cofactor analysis. The resulting data will be evaluated using the model from aim 1. The specific aims will help in designing a useful tool to detect host pathogen interactions, gain insight to the host pathogen interactions seen in malaria infections, and extend to other hostpathogen combinations including but not limited to Human Immunodeficiency Virus (HIV), Hepatitis, and Mycobacterium tuberculosis. BACKGROUND AND SIGNIFICANCE P. falciparum as a human pathogen. P. falciparum has a multistage, two host life cycle; humans and mosquitoes (Figure A). Severity of the infection or parasitemia is measured as the percentage of infected red blood cells and the number of parasites per µl. Treatment of P. falciparum infection include a wide range of drugs; chloroquine, quinine, Fansidar, and antifolates to name a few. However, the evolution of drug resistance and multiple drug Page 1 of 11

2 resistance, coupled with no available vaccination 9 has made wide scale treatment a global health issue. I believe the proposal has relevance with respect to development of vaccinations and new treatments. Identifying genes that are interacting between the host and the pathogen and validation of the interactions using yeast two hybrid assays can lead to identifying means of disrupting the interactions using small molecule or pharmacogenomic approaches, and subsequently inhibit the pathogen. Figure A Plasmodium falciparum life cycle. A) Mosquitoes feed on a human blood and inject sporozoites into the human. The parasite migrates to the liver, goes through several phases of development, and emerges as a merozoite. Merozoites invade red blood cells, multiply as trophozoites, or develop into gametocytes. B) Mosquitoes biting an infected human take up gametocytes, which develop into male and female gametes. The gametes fuse to form a zygote in the mosquito gut, develop into an ookinete. The ookinete crosses the gut wall, turns into a sporozoite filled oocyst, bursts, and sporozoites migrate to the salivary glands 10. P. falciparum and human interactions. The intricate relationship of the parasite and human host, and ability of the parasite to escape immune responses 11, 12 suggest there is an evolution of interactions. These interactions have been identified by investigation of increased polymorphism in the pathogen, however this does not give indication to the host target. The experiment proposed would utilize the variability in both the host and pathogen to identify the interacting sites genome wide (Specific Aim 2). Genetics of P. falciparum. The 14 P. falciparum chromosomes have been fully sequenced in and genome wide microsatellite and single nucleotide polymorphisms (SNPs) identified 9, 16, 17. This variation has been used to quantify recombination rates 16, 18, reject demographic models[unpublished results], identify genes subject to selection 19, 20, and genes associated with drug resistance The genome is unique in base pair composition; 80% AT 13-15, and the majority (~65%) of the SNPs are nonsynonymous 9. The surplus of nssnps may be due to the lack of possible synonymous sites given the constraints caused by the codon bias 24, 25. Since the majority of the polymorphism are coding, the effects can be observed as variability in phenotypes such as drug resistance and parasitemia. Utilizing natural genomic variation. Genetic variation has been intensely investigated and mapped in humans, with the start of the HapMap 26 project to copy number variation 27. A common use of this information is association studies, where correlations between the genotypes and phenotypes are tested. Several large scale association studies have been reported, detecting susceptibility loci for a wide range of diseases; HIV, type 1 and 2 diabetes, coronary artery disease among others. An extension of these methods would be to use both the P. falciparum Page 2 of 11

3 genome and the human genome to detect variances contributing to infection; using natural variation in a host and pathogen to detect interacting genes associated with severity of infection, and suggest targets for vaccinations and new treatment approaches. Epistatic association methods. Association methods test for correlations between genotypes and phenotypes using a number of methods: linear regression, variants of the χ 2 test like genomic control 28 and Armitage χ 2 29, and logistic regression. Multilocus and epistatic models have been developed to detect genes interacting epistatically. The basic method of testing these effects is an exhaustive 2 locus model which has been shown to have maximum power of detection 30, 31 yet remains computationally intensive because there are L choose 2 combinations, where L is the number of markers sampled. In the case of a typical human genome study with 500,000 markers, this is approximately 1.25 x 11. This also imposes a multiple testing problem where several independent and correlated hypotheses are being tested. Two main approaches to addressing the combinatorial and multiple testing issues are 2 stage designs 31 and Bayesian methods 32, 33. The two-stage approach tests for marginal effects with a liberal effect size cut off in the first stage. In the second stage epistatic interactions are tested either a) within the stage one marker set, or b) stage one marker set x all markers 31. These significantly reduce the number of tests however both (a) or one (b) loci must have a marginal effect to be tested for epistatic effects; this case is not always biologically true. The Bayesian methods use Markov chain Monte Carlo (MCMC) to address the large parameter space. Using the Metropolis-Hastings algorithm parameters from a proposal distribution are proposed and updated based on an acceptance function. After a sufficient number of iterations, parameter estimates approach the true values. A natural extension of these methods is to incorporate a second genome and evaluate epistasis between genomes to detect interacting genes (Specific Aim 1). PRELIMINARY STUDIES <NONE> RESEARCH DESIGN AND METHODS Specific Aim #1. Construct and test methods to detect interacting genes. Rationale. Association studies have been successful at identifying susceptibility loci for several diseases, and epistatic methods are becoming more powerful. The next step is to consider epistasis between genomes as in the case of infecting pathogens and hosts. Using natural variation in the same way association studies do, interacting sites that contribute to severity of Page 3 of 11

4 infection can be identified. In this aim, a tool will be developed and tested to detect these interactions. Experimental design. Association methods are greatly varied in methods and ability to detect effects and are tested on simulated data. Simulating data provides a controlled and cost effective method to evaluate method performance, however it is difficult to simulate interactions when the under lying model is unknown. To circumvent the issue, all unique models will be simulated and used to evaluate model performance. Next, two main statistical methods: goodness of fit, and ANOVA will be used and evaluated in all combinations of host and pathogen genotype for ability to detect the interaction. A problem with evaluating all combinations of host and pathogen genotypes is many independent hypotheses are being tested, making the over all probability of rejecting the null inflated. Applying multiple testing corrections like the Bonferroni correction tend to be conservative, and reduce the ability to detect interactions. The main solution is to decrease the number of tests made. In this aim, two methods of reducing the number of tests; limiting the number of loci tested in the interactive method, and Bayesian methods. Li and Riech 34 addressed this issue of unknown biological epistatic model by enumerating and classifying all two-locus models in bialleic, diploid, case control framework. Similarly, in a bialleic, diploid- haploid, 2 means (0 or 1) phenotype system, there are 64 possible interaction models. Of these models, half are equivalent due to symmetry, 14 of the 32 models are unique, and 10 describe interactive effects (Table 1). Simulating phenotypes based on the 14 models and an additive model will provide an extensive data set of all possible interactive models. M1* M2 M3 M4 M5 M6 M7 M8* M9 M10 M11 M12 M13 M14 M15 M16 M17* M18 M19 M20 M21 M22 M23 M24 M25 M26 M27 M28* M29 M30 M31 M32 Table 1. Two-loci models. Rows index host genotype, columns index pathogen genotype, and entries correspond to phenotype mean. Shown are 32 of the phenotype models, the remaining models can be obtained by substituting 0 with 1 and vice versa. Of these 32 models several models are the same: M2, M3, M6 and M7; M4 and M5; M8 and Page 4 of 11

5 M22; M9, M14, M18 and M21; M10, M13, M19 and M20; M11 and M16; M12 and M15; M23 and M24; M25, M26, and M32; M29 and M31, yielding 14 unique models (designated in red). 4 models (denoted by *) lack interactive effects. Two methods for detecting interactions will be considered: likelihood goodness of fit comparison between full interaction (Table 2), host effect (Table 3), pathogen effect (Table 4), and null modelsp; and the two-way analysis of variance (ANOVA). While these methods are very similar, the emphasis of goodness of fit is on the linear regression fit for 4 models and assumes a normally distributed phenotype while the ANOVA is used to test the means based on the variance components. The number of tests for each method is L Host x L Pathogen where L is the number of loci. To address this high number of tests issue, two approaches to reducing the number of tests will be used. First, a two-stage design will be implemented, requiring host main effects to meet a minimum significance in order to be considered for the full model test. Since number of pathogen sites is relatively small (~5,000), all pathogen loci will be used in the second stage. Second, a Bayesian approach will be used to search the entire sample genotype combination space and refine the model to only the genes interacting. Host Genotype at locus A Pathogen Genotype at locus B B b mean AA µ AAB µ AAb µ AA. p 2 A p B p 2 A p b Aa µ AaB µ Aab µ Aa. 2p A p a p B 2p A p a p b aa µ aab µ aab µ aa. p 2 a p B p 2 a p b mean µ..b µ..b Table 2. Full interactive model. Genotypic means and frequencies. Host Geneotype at locus A AA Aa aa 2 p A 2p A p a 2 p a µ AA µ Aa µ aa Table 3. Host effects model. Genotypic means and frequencies. Pathogen Genotype at locus B B b P B p b µ B µ B Table 4. Pathogen effects model. Genotypic means and frequencies. A. Simulation of interacting data. Methods. A locus in a diploid host population, and a locus in a haploid pathogen population will be simulated hosts will be sampled without replacement, for each host a pathogen will be sampled with replacement to mimic infection. 15 phenotypes will be modeled according to the 14 unique models (Table 1) and an additive model distributed normally with variance = replicates will be generated for each phenotypic model. Additionally, data sets with realistic marker numbers, recombination, and linkage disequilibrium will be simulated and phenotypes assigned using a subset of the models in Table 1. B. Method design. Page 5 of 11

6 Methods. Two approaches will be tested: i) Fit full interactive (Table 2), host effect (Table 3), pathogen effect (Table 4), and null models and evaluate goodness of fit. For each loci combination the likelihoods of the all models will be evaluated using equation 1.! Equation 1 Sample log-likelihood 31. l i = "ln# " (x i " µ i )2 2# 2 l i is host pathogen pair i, x i is the phenotypic value, µ i is the mean of the interaction class (µ AAB, µ AaB, etc), and σ is the maximum likelihood estimate of residual standard error. Evans et al 31 comment that parameters are its sample values when phenotypes are normally distributed. Goodness of fit p values will be obtained by assuming G (equation 2) is approximated by a χ 2 distribution with degrees of freedom equal to the difference in the number of parameters estimated. Equation 2 Goodness of fit test statistic G = 2(lnL alternative " lnl null ) ii) Two-way ANOVA using genotypes as factors. Both methods will be evaluated using a full 2 dimensional scan to evaluate maximum power and use a! Bonferroni multiple testing correction based on 900,000 host markers and 5000 pathogen markers. The two-stage and Bayesian approaches will be applied to each method to reduce the number of tests and better search the parameter space. i) Two stage design. Main host effects will be tested and a liberal cut off ( p <.1, p < 0.05, p < 0.01) will be applied, and the reduced set of loci will then be evaluated in the two dimensional method as described above. ii) A Bayesian approach utilizing the Monte Carlo Markov chain (MCMC) will be constructed. The parameter space of all genotype effects will be searched using the Metropolis-Hastings algorithm 35 and refines the models to only the genes affecting the phenotype. C. Model evaluation. Methods. The models will be evaluated for power to detect the interactive sites and the false positive rates. The power of detection is the number of times the method detects the Page 6 of 11

7 interactive effect at a significance level.05/l Host x L Pathogen out of 1000 replicates. The false positive rate will calculated as the number of interactive effects detected when no interactions are modeled, or in data sets with multiple markers and linkage disequilibrium. Additionally, the time to complete the analysis will be measured, since time is often sacrificed for good approximations. Specific Aim #2: Detect interacting genes in P. falciparum and H. sapiens. Rationale. Once an effective host pathogen model of has been developed, it can be applied to data to detect interacting genes. Experimental design. Study enrollment will include Africans from Kenya to minimize the effect of population structure and phenotypic variability due to genetic backgrounds. Information about the patients for cofactor analysis will be recorded. Blood samples will be collected from the enrolled subjects, the extent of parasitemia measured, and both P. falciparum and human DNA extracted. The samples will be genotyped using microarray technology, which allows high throughput genotyping based on sample DNA hybridizing to probes on the array. The Affymetrix Genome-Wide Human SNP Array 6.0 will be used since it has a higher coverage in African populations compared to the Illumina alternatives 36. A. Sample and phenotype collection. Methods African subjects admitted to hospitals in Kenya for treatment malaria infections will be enrolled in the study via informed consent. State, gender, age, language group, previous infection status (yes/no), and days since onset of symptoms will be recorded for each sample for future population structure and cofactor investigations. Finger pricked blood will be collected and tested for infection using the stevor gene PCR 37 assay. Confirming infection, 1 unit of venus blood will be collected, portioned into 2 aliquots, one to be used for phenotyping, and the other for DNA extraction. Parasitemia will be evaluated within an hour of sampling using the protocol outlined by the Medical Chemical Corporation 38 using Giemsa-stained blood films, reporting the percentage of infected red blood cells per 100 red blood cells counted and the number of parasites per µl of blood. Record of treatment type will be taken for each individual, and follow up finger prick blood samples taken 2 and 6 days post treatment. Parasitemia will be measured using the same protocol. B. Sample DNA extraction and genotyping. Methods. 30 ml of the genotyping aliquot will be centrifuged in the ACCUSPIN System-HISTOPAQUE (Sigma-Aldrich, St. Louis, MO) to separate red and white blood cells. Red blood cells will be collected and used for the extraction of P. falciparum DNA using the method described by Jeffares, DC et al 17. Page 7 of 11

8 The white blood cell portion of the HISTOPAQUE tube and human DNA extracted using the DNAQuik TM Large Scale Blood kit (BioServe, Laurel, MD) and recommended protocol. P. falciparum genome variation will be assayed using (pending release and availability) the Affymetrix P. falciparum genotyping array. Alternatively, the Affymetrix Targeted genotyping array (Affymetrix, Santa Clara, CA) will be customized based on annotated SNPs by Mu, J et al 9 and Jeffares, DC et al 17, and used for the genotyping of P. falciparum. Human DNA will be processed for genotyping using the Affymetrix Genome-Wide Human SNP Nsp/Sty Assay Kit 5.0/6.0 and hybridized to the Affymetrix Genome-Wide SNP Array 6.0. C. Detection of host pathogen interacting genes. Methods. The best performing model from Specific Aim #1 will be applied to the collected data set, using parasitemia as the phenotype. The number of days between symptom onset and blood collection, age, gender, previous infection status, and sampling location can be used as cofactors in the model if they appear to have an effect on the phenotype. To address multiple testing issues, to reduce the number of tests, only coding SNPs will be used in the analysis as it is unlikely that noncoding SNPs are interacting and altering phenotype. Alternatively, rates of parasite clearance can be used based on the follow up data or resistance types 38 scored based on the follow up parasitemia measures can be used as the phenotype, using treatment as a cofactor. Expected Outcomes. The results of this experiment will include a novel statistical tool to detect host pathogen interactions that can be applied to a wide range of host pathogen and host parasite combinations. Additionally, known and novel candidate interactions in P. falciparum and humans will be detected. Pitfalls and limitations. Host pathogen interactions may be detectable in simulated data however the biological effects may not be strong enough to be detected. In this event, the collected data will be used for genome wide associations between the host and phenotype, and parasite and phenotype separately. Additionally the data set will be used to study localized coevolution of host and pathogen; investigating genetic combinations that are invariant in the host pathogen combination but otherwise variable. Simulations are an artificial representation of what is inferred to happen biologically. To obtain an improved testing data set, known P. falciparum and human interacting genes can be investigated in vitro and growth rates or parasitemia quantified in cell cultures with known Page 8 of 11

9 genetic background. Alternatively, HIV and host genotypes with known interactions will be compiled and used for method performance testing. The methods proposed assume a normal distribution of phenotypes, however this may not be the case biologically. Method 1 can be modified to use other distributions that may better represent the data. Alternatively, the ANOVA method will be robust to deviations from the assumed distributions. Page 9 of 11

10 References 1. Snow, R. W., Guerra, C. A., Noor, A. M., Myint, H. Y. & Hay, S. I. The global distribution of clinical episodes of Plasmodium falciparum malaria. Nature 434, (2005). 2. Su, X. Z. et al. The large diverse gene family var encodes proteins involved in cytoadherence and antigenic variation of Plasmodium falciparum-infected erythrocytes. Cell 82, (1995). 3. Goel, V. K. et al. Band 3 is a host receptor binding merozoite surface protein 1 during the Plasmodium falciparum invasion of erythrocytes. Proc Natl Acad Sci U S A 100, (2003). 4. Fellay, J. et al. A whole-genome association study of major determinants for host control of HIV-1. Science 317, (2007). 5. Saxena, R. et al. Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science 316, (2007). 6. Zeggini, E. et al. Replication of genome-wide association signals in UK samples reveals risk loci for type 2 diabetes. Science 316, (2007). 7. Scott, L. J. et al. A genome-wide association study of type 2 diabetes in Finns detects multiple susceptibility variants. Science 316, (2007). 8. Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls. Nature 447, (2007). 9. Mu, J. et al. Genome-wide variation and identification of vaccine targets in the Plasmodium falciparum genome. Nat Genet 39, (2007). 10. Wirth, D. F. Biological revelations. Nature 419, (2002). 11. Greenbaum, D. C. et al. A role for the protease falcipain 1 in host cell invasion by the human malaria parasite. Science 298, (2002). 12. Duraisingh, M. T. et al. Heterochromatin silencing and locus repositioning linked to regulation of virulence genes in Plasmodium falciparum. Cell 121, (2005). 13. Hyman, R. W. et al. Sequence of Plasmodium falciparum chromosome 12. Nature 419, (2002). 14. Hall, N. et al. Sequence of Plasmodium falciparum chromosomes 1, 3-9 and 13. Nature 419, (2002). 15. Gardner, M. J. et al. Sequence of Plasmodium falciparum chromosomes 2, 10, 11 and 14. Nature 419, (2002). 16. Su, X. et al. A genetic map and recombination parameters of the human malaria parasite Plasmodium falciparum. Science 286, (1999). 17. Jeffares, D. C. et al. Genome variation and evolution of the malaria parasite Plasmodium falciparum. Nat Genet 39, (2007). 18. Mu, J. et al. Recombination hotspots and population structure in Plasmodium falciparum. PLoS Biol 3, e335 (2005). 19. Escalante, A. A., Lal, A. A. & Ayala, F. J. Genetic polymorphism and natural selection in the malaria parasite Plasmodium falciparum. Genetics 149, (1998). 20. Polley, S. D. & Conway, D. J. Strong diversifying selection on domains of the Plasmodium falciparum apical membrane antigen 1 gene. Genetics 158, (2001). Page 10 of 11

11 21. Wilson, C. M. et al. Amplification of a gene related to mammalian mdr genes in drugresistant Plasmodium falciparum. Science 244, (1989). 22. Foote, S. J. et al. Several alleles of the multidrug-resistance gene are closely linked to chloroquine resistance in Plasmodium falciparum. Nature 345, (1990). 23. Sidhu, A. B., Verdier-Pinard, D. & Fidock, D. A. Chloroquine resistance in Plasmodium falciparum malaria parasites conferred by pfcrt mutations. Science 298, (2002). 24. Hey, J. Parasite populations: the puzzle of Plasmodium. Curr Biol 9, R565-7 (1999). 25. Mu, J. et al. Chromosome-wide SNPs reveal an ancient origin for Plasmodium falciparum. Nature 418, (2002). 26. A haplotype map of the human genome. Nature 437, (2005). 27. Redon, R. et al. Global variation in copy number in the human genome. Nature 444, (2006). 28. Bacanu, S. A., Devlin, B. & Roeder, K. The power of genomic control. Am J Hum Genet 66, (2000). 29. Devlin, B. & Roeder, K. Genomic control for association studies. Biometrics 55, (1999). 30. Marchini, J., Donnelly, P. & Cardon, L. R. Genome-wide strategies for detecting multiple loci that influence complex diseases. Nat Genet 37, (2005). 31. Evans, D. M., Marchini, J., Morris, A. P. & Cardon, L. R. Two-stage two-locus models in genome-wide association. PLoS Genet 2, e157 (2006). 32. Albrechtsen, A. et al. A Bayesian multilocus association method: allowing for higherorder interaction in association studies. Genetics 176, (2007). 33. Zhang, Y. & Liu, J. S. Bayesian inference of epistatic interactions in case-control studies. Nat Genet 39, (2007). 34. Li, W. & Reich, J. A complete enumeration and classification of two-locus disease models. Hum Hered 50, (2000). 35. Hastings, W. K. Monte Carlo Sampling Methods Using Markov CHains and Their Applications. Biometrika 57, (1970). 36. Barrett, J. C. & Cardon, L. R. Evaluating coverage of genome-wide association studies. Nat Genet 38, (2006). 37. Oyedeji, S. I. et al. Comparison of PCR-based detection of Plasmodium falciparum infections based on single and multicopy genes. Malar J 6, 112 (2007). 38. Corporation, T. M. C. Determination of Parasitemia Protocol. Page 11 of 11

Antigenic variation in Plasmodium falciparum : Erythrocyte invasion and immune escape mechanisms

Antigenic variation in Plasmodium falciparum : Erythrocyte invasion and immune escape mechanisms Antigenic variation in Plasmodium falciparum : Erythrocyte invasion and immune escape mechanisms Introduction Why does immunity to malaria take so long to develop? The parasite s survival depends on its

More information

Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the

Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the Chapter 5 Analysis of Prostate Cancer Association Study Data 5.1 Risk factors for Prostate Cancer Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the disease has

More information

Core Facility Genomics

Core Facility Genomics Core Facility Genomics versatile genome or transcriptome analyses based on quantifiable highthroughput data ascertainment 1 Topics Collaboration with Harald Binder and Clemens Kreutz Project: Microarray

More information

Genomes and SNPs in Malaria and Sickle Cell Anemia

Genomes and SNPs in Malaria and Sickle Cell Anemia Genomes and SNPs in Malaria and Sickle Cell Anemia Introduction to Genome Browsing with Ensembl Ensembl The vast amount of information in biological databases today demands a way of organising and accessing

More information

Hierarchical Bayesian Modeling of the HIV Response to Therapy

Hierarchical Bayesian Modeling of the HIV Response to Therapy Hierarchical Bayesian Modeling of the HIV Response to Therapy Shane T. Jensen Department of Statistics, The Wharton School, University of Pennsylvania March 23, 2010 Joint Work with Alex Braunstein and

More information

Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs)

Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs) Single nucleotide polymorphisms or SNPs (pronounced "snips") are DNA sequence variations that occur

More information

Factors for success in big data science

Factors for success in big data science Factors for success in big data science Damjan Vukcevic Data Science Murdoch Childrens Research Institute 16 October 2014 Big Data Reading Group (Department of Mathematics & Statistics, University of Melbourne)

More information

SNPbrowser Software v3.5

SNPbrowser Software v3.5 Product Bulletin SNP Genotyping SNPbrowser Software v3.5 A Free Software Tool for the Knowledge-Driven Selection of SNP Genotyping Assays Easily visualize SNPs integrated with a physical map, linkage disequilibrium

More information

A Primer of Genome Science THIRD

A Primer of Genome Science THIRD A Primer of Genome Science THIRD EDITION GREG GIBSON-SPENCER V. MUSE North Carolina State University Sinauer Associates, Inc. Publishers Sunderland, Massachusetts USA Contents Preface xi 1 Genome Projects:

More information

Genetics of Rheumatoid Arthritis Markey Lecture Series

Genetics of Rheumatoid Arthritis Markey Lecture Series Genetics of Rheumatoid Arthritis Markey Lecture Series Al Kim akim@dom.wustl.edu 2012.09.06 Overview of Rheumatoid Arthritis Rheumatoid Arthritis (RA) Autoimmune disease primarily targeting the synovium

More information

Deterministic computer simulations were performed to evaluate the effect of maternallytransmitted

Deterministic computer simulations were performed to evaluate the effect of maternallytransmitted Supporting Information 3. Host-parasite simulations Deterministic computer simulations were performed to evaluate the effect of maternallytransmitted parasites on the evolution of sex. Briefly, the simulations

More information

Combining Data from Different Genotyping Platforms. Gonçalo Abecasis Center for Statistical Genetics University of Michigan

Combining Data from Different Genotyping Platforms. Gonçalo Abecasis Center for Statistical Genetics University of Michigan Combining Data from Different Genotyping Platforms Gonçalo Abecasis Center for Statistical Genetics University of Michigan The Challenge Detecting small effects requires very large sample sizes Combined

More information

Forensic DNA Testing Terminology

Forensic DNA Testing Terminology Forensic DNA Testing Terminology ABI 310 Genetic Analyzer a capillary electrophoresis instrument used by forensic DNA laboratories to separate short tandem repeat (STR) loci on the basis of their size.

More information

Genetics Lecture Notes 7.03 2005. Lectures 1 2

Genetics Lecture Notes 7.03 2005. Lectures 1 2 Genetics Lecture Notes 7.03 2005 Lectures 1 2 Lecture 1 We will begin this course with the question: What is a gene? This question will take us four lectures to answer because there are actually several

More information

A Genetic Analysis of Rheumatoid Arthritis

A Genetic Analysis of Rheumatoid Arthritis A Genetic Analysis of Rheumatoid Arthritis Introduction to Rheumatoid Arthritis: Classification and Diagnosis Rheumatoid arthritis is a chronic inflammatory disorder that affects mainly synovial joints.

More information

Autoimmunity and immunemediated. FOCiS. Lecture outline

Autoimmunity and immunemediated. FOCiS. Lecture outline 1 Autoimmunity and immunemediated inflammatory diseases Abul K. Abbas, MD UCSF FOCiS 2 Lecture outline Pathogenesis of autoimmunity: why selftolerance fails Genetics of autoimmune diseases Therapeutic

More information

Figure 14.2 Overview of Innate and Adaptive Immunity

Figure 14.2 Overview of Innate and Adaptive Immunity I M M U N I T Y Innate (inborn) Immunity does not distinguish one pathogen from another Figure 14.2 Overview of Innate and Adaptive Immunity Our first line of defense includes physical and chemical barriers

More information

Heritability: Twin Studies. Twin studies are often used to assess genetic effects on variation in a trait

Heritability: Twin Studies. Twin studies are often used to assess genetic effects on variation in a trait TWINS AND GENETICS TWINS Heritability: Twin Studies Twin studies are often used to assess genetic effects on variation in a trait Comparing MZ/DZ twins can give evidence for genetic and/or environmental

More information

GAW 15 Problem 3: Simulated Rheumatoid Arthritis Data Full Model and Simulation Parameters

GAW 15 Problem 3: Simulated Rheumatoid Arthritis Data Full Model and Simulation Parameters GAW 15 Problem 3: Simulated Rheumatoid Arthritis Data Full Model and Simulation Parameters Michael B Miller , Michael Li , Gregg Lind , Soon-Young

More information

A Multi-locus Genetic Risk Score for Abdominal Aortic Aneurysm

A Multi-locus Genetic Risk Score for Abdominal Aortic Aneurysm A Multi-locus Genetic Risk Score for Abdominal Aortic Aneurysm Zi Ye, 1 MD, Erin Austin, 1,2 PhD, Daniel J Schaid, 2 PhD, Iftikhar J. Kullo, 1 MD Affiliations: 1 Division of Cardiovascular Diseases and

More information

DnaSP, DNA polymorphism analyses by the coalescent and other methods.

DnaSP, DNA polymorphism analyses by the coalescent and other methods. DnaSP, DNA polymorphism analyses by the coalescent and other methods. Author affiliation: Julio Rozas 1, *, Juan C. Sánchez-DelBarrio 2,3, Xavier Messeguer 2 and Ricardo Rozas 1 1 Departament de Genètica,

More information

Innovations in Molecular Epidemiology

Innovations in Molecular Epidemiology Innovations in Molecular Epidemiology Molecular Epidemiology Measure current rates of active transmission Determine whether recurrent tuberculosis is attributable to exogenous reinfection Determine whether

More information

History of DNA Sequencing & Current Applications

History of DNA Sequencing & Current Applications History of DNA Sequencing & Current Applications Christopher McLeod President & CEO, 454 Life Sciences, A Roche Company IMPORTANT NOTICE Intended Use Unless explicitly stated otherwise, all Roche Applied

More information

Application Guide... 2

Application Guide... 2 Protocol for GenomePlex Whole Genome Amplification from Formalin-Fixed Parrafin-Embedded (FFPE) tissue Application Guide... 2 I. Description... 2 II. Product Components... 2 III. Materials to be Supplied

More information

Logistic Regression (1/24/13)

Logistic Regression (1/24/13) STA63/CBB540: Statistical methods in computational biology Logistic Regression (/24/3) Lecturer: Barbara Engelhardt Scribe: Dinesh Manandhar Introduction Logistic regression is model for regression used

More information

How many of you have checked out the web site on protein-dna interactions?

How many of you have checked out the web site on protein-dna interactions? How many of you have checked out the web site on protein-dna interactions? Example of an approximately 40,000 probe spotted oligo microarray with enlarged inset to show detail. Find and be ready to discuss

More information

Human Genome and Human Genome Project. Louxin Zhang

Human Genome and Human Genome Project. Louxin Zhang Human Genome and Human Genome Project Louxin Zhang A Primer to Genomics Cells are the fundamental working units of every living systems. DNA is made of 4 nucleotide bases. The DNA sequence is the particular

More information

Online Supplement to Polygenic Influence on Educational Attainment. Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using

Online Supplement to Polygenic Influence on Educational Attainment. Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using Online Supplement to Polygenic Influence on Educational Attainment Construction of Polygenic Score for Educational Attainment Genotyping was conducted with the Illumina HumanOmni1-Quad v1 platform using

More information

The Human Genome Project

The Human Genome Project The Human Genome Project Brief History of the Human Genome Project Physical Chromosome Maps Genetic (or Linkage) Maps DNA Markers Sequencing and Annotating Genomic DNA What Have We learned from the HGP?

More information

Biology Behind the Crime Scene Week 4: Lab #4 Genetics Exercise (Meiosis) and RFLP Analysis of DNA

Biology Behind the Crime Scene Week 4: Lab #4 Genetics Exercise (Meiosis) and RFLP Analysis of DNA Page 1 of 5 Biology Behind the Crime Scene Week 4: Lab #4 Genetics Exercise (Meiosis) and RFLP Analysis of DNA Genetics Exercise: Understanding how meiosis affects genetic inheritance and DNA patterns

More information

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics.

Service courses for graduate students in degree programs other than the MS or PhD programs in Biostatistics. Course Catalog In order to be assured that all prerequisites are met, students must acquire a permission number from the education coordinator prior to enrolling in any Biostatistics course. Courses are

More information

Chapter 13: Meiosis and Sexual Life Cycles

Chapter 13: Meiosis and Sexual Life Cycles Name Period Chapter 13: Meiosis and Sexual Life Cycles Concept 13.1 Offspring acquire genes from parents by inheriting chromosomes 1. Let s begin with a review of several terms that you may already know.

More information

(1-p) 2. p(1-p) From the table, frequency of DpyUnc = ¼ (p^2) = #DpyUnc = p^2 = 0.0004 ¼(1-p)^2 + ½(1-p)p + ¼(p^2) #Dpy + #DpyUnc

(1-p) 2. p(1-p) From the table, frequency of DpyUnc = ¼ (p^2) = #DpyUnc = p^2 = 0.0004 ¼(1-p)^2 + ½(1-p)p + ¼(p^2) #Dpy + #DpyUnc Advanced genetics Kornfeld problem set_key 1A (5 points) Brenner employed 2-factor and 3-factor crosses with the mutants isolated from his screen, and visually assayed for recombination events between

More information

Gene mutation and molecular medicine Chapter 15

Gene mutation and molecular medicine Chapter 15 Gene mutation and molecular medicine Chapter 15 Lecture Objectives What Are Mutations? How Are DNA Molecules and Mutations Analyzed? How Do Defective Proteins Lead to Diseases? What DNA Changes Lead to

More information

The Functional but not Nonfunctional LILRA3 Contributes to Sex Bias in Susceptibility and Severity of ACPA-Positive Rheumatoid Arthritis

The Functional but not Nonfunctional LILRA3 Contributes to Sex Bias in Susceptibility and Severity of ACPA-Positive Rheumatoid Arthritis The Functional but not Nonfunctional LILRA3 Contributes to Sex Bias in Susceptibility and Severity of ACPA-Positive Rheumatoid Arthritis Yan Du Peking University People s Hospital 100044 Beijing CHINA

More information

GENETIC DATA ANALYSIS

GENETIC DATA ANALYSIS GENETIC DATA ANALYSIS 1 Genetic Data: Future of Personalized Healthcare To achieve personalization in Healthcare, there is a need for more advancements in the field of Genomics. The human genome is made

More information

Biological Sciences Initiative. Human Genome

Biological Sciences Initiative. Human Genome Biological Sciences Initiative HHMI Human Genome Introduction In 2000, researchers from around the world published a draft sequence of the entire genome. 20 labs from 6 countries worked on the sequence.

More information

CCR Biology - Chapter 9 Practice Test - Summer 2012

CCR Biology - Chapter 9 Practice Test - Summer 2012 Name: Class: Date: CCR Biology - Chapter 9 Practice Test - Summer 2012 Multiple Choice Identify the choice that best completes the statement or answers the question. 1. Genetic engineering is possible

More information

GENOMIC SELECTION: THE FUTURE OF MARKER ASSISTED SELECTION AND ANIMAL BREEDING

GENOMIC SELECTION: THE FUTURE OF MARKER ASSISTED SELECTION AND ANIMAL BREEDING GENOMIC SELECTION: THE FUTURE OF MARKER ASSISTED SELECTION AND ANIMAL BREEDING Theo Meuwissen Institute for Animal Science and Aquaculture, Box 5025, 1432 Ås, Norway, theo.meuwissen@ihf.nlh.no Summary

More information

The role of IBV proteins in protection: cellular immune responses. COST meeting WG2 + WG3 Budapest, Hungary, 2015

The role of IBV proteins in protection: cellular immune responses. COST meeting WG2 + WG3 Budapest, Hungary, 2015 The role of IBV proteins in protection: cellular immune responses COST meeting WG2 + WG3 Budapest, Hungary, 2015 1 Presentation include: Laboratory results Literature summary Role of T cells in response

More information

Human Genome Organization: An Update. Genome Organization: An Update

Human Genome Organization: An Update. Genome Organization: An Update Human Genome Organization: An Update Genome Organization: An Update Highlights of Human Genome Project Timetable Proposed in 1990 as 3 billion dollar joint venture between DOE and NIH with 15 year completion

More information

Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources

Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Computational Drug Repositioning by Ranking and Integrating Multiple Data Sources Ping Zhang IBM T. J. Watson Research Center Pankaj Agarwal GlaxoSmithKline Zoran Obradovic Temple University Terms and

More information

Mitochondrial DNA Analysis

Mitochondrial DNA Analysis Mitochondrial DNA Analysis Lineage Markers Lineage markers are passed down from generation to generation without changing Except for rare mutation events They can help determine the lineage (family tree)

More information

The Body s Defenses CHAPTER 24

The Body s Defenses CHAPTER 24 CHAPTER 24 The Body s Defenses PowerPoint Lectures for Essential Biology, Third Edition Neil Campbell, Jane Reece, and Eric Simon Essential Biology with Physiology, Second Edition Neil Campbell, Jane Reece,

More information

OpenMedicine Foundation (OMF)

OpenMedicine Foundation (OMF) Scientific Advisory Board Director Ronald Davis, Ph.D. Genome Technology Center Paul Berg, PhD Molecular Genetics Mario Capecchi, Ph.D Genetics & Immunology University of Utah Mark Davis, Ph.D. Immunology

More information

Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network

Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network Major US Genomic Medicine Programs: NHGRI s Electronic Medical Records and Genomics (emerge) Network Dan Roden Member, National Advisory Council For Human Genome Research Genomic Medicine Working Group

More information

Building risk prediction models - with a focus on Genome-Wide Association Studies. Charles Kooperberg

Building risk prediction models - with a focus on Genome-Wide Association Studies. Charles Kooperberg Building risk prediction models - with a focus on Genome-Wide Association Studies Risk prediction models Based on data: (D i, X i1,..., X ip ) i = 1,..., n we like to fit a model P(D = 1 X 1,..., X p )

More information

PRINCIPLES OF POPULATION GENETICS

PRINCIPLES OF POPULATION GENETICS PRINCIPLES OF POPULATION GENETICS FOURTH EDITION Daniel L. Hartl Harvard University Andrew G. Clark Cornell University UniversitSts- und Landesbibliothek Darmstadt Bibliothek Biologie Sinauer Associates,

More information

Bayesian coalescent inference of population size history

Bayesian coalescent inference of population size history Bayesian coalescent inference of population size history Alexei Drummond University of Auckland Workshop on Population and Speciation Genomics, 2016 1st February 2016 1 / 39 BEAST tutorials Population

More information

GENOMICS: REINVIGORATING THE FIELD OF PSYCHIATRIC RESEARCH

GENOMICS: REINVIGORATING THE FIELD OF PSYCHIATRIC RESEARCH Office of Communications www.broadinstitute.org T 617-714-7151 communications@broadinstitute.org 7 Cambridge Center, Cambridge, MA 02142 GENOMICS: REINVIGORATING THE FIELD OF PSYCHIATRIC RESEARCH For decades,

More information

SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis

SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis Goal: This tutorial introduces several websites and tools useful for determining linkage disequilibrium

More information

Integrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon

Integrating DNA Motif Discovery and Genome-Wide Expression Analysis. Erin M. Conlon Integrating DNA Motif Discovery and Genome-Wide Expression Analysis Department of Mathematics and Statistics University of Massachusetts Amherst Statistics in Functional Genomics Workshop Ascona, Switzerland

More information

False Discovery Rates

False Discovery Rates False Discovery Rates John D. Storey Princeton University, Princeton, USA January 2010 Multiple Hypothesis Testing In hypothesis testing, statistical significance is typically based on calculations involving

More information

1) Siderophores are bacterial proteins that compete with animal A) Antibodies. B) Red blood cells. C) Transferrin. D) White blood cells. E) Receptors.

1) Siderophores are bacterial proteins that compete with animal A) Antibodies. B) Red blood cells. C) Transferrin. D) White blood cells. E) Receptors. Prof. Lester s BIOL 210 Practice Exam 4 (There is no answer key. Please do not email or ask me for answers.) Chapters 15, 16, 17, 19, HIV/AIDS, TB, Quorum Sensing 1) Siderophores are bacterial proteins

More information

Big Data for Population Health and Personalised Medicine through EMR Linkages

Big Data for Population Health and Personalised Medicine through EMR Linkages Big Data for Population Health and Personalised Medicine through EMR Linkages Zheng-Ming CHEN Professor of Epidemiology Nuffield Dept. of Population Health, University of Oxford Big Data for Health Policy

More information

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Analysis of gene expression data. Ulf Leser and Philippe Thomas Analysis of gene expression data Ulf Leser and Philippe Thomas This Lecture Protein synthesis Microarray Idea Technologies Applications Problems Quality control Normalization Analysis next week! Ulf Leser:

More information

School of Nursing. Presented by Yvette Conley, PhD

School of Nursing. Presented by Yvette Conley, PhD Presented by Yvette Conley, PhD What we will cover during this webcast: Briefly discuss the approaches introduced in the paper: Genome Sequencing Genome Wide Association Studies Epigenomics Gene Expression

More information

Data Analysis for Ion Torrent Sequencing

Data Analysis for Ion Torrent Sequencing IFU022 v140202 Research Use Only Instructions For Use Part III Data Analysis for Ion Torrent Sequencing MANUFACTURER: Multiplicom N.V. Galileilaan 18 2845 Niel Belgium Revision date: August 21, 2014 Page

More information

DNA Fingerprinting. Unless they are identical twins, individuals have unique DNA

DNA Fingerprinting. Unless they are identical twins, individuals have unique DNA DNA Fingerprinting Unless they are identical twins, individuals have unique DNA DNA fingerprinting The name used for the unambiguous identifying technique that takes advantage of differences in DNA sequence

More information

UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015

UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015 UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory April, 2015 1 Contents Overview... 3 Rare Variants... 3 Observation... 3 Approach... 3 ApoE

More information

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE ICH HARMONISED TRIPARTITE GUIDELINE QUALITY OF BIOTECHNOLOGICAL PRODUCTS: ANALYSIS

More information

Bio EOC Topics for Cell Reproduction: Bio EOC Questions for Cell Reproduction:

Bio EOC Topics for Cell Reproduction: Bio EOC Questions for Cell Reproduction: Bio EOC Topics for Cell Reproduction: Asexual vs. sexual reproduction Mitosis steps, diagrams, purpose o Interphase, Prophase, Metaphase, Anaphase, Telophase, Cytokinesis Meiosis steps, diagrams, purpose

More information

Molecular Diagnosis of Hepatitis B and Hepatitis D infections

Molecular Diagnosis of Hepatitis B and Hepatitis D infections Molecular Diagnosis of Hepatitis B and Hepatitis D infections Acute infection Detection of HBsAg in serum is a fundamental diagnostic marker of HBV infection HBsAg shows a strong correlation with HBV replication

More information

Single-Cell DNA Sequencing with the C 1. Single-Cell Auto Prep System. Reveal hidden populations and genetic diversity within complex samples

Single-Cell DNA Sequencing with the C 1. Single-Cell Auto Prep System. Reveal hidden populations and genetic diversity within complex samples DATA Sheet Single-Cell DNA Sequencing with the C 1 Single-Cell Auto Prep System Reveal hidden populations and genetic diversity within complex samples Single-cell sensitivity Discover and detect SNPs,

More information

Current Motif Discovery Tools and their Limitations

Current Motif Discovery Tools and their Limitations Current Motif Discovery Tools and their Limitations Philipp Bucher SIB / CIG Workshop 3 October 2006 Trendy Concepts and Hypotheses Transcription regulatory elements act in a context-dependent manner.

More information

Liver Disease and Therapy of Hepatitis B Virus Infections

Liver Disease and Therapy of Hepatitis B Virus Infections Liver Disease and Therapy of Hepatitis B Virus Infections University of Adelaide Catherine Scougall Arend Grosse Huey-Chi Low Allison Jilbert Fox Chase Cancer Center Chunxiao Xu Carol Aldrich Sam Litwin

More information

Statistics Graduate Courses

Statistics Graduate Courses Statistics Graduate Courses STAT 7002--Topics in Statistics-Biological/Physical/Mathematics (cr.arr.).organized study of selected topics. Subjects and earnable credit may vary from semester to semester.

More information

DNA-Analytik III. Genetische Variabilität

DNA-Analytik III. Genetische Variabilität DNA-Analytik III Genetische Variabilität Genetische Variabilität Lexikon Scherer et al. Nat Genet Suppl 39:s7 (2007) Genetische Variabilität Sequenzvariation Mutationen (Mikro~) Basensubstitution Insertion

More information

Marker-Assisted Backcrossing. Marker-Assisted Selection. 1. Select donor alleles at markers flanking target gene. Losing the target allele

Marker-Assisted Backcrossing. Marker-Assisted Selection. 1. Select donor alleles at markers flanking target gene. Losing the target allele Marker-Assisted Backcrossing Marker-Assisted Selection CS74 009 Jim Holland Target gene = Recurrent parent allele = Donor parent allele. Select donor allele at markers linked to target gene.. Select recurrent

More information

Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company

Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Chapter 8: Recombinant DNA 2002 by W. H. Freeman and Company Genetic engineering: humans Gene replacement therapy or gene therapy Many technical and ethical issues implications for gene pool for germ-line gene therapy what traits constitute disease rather than just

More information

Lecture 13: DNA Technology. DNA Sequencing. DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology

Lecture 13: DNA Technology. DNA Sequencing. DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology Lecture 13: DNA Technology DNA Sequencing Genetic Markers - RFLPs polymerase chain reaction (PCR) products of biotechnology DNA Sequencing determine order of nucleotides in a strand of DNA > bases = A,

More information

Antibody Function & Structure

Antibody Function & Structure Antibody Function & Structure Specifically bind to antigens in both the recognition phase (cellular receptors) and during the effector phase (synthesis and secretion) of humoral immunity Serology: the

More information

Umm AL Qura University MUTATIONS. Dr Neda M Bogari

Umm AL Qura University MUTATIONS. Dr Neda M Bogari Umm AL Qura University MUTATIONS Dr Neda M Bogari CONTACTS www.bogari.net http://web.me.com/bogari/bogari.net/ From DNA to Mutations MUTATION Definition: Permanent change in nucleotide sequence. It can

More information

Presentation by: Ahmad Alsahaf. Research collaborator at the Hydroinformatics lab - Politecnico di Milano MSc in Automation and Control Engineering

Presentation by: Ahmad Alsahaf. Research collaborator at the Hydroinformatics lab - Politecnico di Milano MSc in Automation and Control Engineering Johann Bernoulli Institute for Mathematics and Computer Science, University of Groningen 9-October 2015 Presentation by: Ahmad Alsahaf Research collaborator at the Hydroinformatics lab - Politecnico di

More information

Population Genetics and Multifactorial Inheritance 2002

Population Genetics and Multifactorial Inheritance 2002 Population Genetics and Multifactorial Inheritance 2002 Consanguinity Genetic drift Founder effect Selection Mutation rate Polymorphism Balanced polymorphism Hardy-Weinberg Equilibrium Hardy-Weinberg Equilibrium

More information

Structure and Function of DNA

Structure and Function of DNA Structure and Function of DNA DNA and RNA Structure DNA and RNA are nucleic acids. They consist of chemical units called nucleotides. The nucleotides are joined by a sugar-phosphate backbone. The four

More information

NGS and complex genetics

NGS and complex genetics NGS and complex genetics Robert Kraaij Genetic Laboratory Department of Internal Medicine r.kraaij@erasmusmc.nl Gene Hunting Rotterdam Study and GWAS Next Generation Sequencing Gene Hunting Mendelian gene

More information

A white paper for consideration by the NIAID Microbial Sequencing Program

A white paper for consideration by the NIAID Microbial Sequencing Program Hepatitis C Virus Sequencing: Viral evolution, immune recognition and vaccine development. A white paper for consideration by the NIAID Microbial Sequencing Program Todd M. Allen, Ph.D. Assistant Professor

More information

HLA data analysis in anthropology: basic theory and practice

HLA data analysis in anthropology: basic theory and practice HLA data analysis in anthropology: basic theory and practice Alicia Sanchez-Mazas and José Manuel Nunes Laboratory of Anthropology, Genetics and Peopling history (AGP), Department of Anthropology and Ecology,

More information

Hardy-Weinberg Equilibrium Problems

Hardy-Weinberg Equilibrium Problems Hardy-Weinberg Equilibrium Problems 1. The frequency of two alleles in a gene pool is 0.19 (A) and 0.81(a). Assume that the population is in Hardy-Weinberg equilibrium. (a) Calculate the percentage of

More information

Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources

Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources Appendix 2 Molecular Biology Core Curriculum Websites and Other Resources Chapter 1 - The Molecular Basis of Cancer 1. Inside Cancer http://www.insidecancer.org/ From the Dolan DNA Learning Center Cold

More information

Gene Therapy and Genetic Counseling. Chapter 20

Gene Therapy and Genetic Counseling. Chapter 20 Gene Therapy and Genetic Counseling Chapter 20 What is Gene Therapy? Treating a disease by replacing, manipulating or supplementing a gene The act of changing an individual s DNA sequence to fix a non-functional

More information

Chapter 43: The Immune System

Chapter 43: The Immune System Name Period Our students consider this chapter to be a particularly challenging and important one. Expect to work your way slowly through the first three concepts. Take particular care with Concepts 43.2

More information

Introduction To Real Time Quantitative PCR (qpcr)

Introduction To Real Time Quantitative PCR (qpcr) Introduction To Real Time Quantitative PCR (qpcr) SABiosciences, A QIAGEN Company www.sabiosciences.com The Seminar Topics The advantages of qpcr versus conventional PCR Work flow & applications Factors

More information

Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider

Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider Electronic Medical Records and Genomics: Possibilities, Realities, Ethical Issues to Consider Daniel Masys, M.D. Affiliate Professor Biomedical and Health Informatics University of Washington, Seattle

More information

A trait is a variation of a particular character (e.g. color, height). Traits are passed from parents to offspring through genes.

A trait is a variation of a particular character (e.g. color, height). Traits are passed from parents to offspring through genes. 1 Biology Chapter 10 Study Guide Trait A trait is a variation of a particular character (e.g. color, height). Traits are passed from parents to offspring through genes. Genes Genes are located on chromosomes

More information

Simplifying Data Interpretation with Nexus Copy Number

Simplifying Data Interpretation with Nexus Copy Number Simplifying Data Interpretation with Nexus Copy Number A WHITE PAPER FROM BIODISCOVERY, INC. Rapid technological advancements, such as high-density acgh and SNP arrays as well as next-generation sequencing

More information

Y Chromosome Markers

Y Chromosome Markers Y Chromosome Markers Lineage Markers Autosomal chromosomes recombine with each meiosis Y and Mitochondrial DNA does not This means that the Y and mtdna remains constant from generation to generation Except

More information

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16

BIOINF 525 Winter 2016 Foundations of Bioinformatics and Systems Biology http://tinyurl.com/bioinf525-w16 Course Director: Dr. Barry Grant (DCM&B, bjgrant@med.umich.edu) Description: This is a three module course covering (1) Foundations of Bioinformatics, (2) Statistics in Bioinformatics, and (3) Systems

More information

Gene Mapping Techniques

Gene Mapping Techniques Gene Mapping Techniques OBJECTIVES By the end of this session the student should be able to: Define genetic linkage and recombinant frequency State how genetic distance may be estimated State how restriction

More information

Breast cancer and the role of low penetrance alleles: a focus on ATM gene

Breast cancer and the role of low penetrance alleles: a focus on ATM gene Modena 18-19 novembre 2010 Breast cancer and the role of low penetrance alleles: a focus on ATM gene Dr. Laura La Paglia Breast Cancer genetic Other BC susceptibility genes TP53 PTEN STK11 CHEK2 BRCA1

More information

Blood-Based Cancer Diagnostics

Blood-Based Cancer Diagnostics The Biotechnology Education Company Blood-Based Cancer Diagnostics EDVO-Kit 141 Store entire experiment at room temperature. EXPERIMENT OBJECTIVE: The objective of this experiment is to learn and understand

More information

Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses

Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses Using Repeated Measures Techniques To Analyze Cluster-correlated Survey Responses G. Gordon Brown, Celia R. Eicheldinger, and James R. Chromy RTI International, Research Triangle Park, NC 27709 Abstract

More information

Milk protein genetic variation in Butana cattle

Milk protein genetic variation in Butana cattle Milk protein genetic variation in Butana cattle Ammar Said Ahmed Züchtungsbiologie und molekulare Genetik, Humboldt Universität zu Berlin, Invalidenstraβe 42, 10115 Berlin, Deutschland 1 Outline Background

More information

Chromosomes, Mapping, and the Meiosis Inheritance Connection

Chromosomes, Mapping, and the Meiosis Inheritance Connection Chromosomes, Mapping, and the Meiosis Inheritance Connection Carl Correns 1900 Chapter 13 First suggests central role for chromosomes Rediscovery of Mendel s work Walter Sutton 1902 Chromosomal theory

More information

Validation and Replication

Validation and Replication Validation and Replication Overview Definitions of validation and replication Difficulties and limitations Working examples from our group and others Why? False positive results still occur. even after

More information

I. Genes found on the same chromosome = linked genes

I. Genes found on the same chromosome = linked genes Genetic recombination in Eukaryotes: crossing over, part 1 I. Genes found on the same chromosome = linked genes II. III. Linkage and crossing over Crossing over & chromosome mapping I. Genes found on the

More information

14.3 Studying the Human Genome

14.3 Studying the Human Genome 14.3 Studying the Human Genome Lesson Objectives Summarize the methods of DNA analysis. State the goals of the Human Genome Project and explain what we have learned so far. Lesson Summary Manipulating

More information

B I O I N F O R M A T I C S

B I O I N F O R M A T I C S B I O I N F O R M A T I C S Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: GENOME-WIDE ASSOCIATION STUDIES 1 Setting

More information

European Medicines Agency

European Medicines Agency European Medicines Agency July 1996 CPMP/ICH/139/95 ICH Topic Q 5 B Quality of Biotechnological Products: Analysis of the Expression Construct in Cell Lines Used for Production of r-dna Derived Protein

More information