CS2220 Introduction to Computational Biology

Similar documents
AP BIOLOGY 2010 SCORING GUIDELINES (Form B)

Lecture 6: Single nucleotide polymorphisms (SNPs) and Restriction Fragment Length Polymorphisms (RFLPs)

BioBoot Camp Genetics

Gene mutation and molecular medicine Chapter 15

Genetics Lecture Notes Lectures 1 2

Lecture 3: Mutations

1 Mutation and Genetic Change

SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis

Human Genome Organization: An Update. Genome Organization: An Update

Genomes and SNPs in Malaria and Sickle Cell Anemia

MUTATION, DNA REPAIR AND CANCER

PRINCIPLES OF POPULATION GENETICS

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE

Genetics Module B, Anchor 3

A Primer of Genome Science THIRD

Basic Principles of Forensic Molecular Biology and Genetics. Population Genetics

Genetics 301 Sample Final Examination Spring 2003

Concluding lesson. Student manual. What kind of protein are you? (Basic)

Genetic information (DNA) determines structure of proteins DNA RNA proteins cell structure enzymes control cell chemistry ( metabolism )

Bio 102 Practice Problems Genetic Code and Mutation

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

Just the Facts: A Basic Introduction to the Science Underlying NCBI Resources

Transcription and Translation of DNA

Algorithms in Computational Biology (236522) spring 2007 Lecture #1

SNPbrowser Software v3.5

The sequence of bases on the mrna is a code that determines the sequence of amino acids in the polypeptide being synthesized:

Cystic Fibrosis Webquest Sarah Follenweider, The English High School 2009 Summer Research Internship Program

Umm AL Qura University MUTATIONS. Dr Neda M Bogari

Population Genetics and Multifactorial Inheritance 2002

Name Class Date. Figure Which nucleotide in Figure 13 1 indicates the nucleic acid above is RNA? a. uracil c. cytosine b. guanine d.

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE E15

RNA & Protein Synthesis

somatic cell egg genotype gamete polar body phenotype homologous chromosome trait dominant autosome genetics recessive

Mendelian inheritance and the

Gene Models & Bed format: What they represent.

School of Nursing. Presented by Yvette Conley, PhD

Structure and Function of DNA

Breast cancer and the role of low penetrance alleles: a focus on ATM gene

Sickle cell anemia: Altered beta chain Single AA change (#6 Glu to Val) Consequence: Protein polymerizes Change in RBC shape ---> phenotypes

SNP Data Integration and Analysis for Drug- Response Biomarker Discovery

Forensic DNA Testing Terminology

DNA Replication & Protein Synthesis. This isn t a baaaaaaaddd chapter!!!

Genetic testing. The difference diagnostics can make. The British In Vitro Diagnostics Association

Protein Synthesis How Genes Become Constituent Molecules

Single Nucleotide Polymorphisms (SNPs)

Molecular Genetics. RNA, Transcription, & Protein Synthesis

How many of you have checked out the web site on protein-dna interactions?

The Human Genome Project

Genetic Mutations. Indicator 4.8: Compare the consequences of mutations in body cells with those in gametes.

Protein Synthesis. Page 41 Page 44 Page 47 Page 42 Page 45 Page 48 Page 43 Page 46 Page 49. Page 41. DNA RNA Protein. Vocabulary

LECTURE 6 Gene Mutation (Chapter )

Biology Final Exam Study Guide: Semester 2

Name Date Period. 2. When a molecule of double-stranded DNA undergoes replication, it results in

12.1 The Role of DNA in Heredity

Activity 7.21 Transcription factors

Lecture Series 7. From DNA to Protein. Genotype to Phenotype. Reading Assignments. A. Genes and the Synthesis of Polypeptides

BI122 Introduction to Human Genetics, Fall 2014

Biomedical Big Data and Precision Medicine

Gene Mapping Techniques

INTERNATIONAL CONFERENCE ON HARMONISATION OF TECHNICAL REQUIREMENTS FOR REGISTRATION OF PHARMACEUTICALS FOR HUMAN USE Q5B

GAW 15 Problem 3: Simulated Rheumatoid Arthritis Data Full Model and Simulation Parameters

Gene and Chromosome Mutation Worksheet (reference pgs in Modern Biology textbook)

Bioinformatics Resources at a Glance

13.2 Ribosomes & Protein Synthesis

From DNA to Protein. Proteins. Chapter 13. Prokaryotes and Eukaryotes. The Path From Genes to Proteins. All proteins consist of polypeptide chains

Appendix 2 Molecular Biology Core Curriculum. Websites and Other Resources

Biological Sciences Initiative. Human Genome

Chromosomes, Mapping, and the Meiosis Inheritance Connection

Paternity Testing. Chapter 23

To be able to describe polypeptide synthesis including transcription and splicing

Biology 1406 Exam 4 Notes Cell Division and Genetics Ch. 8, 9

Coding sequence the sequence of nucleotide bases on the DNA that are transcribed into RNA which are in turn translated into protein

Hardy-Weinberg Equilibrium Problems

Globally, about 9.7% of cancers in men are prostate cancers, and the risk of developing the

13.4 Gene Regulation and Expression

Translation Study Guide

Bob Jesberg. Boston, MA April 3, 2014

Y Chromosome Markers

Milk protein genetic variation in Butana cattle

Becker Muscular Dystrophy

14.3 Studying the Human Genome

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

Ms. Campbell Protein Synthesis Practice Questions Regents L.E.

The Steps. 1. Transcription. 2. Transferal. 3. Translation

Chapter 3. Chapter Outline. Chapter Outline 9/11/10. Heredity and Evolu4on

NGS and complex genetics

European Medicines Agency

CHROMOSOMES AND INHERITANCE

Chapter 4 Pedigree Analysis in Human Genetics. Chapter 4 Human Heredity by Michael Cummings 2006 Brooks/Cole-Thomson Learning

Introduction. What is Ecological Genetics?

Human Genome and Human Genome Project. Louxin Zhang

MCB41: Second Midterm Spring 2009

GWASrap User Manual v1.1

SOP 3 v2: web-based selection of oligonucleotide primer trios for genotyping of human and mouse polymorphisms

DNA, RNA, Protein synthesis, and Mutations. Chapters

Chapter 9 Patterns of Inheritance

The Future of the Electronic Health Record. Gerry Higgins, Ph.D., Johns Hopkins

Bio EOC Topics for Cell Reproduction: Bio EOC Questions for Cell Reproduction:

Commonly Used STR Markers

Sample Questions for Exam 3

Transcription:

CS2220 Introduction to Computational Biology WEEK 7: SINGLE (SIMPLE) NUCLEOTIDE POLYMORPHISMS (SNPS) 1 Dr. Mengling FENG Institute for Infocomm Research Massachusetts Institute of Technology mfeng@mit.edu

PLANS FOR WEEK 7 AND WEEK 8! Week 7, 1 st Oct 2015! 2 hours class: Single (Simple) Nucleotide Polymorphism! 1 hour briefing on project and forming of project teams! Week 8, 7 th Oct 2015! 2 hours class: Genome-wide Association Study (GWAS)! 1 hour Q&A on the lectures and project 2

WEEK 7 S LEARNING OBEJECTIVES! After the class, students should be able to! Define the concept of SNP! Elaborate various types of SNPs and their functions! Explain the applications of SNPs! Know the major initiatives and projects related to SNP! Use online resources to find out information about SNPs 3

SINGLE (SIMPLE) NUCLEOTIDE POLYMORPHISM THE DEFINITION! SNP is a DNA sequence variation occurring commonly within a population! A single nucleotide A, T, C &G, mutation! Must be common! Minor Allele Frequency (MAF) >1% 4

5

SINGLE (SIMPLE) NUCLEOTIDE POLYMORPHISM! ~15 million possible SNP sites in human genome, ~10 million common SNPs (MAF >5%)! ~12 million SNPs have been identified (dbsnp 2012 release 137)! Each individual may carry 3~5 million common SNPs (inherited) and ~120 new mutations! SNPs VS Individual Mutations! Natural Selection! Founder Effect 6

SNPS AS AN EVIDENCE FOR NATURE SELECTION! Many Africans carry SNPs around gene G6PD and CD40 ligand, which may lead to resistance to malaria 7

FOUNDER EFFECT! Examples:! The Amish group National Cancer Institute! Ashkennazi Jews after the Holocaust 8

TYPES OF SNPS! Non-coding SNPs! 5 Un-Translated Regions (UTR)! 3 Un-Translated Regions (UTR)! Introns! Integenic Regions (IGR)! Psuedogenes! Coding SNPs! Synonymous substitution! Non-synonymous substitution! Missense! Nonsense 9

! Take home message: FUNCTIONS OF SNPS! We still know very little about it! Genome-wide Association and other studies to identify associations and causations! Majority of SNPs are believed to be silent! Non-coding SNPs: regulatory functions! Splicing! Transcriptional regulation (promoter & TF binding sites)! Translational regulation (initiation or termination)! Regulate mrna target sites 10

FUNCTIONS OF SNPS SYNONYMOUS SUBSTITUTIONS! Do not trigger amino acid change in protein sequence! Were believed to be silent mutations! Recent studies shown that they can affect! Messenger RNA (mrna) splicing, stability, structure and protein folding => protein functions 11

FUNCTIONS OF SNPS NON-SYNONYMOUS SUBSTITUTIONS! Missense: change in amino acid of protein sequence! Nonsense: change in amino acid that lead to premature stop codon 12

APPLICATIONS OF SNPS! General Applications! Forensics! Paternity tests! Ancestry trace: immigration to the United Kingdom! Follow ethnic migrations 13

APPLICATIONS OF SNPS! Genetic marker for distinguishing traits! Predisposition for disease 14

APPLICATIONS OF SNPS! Genetic marker for distinguishing traits! Predisposition for disease! Drug efficacy! Drug adverse effect 15

APPLICATIONS OF SNPS! Genetic marker for distinguishing traits! Predisposition for disease! Drug efficacy! Drug adverse effect! Other traits 16

APPLICATIONS OF SNPS! Genetic marker for distinguishing traits! Predisposition for disease! Drug efficacy! Drug adverse effect! Other traits! Preventive medicine! Personalized and targeted medicine! Profession selection! etc 17

POPULATION GENETICS OF SNPS FOR FORENSIC AN INDIVIDUAL IDENTIFICATION PANEL 18

SNP PANEL SELECTION! SNPs data from 44 populations! Selection criteria! A small panel is preferred! Incomplete or damaged DNA samples! Reduce cost! For individual SNP! Average Heterozygosity > 0.4! Average Fixation Index Fst < 0.06! Linkage Disequilibrium ~ 0.01 19

HETEROZYGOSITY! Human beings are diploid organism! We carry two copies of a gene! For a gene having two alleles: A & a! Homozygote: AA and aa! Heterozygote: Aa! Heterozygosity! Percentage of heterozygot in the population! SNP selection criterion:! Average heterozygosity > 0.4! High genetic variations among individuals are preferred 20

ESTIMATION OF HETEROZYGOSITY THE HARDY-WEIBERG THEOREM 2 i=1 H E = 2 pq = 1 p 2 q 2 = 1 p i 2 k i=1 H E = 1 p i 2 21

FIXATION INDEX FST! A measure of differentiation of subpopulations σ s 2 p F st = σ s 2 p(1 p) is the variance of allele frequencies among different subpopulations is the average allele frequency across the population! Selection Criterion:! Fst < 0.06! Similar genetic profiles among subpopulations are preferred 22

LINKAGE DISEQUILIBRIUM (LD)! Measures the non-random association of alleles at different loci! In the study, r2 measure was used! Selection criterion:! LD ~ 0.01! Avoid picking up highly linked SNPs! Minimize redundancy 23

AN INDIVIDUAL IDENTIFICATION SNP PANEL THE RESULTS! Identified two sets of SNPs! Set I: 45 SNPs! Estimated average matching probability < 10-15! An two random individuals to have the same genotype will be very unlikely! Set II: 89 SNPs! Estimated average matching probability < 10-33 24

SNP AS A DISEASE BIO-MAKER CYSTIC FIBROSIS! A genetic disorder that affects mostly the lungs! Inherited in an autosomal recessive manner! Most common among people of Northern European ancestry 25 http://www.ncbi.nlm.nih.gov/books/nbk1250/

SNP AS A DISEASE BIO-MAKER GAUCHER DISEASE! A genetic disease in which fatty substances accumulate in cells and certain organs! Inherited in an autosomal recessive manner 26 http://www.ncbi.nlm.nih.gov/books/nbk1269/

BREAKTHROUGH OF THE YEAR 2007 HUMAN GENOME VARIATION 27

THE HAPMAP PROJECT http://hapmap.ncbi.nlm.nih.gov/index.html.en 28

1000 GENOME PROJECT http://www.1000genomes.org/data 29

ONLINE RESOUCES: SNPEDIA http://snpedia.com/index.php/snpedia 30

ONLINE RESOUCES: DBSNP http://www.ncbi.nlm.nih.gov/projects/snp/snp_ref.cgi?rs=487989 31

WEEK 7 S LEARNING OBEJECTIVES! After the class, students should be able to! Define the concept of SNP! Elaborate various types of SNPs and their functions! Explain the applications of SNPs! Know the major initiatives and projects related to SNP! Use online resources to find out information about SNPs! Understand the concept of haplotype and linkage disequilibrium 32