The p53 MUTATION HANDBOOK

Similar documents
( TUTORIAL. (July 2006)

Hands on Simulation of Mutation

UNIVERSITETET I OSLO Det matematisk-naturvitenskapelige fakultet

GENEWIZ, Inc. DNA Sequencing Service Details for USC Norris Comprehensive Cancer Center DNA Core

(A) Microarray analysis was performed on ATM and MDM isolated from 4 obese donors.

10 µg lyophilized plasmid DNA (store lyophilized plasmid at 20 C)

Table S1. Related to Figure 4

Part ONE. a. Assuming each of the four bases occurs with equal probability, how many bits of information does a nucleotide contain?

Mutations and Genetic Variability. 1. What is occurring in the diagram below?

Molecular analyses of EGFR: mutation and amplification detection

Introduction to Perl Programming Input/Output, Regular Expressions, String Manipulation. Beginning Perl, Chap 4 6. Example 1

DNA Bracelets

Next Generation Sequencing

DNA Sample preparation and Submission Guidelines

Mutation. Mutation provides raw material to evolution. Different kinds of mutations have different effects

Chapter 9. Applications of probability. 9.1 The genetic code

Gene Synthesis 191. Mutagenesis 194. Gene Cloning 196. AccuGeneBlock Service 198. Gene Synthesis FAQs 201. User Protocol 204

Inverse PCR & Cycle Sequencing of P Element Insertions for STS Generation

Molecular Facts and Figures

Supplementary Information. Binding region and interaction properties of sulfoquinovosylacylglycerol (SQAG) with human

Gene Finding CMSC 423

pcas-guide System Validation in Genome Editing

Drosophila NK-homeobox genes

Characterization of cdna clones of the family of trypsin/a-amylase inhibitors (CM-proteins) in barley {Hordeum vulgare L.)

Coding sequence the sequence of nucleotide bases on the DNA that are transcribed into RNA which are in turn translated into protein

SERVICES CATALOGUE WITH SUBMISSION GUIDELINES

Inverse PCR and Sequencing of P-element, piggybac and Minos Insertion Sites in the Drosophila Gene Disruption Project

ANALYSIS OF A CIRCULAR CODE MODEL

Module 6: Digital DNA

Mutation, Repair, and Recombination

The making of The Genoma Music

a. Ribosomal RNA rrna a type ofrna that combines with proteins to form Ribosomes on which polypeptide chains of proteins are assembled

Y-chromosome haplotype distribution in Han Chinese populations and modern human origin in East Asians

Provincial Exam Questions. 9. Give one role of each of the following nucleic acids in the production of an enzyme.

Gene and Chromosome Mutation Worksheet (reference pgs in Modern Biology textbook)

Marine Biology DEC 2004; 146(1) : Copyright 2004 Springer

ISTEP+: Biology I End-of-Course Assessment Released Items and Scoring Notes

Insulin Receptor Gene Mutations in Iranian Patients with Type II Diabetes Mellitus

Hiding Data in DNA. 1 Introduction

Association of IGF1 and IGFBP3 polymorphisms with colorectal polyps and colorectal cancer risk

Problem Set 3 KEY

The nucleotide sequence of the gene for human protein C

pcmv6-neo Vector Application Guide Contents

Heraeus Sepatech, Kendro Laboratory Products GmbH, Berlin. Becton Dickinson,Heidelberg. Biozym, Hessisch Oldendorf. Eppendorf, Hamburg

The DNA-"Wave Biocomputer"

How To Clone Into Pcdna 3.1/V5-His

Pipe Cleaner Proteins. Essential question: How does the structure of proteins relate to their function in the cell?

BD BaculoGold Baculovirus Expression System Innovative Solutions for Proteomics

All commonly-used expression vectors used in the Jia Lab contain the following multiple cloning site: BamHI EcoRI SmaI SalI XhoI_ NotI

NimbleGen SeqCap EZ Library SR User s Guide Version 3.0

Peptide bonds: resonance structure. Properties of proteins: Peptide bonds and side chains. Dihedral angles. Peptide bond. Protein physics, Lecture 5

Biological One-way Functions

Chlamydomonas adapted Green Fluorescent Protein (CrGFP)

Transmembrane Signaling in Chimeras of the E. coli Chemotaxis Receptors and Bacterial Class III Adenylyl Cyclases

Protein Synthesis. Page 41 Page 44 Page 47 Page 42 Page 45 Page 48 Page 43 Page 46 Page 49. Page 41. DNA RNA Protein. Vocabulary

and revertant strains. The present paper demonstrates that the yeast gene for subunit II can also be translated to yield a polypeptide

Synthetic Biology: DNA Digital Storage, Computation and the Organic Computer

Genomes and SNPs in Malaria and Sickle Cell Anemia

Protein Synthesis Simulation

Guidelines for Writing a Scientific Paper

Five-minute cloning of Taq polymerase-amplified PCR products

C YTOPLASMIC organelles proliferate by the growth and

Biopython Tutorial and Cookbook

TITRATION OF raav (VG) USING QUANTITATIVE REAL TIME PCR

An Introduction to Bioinformatics Algorithms Gene Prediction

Six Homeoproteins and a Iinc-RNA at the Fast MYH Locus Lock Fast Myofiber Terminal Phenotype

Insulin mrna to Protein Kit

Academic Nucleic Acids and Protein Synthesis Test

Event-specific Method for the Quantification of Maize MIR162 Using Real-time PCR. Protocol

PRACTICE TEST QUESTIONS

pentr Directional TOPO Cloning Kits

Bio 102 Practice Problems Genetic Code and Mutation

Archimer

inhibition of mitosis

Recap. Lecture 2. Protein conformation. Proteins. 8 types of protein function 10/21/10. Proteins.. > 50% dry weight of a cell

BOC334 (Proteomics) Practical 1. Calculating the charge of proteins

Transcription:

The p MUTATION HANDBOOK Version 1. /7 Thierry Soussi Christophe Béroud, Dalil Hamroun Jean Michel Rubio Nevado http://p/free.fr

The p Mutation HandBook By T Soussi, J.M. Rubio-Nevado, D. Hamroun and C. Béroud What is the p Mutation HandBook? The p Mutation Handbook is a compilation of multiple analyses performed using information from the UMD p Mutation database Which release of the UMD TP database was used for these analyses? Except when otherwise specified, the curated version of the 7_R1 release of the UMD p mutation database was used. What is the difference between the curated and uncurated version of the p database? The curated version of the UMD TP database has been purged from artefactual data known to affect p mutation analysis. More information can be found on our website (http://p.free.fr/database/p_database.html) or in our recent publication (Soussi, T., Asselain, B., Hamroun, D., Kato, S., Ishioka, C., Claustres, M. and Beroud, C. () Meta-analysis of the p mutation database for mutant p biological activity reveals a methodological bias in mutation detection. Clin Cancer Res, 1, -9.) How were the various analyses performed? Using novel software, the entire UMD TP mutation database was automatically analyzed resulting in the p Mutation Handbook as output. What type of information can be found in the p Mutation HandBook? The p Mutation Handbook presents an analysis of TP mutation distribution and mutational events in the most frequent human cancers. It also contains helpful reference pages about the human p sequence and how each codon of the protein is mutated in human cancer. text accompanies the various analyses The p mutation handbook is not intended to be a manuscript. The handbook provides crude data and figures that are open for interpretation and/or discussion by anyone. Data / figures included in this handbook can

be used and reproduced freely by anyone as long as the reference to the p mutation handbook is indicated. What is the reference of the p Mutation HandBook? T Soussi, J.M. Rubio-Nevado, D. Hamroun and C. Béroud. The p Mutation handbook, available online; http://p.free.fr What is the audience for this handbook? Every scientist interested in p mutations or more globally in mutation analysis in human cancer. What if I need a specific analysis which is not included in this p Mutation handbook? Just send us an email and, if your request is feasible, the analysis will be performed and added to the handbook. You can also make suggestions via our Forum (http://p.free.fr/forum) What is the future of the p Mutation HandBook? More analyses and novel information will be added to the next version. Stay tuned! Version tory 1.1 March 7 Modification of the CpG mutation analysis page 8 and 9 1. Release of the p Mutation HandBook, February 7

DNA substitution mutations are of two types. Transitions are interchanges of purines (A G) or of pyrimdines (C T), which therefore involve bases of similar shape. Transversions are interchanges between purine and pyrmidine bases, which therefore involve exchange of onering and two-ring structures. Although there are twice as many possible transversions, because of the molecular mechanisms by which they are generated, transition mutations occur at higher frequency than transversions p mutational events mutation at CpG 1 mutation outside CpG : transition : transversion Fr: frameshift mutation GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS Spectrum of p mutations in various types of tumors. A predominance of the GC->AT transition at the CpG dinucleotide (colon, ovary brain, or leukemia) is the consequence of spontaneous deamination of -methylcytosine. A high frequency of GC->TA transversion (Lung, Head and neck or HCC) is strongly indicative of exposure to exogenous carcinogens. CpG dinucleotide mutates at a high rate because cytosine is vulnerable to deamination. Cytosines in CpG dinucleotides are often methylated, and deamination of - methylcytosine (mc) produces thymidine. Deamination of unmethylated cytosine produces uracil (U), which can be removed by uracil glycosylase, but mc deamination generates thymine (T), which cannot be processed by this enzyme. The consequence in humans is that the mutation rate from mc to T is 1-fold to -fold higher than other transitions.

8 17 1 9 1 1 18 1 7 9 p CODON The human p protein consists of 9 amino acids with evolutionarily conserved domains (I to V). Domains II to V correspond to the DNA binding domain which is the target for p mutations. 9 % of p mutations occur in the central region which harbors four of the five highly conserved evolutionary domains. X-ray crystallography of p-dna complexes shows that this region is essential for the p-dna interaction [Cho, 199, #18]. However, this distribution might be slightly biased since most investigations have focused exclusively on this region of the p gene. More recent studies on all the coding exons (exons through 11) show that a considerable number of mutations are found in exons and 1. Mutations in exons -8 are significantly different from those found in exons, 9 (see exon distribution).

Distribution of p mutations in the various functional domains of p. The higher frequency of frameshift mutations between exon - and exon -8 domains is statistically significant (p<.1). Human p protein (Hp) can be divided into five domains, each corresponding to specific functions: i) The amino-terminus part 1- contains the acidic transactivation domain and the mdm protein binding site. It also contains the Highly Conserved Domain I (HCD I) (Yellow) II) Region -9 contains series repeated proline residues that are conserved in the majority of p. it also contains a second transactivation domain (Red). III) The central region (11-) contains the DNA binding domain. It is the target of 9% of p mutations found in human cancers. It contains HCD II to V (Blue). IV) The oligomerization domain (7-, TET) consists of a beta-strand, followed by an alpha-helix necessary for dimerization, as p is composed of a dimer of two dimers. A nuclear export signal (NES) is localized in this oligomerization domain (Green). V) The carboxy-terminus of p (-9) contains nuclear localization signals (NLS) and a non-specific DNA binding domain that binds to damaged DNA. This region is also involved in downregulation of DNA binding of the central domain.

p mutation frequency p MUTATION DATABASE ANALYSIS p mutant frequency 198 19 777 998 11 8 7 71 81% 8% 11% 1% % 7% % % 1% Hot spot mutations 1% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 11 CAG Gln 8 CAT 79 71 TGT 8 8 7 GGC AGC Gly Ser 9 AGG AGC Ser 9 TAT TGT Tyr 1 CGA TGA Stop 8 Exon distribution Exon - Exon 9-11 Exon -8 1% 1% 7% % % 1% 8% 1% % Frameshift nsense Missense

p MUTATION DATABASE ANALYSIS p mutation distribution 8 17 1 9 1 1 18 1 7 9 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

all 1 1 p MUTATION DATABASE ANALYSIS FREQUENCY OF GC->TA TRANSVERSION All hepatocellular carcinoma rectal ca. lung (sclc) ovarian carcinoma bladder carcinoma endometrial tumor head and neck scc pancreatic cancer gastric carcinoma none breast carcinoma lung (nsclc) prostate ca. glioblastoma astrocytoma b-chronic lymphocytic leukemia skin squamous cell carcinoma b-cell lymphoma li-fraumeni synd. basal cell carcinoma colorectal carcinoma osteosarcoma gliomas esophageal scc esophageal adc CANCER 7 FREQUENCY OF GC->AT TRANSITION pancreatic cancer astrocytoma glioblastoma gastric carcinoma hepatocellular carcinoma bladder carcinoma esophageal scc ovarian carcinoma head and neck scc lung (nsclc) breast carcinoma 1 osteosarcoma endometrial tumor gliomas skin squamous cell carcinoma lung (sclc) b-cell lymphoma prostate ca. basal cell carcinoma esophageal adc none li-fraumeni synd. colorectal carcinoma rectal ca. b-chronic lymphocytic leukemia mutation at CpG mutation outside CpG CANCER

p MUTATION DATABASE ANALYSIS UV INDUCED MUTATION IN SKIN CANCER TANDEM MUTATIONS 1 MUTATIONS AT Py-Py SITES 1 1 8 Skin tumours Internal tumours Skin tumours Internal Tumours MUTATIONS AT DI-PYRIMIDINE SITES IN VARIOUS CANCER 1 8 AK BCC Melanoma Patches SCC XP Patients Breast ca. Colon ca.

BREAST CANCER p mutation frequency p mutant frequency 1 89 1 7 9 78 7% 9% 17% 1% % % 7% % 1% Hot spot mutations 8 7% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 1 CAG Gln 9 CAT 87 GGC AGC Gly Ser TAT TGT Tyr TGT 1 CGA TGA Stop 1 7 ATG ATA Met Ile 9 8 7 Exon distribution EXON - EXON -8 EXON 9-11 1% 1% % 8% % % % 77% Frameshift nsense Missense %

BREAST CANCER p mutation distribution 8 7 1 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

CRC CANCER p mutation frequency p mutant frequency 9 77 1 119 7 11 78 81% 9% 1% 1% % 7% 7% % 1% Hot spot mutations 8 % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 18 CAG Gln 19 178 CAT 177 8 1 GGC AGC Gly Ser 117 TGT 111 19 CGA TGA Stop 1 CGA TGA Stop 1 GGC GAC Gly Asp 8 Exon distribution EXON - EXON -8 EXON 9-11 1% % 9% 9% % % 1% 8% % Frameshift nsense Missense

CRC CANCER p mutation distribution 1 1 1 8 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 7 mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

ASTROCYTOMA p mutation frequency p mutant frequency 77 1 11 1 17 88% % 8% 1% 1% 7% % 19% 1% Hot spot mutations 1% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty TGT 89 17 CGC CAC 7 CAG Gln CAT 8 17 1 TAC TGC Tyr 8 179 CAT 7 1 TAC TGC Tyr GGC AGC Gly Ser Exon distribution EXON - EXON -8 8% 8% % 17% % EXON 9-11 % 7% 89% Frameshift nsense Missense 8%

ASTROCYTOMA p mutation distribution 1 1 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

GLIOBLASTOMA p mutation frequency p mutant frequency 71 1 7 1 9 1 89% % 8% 1% 1% 81% % 1% 1% Hot spot mutations 7 % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty TGT 1 17 CGC CAC CAG Gln 7 CAT 8 17 GGC AGC Gly Ser 1 7 ATG ATA Met Ile 1 18 CGC CAC 8 TAT TGT Tyr 8 Exon distribution EXON - EXON -8 7% % 8% % EXON 9-11 9% 89% Frameshift nsense Missense

GLIOBLASTOMA p mutation distribution 1 1 1 8 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

LUNG (NSCLC ) p mutation frequency p mutant frequency 1711 7 11 8 11 8 8% 9% 11% 1% % 71% 7% % 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 18 CGC CTC Leu 1 CTT Leu 7 9 AGG AGT Ser 17 GTC TTC Val Phe TAT TGT Tyr 8 CTG Leu 7 GGC TGC Gly CAT 17 CGC CAC Exon distribution EXON - EXON -8 EXON 9-11 7% 1% 9% 1% % % 81% % 9% Frameshift nsense Missense

LUNG (NSCLC ) p mutation distribution 1 17 18 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG 1 mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

LUNG (SCLC ) p mutation frequency p mutant frequency 181 8 17 9 1 1 1 1 8% 1% 8% 1% % 78% 1% 1% 1% Hot spot mutations 8 % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty CTG Leu 1 17 GTC TTC Val Phe 8 98 GAG TAG Glu Stop 7 9 AGG AGT Ser 18 CGC CTC Leu 171 GAG TAG Glu Stop 17 CGC CAC 17 CGC TGC TGC TTC Phe GGC TGC Gly Exon distribution EXON - EXON -8 EXON 9-11 % % % 11% % 8% % 8% Frameshift nsense Missense 8%

LUNG (SCLC ) p mutation distribution 8 7 17 17 98 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG 1 mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

BLADDER CANCER p mutation frequency p mutant frequency 9 9 19 9 7 8% 9% % 1% % 79% 8% 1% 1% Hot spot mutations 9 11% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 8 GAG AAG Glu Lys 8 AGA ACA Thr CAG Gln 17 CGC CAC 7 CAT 8 AGA AAA Lys 1 TAT TGT Tyr 19 71 GAG AAG Glu Lys 18 GGC AGC Gly Ser 1 Exon distribution EXON - EXON -8 1% % 8% EXON 9-11 1% 7% 1% % % 87% Frameshift nsense Missense

BLADDER CANCER p mutation distribution 8 8 1 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

ENDOMETRIAL CANCER p mutation frequency p mutant frequency 1 1 177 1 8 9 1 111 8% 8% 11% 1% 1% 77% 8% 1% 1% Hot spot mutations 1 1% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 1 CAG Gln 1 17 CGC CAC 8 CAT TGC TTC Phe 1 CGA TGA Stop GGC AGC Gly Ser 8 17 GTC TTC Val Phe 1 TAC TGC Tyr Exon distribution EXON - EXON -8 EXON 9-11 % 7% 11% 8% % % % 81% 7% Frameshift nsense Missense

ENDOMETRIAL CANCER p mutation distribution 1 1 8 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

p mutation frequency ESOPHAGEAL ADC p mutant frequency 18 7 7 78 9 8 11 77% 1% 1% 1% % 8% 8% % 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 1 GGC AGC Gly Ser 1 CAT 1 CAG Gln 8 1 CGA TGA Stop 7 TGT 8 19 CGA TGA Stop 78 CCT TCT Pro Ser Exon distribution EXON - EXON -8 EXON 9-11 1% 1% % % 78% Frameshift nsense Missense %

ESOPHAGEAL ADC p mutation distribution 1 9 8 7 1 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

ESOPHAGEAL SCC p mutation frequency p mutant frequency 897 1 18 1189 8 88 11 8 7% 1% 1% 1% % % 1% % 1% Hot spot mutations 9 % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 8 8 1 CAT TAT TGT Tyr CAG Gln 179 CAT 18 TGT 17 17 TGC TTC Phe 1 17 GTC TTC Val Phe 11 Exon distribution EXON - EXON -8 EXON 9-11 % % 1% 1% 9% % 1% 77% 8% Frameshift nsense Missense

ESOPHAGEAL SCC p mutation distribution 17 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

GASTRIC CANCER p mutation frequency p mutant frequency 89 7 8 1 1 9 8% 7% 9% 1% % 7% % 17% 1% Hot spot mutations 1% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 9 8 CAT CAG Gln GGC AGC Gly Ser 9 TGT 19 CGA TGA Stop 1 1 CGA TGA Stop 1 TAT TGT Tyr 1 Exon distribution EXON - EXON -8 EXON 9-11 % 9% 7% % % 8% Frameshift nsense Missense

GASTRIC CANCER p mutation distribution 8 7 17 1 8 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

HCC p mutation frequency p mutant frequency 79 8 1 7 88% % % 1% % 78% 9% 1% 1% Hot spot mutations 1 % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 9 AGG AGT Ser TGT 19 17 GTC TTC Val Phe 1 1 ATC AAC Ile Asn 1 TAT TGT Tyr 9 9 AGG ATG Met 9 19 GCC CCC Ala Pro 7 17 CGC CAC 7 CAG Gln 7 1 CAT Exon distribution EXON - EXON -8 18% % % EXON 9-11 % 8% 1% % 9% % Frameshift nsense Missense

HCC p mutation distribution 9 1 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

HEAD AND NECK SCC p mutation frequency p mutant frequency 1 8 1918 17 7 7% 1% 1% 1% % % 8% % 1% Hot spot mutations 7 7% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty CAG Gln 8 17 CGC CAC CAT 8 1 TAT TGT Tyr 17 TGC TTC Phe 1 TGT TAT TGT Tyr 7 17 GTC TTC Val Phe Exon distribution EXON - EXON -8 EXON 9-11 1% 1% 7% 9% % % % 77% 1% Frameshift nsense Missense

HEAD AND NECK SCC p mutation distribution 7 17 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG 1 mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

LI-FRAUMENI SYNDROME p mutation frequency p mutant frequency 18 19 1 18 9 1 7 81% 1% 9% 1% % 71% 1% 17% 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 1 CAG Gln 1 1 GGC AGC Gly Ser 11 CAT 11 TGT 7 8 1 CGA TGA Stop 9 CGC CAC CGA TGA Stop Exon distribution EXON - EXON -8 EXON 9-11 8% 1% 8% Frameshift nsense Missense

LI-FRAUMENI SYNDROME p mutation distribution 1 1 1 1 8 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 7 mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

SKIN BCC p mutation frequency p mutant frequency 1 1 79 1 11 1 18 8% 1% % 1% 9% 8% 9% 9% 1% Hot spot mutations 1 1% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 19 CGA TGA Stop 1 1 1 TCC TTC Ser Phe 1 177 CCC CTC Pro Leu 11 8 9 179 CAT TAT Tyr 8 CAA Gln 8 17 TCC TTC Ser Phe 1 CCG TCG Pro Ser 1 CGA TGA Stop Exon distribution EXON - EXON -8 % 1% EXON 9-11 % 1% % 9% % 87% 71% Frameshift nsense Missense

SKIN BCC p mutation distribution 9 8 7 1 8 177 19 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 7 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

SKIN SCC p mutation frequency p mutant frequency 1 1 18 18 99 1 7 119 8% 11% % 1% 1% 8% 11% % 1% Hot spot mutations 1 11% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 8 9 8 179 CAT TAT Tyr 81 GAC GAT Asp Asp 8 GAA AAA Glu Lys 19 CGA TGA Stop 17 CAG TAG Gln Stop 1 CAG INS1B Gln Fs. Fr GGC AGC Gly Ser CAG Gln Exon distribution EXON - EXON -8 % % 7% % % EXON 9-11 % % % 89% Frameshift nsense Missense

SKIN SCC p mutation distribution 8 7 1 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

MELANOMA p mutation frequency p mutant frequency 8 9 1 7 8 89% % % 1% 1% 88% % % 1% Hot spot mutations 1 18% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 177 CCC TCC Pro Ser 9 CGC CAC 17 TCC TTC Ser Phe 187 GGT AGT Gly Ser 8 GAA AAA Glu Lys GGA GAA Gly Glu TCC CCC Ser Pro 1 7 CCG TCG Pro Ser 1 9 GGT GAT Gly Asp 1 Exon distribution EXON - EXON -8 % % % 1% % EXON 9-11 7% 88% 9% % Frameshift nsense Missense

MELANOMA p mutation distribution 1 1 7 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p mutational events 177 9 p CODON mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

PANCREAS CARCINOMA p mutation frequency p mutant frequency 9 19 7 7 1 1 11 198 79% % 1% 1% % 7% % % 1% Hot spot mutations 1 % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty CAT 17 CGC CAC 1 1 TGT 1 8 11 TAT TGT Tyr 1 CAG Gln 8 1 CGA CTA Leu 1 AAG AGG Lys 11 CCC TCC Pro Ser Exon distribution EXON - EXON -8 EXON 9-11 1% % % % % 8% Frameshift nsense Missense

PANCREAS CARCINOMA p mutation distribution 1 1 1 8 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

PROSTATE CANCER p mutation frequency p mutant frequency 17 7 18 1 1 1 11 8% % 8% 1% 7% 8% 7% 1% 1% Hot spot mutations 1 8% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty TGT 1 17 CGC CAC 7 18 GCC DEL1A Ala Fs. Fr 1 ATC AGC Ile Ser CAT 1 CAT 7 GTT TTT Val Phe 1 TAC TGC Tyr 1 CCG CCA Pro Pro Exon distribution EXON - EXON -8 EXON 9-11 % 7% % 11% % 11% 9% 87% 78% Frameshift nsense Missense

PROSTATE CANCER p mutation distribution 8 7 18 17 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

p mutation frequency RENAL CELL CARCINOMA p mutant frequency 8 1 1 7 1 7 8% % 1% 1% % 77% % 19% 1% Hot spot mutations 8% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty GGC TGC Gly 8 78 CCT CTT Pro Leu 8 17 TGC TTC Phe CAG Gln 9 GAG TAG Glu Stop 17 GTG GGG Val Gly 19 CCC DEL1A Pro Fs. Fr 9 AGG AGT Ser 9 AGG ATG Met Exon distribution EXON - EXON -8 EXON 9-11 1% % 81% Frameshift nsense Missense

RENAL CELL CARCINOMA p mutation distribution 9 8 7 1 78 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

OSTEOSARCOMA p mutation frequency p mutant frequency 18 191 7 1 8 11 7% 1% 17% 1% 1% % 1% % 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty CAG Gln 1 CAT 8 17 CGC CAC 7 7 ATG ATT Met Ile 8 TGC TAC Tyr GGC AGC Gly Ser TGT 81 GAC AAC Asp Asn 81 GAC CAC Asp Exon distribution EXON - EXON -8 EXON 9-11 19% 1% 8% % 19% 78% Frameshift nsense Missense

OSTEOSARCOMA p mutation distribution 8 7 17 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

OVARIAN CANCER p mutation frequency p mutant frequency 17 11 19 18 17 1 9 7% 8% 1% 1% % % 8% 7% 1% Hot spot mutations 1 % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 17 CGC CAC 71 CAT CAG Gln 9 TAT TGT Tyr 7 TGT 1 8 8 GGC AGC Gly Ser 19 ATC ACC Ile Thr 7 ATG ATA Met Ile Exon distribution EXON - EXON -8 EXON 9-11 % % 1% 7% % % 1% 79% 1% Frameshift nsense Missense

OVARIAN CANCER p mutation distribution 9 8 7 1 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

p mutation frequency ACUTE MYELOID LEUKEMIA p mutant frequency 9 11 11 11 78 8% % 1% 1% % 79% % 1% 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty 7 CAT TGT 17 CGC CAC TAT TGT Tyr 8 TGT TAT Tyr CAG Gln 1 TGC AGC Ser 17 GTT TTT Val Phe 17 TGC TAC Tyr Exon distribution EXON - EXON -8 EXON 9-11 8% % 8% Frameshift nsense Missense

ACUTE MYELOID LEUKEMIA p mutation distribution 1 1 8 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

p mutation frequency BURKITT LYMPHOMA p mutant frequency 87 9 1 9 8 8% 9% % 1% % 8% 7% 9% 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty CAG Gln 1 18 CGC CAC 17 CGC CAC 1 CGA TGA Stop 8 TGT TAT Tyr ATC GAC Ile Asp 1 TAC CAC Tyr 1 CGA CAA Gln 7 ATG ATA Met Ile Exon distribution EXON - EXON -8 EXON 9-11 % 9% 8% Frameshift nsense Missense

BURKITT LYMPHOMA p mutation distribution 1 1 1 1 8 18 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

p mutation frequency B-CELL LYMPHOMA p mutant frequency 187 7 1 8 1 1 17 9% % 7% 1% % 88% % 9% 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty CAG Gln 1 CAT 9 7 TGT 17 CGC CAC 19 CGA TGA Stop 8 18 CGC CAC 17 TGC GGC Gly TAT TGT Tyr Exon distribution EXON - EXON -8 EXON 9-11 7% % 9% Frameshift nsense Missense

B-CELL LYMPHOMA p mutation distribution 1 1 8 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

p mutation frequency MYELODYSPLASTIC SYNDROME p mutant frequency 9 1 11 1 71 8% % 9% 1% % 77% 8% 1% 1% Hot spot mutations % Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty TAT TGT Tyr 7 CAT 7 8 TGT TAT Tyr CAG Gln TAC TGC Tyr 7 ATG ATA Met Ile GGC AGC Gly Ser 11 TGC TAC Tyr 17 GTG CTG Val Leu 17 CGC CAC Exon distribution EXON - EXON -8 EXON 9-11 9% % 8% Frameshift nsense Missense

MYELODYSPLASTIC SYNDROME p mutation distribution 8 7 8 1 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events 1 mutation at CpG mutation outside CpG GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

p mutation frequency NON-HODGKIN'S LYMPHOMA p mutant frequency 19 11 78 8 9% % % 1% % 9% % % 1% Hot spot mutations 7% Codon WT Codon Mutant Codon WT AA Mutant AA Type CpG File Qty CAG Gln 8 17 CGC CAC 7 TAC AAC Tyr Asn CAT TGT 1 AAG CAG Lys Gln 1 ATG ACG Met Thr 18 GCC GTC Ala Val 11 TGC TAC Tyr 1 CAG Gln Exon distribution EXON - EXON -8 EXON 9-11 % % 9% Frameshift nsense Missense

NON-HODGKIN'S LYMPHOMA p mutation distribution 9 8 7 1 17 1 1 1 91 11 11 181 11 1 71 1 1 1 91 p CODON p mutational events mutation at CpG mutation outside CpG 1 GC->AT AT->GC GC->CG GC->TA AT->CG AT->TA Fr. MUTATIONAL EVENTS

DISTRIBUTION OF p MUTATION Amino acids residues are shown using both or 1 letter abbreviation Yellow: codon number White: wt codon Light orange: letter aa light blue: 1 letter aa The last lane shows the number of mutations found at this position in the UMD p database The frequency of p mutations is colored coded: Red: between 1 and 1 mutations Green: between 11 and 1 mutations Blue: > 1 mutations 1 7 8 9 1 11 1 1 1 1 1 17 18 19 ATG GAG GAG CCG CAG TCA GAT CCT AGC GTC GAG CCC CCT CTG AGT CAG GAA ACA TTT TCA Met Glu Glu Pro Gln Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gln Glu Thr Phe Ser M E E P Q S D P S V E P P L S Q E T F S 1 1 1 1 1 1 1 7 8 9 1 7 8 9 GAC CTA AAA CTA CTT CCT GAA AAC AAC GTT CTG TCC CCC TTG CCG TCC CAA GCA ATG Asp Leu Lys Leu Leu Pro Glu Asn Asn Val Leu Ser Pro Leu Pro Ser Gln Ala Met D L W K L L P E N N V L S P L P S Q A M 1 1 1 1 1 1 1 1 9 1 1 7 8 9 1 7 8 9 GAT GAT TTG ATG CTG TCC CCG GAC GAT ATT GAA CAA TTC ACT GAA GAC CCA GGT CCA Asp Asp Leu Met Leu Ser Pro Asp Asp Iso Glu Gln Phe Thr Glu Asp Pro Gly Pro D D L M L S P D D I E Q W F T E D P G P 1 1 7 8 1 8 1 1 7 8 9 7 71 7 7 7 7 7 77 78 79 8 GAT GAA GCT CCC AGA ATG CCA GAG GCT GCT CCC CCC GTG GCC CCT GCA CCA GCA GCT CCT Asp Glu Ala Pro Met Pro Glu Ala Ala Pro Pro Val Ala Pro Ala Pro Ala Ala Pro D E A P R M P E A A P P V A P A P A A P 11 1 11 9 9 CGC 8 17 1 R 81 8 8 8 8 8 87 88 89 9 91 9 9 9 9 9 97 98 99 1 ACA CCG GCG GCC CCT GCA CCA GCC CCC TCC CCC CTG TCA TCT TCT GTC CCT TCC CAG Thr Pro Ala Ala Pro Ala Pro Ala Pro Ser Pro Leu Ser Ser Ser Val Pro Ser Gln T P A A P A P A P S W P L S S S V P S Q 1 11 9 7 1 19 7 7 8 17 11 1 1 1 1 1 17 18 19 11 111 11 11 11 11 11 117 118 119 1 AAA ACC TAC CAG GGC AGC TAC GGT TTC CTG GGC TTC TTG CAT TCT GGG ACA GCC AAG Lys Thr Tyr Gln Gly Ser Tyr Gly Phe Leu Gly Phe Leu Ser Gly Thr Ala Lys K T Y Q G S Y G F R L G F L H S G T A K 8 1 8 11 1 1 9 8 7 1 1 7 11 8 11 11 1 1 1 1 1 17 18 19 1 11 1 1 1 1 1 17 18 19 1 TCT GTG ACT TGC ACG TAC TCC CCT GCC CTC AAC AAG ATG TTT TGC CAA CTG GCC AAG ACC Ser Val Thr Thr Tyr Ser Pro Ala Leu Asn Lys Met Phe Gln Leu Ala Lys Thr S V T C T Y S P A L N K M F C Q L A K T 11 17 11 9 8 8 11 1

11 1 1 1 1 1 17 18 19 1 11 1 1 1 1 1 17 18 19 1 TGC CCT GTG CAG CTG GTT GAT TCC ACA CCC CCG CCC GGC ACC CGC GTC CGC GCC ATG Pro Val Gln Leu Val Asp Ser Thr Pro Pro Pro Gly Thr Val Ala Met C P V Q L W V D S T P P P G T R V R A M 1 1 7 9 1 8 7 1 17 17 11 1 9 117 7 11 1 1 1 1 1 17 18 19 17 171 17 17 17 17 17 177 178 179 18 GCC ATC TAC AAG CAG TCA CAG CAC ATG ACG GAG GTT GTG AGG CGC TGC CCC CAC CAT GAG Ala Iso Tyr Lys Gln Ser Gln Met Thr Glu Val Val Pro Glu A I Y K Q S Q H M T E V V R R C P H H E 11 1 19 8 7 7 77 19 17 79 1 8 181 18 18 18 18 18 187 188 189 19 191 19 19 19 19 19 197 198 199 CGC TGC TCA GAT AGC GAT GGT CTG GCC CCT CCT CAG CAT CTT ATC CGA GTG GAA GGA AAT Ser Asp Ser Asp Gly Leu Ala Pro Pro Gln Leu Iso Val Glu Gly Asn R C S D S D G L A P P Q H L I R V E G N 8 9 7 18 1 7 7 1 189 11 19 8 8 1 1 7 8 9 1 11 1 1 1 1 1 17 18 19 TTG GTG GAG TAT TTG GAT GAC AGA AAC ACT TTT CGA CAT AGT GTG GTG GTG CCC TAT Leu Val Glu Tyr Leu Asp Asp Asn Thr Phe Ser Val Val Val Pro Tyr L R V E Y L D D R N T F H S V V V P Y 1 18 18 77 7 9 R 88 8 18 1 7 1 7 8 9 1 7 8 9 GAG CCG CCT GAG GTT GGC TCT GAC TGT ACC ACC ATC CAC TAC AAC TAC ATG TGT AAC AGT Glu Pro Pro Glu Val Gly Ser Asp Thr Thr Iso Tyr Asn Tyr Met Asn Ser E P P E V G S D C T T I H Y N Y M C N S 9 7 18 1 1 7 8 9 17 1 1 197 1 7 1 7 9 1 7 8 9 TCC TGC ATG GGC GGC ATG AAC AGG CCC ATC CTC ACC ATC ATC ACA CTG GAA GAC TCC Ser Met Gly Gly Met Asn Pro Iso Leu Thr Iso Iso Thr Leu Glu Asp Ser S C M G G M N R R P I L T I I T L E D S 8 198 7 718 1 8 1 7 1 9 1 7 7 8 7 17 7 1 7 8 9 7 71 7 7 7 7 77 78 79 8 AGT GGT AAT CTA CTG GGA AAC AGC TTT GAG GTG GTT TGT GCC TGT CCT GGG AGA Ser Gly Asn Leu Leu Gly Asn Ser Phe Glu Val Val Ala Pro Gly S G N L L G R N S F E V R V C A C P G R 8 19 19 18 8 97 8 1 1 98 18 7 9 8 7 1 81 8 8 8 8 8 87 88 89 9 91 9 9 9 9 9 97 98 99 GAC CGC ACA GAG GAA GAG AAT CTC CGC AAG AAA GGG GAG CCT CAC CAC GAG CTG CCC Asp Thr Glu Glu Glu Asn Leu Lys Lys Gly Glu Pro Glu Leu Pro D R R T E E E N L R K K G E P H H E L P 178 1 1 1 7 8 7 1 9 7 7 1 7 8 9 1 11 1 1 1 1 1 17 18 19 CCA GGG AGC ACT AAG CGA GCA CTG CCC AAC AAC ACC AGC TCC TCT CCC CAG CCA AAG AAG Pro Gly Ser Thr Lys Ala Leu Pro Asn Asn Thr Ser Ser Ser Pro Gln Pro Lys Lys P G S T K R A L P N N T S S S P Q P K K 18 1 7 1 1 1 9 1 1 1 1 1 1 7 11 1

1 7 8 9 1 7 8 9 AAA CCA CTG GAT GGA GAA TAT TTC ACC CTT CAG ATC GGG GAG CGC TTC GAG ATG Lys Pro Leu Asp Gly Glu Tyr Phe Thr Leu Gln Iso Gly Glu Phe Glu Met K P L D G E Y F T L Q I R G R E R F E M 8 1 1 1 1 8 7 1 1 7 8 9 1 7 8 9 TTC CGA GAG CTG AAT GAG GCC TTG GAA CTC AAG GAT GCC CAG GCT GGG AAG GAG CCA GGG Phe Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala Gln Ala Gly Lys Glu Pro Gly F R E L N E A L E L K D A Q A G K E P G 7 7 1 1 1 7 8 9 7 71 7 7 7 7 7 77 78 79 8 GGG AGC AGG GCT CAC TCC AGC CAC CTG AAG TCC AAA AAG GGT CAG TCT ACC TCC CGC CAT Gly Ser Ala Ser Ser Leu Lys Ser Lys Lys Gly Gln Ser Thr Ser G S R A H S S H L K S K K G Q S T S R H 1 1 1 1 1 1 1 81 8 8 8 8 8 87 88 89 9 91 9 9 AAA AAA CTC ATG TTC AAG ACA GAA GGG CCT GAC TCA GAC Lys Lys Leu Met Phe Lys Thr Glu Gly Pro Asp Ser Asp K K L M F K T E G P D S D 1 1 1 1

CpG MUTATION AND LOSS OF p ACTIVITY POS: Codon position (1 to 9) WT: rmal base sequence of the codon in which the mutation occurred Mut: Sequence of the mutated codon WT AA: Wild type amino acid Mut: Mutant amino acid. Event: Mutational event: Number: Number of reccord in the database WAF1: ND: mutant activity available NR: t available Code: Polymorphism Mutant Mutation name Pos. WT Mut. WT AA Mut. AA Event Number WAF1 Code p.proleu CCG CTG Pro Leu C->T 9. p.propro CCG CCA Pro Pro G->A NR 1 p.ser9ser 9 AGC AGT Ser Ser C->T NR p.val1ile 1 GTC ATC Val Ile G->A 1 7.8 p.val1val 1 GTC GTT Val Val C->T NR p.glu11lys 11 GAG AAG Glu Lys G->A 1 7.8 p.asnasn AAC AAT Asn Asn C->T NR p.val1ile 1 GTT ATT Val Ile G->A.79 7 p.proleu CCG CTG Pro Leu C->T.8 8 p.propro CCG CCA Pro Pro G->A NR 9 p.pro7leu 7 CCG CTG Pro Leu C->T 1. 1 p.pro7pro 7 CCG CCA Pro Pro G->A 1 NR 11 p.asp8asp 8 GAC GAT Asp Asp C->T 1 NR 1 p.asp9asn 9 GAT AAT Asp Asn G->A 1 7 1 p.7 7 CGC TGC C->T ND 1 p.7 7 CGC CAC G->A ND 1 p.pro7pro 7 CCC CCT Pro Pro C->T NR 1 p.val7met 7 GTG ATG Val Met G->A 7.99 17 p.pro8leu 8 CCG CTG Pro Leu C->T 8. 18 p.pro8pro 8 CCG CCA Pro Pro G->A 1 NR 19 p.ala8val 8 GCG GTG Ala Val C->T 1. p.ala8ala 8 GCG GCA Ala Ala G->A NR 1 p.tyr17tyr 17 TAC TAT Tyr Tyr C->T NR p.gly18ser 18 GGT AGT Gly Ser G->A 1 1.7 p.11 11 TGT C->T 1 1.91 p.11 11 CAT G->A.7 p.thr1met 1 ACG ATG Thr Met C->T 11 1. p.thr1thr 1 ACG ACA Thr Thr G->A NR 7 p.pro1leu 1 CCG CTG Pro Leu C->T 8 9. 8 p.pro1pro 1 CCG CCA Pro Pro G->A 1 NR 9 p.pro1pro 1 CCC CCT Pro Pro C->T 1 NR p.gly1ser 1 GGC AGC Gly Ser G->A 1 11.7 1 p.1 1 CGC TGC C->T 8.9 p.1 1 CGC CAC G->A 1 18. p.1 1 CGC C->T 1 NR p.val17ile 17 GTC ATC Val Ile G->A 1.7 p.18 18 CGC TGC C->T 1.

p.18 18 CGC CAC G->A 1 8.78 7 p.18 18 CGC C->T NR 8 p.ala19thr 19 GCC ACC Ala Thr G->A 7 8. 9 p.thr17met 17 ACG ATG Thr Met C->T 9.8 p.thr17thr 17 ACG ACA Thr Thr G->A 8 NR 1 p.17 17 CGC TGC C->T 1. p.17 17 CGC CAC G->A 11 1.1 p.181 181 CGC TGC C->T.1 p.181 181 CGC CAC G->A.7 p.ser18ser 18 AGC AGT Ser Ser C->T 1 NR p.asp18asn 18 GAT AAT Asp Asn G->A 1.8 7 p.19stop 19 CGA TGA Stop C->T 9 8 p.19gln 19 CGA CAA Gln G->A 7. 9 p. TGT C->T 9. p. CAT G->A 8 9.8 1 p.1stop 1 CGA TGA Stop C->T 81 p.1gln 1 CGA CAA Gln G->A 9.19 p.proleu CCG CTG Pro Leu C->T.9 p.propro CCG CCA Pro Pro G->A NR p.glygly GGC GGT Gly Gly C->T 11 NR p.glyser GGC AGC Gly Ser G->A 7 p. C->T 7 8 p.gln CAG Gln G->A 88 9 p.7 7 C->T 1.8 p.7gln 7 CAG Gln G->A 1 9.7 1 p. TGT C->T 8.91 p. CAT G->A 79 1.1 p.8 8 C->T 7. p.8gln 8 CAG Gln G->A 7.18 p.8 8 CGC TGC C->T.7 p.8 8 CGC CAC G->A 17. 7 p.9 9 CGC TGC C->T 11. 8 p.9 9 CGC CAC G->A 7. 9 p.97 97 CAC CAT C->T 1 NR 7 p.glu98lys 98 GAG AAG Glu Lys G->A 8.1 71 p.stop CGA TGA Stop C->T 1 7 p.gln CGA CAA Gln G->A 1 18. 7 p. TGT C->T ND 7 p. CAT G->A 1.7 7 p. TGT C->T ND 7 p. CAT G->A 1 8.88 77 p.7 7 CGC TGC C->T 1 11.8 78 p.7 7 CGC CAC G->A 8.17 79 p.phe8phe 8 TTC TTT Phe Phe C->T NR 8 p.glu9lys 9 GAG AAG Glu Lys G->A ND 81 p.stop CGA TGA Stop C->T 8 p.gln CGA CAA Gln G->A.9 p.79 79 CGC TGC G->A 1 8 p.79 79 CGC CAC C->T 1

CpG type I: CGN NUMBER OF MUTATIONS 1 1 8 9 8 7 1 11 1 181 7 8 9 7 79 11 1 18 17 181 19 1 7 8 8 9 7 79 CpG POSITION Mutation targeting the C Mutation targeting the G

NUMBER OF MUTATIONS 9 8 7 1 CpG type II: NCG Mutation targeting the C Mutation targeting the G 7 8 8 1 1 17 CpG POSITION

CpG type III: NNC GNN 1 1 1 9/1_ 1/11_ /1 8/9 7/7 17/18 1/1 1/17 18/19 18/18 / 97/98 8/9 NUMBER OF MUTATIONS 9/1_ 1/11_ /1 8/9 7/7 17/18 1/1 1/17 18/19 18/18 / 97/98 8/9 CpG POSITION Mutation targeting the C Mutation targeting the G

p mutant position 17 8 1 19 18 1 7 1 181 9 7 181 17 8 18 8 7 1 8 1 7 17 1 9 1 11 17 98 8 19 19 18 7 11 7 1 79 11 18 8 9 1 p Activity (%) Number of Mutant 1 1 p Activity (%) Number of Mutant