BRCA1 / 2 testing by massive sequencing highlights, shadows or pitfalls? Giovanni Luca Scaglione, PhD ------------------------ Laboratory of Clinical Molecular Diagnostics and Personalized Medicine, Institute of Biochemistry and Clinical Biochemistry, Catholic University of Rome, Italy
Twenty years since the discovery of BRCA1 1994 BRCA1 gene positionally cloned and sequenced 1,2 1995 BRCA2 gene identified and sequenced 3,4 2002 BRCA mutation life time cancer risks: breast (60 85%), ovarian (15 40%) 5 7 2005 Synthetic lethality of PARP inhibition in BRCA-deficient cell lines 8,9 2009 2013 Clinical antitumour activity of PARP inhibitor shown in BRCA-mutation associated cancer 10 Initiation of Phase III trials of PARP inhibitors in BRCA-mutant ovarian and breast cancers 2014: BRCA as PGx gene? 1. Friedman et al. Nat Genet 1994;8:399 404; 2. Miki et al. Science 1994;266:66 71; 3. Wooster et al. Nature 1995;378:789 792; 4. Tavtigian et al. Nat Genet 1996;12:333 337; 5. Brose et al. J Natl Cancer Inst 2002;94:1365 1372; 6. Thompson et al. J Natl Cancer Inst 2002;94:1358 1365; 7. Wooster, Weber. N Engl J Med 2003;348:2339 2347; 8. Farmer et al. Nature 2005;434:917 921; 9. Brody. N Engl J Med 2005;353:949 950; 10. Fong et al. N Engl J Med 2009;361:123 134
Highlights Summary Working on 454 GS junior (Roche Diagnostics) 1. Up to 12 patients & multigenic profile within a run 2. Complete BRCA1/2 screening (100% coding regions) 3. Fast variant call report by means of bioinformatics tool: Amplicon Suite Shadows 1. Pre-NGS Quality Control >>> Fragment Analysis 2. NGS data robustness: Inter-run & Intra-run >>> Read coverage profile 3. Post-NGS Validation Experiments >>> Sanger Chemistry Pitfalls 1. PCR Bias (Primer Binding Energies, GC-content) 2. empcr errors 3. CNV Copy Number Variations
Summary Patients Sample 1. w/ breast cancer 2. and/or ovarian cancer 3. w/ a positive family history Collection whole blood sample (K 3 EDTA) 8-12 patients per run (3/4 day) DNA extraction
Complete BRCA1/2 screening (100% coding regions) Genomic DNA Fragment Library Barcoded Library Multiplex PCR, Amplification of Specific Targets 5 PCR multiplex 93 Amplicons Universal PCR, Addition of MIDs and 454 adaptor sequences
Complete BRCA1/2 screening (100% coding regions) Plex A B C D E empcr water-in-oil emulsion
Summary Gene Scan Profile Pre-emulsion QC step check-point Sanger Pre-NGS Quality Control (Fragment Analysis) Mut + or Re-Extraction Mut - NGS
Complete BRCA1/2 screening (100% coding regions) Pyrosequencing GS 454 Junior 454 GS Junior Clinical Lab Report Data Analysis
Summary Quantitative Analysis: Read Coverage > 38x NGS data robustness: Inter-run & Intra-run read cover profile
Case Report Read Coverage Warning!
Summary Fast variant call report by means of bioinformatics tool: Amplicon Suite
Summary QC Mut - NGS Mut + Sanger Post-NGS Validation Experiments (Sanger Chemistry)
Summary >Library Preparation< 1. Contamination (not completely excluded) 2. PCR Bias (Primer Binding Energies, GC-content) 3. Chimeras (up to 45%, high sequence similarity) 4. PCR errors (454 protocol TAQ error rate 1:9000 bases) >empcr< 1. empcr errors 2. Artificial Duplicates >Sequencing< Pitfalls Known Error Patterns 1. Well Cross-Talk Type I ( Ghost well ) 2. Well Cross-Talk Type II ( Optical Bleeding ) 3. Homopolymer Errors
Pitfalls Known Error Patterns >Library Preparation< 1. Contamination (not completely excluded) 2. PCR Bias (Primer Binding Energies, GC-content) Plex 3. Chimeras A B (up C D to 45%, E high sequence similarity) 4. PCR errors (454 protocol TAQ error rate 1:9000 bases) n=117 >empcr< 1. empcr errors 2. Artificial Duplicates >Sequencing< n of reads per plex 1. Well Cross-Talk Type [95% Conf. I ( Ghost Interval] well ) Variabl e (Mean ± SE) 2. Plex A Well Cross-Talk 120 ± 4 Type 107 II - ( Optical 124 Bleeding ) Plex B 180 ± 7 163-192 3. Homopolymer Errors Plex C 130 ±5 120-140 Plex D 110 ± 4 105-120 Plex E 150 ± 5 135-155
Case Report Read Coverage Warning! BRCA1 exon 20 1305x MLPA Negative
Case Report Read Coverage Warning!
>Library Preparation< Pitfalls Known Error Patterns 1. Contamination (not completely excluded) 2. PCR Bias (Primer Binding Energies, GC-content) 3. Chimeras (up to 45%, high sequence similarity) 4. PCR errors (454 protocol TAQ error rate 1:9000 bases) >empcr< 1. empcr errors 2. Artificial Duplicates >Sequencing< 1. Well Cross-Talk Type I ( Ghost well ) 2. Well Cross-Talk Type II ( Optical Bleeding ) 3. Homopolymer Errors
Plex B - Amplicon Read Coverage PLEX B Variable n of reads per plex (Mean ± SE) [95% Conf. Interval ] Plex A 120 ± 4 107-124 Plex B 180 ± 7 163-192 Plex C 130 ±5 120-140 Plex D 110 ± 4 105-120 Plex E 150 ± 5 135-155
Conclusions Amplicon Suite tool could be tuned for the Copy Number Variation (CNV) analysis in BRCA1/2 testing due to his highly flexible and fast capability of finding out data from GS Junior 454 runs The analysis performed not only in the inter-run series but also in intra-run series is the driving force in the setup of CNV analysis NGS data analysis could not be separated from amplicon library data analysis
Thank You