SUPPLEMENTARY METHODS



Similar documents
MATCH IT! DNA Software Quick Reference Guide

Human Leukocyte Antigens - HLA

HISTOCOMPATIBILITY. and IMMUNOGENETICS. Prospectus

Single-Cell Whole Genome Sequencing on the C1 System: a Performance Evaluation

Illumina TruSight HLA Sequencing Panel Automated on the Biomek FX P HLA SP Liquid Handler

THE INFLUENCE OF TISSUE (IN)COMPATIBILITY IN UMBILICAL CORD BLOOD TRANSPLANTATION

Histocompatibility Matching for Hematopoietic Stem Cell Transplant: Can we Close the Gaps? Michael Aubrey, CHS

14/12/2012. HLA typing - problem #1. Applications for NGS. HLA typing - problem #1 HLA typing - problem #2

Focusing on results not data comprehensive data analysis for targeted next generation sequencing

AS Replaces Page 1 of 50 ATF. Software for. DNA Sequencing. Operators Manual. Assign-ATF is intended for Research Use Only (RUO):

HLA data analysis in anthropology: basic theory and practice

Tutorial for Windows and Macintosh. Preparing Your Data for NGS Alignment

Step-by-Step Guide to Bi-Parental Linkage Mapping WHITE PAPER

SOP 3 v2: web-based selection of oligonucleotide primer trios for genotyping of human and mouse polymorphisms

GENETIC STUDIES OF AUTOIMMUNE DISEASES. Benedicte Alexandre Lie Institute of Immunology Rikshospitalet University Hospital

HISTO SPOT SSO System. The most convenient automated HLA typing system. BAG Health Care the experts for HLA and blood group diagnostics

Gyper: A graph-based HLA genotyper using aligned DNA sequences

Genetics of Rheumatoid Arthritis Markey Lecture Series

Assuring the Quality of Next-Generation Sequencing in Clinical Laboratory Practice. Supplementary Guidelines

Services. Updated 05/31/2016

Simplifying Data Interpretation with Nexus Copy Number

Next Generation Sequencing: Technology, Mapping, and Analysis

GAW 15 Problem 3: Simulated Rheumatoid Arthritis Data Full Model and Simulation Parameters

ANTHONY NOLAN SEARCH ALGORITHM FOR A BASIC CORD BLOOD UNIT SELECTION BY SERGIO QUEROL AND IRINA EVSEEVA FEBRUARY 2012

A guide to the analysis of KASP genotyping data using cluster plots

RETRIEVING SEQUENCE INFORMATION. Nucleotide sequence databases. Database search. Sequence alignment and comparison

SeqScape Software Version 2.5 Comprehensive Analysis Solution for Resequencing Applications

Single-Cell DNA Sequencing with the C 1. Single-Cell Auto Prep System. Reveal hidden populations and genetic diversity within complex samples

A Primer of Genome Science THIRD

T cell Epitope Prediction

Using Illumina BaseSpace Apps to Analyze RNA Sequencing Data

Name: Class: Date: ID: A

Drawing a histogram using Excel

2. True or False? The sequence of nucleotides in the human genome is 90.9% identical from one person to the next. False (it s 99.

HISTO SPOT SSO System. The most convenient automated HLA typing system. BAG Health Care the experts for HLA and blood group diagnostics

Single Nucleotide Polymorphisms (SNPs)

SICKLE CELL ANEMIA & THE HEMOGLOBIN GENE TEACHER S GUIDE

Version 5.0 Release Notes

Towards Integrating the Detection of Genetic Variants into an In-Memory Database

Lectures 1 and February 7, Genomics 2012: Repetitorium. Peter N Robinson. VL1: Next- Generation Sequencing. VL8 9: Variant Calling

When you install Mascot, it includes a copy of the Swiss-Prot protein database. However, it is almost certain that you and your colleagues will want

The donor search: the best donor or cord blood unit

Forensic DNA Testing Terminology

HISTO TYPE SSP Immunogenetic Diagnostics. Robust and fast - validated Taq Polymerase included

The Power of Next-Generation Sequencing in Your Hands On the Path towards Diagnostics

Basic Analysis of Microarray Data

How To Find Rare Variants In The Human Genome

History of DNA Sequencing & Current Applications

UKB_WCSGAX: UK Biobank 500K Samples Genotyping Data Generation by the Affymetrix Research Services Laboratory. April, 2015

Genotyping and quality control of UK Biobank, a large- scale, extensively phenotyped prospective resource

New HLA class I epitopes defined by murine monoclonal antibodies

SeattleSNPs Interactive Tutorial: Web Tools for Site Selection, Linkage Disequilibrium and Haplotype Analysis

NEXT GENERATION SEQUENCING

Custom TaqMan Assays For New SNP Genotyping and Gene Expression Assays. Design and Ordering Guide

SNP Essentials The same SNP story

Searching Nucleotide Databases

Logistic Regression (1/24/13)

Transcription and Translation of DNA

Instructions for Use. CyAn ADP. High-speed Analyzer. Summit G June Beckman Coulter, Inc N. Harbor Blvd. Fullerton, CA 92835

Immunogenetics of Type 1 Diabetes

CHAPTER 6 ANTIBODY GENETICS: ISOTYPES, ALLOTYPES, IDIOTYPES

AN INTEGRATED APPROACH TO TEACHING SPREADSHEET SKILLS. Mindell Reiss Nitkin, Simmons College. Abstract

GeneChip Sequence Analysis Software (GSEQ) is used to analyze data from the Resequencing Arrays

An example of bioinformatics application on plant breeding projects in Rijk Zwaan

Real-time qpcr Assay Design Software

Advances in RainDance Sequence Enrichment Technology and Applications in Cancer Research. March 17, 2011 Rendez-Vous Séquençage

ALLEN Mouse Brain Atlas

Data Analysis for Ion Torrent Sequencing

Typing in the NGS era: The way forward!

Analysis of gene expression data. Ulf Leser and Philippe Thomas

Delivering the power of the world s most successful genomics platform

(1-p) 2. p(1-p) From the table, frequency of DpyUnc = ¼ (p^2) = #DpyUnc = p^2 = ¼(1-p)^2 + ½(1-p)p + ¼(p^2) #Dpy + #DpyUnc

Descriptive Statistics

Genetics Lecture Notes Lectures 1 2

Molecular typing of VTEC: from PFGE to NGS-based phylogeny

School of Nursing. Presented by Yvette Conley, PhD

Next Generation Sequencing: Adjusting to Big Data. Daniel Nicorici, Dr.Tech. Statistikot Suomen Lääketeollisuudessa

The correct answer is c A. Answer a is incorrect. The white-eye gene must be recessive since heterozygous females have red eyes.

Supplementary Online Content

Inflammatory bowel disease: Ulcerative Colitis and Crohn s disease. Crohn's disease (CD) and ulcerative colitis (UC) are related inflammatory

TIPS FOR DOING STATISTICS IN EXCEL

Overview One of the promises of studies of human genetic variation is to learn about human history and also to learn about natural selection.

Milk protein genetic variation in Butana cattle

Improving the Performance of Data Mining Models with Data Preparation Using SAS Enterprise Miner Ricardo Galante, SAS Institute Brasil, São Paulo, SP

Terms: The following terms are presented in this lesson (shown in bold italics and on PowerPoint Slides 2 and 3):

Years after US Student to Teacher Ratio

APPLYING BENFORD'S LAW This PDF contains step-by-step instructions on how to apply Benford's law using Microsoft Excel, which is commonly used by

Deterministic computer simulations were performed to evaluate the effect of maternallytransmitted

Microsoft Excel. Qi Wei

SAP HANA Enabling Genome Analysis

SEQUENCING. From Sample to Sequence-Ready

Transcription:

SUPPLEMENTARY METHODS Description of parameter selection for the automated calling algorithm The first analyses of the HLA data were performed with the haploid cell lines described by Horton et al. (1). For each sample and locus we mapped all reads to the collection of available reference sequences and saw that the allele with the highest number of perfect alignable reads was the reference allele. For the evaluation of heterozygous samples we had to generate a valid sample set. Here we chose 20 samples for which we had Sanger-determined HLA allele data available (see Supplementary Methods Table 1 below). For these samples we performed the targeted enrichment with a bait design that also covered the Affymetrix SNP array 6.0 SNVs of the HLA region. Employing our pibase software (2) for SNP calling and HLA imputation (3), we calculated two field HLA types for that sample set (Supplementary Methods Table 1). These twenty samples and their Sanger and imputation based HLA types were the reference data with which we set up the analysis method. Next, we ran the afore-mentioned perfect mapping count approach on the heterozygous samples. When we sorted all alleles by the number of mapped reads (descending), we found allele A at the top of the list followed by many alleles that showed high similarity to that allele A. Allele B was very rarely ranked second place. Therefore, additional parameters had to be established to identify the correct allele combination for the sample s genotype. We ended up with 5 parameters: First parameter As described in the Methods section of the main manuscript we removed all redundancies regarding the mapped reads. Based on these results we calculated an ideal coverage, which reflects the maximal possible coverage. The area under the curve (AUC) was calculated as the fraction of ideal coverage that was achieved. The AUC parameter was then used instead of the number of perfect alignable reads. In Supplementary Methods Figure 1a we visualized the distribution of the AUC for all possible genotypes built from all fully covered alleles. The distribution for the reference genotypes is shown in blue. One can see that the AUC of the reference genotype is usually at the upper end of the distribution. Second parameter For heterozygous samples we assumed to cover the SNV positions of both DNA copies with about the same number of reads. Taking this into account we calculated the read equality (REQ) and counted the read mappings that mapped to only one of both alleles of a genotype. Then we divided the minimum of both by the maximum of both and received a value between 0 and 1 (1 stands for an exact even distribution). Supplementary Methods Figure 1b shows that the read equality of the reference genotypes is again close to the upper-side of the distribution. The zero values come from the homozygous genotypes. Third parameter A further review of the results showed that not only the read equality is a criteria that is maximized for heterozygous samples, but also the number of allele specific mappings per base (ASM). The ditribution of ASM is visualized in Supplementary Methods Figure 2a. The values of the reference genotypes are again located at the upper side of the destribution, but less than it is for the first two parameters (Supplementary Methods Figure 1). This observation already indicates a probable minor weighting of this parameter (compared to auc and req) when we try to find the genotype that has all parameters as high as possible later. Fourth parameter For the 4 th parameter we calculated the number of mapped pairs per read (MPPR) for each allele of every genotype. In case the input data comes from a paired end run, we assumed that the correct allele mapping shows as many paired end mappings as possible. The distribution of these values is shown in Supplementary Methods Figure 2b. The distribution again indicates a minor weighting of that parameter.

Fifth parameter It also turned out that the IMGT/HLA database often only has sequence information for a reduced number of exons. These are, as expected, exons 2 & 3 for class I and exon 2 for class II genes. These short cdna references have the tendency to show high values for allele specific mappings per base. On the other hand these short cdnas cannot explain where many of the locus specific reads stem from. To correct for such instances, we introduced the length of the mappable sequence (MSL) as parameter five. For this parameter we assumed that the longer allele version is present if we detect the sequence for this allele. The distribution of these values is shown in Supplementary Methods Figure 2c and a minor weighting is again indicated. After all parameters were calculated for all genotypes, they were scaled between zero and one for each sample and locus separate. If a value was by definition already scaled between zero and one this step was skipped (AUC, REQ). In the next step we calculated the harmonic mean for the rescaled parameters. Like the distribution of the values already indicated, some parameters might be weighted less. For ASM, MPPR and MSL we tested the weightings 1, 0.5 and 0.1. The best results for our validation set (Supplementary Methods Table 1) could be achieved with ASM weighted 0.5, MPPR weighted 0.1 and MSL weighted 0.1. The exact formula can be found in the Methods section of the main manuscript. Supplementary Methods Figure 1: Distribution of the parameter values that were used for the automated calling algorithm. a) Histograms of the area-under-the-curve and b) read-equality. For every sample and all loci we took all the alleles that were covered by 100%, following our mapping criteria described in the publication. For each locus we build all possible genotypes of the available alleles and calculated the parameters AUC (area under the curve) and read equality (see main paper). The values were sample- and locus-specific scaled between zero and one ([0;1]). The histograms for these values are shown here. The black framed bars were generated from the whole data set, the blue bars are derived from the values calculated for the reference alleles. We see for both parameters that these values are at the upper end of the distribution. The zero values for read equality stem from the homozygous genotypes. The panel between the x-axes and axis labels show vertical lines for each value that was used to to generate the histogram. Black lines are the values for the black framed histogram and blue for the blue histogram.

Supplementary Methods Figure 2: Distribution of the parameter values that were used for the automated calling algorithm. Please see legend of Supplementary Methods Figure 1 for a description of the histogram type. The parameters visualized here are a) allele specific reads per base, b) mapped pairs per read and c) length of the mappable sequence. For the allele specific reads per base one can see a tendency that the reference genotypes are concentrated at the upper side of the distribution even in instances where it is not so clear as for the AUC and the read equality. For the score calculation we weigh allele specific reads per base with 0.5. For the remaining two parameters this effect is less strong and we weight those with 0.1 only.

Supplementary Methods Table 1. Reference data set that was used for the implementation of the calling algorithm. The table shows the sample set used to develop the HLA calling algorithm. The first column listst the sample identifier, the second the HLA locus for which the following columns show the allele calls. Columns 3 and 4 show both alleles determined by the in silico SNP2HLA allelle imputation. Columns 5 and 6 show the classically Sanger determined genotype. Columns 7 and 8 show the HLA calling results of our tool after manual review of the NGS based calls. The fully automated calling that was based on our feature selection and weighting showed an overall 97% concordance to these results. HLA imputation Sanger NGS based caling Sample locus allele 1 allele 2 allele 1 allele 2 allele 1 allele 2 B0708 HLA-A A*01:01 A*31:01 A*01 A*19 A*01:01:01 A*31:01:02 B0708 HLA-B B*08:01 B*40:01 B*08 B*40 B*08:01:01 B*40:01:02 B0708 HLA-C C*03:04 C*07:01 C*03 C*07 C*03:04:01 C*07:01:01 B0708 HLA-DQA DQA*03:01 DQA*05:01 DQA1*03:01:01 DQA1*05:01:01 B0708 HLA-DQB DQB*02:01 DQB*03:02 DQB1*02:01 DQB1*03:02 DQB1*02:01:01 DQB1*03:02:01 B0708 HLA-DRB1 DRB1*03:01 DRB1*04:04 DRB1*03:01 DRB1*04:04 DRB1*03:01:01 DRB1*04:04:01 B0709 HLA-A A*01:01 A*03:01 A*01 A*03:26 A*01:01:01 A*03:26 B0709 HLA-B B*08:01 B*35:03 B*08 B*35 B*08:01:01 B*35:03:01 B0709 HLA-C C*04:01 C*07:01 C*04 C*07 C*04:01:01 C*07:01:01 B0709 HLA-DQA DQA*03:01 DQA*05:01 DQA1*03:01:01 DQA1*05:01:01 B0709 HLA-DQB DQB*02:01 DQB*03:02 DQB1*02:01 DQB1*03:02 DQB1*02:01:01 DQB1*03:02:01 B0709 HLA-DRB1 DRB1*03:01 DRB1*04:04 DRB1*03:01 DRB1*04:03 DRB1*03:01:01 DRB1*04:03:01 B0710 HLA-A A*02:01 A*24:02 A*02 A*24:02 A*02:01:01 A*24:02:01 B0710 HLA-B B*35:01 B*40:01 B*35 B*40 B*35:01:01 B*40:01:02 B0710 HLA-C C*03:04 C*04:01 C*03 C*04 C*03:04:01 C*04:01:01 B0710 HLA-DQA DQA*01:01 DQA*03:01 DQA1*01:01:01 DQA1*03:01:01 B0710 HLA-DQB DQB*03:02 DQB*05:01 DQB1*06:02 DQB1*06:03 DQB1*03:02:01 DQB1*05:01:01 B0710 HLA-DRB1 DRB1*01:01 DRB1*04:04 DRB1*01:03 DRB1*04:04 DRB1*01:03 DRB1*04:04:01 B0711 HLA-A A*02:01 A*02:01 A*02 A*02 A*02:01:01 A*02:01:01 B0711 HLA-B B*27:05 B*40:01 B*27 B*40 B*27:05:02 B*40:01:02 B0711 HLA-C C*02:02 C*03:04 C*02 C*03 C*02:02:02 C*03:04:01 B0711 HLA-DQA DQA*01:02 DQA*01:03 DQA1*01:02:01 DQA1*01:03:01 B0711 HLA-DQB DQB*06:03 DQB*06:04 DQB1*06:03 DQB1*06:04 DQB1*06:03:01 DQB1*06:04:01 B0711 HLA-DRB1 DRB1*13:01 DRB1*13:02 DRB1*13:01 DRB1*13:02 DRB1*13:01:01 DRB1*13:02:01 B0712 HLA-A A*01:01 A*02:01 A*01:01 A*02:01 A*01:01:01 A*02:01:01 B0712 HLA-B B*15:01 B*39:06 B*14 B*16 B*15:01:01 B*39:06:02 B0712 HLA-C C*03:03 C*07:02 C*03 C*07 C*03:03:01 C*07:02:01 B0712 HLA-DQA DQA*01:03 DQA*04:01 DQA1*01:03:01 DQA1*04:01:01 B0712 HLA-DQB DQB*04:02 DQB*06:03 DQB1*04 DQB1*06:03 DQB1*04:02:01 DQB1*06:03:01 B0712 HLA-DRB1 DRB1*08:01 DRB1*13:01 DRB1*08:01 DRB1*13:01 DRB1*08:01:01 DRB1*13:01:01 B0713 HLA-A A*01:01 A*68:01 A*01:01 A*68:01 A*01:01:01 A*68:01:02 B0713 HLA-B B*51:01 B*57:01 B*05 B*17 B*51:01:01 B*57:01:01 B0713 HLA-C C*06:02 C*14:02 C*06 C*14 C*06:02:01 C*14:02:01 B0713 HLA-DQA DQA*01:01 DQA*01:03 DQA1*01:01:01 DQA1*01:03:01 B0713 HLA-DQB DQB*05:01 DQB*06:03 DQB1*05:01 DQB1*06:03 DQB1*05:01:01 DQB1*06:03:01 B0713 HLA-DRB1 DRB1*01:01 DRB1*13:01 DRB1*01:01 DRB1*13:01 DRB1*01:01:01 DRB1*13:01:01

HLA imputation Sanger NGS based caling Sample locus allele 1 allele 2 allele 1 allele 2 allele 1 allele 2 B0714 HLA-A A*01:01 A*24:02 A*01:01 A*24:02 A*01:01:01 A*24:02:01 B0714 HLA-B B*08:01 B*51:01 B*05 B*08 B*08:01:01 B*51:01:01 B0714 HLA-C C*07:01 C*15:02 C*07 C*15 C*07:01:01 C*15:02:01 B0714 HLA-DQA DQA*01:03 DQA*05:01 DQA1*01:03:01 DQA1*05:01:01 B0714 HLA-DQB DQB*02:01 DQB*06:03 DQB1*02:01 DQB1*06:03 DQB1*02:01:01 DQB1*06:03:01 B0714 HLA-DRB1 DRB1*03:01 DRB1*13:01 DRB1*03:01 DRB1*13:01 DRB1*03:01:01 DRB1*13:01:01 B0715 HLA-A A*02:01 A*11:01 A*02:01 A*11:01 A*02:01:01 A*11:01:01 B0715 HLA-B B*15:01 B*44:02 B*12 B*15 B*15:01:01 B*44:02:01 B0715 HLA-C C*03:03 C*05:01 C*03 C*05 C*03:03:01 C*05:01:01 B0715 HLA-DQA DQA*01:02 DQA*01:03 DQA1*01:02:01 DQA1*01:03:01 B0715 HLA-DQB DQB*06:02 DQB*06:03 DQB1*06:02 DQB1*06:03 DQB1*06:02:01 DQB1*06:03:01 B0715 HLA-DRB1 DRB1*13:01 DRB1*15:01 DRB1*13:01 DRB1*15:01 DRB1*13:01:01 DRB1*15:01:01 B0716 HLA-A A*31:01 A*68:01 A*19:01 A*28 A*31:01:02 A*68:01:02 B0716 HLA-B B*27:05 B*35:01 B*27 B*35 B*27:05:02 B*35:01:01 B0716 HLA-C C*01:02 C*14:02 C*01 C*14 C*01:02:01 C*14:02:01 B0716 HLA-DQA DQA*01:01 DQA*01:02 DQA1*01:01:01 DQA1*01:02:01 B0716 HLA-DQB DQB*05:01 DQB*06:02 DQB1*05:01 DQB1*06:02 DQB1*05:01:01 DQB1*06:02:01 B0716 HLA-DRB1 DRB1*01:01 DRB1*15:01 DRB1*01:01 DRB1*15:01 DRB1*01:01:01 DRB1*15:01:01 B0717 HLA-A A*02:01 A*68:01 A*02 A*28 A*02:01:01 A*68:01:01 B0717 HLA-B B*13:02 B*35:03 B*13 B*35 B*13:02:01 B*35:03:01 B0717 HLA-C C*04:01 C*06:02 C*04 C*06 C*04:01:01 C*06:02:01 B0717 HLA-DQA DQA*01:01 DQA*02:01 DQA1*01:01:01 DQA1*02:01 B0717 HLA-DQB DQB*02:02 DQB*05:01 DQB1*05:01 DQB1*02:02 DQB1*02:02:01 DQB1*05:01:01 B0717 HLA-DRB1 DRB1*01:01 DRB1*07:01 DRB1*01:01 DRB1*07:01 DRB1*01:01:01 DRB1*07:01:01 B0718 HLA-A A*24:02 A*29:02 A*09 A*19 A*24:02:01 A*29:02:01 B0718 HLA-B B*07:02 B*44:03 B*07 B*12 B*07:02:01 B*44:03:01 B0718 HLA-C C*07:02 C*16:01 C*07 C*16 C*07:02:01 C*16:01:01 B0718 HLA-DQA DQA*01:01 DQA*02:01 DQA1*01:04:01 DQA1*02:01 B0718 HLA-DQB DQB*02:02 DQB*05:03 DQB1*05:03 DQB1*02:02 DQB1*02:02:01 DQB1*05:03:01 B0718 HLA-DRB1 DRB1*07:01 DRB1*14:01 DRB1*07:01 DRB1*14:01 DRB1*07:01:01 DRB1*14:54:01 B0719 HLA-A A*03:01 A*03:01 A*03 A*03 A*03:01:01 A*03:01:01 B0719 HLA-B B*07:02 B*44:02 B*07 B*12 B*07:02:01 B*44:02:01 B0719 HLA-C C*05:01 C*07:02 C*05 C*07 C*05:01:01 C*07:02:01 B0719 HLA-DQA DQA*01:01 DQA*01:02 DQA1*01:02:01 DQA1*05:05:01 B0719 HLA-DQB DQB*03:01 DQB*06:02 DQB1*03:01 DQB1*06:02 DQB1*03:01:01 DQB1*06:02:01 B0719 HLA-DRB1 DRB1*01:03 DRB1*15:01 DRB1*01:03 DRB1*15:01 DRB1*01:03 DRB1*15:01:01 B0720 HLA-A A*25:01 A*26:01 A*10 A*10 A*25:01:01 A*26:01:01 B0720 HLA-B B*08:01 B*15:01 B*08 B*15 B*08:01:01 B*15:01:01 B0720 HLA-C C*03:03 C*07:01 C*03 C*07 C*03:03:01 C*07:01:01 B0720 HLA-DQA DQA*03:01 DQA*05:01 DQA1*03:03:01 DQA1*05:01:01 B0720 HLA-DQB DQB*02:01 DQB*03:01 DQB1*02:01 DQB1*03:01 DQB1*02:01:01 DQB1*03:01:01 B0720 HLA-DRB1 DRB1*03:01 DRB1*04:01 DRB1*03:01 DRB1*04:08 DRB1*03:01:01 DRB1*04:08:01

HLA imputation Sanger NGS based caling Sample locus allele 1 allele 2 allele 1 allele 2 allele 1 allele 2 B0721 HLA-A A*01:01 A*02:01 A*01 A*02 A*01:01:01 A*02:01:01 B0721 HLA-B B*15:01 B*57:01 B*15 B*17 B*15:01:01 B*57:01:01 B0721 HLA-C C*03:03 C*06:02 C*03 C*06 C*03:03:01 C*06:02:01 B0721 HLA-DQA DQA*01:03 DQA*01:03 DQA1*01:03:01 DQA1*01:03:01 B0721 HLA-DQB DQB*06:03 DQB*06:03 DQB1*06:03 DQB1*06:03 DQB1*06:03:01 DQB1*06:03:01 B0721 HLA-DRB1 DRB1*13:01 DRB1*13:01 DRB1*13:01 DRB1*13:02 DRB1*13:01:01 DRB1*13:01:01 B1512 HLA-A A*01:01 A*02:01 A*01:01:01 A*02:01:01:01 A*01:01:01 A*02:01:01 B1512 HLA-B B*35:01 B*52:01 B*35:01:01 B*52:01:01 B*35:01:01 B*52:01:01 B1512 HLA-C C*04:01 C*12:02 C*04:01:01:01 C*12:02:01 C*04:01:01 C*12:02:02 B1512 HLA-DQA DQA*01:01 DQA*01:01 DQA1*01:01:01 DQA1*01:04:01 B1512 HLA-DQB DQB*05:01 DQB*05:03 DQB1*05:01 DQB1*05:03 DQB1*05:01:01 DQB1*05:03:01 B1512 HLA-DRB1 DRB1*01:01 DRB1*14:01 DRB1*01:01 DRB1*14:11 DRB1*01:01:01 DRB1*14:11 B1513 HLA-A A*01:01 A*02:01 A*01:02 A*02:01/09 A*01:02 A*02:01:01 B1513 HLA-B B*14:01 B*27:05 B*27:05 B*81:01 B*27:05:02 B*81:01 B1513 HLA-C C*02:02 C*08:02 C*02:02 C*08:04 C*02:02:02 C*08:04:01 B1513 HLA-DQA DQA*01:01 DQA*03:01 DQA1*01:05 DQA1*03:01:01 B1513 HLA-DQB DQB*03:02 DQB*05:01 DQB1*01:05 DQB1*03:01 DQB1*03:02:01 DQB1*05:01:01 B1513 HLA-DRB1 DRB1*04:04 DRB1*13:01 DRB1*04:04 DRB1*12:01 DRB1*04:04:01 DRB1*12:01:01 B1985 HLA-A A*01:01 A*02:01 A*01 A*02 A*01:01:01 A*02:01:01 B1985 HLA-B B*44:02 B*57:01 B*12 B*17 B*44:02:01 B*57:01:01 B1985 HLA-C C*05:01 C*06:02 C*05 C*06 C*05:01:01 C*06:02:01 B1985 HLA-DQA DQA*03:01 DQA*05:05 DQA1*03:03:01 DQA1*05:05:01 B1985 HLA-DQB DQB*03:01 DQB*03:01 DQB1*03:01 DQB1*03:01 DQB1*03:01:01 DQB1*03:01:01 B1985 HLA-DRB1 DRB1*04:01 DRB1*12:01 DRB1*04:01 DRB1*12:01 DRB1*04:01:01 DRB1*12:01:01 B1986 HLA-A A*02:01 A*32:01 A*02 A*19 A*02:01:01 A*32:01:01 B1986 HLA-B B*40:01 B*44:02 B*0 B*40 B*40:01:02 B*40:01:02 B1986 HLA-C C*03:04 C*05:01 C*0 C*03 C*03:04:01 C*05:01:01 B1986 HLA-DQA DQA*03:01 DQA*05:05 DQA1*03:01:01 DQA1*03:01:01 B1986 HLA-DQB DQB*03:02 DQB*03:03 DQB1*03:02 DQB1*03:03 DQB1*03:02:01 DQB1*03:03:02 B1986 HLA-DRB1 DRB1*04:01 DRB1*09:01 DRB1*04:01 DRB1*09:01 DRB1*04:01:01 DRB1*09:01:02 B1987 HLA-A A*01:01 A*03:01 A*01 A*03 A*01:01:01 A*03:01:01 B1987 HLA-B B*08:01 B*15:01 B*08 B*15 B*08:01:01 B*15:01:01 B1987 HLA-C C*03:04 C*07:01 C*03 C*07 C*03:04:01 C*07:01:01 B1987 HLA-DQA DQA*01:01 DQA*05:01 DQA1*01:05 DQA1*05:01:01 B1987 HLA-DQB DQB*02:01 DQB*05:01 DQB1*02:01 DQB1*05:01 DQB1*02:01:01 DQB1*05:01:01 B1987 HLA-DRB1 DRB1*03:01 DRB1*10:01 DRB1*03:01 DRB1*10:01 DRB1*03:01:01 DRB1*10:01:01 B1988 HLA-A A*01:01 A*03:01 A*01 A*03 A*01:01:01 A*03:01:01 B1988 HLA-B B*37:01 B*44:02 B*12 B*37 B*37:01:01 B*44:27:01 B1988 HLA-C C*06:02 C*07:04 C*06 C*07 C*06:02:01 C*07:04:01 B1988 HLA-DQA DQA*01:02 DQA*03:01 DQA1*01:02:02 DQA1*03:03:01 B1988 HLA-DQB DQB*03:01 DQB*05:02 DQB1*03:01 DQB1*05:02 DQB1*03:01:01 DQB1*05:02:01 B1988 HLA-DRB1 DRB1*04:01 DRB1*16:01 DRB1*04:01 DRB1*16:01 DRB1*04:01:01 DRB1*16:01:01

REFERENCES 1. Horton,R., Wilming,L., Rand,V., Lovering,R.C., Bruford,E.A., Khodiyar,V.K., Lush,M.J., Povey,S., Talbot,C.C. Jr., Wright,M.W., Wain,H.M., Trowsdale,J., Ziegler,A., Beck,S. (2004) Gene map of the extended human MHC. Nature Reviews Genetics, 5, 889-899. 2. Forster, M., Forster,P., Elsharawy,A., Hemmrich,G., Kreck,B., Wittig,M., Thomsen,I., Stade,B., Barann,M., Ellinghaus,D., Petersen,B.S., May,S., Melum,E., Schilhabel,M.B., Keller,A., Schreiber,S., Rosenstiel,P., Franke,A. (2013) From next-generation sequencing alignments to accurate comparison and validation of single-nucleotide variants: the pibase software. Nucleic Acids Res., 41, e16 3. Jia,X., Han,B., Onengut-Gumuscu,S., Chen,W.M., Concannon,P.J., Rich,S.S., Raychaudhuri,S., de Bakker,P.I. (2013) Imputing amino acid polymorphisms in human leukocyte antigens. PLoS One., 8, e64683