Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity.

Corcoran MM, Phad GE, Vázquez Bernat , Stahl-Hennig C, Sumida N, Persson MA, Martin M, Karlsson Hedestam GB

Nat Commun 7 (-) 13642 [2016-12-20; online 2016-12-20]

Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool.

Affiliated researcher

QC bibliography QC xrefs

PubMed 27995928

DOI 10.1038/ncomms13642

Crossref 10.1038/ncomms13642

ncomms13642

pmc PMC5187446