Probabilistic inference of lateral gene transfer events.

Khan MA, Mahmudi O, Ullah I, Arvestad L, Lagergren J

BMC Bioinformatics 17 (Suppl 14) 431 [2016-11-11; online 2016-11-11]

Lateral gene transfer (LGT) is an evolutionary process that has an important role in biology. It challenges the traditional binary tree-like evolution of species and is attracting increasing attention of the molecular biologists due to its involvement in antibiotic resistance. A number of attempts have been made to model LGT in the presence of gene duplication and loss, but reliably placing LGT events in the species tree has remained a challenge. In this paper, we propose probabilistic methods that samples reconciliations of the gene tree with a dated species tree and computes maximum a posteriori probabilities. The MCMC-based method uses the probabilistic model DLTRS, that integrates LGT, gene duplication, gene loss, and sequence evolution under a relaxed molecular clock for substitution rates. We can estimate posterior distributions on gene trees and, in contrast to previous work, the actual placement of potential LGT, which can be used to, e.g., identify "highways" of LGT. Based on a simulation study, we conclude that the method is able to infer the true LGT events on gene tree and reconcile it to the correct edges on the species tree in most cases. Applied to two biological datasets, containing gene families from Cyanobacteria and Molicutes, we find potential LGTs highways that corroborate other studies as well as previously undetected examples.

Affiliated researcher

PubMed 28185583

DOI 10.1186/s12859-016-1268-2

Crossref 10.1186/s12859-016-1268-2

pii: 10.1186/s12859-016-1268-2
pmc: PMC5123345


Publications 9.5.0