A survey of mapping algorithms in the long-reads era.

Sahlin K, Baudeau T, Cazaux B, Marchet C

Genome Biol. 24 (1) 133 [2023-06-01; online 2023-06-01]

It has been over a decade since the first publication of a method dedicated entirely to mapping long-reads. The distinctive characteristics of long reads resulted in methods moving from the seed-and-extend framework used for short reads to a seed-and-chain framework due to the seed abundance in each read. The main novelties are based on alternative seed constructs or chaining formulations. Dozens of tools now exist, whose heuristics have evolved considerably. We provide an overview of the methods used in long-read mappers. Since they are driven by implementation-specific parameters, we develop an original visualization tool to understand the parameter settings ( http://bcazaux.polytech-lille.net/Minimap2/ ).

Kristoffer Sahlin

SciLifeLab Fellow

PubMed 37264447

DOI 10.1186/s13059-023-02972-3

Crossref 10.1186/s13059-023-02972-3

pii: 10.1186/s13059-023-02972-3

Publications 9.5.0