Ke Y, Sharma E, Wayment-Steele HK, Becker WR, Ho A, Marklund E, Greenleaf WJ
Nat Commun 16 (1) 5572 [2025-07-01; online 2025-07-01]
DNA folding thermodynamics are central to many biological processes and biotechnological applications involving base-pairing. Current methods for predicting stability from DNA sequence use nearest-neighbor models that struggle to accurately capture the diverse sequence dependence of secondary structural motifs beyond Watson-Crick base pairs, likely due to insufficient experimental data. In this work, we introduce a massively parallel method, Array Melt, that uses fluorescence-based quenching signals to measure the equilibrium stability of millions of DNA hairpins simultaneously on a repurposed Illumina sequencing flow cell. By leveraging this dataset of 27,732 sequences with two-state melting behaviors, we derive a NUPACK-compatible model (dna24), a rich parameter model that exhibits higher accuracy, and a graph neural network (GNN) model that identifies relevant interactions within DNA beyond nearest neighbors. All models show improved accuracy in predicting DNA folding thermodynamics, enabling more effective in silico design of qPCR primers, oligo hybridization probes, and DNA origami.
PubMed 40593545
DOI 10.1038/s41467-025-60455-4
Crossref 10.1038/s41467-025-60455-4
pmc: PMC12216960
pii: 10.1038/s41467-025-60455-4