MSBack: Multiscale Backmapping of Highly Coarse-Grained Proteins Using Constrained Diffusion.

Publication date: Jun 01, 2025

Coarse-grained (CG) molecular dynamics is a powerful tool for simulating the collective behavior of biomolecules. However, the structural information lost during coarse-graining prevents the CG configurations from being more widely useful (e. g., for ligand binding). Regenerating the lost all-atom coordinates, or backmapping, is an unmet challenge for protein CG at resolutions lower than one coarse-grain site or bead per amino acid residue. This low resolution is computationally necessary to simulate many protein complexes including viruses like SARS-CoV-2 and HIV-1. We propose MSBack, a method to backmap highly CG proteins using a diffusion model for the all-atom coordinates constrained to fit the CG coordinates. This diffusion process works by perturbing a known all-atom structure and does not require retraining. We show that this stochastically generates a distribution of α-carbon traces that match the CG coordinates. By combining this with physics-based methods for smaller-length backmapping, we fully backmap a mature HIV-1 capsid bound with the small molecule inositol hexakisphosphate at 1 A resolution.

Concepts Keywords
Backmap Atom
Biomolecules Backmap
Hexakisphosphate Backmapping
Hiv Coarse
Physics Constrained
Coordinates
Diffusion
Dynamics
Grained
Hiv
Lost
Molecular
Msback
Multiscale
Resolution

Semantics

Type Source Name
disease IDO protein
disease IDO site
disease IDO process
drug DRUGBANK Activated charcoal

Original Article

(Visited 1 times, 1 visits today)