Learning the language of antibody hypervariability.

Publication date: Jan 07, 2025

Protein language models (PLMs) have demonstrated impressive success in modeling proteins. However, general-purpose “foundational” PLMs have limited performance in modeling antibodies due to the latter’s hypervariable regions, which do not conform to the evolutionary conservation principles that such models rely on. In this study, we propose a transfer learning framework called Antibody Mutagenesis-Augmented Processing (AbMAP), which fine-tunes foundational models for antibody-sequence inputs by supervising on antibody structure and binding specificity examples. Our learned feature representations accurately predict mutational effects on antigen binding, paratope identification, and other key antibody properties. We experimentally validate AbMAP for antibody optimization by applying it to refine a set of antibodies that bind to a SARS-CoV-2 peptide, and obtain an 82% hit-rate and up to 22-fold increase in binding affinity. AbMAP also unlocks large-scale analyses of immune repertoires, revealing that B-cell receptor repertoires of individuals, while remarkably different in sequence, converge toward similar structural and functional coverage. Importantly, AbMAP’s transfer learning approach can be readily adapted to advances in foundational PLMs. We anticipate AbMAP will accelerate the efficient design and modeling of antibodies, expedite the discovery of antibody-based therapeutics, and deepen our understanding of humoral immunity.

Open Access PDF

Concepts Keywords
Antibody Antibodies
Efficient Antibodies
Models Antibodies, Viral
Mutagenesis Antibodies, Viral
antibody modeling
COVID-19
Humans
Mutagenesis
protein language models
SARS-CoV-2
transfer learning

Semantics

Type Source Name
disease IDO protein
drug DRUGBANK Spinosad
disease IDO cell
disease IDO process
pathway REACTOME Immune System
drug DRUGBANK Hexadecanal
disease MESH Tryptophan
drug DRUGBANK L-Tryptophan
disease MESH dissociation
disease MESH point mutations
drug DRUGBANK Sodium lauryl sulfate
disease MESH influenza
disease IDO algorithm
disease IDO site
drug DRUGBANK Pidolic Acid
disease IDO assay
disease MESH COVID 19
drug DRUGBANK Amino acids
drug DRUGBANK Coenzyme M
drug DRUGBANK Isoxaflutole
pathway REACTOME Translation
drug DRUGBANK Flunarizine
drug DRUGBANK Ethionamide
disease IDO object
drug DRUGBANK Lauric Acid

Original Article

(Visited 1 times, 1 visits today)