Advancing real-time infectious disease forecasting using large language models.

Publication date: Jun 06, 2025

Forecasting the short-term spread of an ongoing disease outbreak poses a challenge owing to the complexity of contributing factors, some of which can be characterized through interlinked, multi-modality variables, and the intersection of public policy and human behavior. Here we introduce PandemicLLM, a framework with multi-modal large language models (LLMs) that reformulates real-time forecasting of disease spread as a text-reasoning problem, with the ability to incorporate real-time, complex, non-numerical information. This approach, through an artificial intelligence-human cooperative prompt design and time-series representation learning, encodes multi-modal data for LLMs. The model is applied to the COVID-19 pandemic, and trained to utilize textual public health policies, genomic surveillance, spatial and epidemiological time-series data, and is tested across all 50 states of the United States for a duration of 19 months. PandemicLLM opens avenues for incorporating various pandemic-related data in heterogeneous formats and shows performance benefits over existing models.

Concepts Keywords
19months Advancing
Genomic Infectious
Large Large
Learning Llms
Pandemic Modal
Models
Multi
Pandemic
Pandemicllm
Public
Real
Series
Short
Spread
Time

Semantics

Type Source Name
disease MESH infectious disease
pathway REACTOME Infectious disease
drug DRUGBANK Sulpiride
disease MESH COVID-19 pandemic

Original Article

(Visited 1 times, 1 visits today)