Understanding global research trends in the control and prevention of infectious diseases for children: Insights from text mining and topic modeling.

Publication date: Feb 21, 2024

The emergence of novel infectious diseases has amplified the urgent need for effective prevention strategies, especially ones targeting vulnerable populations such as children. Factors such as the high incidence of both emerging and existing infectious diseases, delays in vaccinations, and routine exposure in communal settings heighten children’s susceptibility to infections. Despite this pressing need, a comprehensive exploration of research trends in this domain remains lacking. This study aims to address this gap by employing text mining and modeling techniques to conduct a comprehensive analysis of the existing literature, thereby identifying emerging research trends in infectious disease prevention among children. A cross-sectional text mining approach was adopted, focusing on journal articles published between January 1, 2003, and August 31, 2022. These articles, related to infectious disease prevention in children, were sourced from databases such as PubMed, CINAHL, MEDLINE (Ovid), Scopus, and Korean RISS. The data underwent preprocessing using the Natural Language Toolkit (NLTK) in Python, with a semantic network analysis and topic modeling conducted using R software. The final dataset comprised 509 journal articles extracted from multiple databases. The study began with a word frequency analysis to pinpoint relevant themes, subsequently visualized through a word cloud. Dominant terms encompassed “vaccination,” “adolescent,” “infant,” “parent,” “family,” “school,” “country,” “household,” “community,” “HIV,” “HPV,” “COVID-19,” “influenza,” and “diarrhea. ” The semantic analysis identified “age” as a key term across infection, control, and intervention discussions. Notably, the relationship between “hand” and “handwashing” was prominent, especially in educational contexts linked with “school” and “absence. ” Latent Dirichlet Allocation (LDA) topic modeling further delineated seven topics related to infectious disease prevention for children, encompassing (1) educational programs, (2) vaccination efforts, (3) family-level responses, (4) care for immunocompromised individuals, (5) country-specific responses, (6) school-based strategies, and (7) persistent threats from established infectious diseases. The study emphasizes the indispensable role of personalized interventions tailored for various child demographics, highlighting the pivotal contributions of both parental guidance and school participation. The study provides insights into the complex public health challenges associated with preventing and managing infectious diseases in children. The insights derived could inform the formulation of evidence-based public health policies, steering practical interventions and fostering interdisciplinary synergy for holistic prevention strategies.

Concepts Keywords
August children
Immunocompromised infectious disease
Influenza prevention strategies
Korean text mining
Python topic modeling


Type Source Name
disease MESH infectious diseases
disease VO effective
disease IDO susceptibility
disease MESH infections
disease VO Gap
pathway REACTOME Infectious disease
disease IDO infectious disease
disease VO frequency
disease VO vaccination
disease IDO country
disease MESH COVID-19
disease MESH influenza
disease IDO infection
disease IDO intervention

Original Article

(Visited 1 times, 1 visits today)