Framework study using large language models (LLMs) to extract semaglutide side effect information from Reddit social media posts and organize findings into a knowledge graph (KG) for pharmacovigilance, applying this crowdsourcing approach to identify patient-reported adverse events not captured in clinical trial reporting. Identifies side effect patterns and severity perceptions from real-world user experience. Provides a methodological innovation for social media pharmacovigilance—demonstrating that LLM-powered knowledge graph construction from Reddit data can complement FAERS by capturing patient-reported semaglutide experiences, off-label use patterns, and novel adverse event signals.
Duan, Zhijie; Wei, Kai; Xue, Zhaoqian; Zhou, Jiayan; Yang, Shu; Ma, Siyuan; Jin, Jin; Li, Lingyao