好色先生

好色先生

Explore the latest content from across our publications

Log In

Forgot Password?
Create New Account

Loading... please wait

Abstract Details

Generative AI-assisted Screening Improves Efficiency in Neurology Systematic Reviews
好色先生, Research, and Methodology
P1 - Poster Session 1 (8:00 AM-9:00 AM)
15-001
To evaluate the impact of generative artificial intelligence(AI) assistance on the speed and accuracy of title and abstract screening for systematic reviews in neurology.

Systematic reviews are critical for evidence-based neurology but are time-consuming and resource-intensive, with screening phases often creating significant bottlenecks. While large language models(LLMs) show potential to streamline this process, their real-world impact on reviewer performance in a supportive, human-in-the-loop role is understudied in neurology.

Four neurology trainees were grouped into two pairs based on previous screening experience. Pair A(A1, A2) consisted of less experienced trainees(1–2 SR), while Pair B(B1, B2) consisted of more experienced trainees(≥3 SRs). Within each pair, one reviewer was assigned to a traditional screening method(A2, B2), while the other was assigned to a generative AI-assisted method(A1, B1). The AI-assisted screening utilized PICOS(Population, Intervention/Exposure, Comparison, Outcome, Study design) summaries derived from titles and abstracts using an open-source LLM (Mistral-Nemo-Instruct-2407). All reviewers independently screened the same set of 1,003 articles against predefined criteria.  Screening times were recorded, and performance metrics were calculated. Post-screening surveys assessed usability, confidence, and perceived cognitive workload.

AI-assisted reviewers(A1:116 min; B1:90 min) screened four times faster than those without(A2:463 min; B2:370 min), reducing workload by ~75%. Sensitivity was perfect for AI-assisted reviewers(100%), whereas it was lower for those without assistance(88.0% and 92.0%). Furthermore, AI-assisted reviewers demonstrated higher accuracy(99.9%), specificity(99.9%), F1 scores(98.0%), and strong inter-rater reliability(Cohen's Kappa of 99.8%). Less experienced reviewer with AI-assistance(A1) outperformed experienced reviewer(B2) without assistance in both efficiency and sensitivity. All reviewers reported reduced cognitive load and improved decision confidence.
Generative AI assistance substantially improves efficiency, accuracy, and user experience of systematic review screening in neurology. By enhancing rather than replacing human decision-making, this hybrid workflow offers a scalable approach to accelerate evidence synthesis and reduce reviewer fatigue. 
Authors/Disclosures
Sai Krishna Vallamchetla, MBBS (Mayo Clinic, Florida)
PRESENTER
Mr. Vallamchetla has nothing to disclose.
Omar Abdelkader, MD (Westchester Medical Center) Dr. Abdelkader has nothing to disclose.
Md Manjurul Islam Shourav, MBBS Mr. Shourav has nothing to disclose.
Michelle P. Lin, CRC (Mayo Clinic Florida) Dr. Lin has nothing to disclose.