- Systematic Review of Large Language Models and Natural Language Processing in Stroke Care: Applications, Challenges, and Future Directions. [Review]Stroke Vasc Interv Neurol. 2026 May; 6(3):e002261.SV
- Stroke, a leading cause of mortality, manifests as ischemic (87%) or hemorrhagic (13%), demanding rapid intervention to mitigate irreversible damage. Despite advances in artificial intelligence, systematic reviews addressing the integration of large language models and natural language processing into clinical stroke care remain limited. Such a review is critical given large language models' pote…
- PMC Free PDF
- Retinal structural alterations in Parkinson's disease: association between temporal optic disc thickening and visual hallucinations. [Journal Article]Neurodegener Dis Manag. 2026 May 21; :1-6. [Online ahead of print]ND
- CONCLUSIONS: In this preliminary analysis, temporal optic disc thickening showed a nominal association with VH in PD independent of non-motor symptom severity. OCT may provide structural insights into neuropsychiatric heterogeneity in PD. Further studies are needed to clarify the biological basis of this finding and its potential clinical relevance.
- Publisher Full Text (DOI)
- Abnormal periodic and aperiodic resting-state electroencephalographic markers in Lewy body and Alzheimer's diseases with cognitive decline. [Journal Article]Geroscience. 2026 May 22. [Online ahead of print]G
- Lewy body disease (LBD) and Alzheimer's disease (AD) are the most common causes of cognitive decline and dementia and are associated with characteristic alterations in resting-state electroencephalographic (rsEEG) activity. This multicenter exploratory study investigated periodic and aperiodic rsEEG features in patients with cognitive decline due to Lewy body disease (LBCD) and Alzheimer's diseas…
- Publisher Full Text (DOI)
- Patient-friendly simplification and translation of neuroradiology impressions using artificial intelligence. [Journal Article]Sci Rep. 2026 May 21. [Online ahead of print]SR
- Healthcare systems in the United States are now mandated to provide patients with immediate access to medical results. Access to complex results prior to appropriate clinical follow-up may cause confusion and fear, especially in populations with limited health literacy and English language proficiency. In this study, we demonstrate a use case of large language models (LLMs) to simplify and transl…
- Publisher Full Text (DOI)
- Hallucination risk and trustworthiness of generative AI systems based on IVMPF-MABAC decision-making strategies. [Journal Article]Sci Rep. 2026 May 21. [Online ahead of print]SR
- Hallucination risk and trustworthiness of a generative artificial intelligence system play an important and vital role in daily-life scenarios. The terms Hallucination risk and trustworthiness are very famous for the generative artificial intelligence techniques to confirm that the final results or outputs are safe, accurate, and reliable for users or customers, particularly in education, finance…
- Publisher Full Text (DOI)
- Multi-Agent System for Early Sepsis Management Support: A Follow-up Evaluation Study. [Case Reports]Healthc Inform Res. 2026 Apr; 32(2):190-195.HI
- CONCLUSIONS: In this exploratory study, the MA system shows preliminary promise for early sepsis support but requires human oversight to mitigate hallucinations. Code is available in GitHub; further validation is needed.
- Publisher Full Text (DOI)
- Managing auditory hallucination symptoms in patients with schizophrenia in China: a best practice implementation project. [Journal Article]JBI Evid Implement. 2026 May 22. [Online ahead of print]JE
- CONCLUSIONS: The application of best evidence in managing auditory hallucinations in patients with schizophrenia can enhance nursing practices, including nurses' knowledge and confidence, and effectively reduce the severity of patients' auditory hallucinations.
- Publisher Full Text (DOI)
- Evaluation of AI Citation Accuracy in Anterior Segment Research. [Journal Article]Cesk Slov Oftalmol. 2026; 82(Ahead of Print):1-5.CS
- CONCLUSIONS: This pilot study indicates that contemporary AI models, particularly those like DeepSeek, show potential in assisting with citation generation. However, the observed error rates, including instances of hallucination, remain substantial. These findings underscore that rigorous human verification is indispensable when using AI for academic referencing in specialized medical fields, and highlight the need for continuous, version-specific benchmarking as these tools evolve.
- Publisher Full Text (DOI)
- The next paradigm in bioinformatics: a review of multi-agent systems and foundational models for end-to-end scientific discovery. [Review]Brief Bioinform. 2026 May 04; 27(3).BB
- Bioinformatics is entering a new phase characterized by the integration of universal biological models and multi-agent systems to enable end-to-end scientific discoveries. This review argues that the next paradigm shift will go beyond traditional predictive models and generative artificial intelligence (AI) toward agentic AI: systems capable of planning, acting through tools, reflecting on result…
- Publisher Full Text (DOI)
- Large Language Models in Clinical Trial Recruitment: Sociotechnical and Economic Framework Development Study. [Journal Article]JMIR AI. 2026 May 20; 5:e95899.JA
- CONCLUSIONS: LECRA offers a more deployment-sensitive account of when LLM-enabled recruitment is likely to create value and when the benefits may be offset by coordination, compliance, or trust-related frictions. This framework is intended to support future empirical studies and more realistic implementation decisions rather than to claim validated superiority of LLM-assisted recruitment.
- PMC Free PDF
- Safety and Effectiveness of Guanfacine Hydrochloride Extended-Release in Adult Patients with ADHD in Japan: A Post-Marketing Surveillance Study. [Journal Article]
- CONCLUSIONS: No ADRs requiring new safety measures were observed. The effectiveness of GXR in adult ADHD was evident among patients remaining on treatment.
- Publisher Full Text (DOI)
- Comparative evaluation of treatment recommendations generated by generative AI and breast cancer specialists for advanced and recurrent breast cancer: a multidimensional assessment of evidence interpretation and clinical decision support. [Journal Article]
- CONCLUSIONS: LLMs were able to generate comprehensive and structured treatment recommendations based on the provided clinical information. However, these findings primarily reflect differences in information synthesis under predefined evaluation criteria rather than superiority in real-world clinical decision-making. Given the potential risk of hallucinations, AI should be positioned as an assistive tool to support specialist-led decision-making in breast cancer care.
- Publisher Full Text (DOI)
- Large language models and large multimodal models in radiology: opportunities, challenges, and the path toward sustainable long-term clinical integration. [Review]
- Large language models (LLMs), built on transformer architecture, have emerged as a fundamental tool in natural language processing and contextual reasoning, and have been extended to multimodal data interpretation, which has been termed large multimodal models (LMMs). Radiology as a medical discipline, having undergone full digital transformation over the past two decades, is uniquely positioned …
- Publisher Full Text (DOI)
- A temporally Anchored Retrieval-Augmented Generation Framework for Metabolic and Bariatric Surgery Patient Education: An IFSO Artificial Intelligence Task Force Multinational Validation Study. [Journal Article]
- CONCLUSIONS: This proof-of-concept study suggests that a multi-module RAG framework with temporal stability anchoring can improve expert-rated LLM response quality for bariatric surgery domain knowledge, though prospective validation in patient-facing settings is needed before clinical implementation.
- Publisher Full Text (DOI)
- Comparative evaluation of seven large language models in providing home phototherapy care guidance for neonatal hyperbilirubinemia. [Journal Article]Transl Pediatr. 2026 Apr 30; 15(4):121.TP
- CONCLUSIONS: DeepSeek-R1 and ChatGPT-4o achieved near-expert accuracy, potentially reliable for basic queries under oversight. Claude 3.5 Sonnet and GLM-4 showed moderate performance with notable gaps. Copilot, Gemini, and ERNIE demonstrated critical errors. However, all models exhibited hallucinated references and a lack of patient-specific reasoning. LLMs should not replace professional medical judgment in treatment decisions. Safe implementation requires restricting high-performing LLMs to narrowly defined procedural questions only, with mandatory physician verification.
- PMC Free PDF