Agentic, Tool Augmented Large Language Models for Clinical Decision Support: The CURE-Bench Agentic Reasoning Pipeline and Evaluation

Initial Paper Release ABSTRACT As large language models (LLMs) are increasingly proposed for clinical decision support, a critical question remains: can agentic architectures that combine generative LLM reasoning with domain tools (literature search, interaction checkers,…