-
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Paper • 2412.18925 • Published • 104 -
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning
Paper • 2502.19634 • Published • 63 -
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper • 2409.07314 • Published • 56 -
On the Compositional Generalization of Multimodal LLMs for Medical Imaging
Paper • 2412.20070 • Published • 45
Collections
Discover the best community collections!
Collections including paper arxiv:2409.07314
-
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper • 2409.07314 • Published • 56 -
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Paper • 2407.21072 • Published • 2 -
Named Clinical Entity Recognition Benchmark
Paper • 2410.05046 • Published • 16 -
MEDIC Benchmark
📊49View and compare medical LLM evaluations
-
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 146 -
Elucidating the Design Space of Diffusion-Based Generative Models
Paper • 2206.00364 • Published • 18 -
GLU Variants Improve Transformer
Paper • 2002.05202 • Published • 4 -
StarCoder 2 and The Stack v2: The Next Generation
Paper • 2402.19173 • Published • 151
-
epfl-llm/meditron-70b
Text Generation • 69B • Updated • 400 • 255 -
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper • 2409.07314 • Published • 56 -
m42-health/Llama3-Med42-70B
Text Generation • 71B • Updated • 2.24k • • 65 -
Med42-v2: A Suite of Clinical LLMs
Paper • 2408.06142 • Published • 52
-
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Paper • 2408.00765 • Published • 14 -
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
Paper • 2407.21646 • Published • 18 -
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection
Paper • 2408.04284 • Published • 26 -
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Paper • 2408.07852 • Published • 16
-
MedS^3: Towards Medical Small Language Models with Self-Evolved Slow Thinking
Paper • 2501.12051 • Published -
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs
Paper • 2501.09825 • Published • 14 -
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators
Paper • 2501.09484 • Published • 19 -
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Paper • 2501.07171 • Published • 55
-
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Paper • 2412.18925 • Published • 104 -
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning
Paper • 2502.19634 • Published • 63 -
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper • 2409.07314 • Published • 56 -
On the Compositional Generalization of Multimodal LLMs for Medical Imaging
Paper • 2412.20070 • Published • 45
-
epfl-llm/meditron-70b
Text Generation • 69B • Updated • 400 • 255 -
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper • 2409.07314 • Published • 56 -
m42-health/Llama3-Med42-70B
Text Generation • 71B • Updated • 2.24k • • 65 -
Med42-v2: A Suite of Clinical LLMs
Paper • 2408.06142 • Published • 52
-
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper • 2409.07314 • Published • 56 -
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Paper • 2407.21072 • Published • 2 -
Named Clinical Entity Recognition Benchmark
Paper • 2410.05046 • Published • 16 -
MEDIC Benchmark
📊49View and compare medical LLM evaluations
-
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Paper • 2408.00765 • Published • 14 -
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent
Paper • 2407.21646 • Published • 18 -
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection
Paper • 2408.04284 • Published • 26 -
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Paper • 2408.07852 • Published • 16
-
Mamba: Linear-Time Sequence Modeling with Selective State Spaces
Paper • 2312.00752 • Published • 146 -
Elucidating the Design Space of Diffusion-Based Generative Models
Paper • 2206.00364 • Published • 18 -
GLU Variants Improve Transformer
Paper • 2002.05202 • Published • 4 -
StarCoder 2 and The Stack v2: The Next Generation
Paper • 2402.19173 • Published • 151
-
MedS^3: Towards Medical Small Language Models with Self-Evolved Slow Thinking
Paper • 2501.12051 • Published -
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs
Paper • 2501.09825 • Published • 14 -
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators
Paper • 2501.09484 • Published • 19 -
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature
Paper • 2501.07171 • Published • 55