Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.07314

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 104
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56
On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 45

Comprehensive Evaluations

Model evaluation framework for Clinical Application

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks

Paper • 2407.21072 • Published Jul 29, 2024 • 2
Named Clinical Entity Recognition Benchmark

Paper • 2410.05046 • Published Oct 7, 2024 • 16
Running

49

MEDIC Benchmark

📊

49

View and compare medical LLM evaluations

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 146
Elucidating the Design Space of Diffusion-Based Generative Models

Paper • 2206.00364 • Published Jun 1, 2022 • 18
GLU Variants Improve Transformer

Paper • 2002.05202 • Published Feb 12, 2020 • 4
StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 151

epfl-llm/meditron-70b

Text Generation • 69B • Updated Dec 7, 2023 • 400 • 255
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56
m42-health/Llama3-Med42-70B

Text Generation • 71B • Updated Aug 20, 2024 • 2.24k • • 65
Med42-v2: A Suite of Clinical LLMs

Paper • 2408.06142 • Published Aug 12, 2024 • 52

Industry models

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 14
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Paper • 2407.21646 • Published Jul 31, 2024 • 18
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection

Paper • 2408.04284 • Published Aug 8, 2024 • 26
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Paper • 2408.07852 • Published Aug 14, 2024 • 16

Biomedical NLP papers

Papers posted on @[email protected] (Clinical, Healthcare & Biomedical NLP)

MedS^3: Towards Medical Small Language Models with Self-Evolved Slow Thinking

Paper • 2501.12051 • Published Jan 21
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs

Paper • 2501.09825 • Published Jan 16 • 14
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators

Paper • 2501.09484 • Published Jan 16 • 19
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13 • 55

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 104
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning

Paper • 2502.19634 • Published Feb 26 • 63
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56
On the Compositional Generalization of Multimodal LLMs for Medical Imaging

Paper • 2412.20070 • Published Dec 28, 2024 • 45

epfl-llm/meditron-70b

Text Generation • 69B • Updated Dec 7, 2023 • 400 • 255
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56
m42-health/Llama3-Med42-70B

Text Generation • 71B • Updated Aug 20, 2024 • 2.24k • • 65
Med42-v2: A Suite of Clinical LLMs

Paper • 2408.06142 • Published Aug 12, 2024 • 52

Comprehensive Evaluations

Model evaluation framework for Clinical Application

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks

Paper • 2407.21072 • Published Jul 29, 2024 • 2
Named Clinical Entity Recognition Benchmark

Paper • 2410.05046 • Published Oct 7, 2024 • 16
Running

49

MEDIC Benchmark

📊

49

View and compare medical LLM evaluations

Industry models

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 14
Towards Achieving Human Parity on End-to-end Simultaneous Speech Translation via LLM Agent

Paper • 2407.21646 • Published Jul 31, 2024 • 18
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection

Paper • 2408.04284 • Published Aug 8, 2024 • 26
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

Paper • 2408.07852 • Published Aug 14, 2024 • 16

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 146
Elucidating the Design Space of Diffusion-Based Generative Models

Paper • 2206.00364 • Published Jun 1, 2022 • 18
GLU Variants Improve Transformer

Paper • 2002.05202 • Published Feb 12, 2020 • 4
StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 151

Biomedical NLP papers

Papers posted on @[email protected] (Clinical, Healthcare & Biomedical NLP)

MedS^3: Towards Medical Small Language Models with Self-Evolved Slow Thinking

Paper • 2501.12051 • Published Jan 21
Bridging Language Barriers in Healthcare: A Study on Arabic LLMs

Paper • 2501.09825 • Published Jan 16 • 14
Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators

Paper • 2501.09484 • Published Jan 16 • 19
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Paper • 2501.07171 • Published Jan 13 • 55

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs