File size: 3,205 Bytes
cc15807 5b0d7e6 cc15807 8a7b38c cc15807 633201c b0651ac 633201c cc15807 5b0d7e6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 |
---
base_model:
- LiquidAI/LFM2-1.2B
tags:
- text-generation-inference
- transformers
- unsloth
- lfm2
license: apache-2.0
language:
- en
datasets:
- miriad/miriad-4.4M
---
# 🦷 DentaInstruct-1.2B
<img src="banner.png" width="800" />
**DentaInstruct-1.2B** is a fine-tuned instruction-following language model designed to assist with dental domain queries. It was trained using clinically focused questions and answers from the [`miriad/miriad-4.4M`](https://huggingface.co/datasets/miriad/miriad-4.4M) dataset, specifically filtered for the `Dental & Oral Medicine` specialty. The base model used is [`LiquidAI/LFM2-1.2B`](https://huggingface.co/LiquidAI/LFM2-1.2B), optimized with the [Unsloth](https://github.com/unslothai/unsloth) fine-tuning library.
---
## Model Details
- **Architecture**: LFM2-1.2B (decoder-only transformer)
- **Base Model**: [`LiquidAI/LFM2-1.2B`](https://huggingface.co/LiquidAI/LFM2-1.2B)
- **Fine-tuning Library**: [Unsloth](https://github.com/unslothai/unsloth)
- **Instruction Format**: Chat-style, formatted using `apply_chat_template`
- **Trained On**: Dental subset of [`miriad-4.4M`](https://huggingface.co/datasets/miriad/miriad-4.4M)
- **Compute**: Trained using Google Colab T4 (free tier)
---
## Benchmark Summary
The model was evaluated on a curated set of dental prompts with rich terminology across endodontics, periodontics, prosthodontics, and oral surgery. Responses were assessed for fluency, accuracy, and domain relevance.
- ✅ **Terminology Handling**: Excellent coverage of dental-specific terms
- ✅ **Instruction Following**: Clear, context-aware responses
- ✅ **Answer Structure**: Consistently professional and clinically coherent
- ⚠️ **Minor Hallucinations**: A few outputs demonstrated factual drift in rare cases
---
## Limitations & Warnings
This model was fine-tuned using the MIRIAD dataset, which comes with the following caution:
> *This model is trained on data that has not been manually reviewed by medical experts. It should not be used for diagnostic purposes or to inform medical decision-making. It is intended for research and educational purposes only.*
- **Not a substitute for professional dental care**
- **Do not use this model for clinical diagnosis or treatment advice**
---
## Intended Use
- Educational Q&A for dental students
- Conversational chatbots focused on oral health
- Research on clinical-domain instruction tuning
---
## Citation
If you use this model or parts of it, please consider citing:
```bibtex
@misc{miriad2024,
title={MIRIAD: Medical Instructional Record with Interactions and Answers Dataset},
author={Xue, Yutong and others},
year={2024},
url={https://huggingface.co/datasets/miriad/miriad-4.4M}
}
````
---
## Acknowledgements
* [LiquidAI](https://huggingface.co/LiquidAI) for the LFM2 model series
* [Unsloth](https://github.com/unslothai/unsloth) for training acceleration
* [MIRIAD authors](https://huggingface.co/datasets/miriad/miriad-4.4M) for the publicly available medical dataset
---
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth) |