DentaInstruct-1.2B / README.md
yasserrmd's picture
Update README.md
8a7b38c verified
---
base_model:
- LiquidAI/LFM2-1.2B
tags:
- text-generation-inference
- transformers
- unsloth
- lfm2
license: apache-2.0
language:
- en
datasets:
- miriad/miriad-4.4M
---
# 🦷 DentaInstruct-1.2B
<img src="banner.png" width="800" />
**DentaInstruct-1.2B** is a fine-tuned instruction-following language model designed to assist with dental domain queries. It was trained using clinically focused questions and answers from the [`miriad/miriad-4.4M`](https://huggingface.co/datasets/miriad/miriad-4.4M) dataset, specifically filtered for the `Dental & Oral Medicine` specialty. The base model used is [`LiquidAI/LFM2-1.2B`](https://huggingface.co/LiquidAI/LFM2-1.2B), optimized with the [Unsloth](https://github.com/unslothai/unsloth) fine-tuning library.
---
## Model Details
- **Architecture**: LFM2-1.2B (decoder-only transformer)
- **Base Model**: [`LiquidAI/LFM2-1.2B`](https://huggingface.co/LiquidAI/LFM2-1.2B)
- **Fine-tuning Library**: [Unsloth](https://github.com/unslothai/unsloth)
- **Instruction Format**: Chat-style, formatted using `apply_chat_template`
- **Trained On**: Dental subset of [`miriad-4.4M`](https://huggingface.co/datasets/miriad/miriad-4.4M)
- **Compute**: Trained using Google Colab T4 (free tier)
---
## Benchmark Summary
The model was evaluated on a curated set of dental prompts with rich terminology across endodontics, periodontics, prosthodontics, and oral surgery. Responses were assessed for fluency, accuracy, and domain relevance.
-**Terminology Handling**: Excellent coverage of dental-specific terms
-**Instruction Following**: Clear, context-aware responses
-**Answer Structure**: Consistently professional and clinically coherent
- ⚠️ **Minor Hallucinations**: A few outputs demonstrated factual drift in rare cases
---
## Limitations & Warnings
This model was fine-tuned using the MIRIAD dataset, which comes with the following caution:
> *This model is trained on data that has not been manually reviewed by medical experts. It should not be used for diagnostic purposes or to inform medical decision-making. It is intended for research and educational purposes only.*
- **Not a substitute for professional dental care**
- **Do not use this model for clinical diagnosis or treatment advice**
---
## Intended Use
- Educational Q&A for dental students
- Conversational chatbots focused on oral health
- Research on clinical-domain instruction tuning
---
## Citation
If you use this model or parts of it, please consider citing:
```bibtex
@misc{miriad2024,
title={MIRIAD: Medical Instructional Record with Interactions and Answers Dataset},
author={Xue, Yutong and others},
year={2024},
url={https://huggingface.co/datasets/miriad/miriad-4.4M}
}
````
---
## Acknowledgements
* [LiquidAI](https://huggingface.co/LiquidAI) for the LFM2 model series
* [Unsloth](https://github.com/unslothai/unsloth) for training acceleration
* [MIRIAD authors](https://huggingface.co/datasets/miriad/miriad-4.4M) for the publicly available medical dataset
---
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)