File size: 3,205 Bytes
cc15807
5b0d7e6
 
cc15807
 
 
 
 
 
 
 
8a7b38c
 
cc15807
 
633201c
 
b0651ac
 
633201c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
cc15807
 
 
5b0d7e6
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
---
base_model:
- LiquidAI/LFM2-1.2B
tags:
- text-generation-inference
- transformers
- unsloth
- lfm2
license: apache-2.0
language:
- en
datasets:
- miriad/miriad-4.4M
---

# 🦷 DentaInstruct-1.2B

<img src="banner.png" width="800" />

**DentaInstruct-1.2B** is a fine-tuned instruction-following language model designed to assist with dental domain queries. It was trained using clinically focused questions and answers from the [`miriad/miriad-4.4M`](https://huggingface.co/datasets/miriad/miriad-4.4M) dataset, specifically filtered for the `Dental & Oral Medicine` specialty. The base model used is [`LiquidAI/LFM2-1.2B`](https://huggingface.co/LiquidAI/LFM2-1.2B), optimized with the [Unsloth](https://github.com/unslothai/unsloth) fine-tuning library.

---

## Model Details

- **Architecture**: LFM2-1.2B (decoder-only transformer)
- **Base Model**: [`LiquidAI/LFM2-1.2B`](https://huggingface.co/LiquidAI/LFM2-1.2B)
- **Fine-tuning Library**: [Unsloth](https://github.com/unslothai/unsloth)
- **Instruction Format**: Chat-style, formatted using `apply_chat_template`
- **Trained On**: Dental subset of [`miriad-4.4M`](https://huggingface.co/datasets/miriad/miriad-4.4M)
- **Compute**: Trained using Google Colab T4 (free tier)

---

## Benchmark Summary

The model was evaluated on a curated set of dental prompts with rich terminology across endodontics, periodontics, prosthodontics, and oral surgery. Responses were assessed for fluency, accuracy, and domain relevance.

-**Terminology Handling**: Excellent coverage of dental-specific terms
-**Instruction Following**: Clear, context-aware responses
-**Answer Structure**: Consistently professional and clinically coherent
- ⚠️ **Minor Hallucinations**: A few outputs demonstrated factual drift in rare cases

---

## Limitations & Warnings

This model was fine-tuned using the MIRIAD dataset, which comes with the following caution:

> *This model is trained on data that has not been manually reviewed by medical experts. It should not be used for diagnostic purposes or to inform medical decision-making. It is intended for research and educational purposes only.*

- **Not a substitute for professional dental care**
- **Do not use this model for clinical diagnosis or treatment advice**

---

## Intended Use

- Educational Q&A for dental students
- Conversational chatbots focused on oral health
- Research on clinical-domain instruction tuning

---

## Citation

If you use this model or parts of it, please consider citing:

```bibtex
@misc{miriad2024,
  title={MIRIAD: Medical Instructional Record with Interactions and Answers Dataset},
  author={Xue, Yutong and others},
  year={2024},
  url={https://huggingface.co/datasets/miriad/miriad-4.4M}
}
````

---

## Acknowledgements

* [LiquidAI](https://huggingface.co/LiquidAI) for the LFM2 model series
* [Unsloth](https://github.com/unslothai/unsloth) for training acceleration
* [MIRIAD authors](https://huggingface.co/datasets/miriad/miriad-4.4M) for the publicly available medical dataset

---




[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)