Deep Ignorance
This collection contains the model and data artifacts from O'Brien et al. (2025). https://deepignorance.ai
-
Paper • 2508.06601 • Published • 5
EleutherAI/deep-ignorance-unfiltered
Text Generation • 7B • Updated • 7.58k • 2Note Fully Trained — Unfiltered Baseline Model - Pretraining Filtering: None - Annealing Filtering: None - Results Location: Main Paper
EleutherAI/deep-ignorance-e2e-strong-filter
Text Generation • 7B • Updated • 130Note Fully Trained - Pretraining Filtering: Strong Filter - Annealing Filtering: Strong Filter - Results Location: Main Paper (Strong Filter)
EleutherAI/deep-ignorance-strong-filter-pt-weak-filter-anneal
Text Generation • 7B • Updated • 107Note Fully Trained - Pretraining Filtering: Strong Filter - Annealing Filtering: Weak Filter - Results Location: Main Paper (Weak Filter)
EleutherAI/deep-ignorance-e2e-weak-filter
Text Generation • 7B • Updated • 169Note Fully Trained - Pretraining Filtering: Weak Filter - Annealing Filtering: Weak Filter - Results Location: Appendix
EleutherAI/deep-ignorance-weak-filter-pt-strong-filter-anneal
Text Generation • 7B • Updated • 77Note Fully Trained - Pretraining Filtering: Weak Filter - Annealing Filtering: Strong Filter
EleutherAI/deep-ignorance-pretraining-stage-unfiltered
Text Generation • 7B • Updated • 25.7kNote Pretrained model that has not undergone annealing or any data filtering. - Pretraining Filtering: None - Results Location: Not Included
EleutherAI/deep-ignorance-pretraining-stage-strong-filter
Text Generation • 7B • Updated • 115Note Pretrained model that has not undergone annealing. - Pretraining Filtering: Strong Filter - Results Location: Not Included
EleutherAI/deep-ignorance-pretraining-stage-weak-filter
Text Generation • 7B • Updated • 124Note Pretrained model which has not undergone annealing. - Pretraining Filtering: Weak Filter - Results Location: Not Included
EleutherAI/deep-ignorance-e2e-extra-weak-filter
7B • Updated • 80Note Fully Trained - Pretraining Filtering: Extra Weak Filter - Annealing Filtering: Extra Weak Filter - Results Location: Not Included
EleutherAI/deep-ignorance-pretraining-stage-extra-weak-filter
7B • Updated • 120Note Pretrained model that has not undergone annealing. - Pretraining Filtering: Extra Weak Filter - Results Location: Not Included
EleutherAI/deep-ignorance-e2e-strong-filter-cb-lat
Text Generation • 7B • Updated • 75Note Fully Trained with Circuit Breaking & Latent Adversarial Training - Pretraining Filtering: Strong Filter - Annealing Filtering: Strong Filter - Post-training: Circuit Breaking + Latent Adversarial Training - Results Location: Main Paper (Strong Filter + CB + LAT)
EleutherAI/deep-ignorance-strong-filter-pt-weak-filter-anneal-cb-lat
Text Generation • 7B • Updated • 74Note Fully Trained with Circuit Breaking & Latent Adversarial Training - Pretraining Filtering: Strong Filter - Annealing Filtering: Weak Filter - Post-training: Circuit Breaking + Latent Adversarial Training - Results Location: Main Paper (Weak Filter + CB + LAT)
EleutherAI/deep-ignorance-unfiltered-cb
Text Generation • 7B • Updated • 79Note Fully Trained — Unfiltered Baseline Model with Circuit Breaking - Pretraining Filtering: None - Annealing Filtering: None - Post-training: Circuit Breaking - Results Location: Main Paper (CB)
EleutherAI/deep-ignorance-unfiltered-cb-lat
Text Generation • 7B • Updated • 96Note Fully Trained — Unfiltered Baseline Model with Circuit Breaking & Latent Adversarial Training - Pretraining Filtering: None - Annealing Filtering: None - Post-training: Circuit Breaking + Latent Adversarial Training - Results Location: Main Paper (CB + LAT)
EleutherAI/deep-ignorance-e2e-strong-filter-cb
Text Generation • 7B • Updated • 81Note Fully Trained with Circuit Breaking - Pretraining Filtering: Strong Filter - Annealing Filtering: Strong Filter - Post-training: Circuit Breaking - Results Location: Main Paper (Strong Filter + CB)
EleutherAI/deep-ignorance-strong-filter-pt-weak-filter-anneal-cb
Text Generation • 7B • Updated • 80Note Fully Trained with Circuit Breaking - Pretraining Filtering: Strong Filter - Annealing Filtering: Weak Filter - Post-training: Circuit Breaking - Results Location: Main Paper (Weak Filter + CB)
EleutherAI/deep-ignorance-e2e-strong-filter-weak-knowledge-corrupted
Text Generation • 7B • Updated • 81Note Fully Trained - Pretraining Filtering: Strong Filter - Annealing Filtering: Strong Filter - Post-training: Weak Knowledge Corruption via Synthetic Document Fine-Tuning - Results Location: Main Paper & Appendix
EleutherAI/deep-ignorance-e2e-strong-filter-strong-knowledge-corrupted
Text Generation • 7B • Updated • 92Note Fully Trained - Pretraining Filtering: Strong Filter - Annealing Filtering: Strong Filter - Post-training: Strong Knowledge Corruption via Synthetic Document Fine-Tuning - Results Location: Main Paper & Appendix
EleutherAI/wmdp_bio_cloze
Viewer • Updated • 1.27k • 2.73kNote All prompts from WMDP-Bio that can be evaluated using a cloze-style prompt.
EleutherAI/wmdp_bio_robust_mcqa
Viewer • Updated • 1.27k • 471Note WMDP-Bio, where data is broken down by topic category and whether it contains likely shortcuts.
EleutherAI/mmlu_test_task_training_mix
Viewer • Updated • 200k • 27Note General knowledge multiple-choice and cloze-style prompts that are used to ensure that models are familiar with the MCQA test benchmarks, like WMDP and MMLU.
EleutherAI/deep-ignorance-annealing-mix
Viewer • Updated • 89M • 556 • 1Note The original annealing dataset for training the LLMs. This dataset is not filtered.
EleutherAI/deep-ignorance-pretraining-mix
Viewer • Updated • 410M • 2.16k • 2Note The original pretraining dataset for training the LLMs. This dataset is not filtered.
-
EleutherAI/deep-ignorance-random-init
Text Generation • 7B • Updated • 174