Steelskull commited on
Commit
b7ea7ed
·
verified ·
1 Parent(s): 10fb146

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -16
README.md CHANGED
@@ -1898,28 +1898,23 @@ library_name: transformers
1898
  </details>
1899
  </li>
1900
  <li><span class="model-component"><a href="https://huggingface.co/EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0" target="_blank">EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0</a></span> Core capabilities</li>
1901
- <li><span class="model-component"><a href="https://huggingface.co/LatitudeGames/Wayfarer-Large-70B-Llama-3.3" target="_blank">LatitudeGames/Wayfarer-Large-70B-Llama-3.3</a></span> Enhanced reasoning</li>
1902
- <li><span class="model-component"><a href="https://huggingface.co/Sao10K/L3.3-70B-Euryale-v2.3" target="_blank">Sao10K/L3.3-70B-Euryale-v2.3</a></span> Improved capabilities</li>
1903
  <li><span class="model-component"><a href="https://huggingface.co/Sao10K/70B-L3.3-Cirrus-x1" target="_blank">Sao10K/70B-L3.3-Cirrus-x1</a></span> Improved coherence</li>
1904
  <li><span class="model-component"><a href="https://huggingface.co/Sao10K/L3.1-70B-Hanami-x1" target="_blank">Sao10K/L3.1-70B-Hanami-x1</a></span> Balanced responses</li>
1905
  <li><span class="model-component"><a href="https://huggingface.co/TheDrummer/Anubis-70B-v1" target="_blank">TheDrummer/Anubis-70B-v1</a></span> Enhanced detail</li>
1906
- <li><span class="model-component"><a href="https://huggingface.co/SicariusSicariiStuff/Negative_LLAMA_70B" target="_blank">SicariusSicariiStuff/Negative_LLAMA_70B</a></span> Reduced bias</li>
 
1907
  </ul>
1908
  <div class="model-description">
1909
  <h4>Model Series Overview</h4>
1910
- <p>L3.3-Electra-R1-70b represents the foundational release in a three-part model series, followed by L3.3-Cu-Mai-R1-70b (Version A) and L3.3-Mokume-Gane-R1-70b (Version C). The name "Electra" draws inspiration from the electric-powered aesthetic of the model's mascot, representing the powerful capabilities and lightning-fast responses that define this model's performance.</p>
1911
  <h4>Technical Architecture</h4>
1912
- <p>Built on a custom DeepSeek R1 Distill base (TheSkullery/L3.1x3.3-Hydroblated-R1-70B-v4.4), Electra-R1 integrates specialized components through the SCE merge method with a select_topk parameter of 0.16. The model uses float32 dtype during processing with a bfloat16 output dtype for optimized performance.</p>
1913
- <ul>
1914
- <li>EVA and Wayfarer foundations for creative expression and scene comprehension</li>
1915
- <li>Euryale, Cirrus and Hanami elements for enhanced reasoning capabilities</li>
1916
- <li>Anubis components for detailed scene description</li>
1917
- <li>Negative_LLAMA integration for balanced perspective and response</li>
1918
- </ul>
1919
  <h4>Core Capabilities</h4>
1920
- <p>As the OG model in the series, Electra-R1 serves as the gold standard and reliable baseline. User feedback consistently highlights its superior intelligence, coherence, and unique ability to provide deep character insights. Through proper prompting, the model demonstrates advanced reasoning capabilities and an "X-factor" that enables unprompted exploration of character inner thoughts and motivations.</p>
1921
  <h4>Base Architecture</h4>
1922
- <p>The model utilizes the custom Hydroblated-R1 base, engineered for stability and enhanced reasoning. The SCE merge method's settings are precisely tuned based on extensive community feedback, ensuring optimal component integration while maintaining model coherence and reliability. This foundation establishes Electra-R1 as the benchmark upon which its variant models build and expand.</p>
1923
  </div>
1924
  </div>
1925
  </div>
@@ -2230,7 +2225,7 @@ library_name: transformers
2230
  </div>
2231
  <div class="settings-content">
2232
  <div class="setting-item">
2233
- <p>'<span style="color: var(--electra-primary)">&lt;think&gt;</span> OK, as an objective, detached narrative analyst, let's think this through carefully:'</p>
2234
  </div>
2235
  </div>
2236
  </div>
@@ -2241,11 +2236,11 @@ library_name: transformers
2241
  <div class="settings-content">
2242
  <div class="setting-item">
2243
  <span class="setting-label">Prefix:</span>
2244
- <span class="setting-value">'<span style="color: var(--electra-primary)">&lt;think&gt;</span>'</span>
2245
  </div>
2246
  <div class="setting-item">
2247
  <span class="setting-label">Suffix:</span>
2248
- <span class="setting-value">'<span style="color: var(--electra-primary)">&lt;/think&gt;</span>'</span>
2249
  </div>
2250
  </div>
2251
  </div>
 
1898
  </details>
1899
  </li>
1900
  <li><span class="model-component"><a href="https://huggingface.co/EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0" target="_blank">EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.0</a></span> Core capabilities</li>
1901
+ <li><span class="model-component"><a href="https://huggingface.co/LatitudeGames/Wayfarer-Large-70B-Llama-3.3" target="_blank">LatitudeGames/Wayfarer-Large-70B-Llama-3.3</a></span> Enhanced Storytelling and RP</li>
1902
+ <li><span class="model-component"><a href="https://huggingface.co/Sao10K/L3.3-70B-Euryale-v2.3" target="_blank">Sao10K/L3.3-70B-Euryale-v2.3</a></span> Improved all rounder capabilities</li>
1903
  <li><span class="model-component"><a href="https://huggingface.co/Sao10K/70B-L3.3-Cirrus-x1" target="_blank">Sao10K/70B-L3.3-Cirrus-x1</a></span> Improved coherence</li>
1904
  <li><span class="model-component"><a href="https://huggingface.co/Sao10K/L3.1-70B-Hanami-x1" target="_blank">Sao10K/L3.1-70B-Hanami-x1</a></span> Balanced responses</li>
1905
  <li><span class="model-component"><a href="https://huggingface.co/TheDrummer/Anubis-70B-v1" target="_blank">TheDrummer/Anubis-70B-v1</a></span> Enhanced detail</li>
1906
+ <li><span class="model-component"><a href="https://huggingface.co/SicariusSicariiStuff/Negative_LLAMA_70B" target="_blank">SicariusSicariiStuff/Negative_LLAMA_70B</a></span> Reduced bias - Base</li>
1907
+ <li><span class="model-component"><a href="https://huggingface.co/TheDrummer/Fallen-Llama-3.3-R1-70B-v1" target="_blank">TheDrummer/Fallen-Llama-3.3-R1-70B-v1</a></span> Reduced bias - Base</li>
1908
  </ul>
1909
  <div class="model-description">
1910
  <h4>Model Series Overview</h4>
1911
+ <p>L3.3-Electra-R1-70b is the newest release of the Unnamed series, this is the 6th iteration based of user feedback.</p>
1912
  <h4>Technical Architecture</h4>
1913
+ <p>Built on a custom DeepSeek R1 Distill base (TheSkullery/L3.1x3.3-Hydroblated-R1-70B-v4.4), Electra-R1 integrates specialized components through the SCE merge method. The model uses float32 dtype during processing with a bfloat16 output dtype for optimized performance.</p>
 
 
 
 
 
 
1914
  <h4>Core Capabilities</h4>
1915
+ <p>Electra-R1 serves newest gold standard and baseline. User feedback consistently highlights its superior intelligence, coherence, and unique ability to provide deep character insights. Through proper prompting, the model demonstrates advanced reasoning capabilities and unprompted exploration of character inner thoughts and motivations.</p>
1916
  <h4>Base Architecture</h4>
1917
+ <p>The model utilizes the custom Hydroblated-R1 base, created for stability and enhanced reasoning. The SCE merge method's settings are precisely tuned based on extensive community feedback (of over 10 diffrent models from Nevoria to Cu-Mai), ensuring optimal component integration while maintaining model coherence and reliability. This foundation establishes Electra-R1 as the benchmark upon which its variant models build and expand.</p>
1918
  </div>
1919
  </div>
1920
  </div>
 
2225
  </div>
2226
  <div class="settings-content">
2227
  <div class="setting-item">
2228
+ <p>'<span style="color: #00b2ff">&lt;think&gt;</span> OK, as an objective, detached narrative analyst, let's think this through carefully:'</p>
2229
  </div>
2230
  </div>
2231
  </div>
 
2236
  <div class="settings-content">
2237
  <div class="setting-item">
2238
  <span class="setting-label">Prefix:</span>
2239
+ <span class="setting-value">'<span style="color: #00b2ff">&lt;think&gt;</span>'</span>
2240
  </div>
2241
  <div class="setting-item">
2242
  <span class="setting-label">Suffix:</span>
2243
+ <span class="setting-value">'<span style="color: #00b2ff">&lt;/think&gt;</span>'</span>
2244
  </div>
2245
  </div>
2246
  </div>