SentenceTransformer based on thebajajra/RexBERT-base-embed-pf-v0.1

This is a sentence-transformers model finetuned from thebajajra/RexBERT-base-embed-pf-v0.1 on the nomic-embed-supervised-data dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 1024, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
queries = [
    "None of this proves that Loral wasn\u0027t disloyal or criminally negligent in its dealings with China.",
]
documents = [
    'There is nothing that proves that Loral was negligent in dealing with China. ',
    'Trimipramine is a tricyclic antidepressant. Trimipramine affects chemicals in the brain that may become unbalanced. Trimipramine is used to treat symptoms of depression.Trimipramine may also be used for purposes not listed in this medication guide. You should not take trimipramine if you have recently had a heart attack. Do not use trimipramine if you have used an MAO inhibitor in the past 14 days.A dangerous drug interaction could occur.ou should not use trimipramine if you are allergic to it, or if you have: 1  if you have recently had a heart attack; or. 2  if you are allergic to antidepressants such as amitriptyline, amoxapine, clomipramine, desipramine, doxepin, imipramine, nortriptyline, or protriptyline.',
    'Lorraine Dunn Lorraine Dunn (12 September 1942 – 16 October 2003) was a Panamanian sprinter. She competed in the 4 × 100 metres relay at the 1960 Summer Olympics and the 1964 Summer Olympics. Dunn finished third in the 200 metres and finished fourth in the 80 metres hurdles at the 1963 Pan American Games.',
]
query_embeddings = model.encode_query(queries)
document_embeddings = model.encode_document(documents)
print(query_embeddings.shape, document_embeddings.shape)
# [1, 768] [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(query_embeddings, document_embeddings)
print(similarities)
# tensor([[ 0.7875, -0.0236, -0.0537]])

Training Details

Training Dataset

nomic-embed-supervised-data

  • Dataset: nomic-embed-supervised-data at 13eef8a
  • Size: 1,611,024 training samples
  • Columns: query, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    query positive negative
    type string string list
    details
    • min: 4 tokens
    • mean: 37.16 tokens
    • max: 1024 tokens
    • min: 5 tokens
    • mean: 99.03 tokens
    • max: 1024 tokens
    • min: 20 elements
    • mean: 155.14 elements
    • max: 209 elements
  • Samples:
    query positive negative
    Japan's biggest daily newspaper, Asahi Shimbun, published an interview Sunday with Woody Allen, who, asked to sum up the 20 th century in a single word, replied, Disappointing. Woody Allen described the 20th century as "disappointing'' in an interview with the Japaense newspaper, Asahi Shimbun. ['Billy Joel described the 20th century as "outlandish" in a recent interview with Japan's leading newspaper. ', "aside for the kids' education and--BOP--I have so much trouble with that the", 'An old man is jogging.', 'It was my favorite spot.', 'Two women are playing volleyball.', ...]
    Sedimentation is the term for when clumped solids sink to the bottom of the water. Deposition occurs where the water motion slows. ['The law of superposition is best described by: in undisturbed layers of sedimentary rock, the lowest layers contain the older rocks.', "Review: I felt this film - throughout. I waas impressed with Russell Crowe's talent in developing his relationship with Lillie, such a typical Aussie blend of softly softly approach, a bit self depreciating and very persistent. Really loved the cinematography and direction. Pace was just right and the portrayals of nearly all characters was impressive.Gosh, didn't Russell's talent even in 1993 shine! .. and I have yet to see Gladiator. Question: is it a negative review?", 'Question: How many times did Chopin and Liszy perform together in public? (Answer: seven).', 'Fact 1: Birth may be followed by a period of parental care of the offspring. \nFact 2: Human birth is the rarest of all births.', 'acts. Id. at 963. The Lawson court stated that a "critical distinction” of Pylant from the facts before it in Lawson was that in Pylant, there was "not a factu...
    Volleyball involves techniques like jumping. Volleyball Volleyball is a team sport in which two teams of six players are separated by a net . Each team tries to score points by grounding a ball on the other team 's court under organized rules . It has been a part of the official program of the Summer Olympic Games since 1964 . The complete rules are extensive . But simply , play proceeds as follows : a player on one of the teams begins a ` rally ' by serving the ball ( tossing or releasing it and then hitting it with a hand or arm ) , from behind the back boundary line of the court , over the net , and into the receiving team 's court . The receiving team must not let the ball be grounded within their court . The team may touch the ball up to 3 times but individual players may not touch the ball twice consecutively . Typically , the first two touches are used to set up for an attack , an attempt to direct the ball back over the net in such a way that the serving team is unable to prevent it from being grounded in their court . ... ['Volleyball jump serve The Volleyball jump Serve is a type of volleyball serve where the player increases the power and height of their serve by jumping into the hit . The Jump Serve itself was popularized by the brazilian national volleyball team in 1984 on the Olympics .', "Volleyball variations As volleyball is one of the world 's most popular team sports , second only to football ( soccer ) in the number of players , there are numerous variations of the basic rules . The rules have changed around the world since its creation in 1895 , as skills have developed , to make the game more suited for spectators , for learning or other special needs . Some variations have been included as a change in the international rules by Fédération Internationale de Volleyball ( FIVB ) , others have resulted in variations with specific names . The more notable variations include :", 'Volleyball (ball) A volleyball is a ball used to play indoor volleyball , beach volleyball , or other less common v...
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim",
        "gather_across_devices": false
    }
    

Evaluation Dataset

nomic-embed-supervised-data

  • Dataset: nomic-embed-supervised-data at 13eef8a
  • Size: 84,795 evaluation samples
  • Columns: query, positive, and negative
  • Approximate statistics based on the first 1000 samples:
    query positive negative
    type string string list
    details
    • min: 4 tokens
    • mean: 34.78 tokens
    • max: 1024 tokens
    • min: 5 tokens
    • mean: 91.63 tokens
    • max: 1024 tokens
    • min: 15 elements
    • mean: 151.78 elements
    • max: 209 elements
  • Samples:
    query positive negative
    I think people are upset that they're "copying" a mechanic. Which is ridiculous. Every platformer copied from Mario. That doesn't mean they weren't good games. And this says nothing of the end result of the flow of the game. The people who say spiderman is copying another game. Are the same retarded little children who think any game that has drivable vehicles in it, is a gta rip off.

    You can't take them serious.
    ['i unfollowed all of them earlier this year theres too many and all their tweets and instaposts made up damn near half my feed i said id just wait for the music and then iridesence came out and was pretty mid so ', 'There are five, FIVE fucking boxes of doughnuts by the printer in the office this morning and I am overtired and hungry and in the worst mood, and I’m sorry r/1200isplenty, tea is not a goddamn dessert substitute; I keep drinking it and not only am I not satiated, I have to pee every ten minutes, so I’m here guzzling tea and growling ”nothing tastes as good as skinny feels” under my breath and if I sound crazy, it’s because I am. \n\nEdit: down to one box. Just checked the calories for one doughnut—380. Three hundred. And eighty. Calories. For one doughnut. ', 'What a strange thing to call your cock.', "It doesn't. That's a very personal decision. You're not right or wrong on that. I was just replying because you were clear you don't want to raise the child as your own....
    duties of a medical assistant for resume Include clinical skills in a health care resume. Those that apply to a medical assistant job may be: 1 Preparing patients, including taking of medical histories. 2 The ability to take vital signs. 3 Preparing medication and treatments. 4 Basic first aid, CPR and infection control knowledge. Assisting physicians with exams. ['Include clinical skills in a health care resume. Those that apply to a medical assistant job may be: 1 Preparing patients, including taking of medical histories. 2 The ability to take vital signs. 3 Preparing medication and treatments. Basic first aid, CPR and infection control knowledge.', 'Clinical Medical Assistant Resume. The position of a medical assistant is a key position in any hospital, clinic or a health care center. A medical assistant may perform a variety of duties, pertaining to administrative, front office or clinical duties. Here we will discuss the job profile of a clinical medical assistant. Also, there is a clinical medical assistant resume example provided below, for your further reference.', "Community Q&A. A medical assistant is a member of a health care team that undertakes administrative and clinical job duties. A health care resume must emphasize skills as well as education and job experience. Learn how to write a resume for a medical assistant job and yo...
    when did british rule end in south africa History of South Africa British colonies: Cape Colony, Natal Colony, Transvaal Colony, and Orange River Colony. The country became a self-governing nation state within the British Empire, in 1934 following enactment of the Status of the Union Act. The dominion came to an end on 31 May 1961 as the consequence of a 1960 referendum, which legitimised the country becoming a sovereign state named Republic of South Africa. A republican constitution was adopted. From 1948–1994, South African politics were dominated by Afrikaner nationalism. Racial segregation and white minority rule known officially as apartheid, an Afrikaans word meaning "separateness”, came into existence in 1948 (under ["South Africa 1910. The Union was a dominion that included the former territories of the Cape, Transvaal and Natal colonies, as well as the Orange Free State republic. The Natives' Land Act of 1913 severely restricted the ownership of land by blacks; at that stage natives controlled only 7% of the country. The amount of land reserved for indigenous peoples was later marginally increased. In 1931, the union was fully sovereign from the United Kingdom with the passage of the Statute of Westminster, which abolished the last powers of the British Government on the country. In 1934, the South African Party and", 'History of South Africa conducted raids in Free State territories. Both sides adopted scorched-earth tactics, with large swathes of pasturage and cropland being destroyed. Faced with starvation, Moshoeshoe signed a peace treaty on 15 October 1858, though crucial boundary issues remained unresolved. War broke out again in 1865. After an unsuccessful appeal for aid from the British ...
  • Loss: MultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim",
        "gather_across_devices": false
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • eval_strategy: steps
  • per_device_train_batch_size: 256
  • per_device_eval_batch_size: 128
  • learning_rate: 0.0001
  • num_train_epochs: 20
  • warmup_ratio: 0.1
  • bf16: True
  • dataloader_num_workers: 10
  • dataloader_prefetch_factor: 10

All Hyperparameters

Click to expand
  • overwrite_output_dir: False
  • do_predict: False
  • eval_strategy: steps
  • prediction_loss_only: True
  • per_device_train_batch_size: 256
  • per_device_eval_batch_size: 128
  • per_gpu_train_batch_size: None
  • per_gpu_eval_batch_size: None
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 0.0001
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1.0
  • num_train_epochs: 20
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: {}
  • warmup_ratio: 0.1
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • save_safetensors: True
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • no_cuda: False
  • use_cpu: False
  • use_mps_device: False
  • seed: 42
  • data_seed: None
  • jit_mode_eval: False
  • bf16: True
  • fp16: False
  • fp16_opt_level: O1
  • half_precision_backend: auto
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: 0
  • ddp_backend: None
  • tpu_num_cores: None
  • tpu_metrics_debug: False
  • debug: []
  • dataloader_drop_last: True
  • dataloader_num_workers: 10
  • dataloader_prefetch_factor: 10
  • past_index: -1
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_min_num_params: 0
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • fsdp_transformer_layer_cls_to_wrap: None
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • parallelism_config: None
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch_fused
  • optim_args: None
  • adafactor: False
  • group_by_length: False
  • length_column_name: length
  • project: huggingface
  • trackio_space_id: trackio
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • use_legacy_prediction_loop: False
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: None
  • hub_always_push: False
  • hub_revision: None
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_inputs_for_metrics: False
  • include_for_metrics: []
  • eval_do_concat_batches: True
  • fp16_backend: auto
  • push_to_hub_model_id: None
  • push_to_hub_organization: None
  • mp_parameters:
  • auto_find_batch_size: False
  • full_determinism: False
  • torchdynamo: None
  • ray_scope: last
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • include_tokens_per_second: False
  • include_num_input_tokens_seen: no
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • use_liger_kernel: False
  • liger_kernel_config: None
  • eval_use_gather_object: False
  • average_tokens_across_devices: True
  • prompts: None
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: proportional
  • router_mapping: {}
  • learning_rate_mapping: {}

Training Logs

Click to expand
Epoch Step Training Loss Validation Loss
0.0127 10 1.278 -
0.0254 20 1.2803 -
0.0382 30 1.2583 -
0.0509 40 1.2256 -
0.0636 50 1.2424 -
0.0763 60 1.148 -
0.0891 70 1.1033 -
0.1018 80 1.0141 -
0.1145 90 0.8885 -
0.1272 100 0.7322 -
0.1399 110 0.5795 -
0.1527 120 0.4847 -
0.1654 130 0.4266 -
0.1781 140 0.3894 -
0.1908 150 0.3716 -
0.2036 160 0.3361 -
0.2163 170 0.33 -
0.2290 180 0.3192 -
0.2417 190 0.3084 -
0.2545 200 0.3092 -
0.2672 210 0.298 -
0.2799 220 0.2783 -
0.2926 230 0.2855 -
0.3053 240 0.2767 -
0.3181 250 0.2743 -
0.3308 260 0.2677 -
0.3435 270 0.2644 -
0.3562 280 0.2662 -
0.3690 290 0.2595 -
0.3817 300 0.2548 -
0.3944 310 0.2587 -
0.4071 320 0.2538 -
0.4198 330 0.2525 -
0.4326 340 0.2522 -
0.4453 350 0.2545 -
0.4580 360 0.2408 -
0.4707 370 0.2548 -
0.4835 380 0.2419 -
0.4962 390 0.2276 -
0.5 393 - 0.1599
0.5089 400 0.2377 -
0.5216 410 0.2309 -
0.5344 420 0.2322 -
0.5471 430 0.2277 -
0.5598 440 0.2286 -
0.5725 450 0.2338 -
0.5852 460 0.2247 -
0.5980 470 0.2239 -
0.6107 480 0.2317 -
0.6234 490 0.2262 -
0.6361 500 0.2282 -
0.6489 510 0.2165 -
0.6616 520 0.2186 -
0.6743 530 0.2169 -
0.6870 540 0.2197 -
0.6997 550 0.2203 -
0.7125 560 0.2183 -
0.7252 570 0.2188 -
0.7379 580 0.2167 -
0.7506 590 0.2026 -
0.7634 600 0.2174 -
0.7761 610 0.2129 -
0.7888 620 0.207 -
0.8015 630 0.2123 -
0.8142 640 0.2079 -
0.8270 650 0.2176 -
0.8397 660 0.2123 -
0.8524 670 0.2061 -
0.8651 680 0.2058 -
0.8779 690 0.2027 -
0.8906 700 0.2138 -
0.9033 710 0.2025 -
0.9160 720 0.2015 -
0.9288 730 0.2112 -
0.9415 740 0.2142 -
0.9542 750 0.2 -
0.9669 760 0.2046 -
0.9796 770 0.1986 -
0.9924 780 0.2044 -
1.0 786 - 0.1360
1.0051 790 0.1905 -
1.0178 800 0.1724 -
1.0305 810 0.1899 -
1.0433 820 0.1739 -
1.0560 830 0.1682 -
1.0687 840 0.1804 -
1.0814 850 0.1794 -
1.0941 860 0.1735 -
1.1069 870 0.1792 -
1.1196 880 0.1849 -
1.1323 890 0.178 -
1.1450 900 0.1824 -
1.1578 910 0.1841 -
1.1705 920 0.1874 -
1.1832 930 0.1759 -
1.1959 940 0.1737 -
1.2087 950 0.1779 -
1.2214 960 0.1866 -
1.2341 970 0.1806 -
1.2468 980 0.174 -
1.2595 990 0.1803 -
1.2723 1000 0.1707 -
1.2850 1010 0.1796 -
1.2977 1020 0.1769 -
1.3104 1030 0.1885 -
1.3232 1040 0.1752 -
1.3359 1050 0.1856 -
1.3486 1060 0.1782 -
1.3613 1070 0.1772 -
1.3740 1080 0.1841 -
1.3868 1090 0.1774 -
1.3995 1100 0.1822 -
1.4122 1110 0.1837 -
1.4249 1120 0.176 -
1.4377 1130 0.1865 -
1.4504 1140 0.1841 -
1.4631 1150 0.1736 -
1.4758 1160 0.1788 -
1.4885 1170 0.1821 -
1.5 1179 - 0.1318
1.5013 1180 0.1753 -
1.5140 1190 0.1739 -
1.5267 1200 0.1841 -
1.5394 1210 0.1749 -
1.5522 1220 0.1847 -
1.5649 1230 0.1731 -
1.5776 1240 0.1901 -
1.5903 1250 0.1777 -
1.6031 1260 0.1812 -
1.6158 1270 0.1861 -
1.6285 1280 0.1808 -
1.6412 1290 0.1834 -
1.6539 1300 0.1797 -
1.6667 1310 0.1822 -
1.6794 1320 0.1846 -
1.6921 1330 0.1829 -
1.7048 1340 0.1838 -
1.7176 1350 0.1832 -
1.7303 1360 0.1766 -
1.7430 1370 0.186 -
1.7557 1380 0.1886 -
1.7684 1390 0.1794 -
1.7812 1400 0.1851 -
1.7939 1410 0.1785 -
1.8066 1420 0.1887 -
1.8193 1430 0.1766 -
1.8321 1440 0.1746 -
1.8448 1450 0.1826 -
1.8575 1460 0.1861 -
1.8702 1470 0.1758 -
1.8830 1480 0.1801 -
1.8957 1490 0.178 -
1.9084 1500 0.1771 -
1.9211 1510 0.17 -
1.9338 1520 0.1827 -
1.9466 1530 0.185 -
1.9593 1540 0.1814 -
1.9720 1550 0.1824 -
1.9847 1560 0.1738 -
1.9975 1570 0.1805 -
2.0 1572 - 0.1268
2.0102 1580 0.1264 -
2.0229 1590 0.1183 -
2.0356 1600 0.1155 -
2.0483 1610 0.1232 -
2.0611 1620 0.1133 -
2.0738 1630 0.1208 -
2.0865 1640 0.1218 -
2.0992 1650 0.111 -
2.1120 1660 0.1159 -
2.1247 1670 0.1194 -
2.1374 1680 0.1235 -
2.1501 1690 0.1207 -
2.1628 1700 0.1193 -
2.1756 1710 0.1279 -
2.1883 1720 0.1229 -
2.2010 1730 0.1252 -
2.2137 1740 0.1189 -
2.2265 1750 0.1242 -
2.2392 1760 0.1321 -
2.2519 1770 0.1243 -
2.2646 1780 0.125 -
2.2774 1790 0.1218 -
2.2901 1800 0.1324 -
2.3028 1810 0.1342 -
2.3155 1820 0.1272 -
2.3282 1830 0.1252 -
2.3410 1840 0.1261 -
2.3537 1850 0.1297 -
2.3664 1860 0.1224 -
2.3791 1870 0.1294 -
2.3919 1880 0.119 -
2.4046 1890 0.1269 -
2.4173 1900 0.1286 -
2.4300 1910 0.1262 -
2.4427 1920 0.1255 -
2.4555 1930 0.1229 -
2.4682 1940 0.129 -
2.4809 1950 0.127 -
2.4936 1960 0.1226 -
2.5 1965 - 0.1274
2.5064 1970 0.1292 -
2.5191 1980 0.1306 -
2.5318 1990 0.1241 -
2.5445 2000 0.1229 -
2.5573 2010 0.1293 -
2.5700 2020 0.1271 -
2.5827 2030 0.123 -
2.5954 2040 0.1344 -
2.6081 2050 0.1252 -
2.6209 2060 0.126 -
2.6336 2070 0.1307 -
2.6463 2080 0.1272 -
2.6590 2090 0.1264 -
2.6718 2100 0.1322 -
2.6845 2110 0.1245 -
2.6972 2120 0.1278 -
2.7099 2130 0.133 -
2.7226 2140 0.1309 -
2.7354 2150 0.1366 -
2.7481 2160 0.1279 -
2.7608 2170 0.1292 -
2.7735 2180 0.1236 -
2.7863 2190 0.1273 -
2.7990 2200 0.1308 -
2.8117 2210 0.135 -
2.8244 2220 0.1302 -
2.8372 2230 0.1248 -
2.8499 2240 0.1206 -
2.8626 2250 0.1254 -
2.8753 2260 0.1229 -
2.8880 2270 0.1305 -
2.9008 2280 0.1288 -
2.9135 2290 0.1284 -
2.9262 2300 0.1293 -
2.9389 2310 0.1354 -
2.9517 2320 0.127 -
2.9644 2330 0.124 -
2.9771 2340 0.1307 -
2.9898 2350 0.1326 -
3.0 2358 - 0.1239

Framework Versions

  • Python: 3.11.13
  • Sentence Transformers: 5.1.2
  • Transformers: 4.57.1
  • PyTorch: 2.8.0+cu129
  • Accelerate: 1.11.0
  • Datasets: 4.3.0
  • Tokenizers: 0.22.1

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

MultipleNegativesRankingLoss

@misc{henderson2017efficient,
    title={Efficient Natural Language Response Suggestion for Smart Reply},
    author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
    year={2017},
    eprint={1705.00652},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}
Downloads last month
55
Safetensors
Model size
0.1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for thebajajra/RexBERT-base-embed-pf-v0.4a

Finetuned
(2)
this model

Dataset used to train thebajajra/RexBERT-base-embed-pf-v0.4a