Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

lindafei001
/
llama-8b-instruct-medical-reward-10epochs-2e-5-64-128-1024

Text Generation
PEFT
TensorBoard
Safetensors
Transformers
llama
lora
reward-trainer
trl
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-8b-instruct-medical-reward-10epochs-2e-5-64-128-1024
1.66 GB
  • 1 contributor
History: 2 commits
lindafei001's picture
lindafei001
Upload trained model checkpoint
0103325 verified about 2 months ago
  • checkpoint-140
    Upload trained model checkpoint about 2 months ago
  • checkpoint-144
    Upload trained model checkpoint about 2 months ago
  • checkpoint-148
    Upload trained model checkpoint about 2 months ago
  • checkpoint-150
    Upload trained model checkpoint about 2 months ago
  • checkpoint-52
    Upload trained model checkpoint about 2 months ago
  • runs
    Upload trained model checkpoint about 2 months ago
  • .gitattributes
    1.9 kB
    Upload trained model checkpoint about 2 months ago
  • README.md
    1.63 kB
    Upload trained model checkpoint about 2 months ago
  • adapter_config.json
    972 Bytes
    Upload trained model checkpoint about 2 months ago
  • adapter_model.safetensors
    97.3 MB
    xet
    Upload trained model checkpoint about 2 months ago
  • chat_template.jinja
    3.83 kB
    Upload trained model checkpoint about 2 months ago
  • config.json
    937 Bytes
    Upload trained model checkpoint about 2 months ago
  • special_tokens_map.json
    325 Bytes
    Upload trained model checkpoint about 2 months ago
  • tokenizer.json
    17.2 MB
    xet
    Upload trained model checkpoint about 2 months ago
  • tokenizer_config.json
    50.6 kB
    Upload trained model checkpoint about 2 months ago