Cannot access gated repo You must be authenticated to access it.
β
32
44
#93 opened over 1 year ago
by
liketheflower
deepspeed inference tensor parallelism memory footprint doesn't decrease with deepspeed tp_size increase.
6
#92 opened over 1 year ago
by
jiangtaozh
why put MistralRotaryEmbedding in each attention layer instead of putting only once before the first attention layer?
#91 opened over 1 year ago
by
liougehooa
How to use this model in next js?
2
#90 opened over 1 year ago
by
shreyassihasane
Model doesn't stop generation after answering the user question.
π
1
2
#88 opened over 1 year ago
by
jerinjude
How does v0.2 manages to support 32k token context without Sliding Window Attention?
4
#85 opened over 1 year ago
by
Andriy
will Mistral-7B-Instruct-v0.2 let me generate a response of around 8k tokens in one go?
#84 opened over 1 year ago
by
akshat1311
How to prune layers in AutoModelForCausalModel
5
#83 opened over 1 year ago
by
badri369
[AUTOMATED] Model Memory Requirements
#82 opened over 1 year ago
by
model-sizer-bot
Update README.md
#81 opened over 1 year ago
by
Austinc2003
Quantized version taking too long with CPU's
#80 opened over 1 year ago
by
SukanyaM
Model inconsistency Issue
#79 opened over 1 year ago
by
adityar23
LangChain Agent with Mistral-7B-Instruct-v0.2
12
#78 opened over 1 year ago
by
deeplearner123
Training Data difference from v0.1
#77 opened over 1 year ago
by
tsavage68
Update README.md
#76 opened over 1 year ago
by
mixxz
Why was Sliding-Window Attention deprecated?
π
12
#75 opened over 1 year ago
by
matrixssy
Update config.json to accurately reflect the 32k context window.
π€
π§
2
4
#73 opened over 1 year ago
by
Kearm
Was this model based of Mistral-7B-v0.2 from the start?
π
17
4
#72 opened over 1 year ago
by
stduhpf
Can someone from Mistral comment on what the knowledge cutoff is?
1
#69 opened over 1 year ago
by
MarginallyEffective
Mistral-7B-Instruct-v0.2 loopy text generation with custom chat template
4
#68 opened over 1 year ago
by
ercanucan
User input repetition after finetuning
1
#67 opened over 1 year ago
by
nuratamton
What is the max context length of this model?
π
2
1
#66 opened over 1 year ago
by
flexwang
Inference API
1
#65 opened over 1 year ago
by
Shivkumar27
cm_test
#64 opened over 1 year ago
by
chenmin2001
FIne tuned model generating both user and assistant dialogues during inference
1
#63 opened over 1 year ago
by
sabber
Has anybody gotten this example to work for converting string data into valid JSON?
2
#62 opened over 1 year ago
by
capnchat
Is mistral7b instruct v0.2 down for everybody?
2
#61 opened over 1 year ago
by
SzymonSt2808
Friendly Reminder
#60 opened over 1 year ago
by
AnzaniAI
Is it possible to see embeddinges once you have fine tuned it ??
#59 opened over 1 year ago
by
RikoteMaster
ValueError: Bfloat16 is only supported on GPUs with compute capability of at least 8.0
3
#58 opened over 1 year ago
by
itod
instruction fine tuning template
2
#57 opened over 1 year ago
by
Iamexperimenting
sliding_window appears to be None. TypeError: bad operand type for unary -: 'NoneType'
π
2
4
#56 opened over 1 year ago
by
narai
value for sliding_window in config.json updated
1
#55 opened over 1 year ago
by
manaschauhan
Fix the command format of "Installing transformers from source"
#53 opened over 1 year ago
by
musfiqdehan
System prompt
4
#52 opened over 1 year ago
by
VladimirNGIT
Process finished with exit code -1073741819 (0xC0000005)
1
#51 opened over 1 year ago
by
aminev
Is there any vllm support for this version?
9
#49 opened over 1 year ago
by
Aloukik21
Mistral does not finish the answers
9
#48 opened almost 2 years ago
by
expiderman
Special token( </s>) not generating in the model.generate() method
7
#47 opened almost 2 years ago
by
Pradeep1995
Can we save the finetuned Mistral model by exporting to TorchScript
1
#46 opened almost 2 years ago
by
Pradeep1995
deploying on aws sagemaker.
β€οΈ
1
3
#45 opened almost 2 years ago
by
adhiltortil
Update config.json
#44 opened almost 2 years ago
by
adhiltortil
What is the max. content length of Mistral-7B-Instruct-v0.2?
17
#43 opened almost 2 years ago
by
hanshupe
Response time
π
3
1
#42 opened almost 2 years ago
by
Majidni
SFT Results So Bad
π
2
2
#41 opened almost 2 years ago
by
GokhanAI
OpenMindedChatBot Based on Mistral
#40 opened almost 2 years ago
by
mghafiri
What is the version of the HuggingChat?
14
#39 opened almost 2 years ago
by
aledane
OSError cached file and config.json
10
#38 opened almost 2 years ago
by
shivrajhug
What is the maximum length of Mistral-7B-Instruct-v0.2?
π
5
#37 opened almost 2 years ago
by
xcjthu
create panda dataframe and interact
1
#36 opened almost 2 years ago
by
DonYar