mistralai/Mistral-7B-Instruct-v0.2

#90 opened over 1 year ago by

shreyassihasane

Model doesn't stop generation after answering the user question.

👍 1

#88 opened over 1 year ago by

jerinjude

How does v0.2 manages to support 32k token context without Sliding Window Attention?

#85 opened over 1 year ago by

Andriy

will Mistral-7B-Instruct-v0.2 let me generate a response of around 8k tokens in one go?

#84 opened over 1 year ago by

akshat1311

How to prune layers in AutoModelForCausalModel

5

#83 opened over 1 year ago by

badri369

[AUTOMATED] Model Memory Requirements

#82 opened over 1 year ago by

model-sizer-bot

Update README.md

#81 opened over 1 year ago by

Austinc2003

Quantized version taking too long with CPU's

#80 opened over 1 year ago by

SukanyaM

Model inconsistency Issue

#79 opened over 1 year ago by

adityar23

LangChain Agent with Mistral-7B-Instruct-v0.2

12

#78 opened over 1 year ago by

deeplearner123

Training Data difference from v0.1

#77 opened over 1 year ago by

tsavage68

Update README.md

#76 opened over 1 year ago by

mixxz

Why was Sliding-Window Attention deprecated?

👀 12

#75 opened over 1 year ago by

matrixssy

Update config.json to accurately reflect the 32k context window.

🤗 🧠 2

#73 opened over 1 year ago by

Kearm

Was this model based of Mistral-7B-v0.2 from the start?

👀 17

#72 opened over 1 year ago by

stduhpf

Can someone from Mistral comment on what the knowledge cutoff is?

#69 opened over 1 year ago by

MarginallyEffective

Mistral-7B-Instruct-v0.2 loopy text generation with custom chat template

#68 opened over 1 year ago by

ercanucan

User input repetition after finetuning

#67 opened over 1 year ago by

nuratamton

What is the max context length of this model?

👍 2

#66 opened over 1 year ago by

flexwang

Inference API

#65 opened over 1 year ago by

Shivkumar27

cm_test

#64 opened over 1 year ago by

chenmin2001

FIne tuned model generating both user and assistant dialogues during inference

#63 opened over 1 year ago by

sabber

Has anybody gotten this example to work for converting string data into valid JSON?

#62 opened over 1 year ago by

capnchat

Is mistral7b instruct v0.2 down for everybody?

#61 opened over 1 year ago by

SzymonSt2808

Friendly Reminder

#60 opened over 1 year ago by

AnzaniAI

Is it possible to see embeddinges once you have fine tuned it ??

#59 opened over 1 year ago by

RikoteMaster

ValueError: Bfloat16 is only supported on GPUs with compute capability of at least 8.0

3

#58 opened over 1 year ago by

itod

instruction fine tuning template

#57 opened over 1 year ago by

Iamexperimenting

sliding_window appears to be None. TypeError: bad operand type for unary -: 'NoneType'

👍 2

#56 opened over 1 year ago by

narai

value for sliding_window in config.json updated

#55 opened over 1 year ago by

manaschauhan

Fix the command format of "Installing transformers from source"

#53 opened over 1 year ago by

musfiqdehan

System prompt

#52 opened over 1 year ago by

VladimirNGIT

Process finished with exit code -1073741819 (0xC0000005)

#51 opened over 1 year ago by

aminev

Is there any vllm support for this version?

9

#49 opened over 1 year ago by

Aloukik21

Mistral does not finish the answers

9

#48 opened almost 2 years ago by

expiderman

Special token( </s>) not generating in the model.generate() method

7

#47 opened almost 2 years ago by

Pradeep1995

Can we save the finetuned Mistral model by exporting to TorchScript

#46 opened almost 2 years ago by

Pradeep1995

deploying on aws sagemaker.

❤️ 1

3

#45 opened almost 2 years ago by

adhiltortil

Update config.json

#44 opened almost 2 years ago by

adhiltortil

What is the max. content length of Mistral-7B-Instruct-v0.2?

17

#43 opened almost 2 years ago by

hanshupe

Response time

👍 3

#42 opened almost 2 years ago by

Majidni

SFT Results So Bad

👍 2

#41 opened almost 2 years ago by

GokhanAI

OpenMindedChatBot Based on Mistral

#40 opened almost 2 years ago by

mghafiri

What is the version of the HuggingChat?

14

#39 opened almost 2 years ago by

aledane

OSError cached file and config.json

10

#38 opened almost 2 years ago by

shivrajhug

What is the maximum length of Mistral-7B-Instruct-v0.2?

👍 5

#37 opened almost 2 years ago by

xcjthu

create panda dataframe and interact