Here is the unofficial benchmark: https://huggingface.co/mistral-community/Mixtr...

bevekspldnw · 2024-04-11T15:15:58.000000Z

Wish it had GPT-4, that’s the one to beat still.

GuB-42 · 2024-04-11T16:13:55.000000Z

It is there, not for all the benchmarks, but for those where it is included, GPT-4 scores much higher.

Not surprising since GPT-4 is still state-of-the-art and much bigger. Where Mistral has been particularly impressive is when you take the size of the model into account.

mirekrusin · 2024-04-11T17:23:20.000000Z

GPT-4 is instruct tuned model, of course it's going to score higher, apples and oranges.

bevekspldnw · 2024-04-11T18:20:02.000000Z

Yeah and the instruct tunes provided by Mistral on other models are pretty great.