With open models, yes we are at the performance of at least the first release of...

sp332 · 2024-04-10T22:02:00.000000Z

Could you recommend one or a few in particular?

sanjiwatsuki · 2024-04-10T22:58:00.000000Z

The current best open weights model is probably Cohere Command-R+. The memory requirements on it are quite high, though.

bevekspldnw · 2024-04-11T18:18:59.000000Z

I really want to see some benchmarks with performance weighted by energy use. I think Mistral 7B performance to watt would be the leader by a huge margin. On many tasks I get equal performance on zero shot classification tasks on Mistral than in bigger models.