Hacker News new | past | comments | ask | show | jobs | submit login

With open models, yes we are at the performance of at least the first release of ChatGPT 4.



Could you recommend one or a few in particular?


The current best open weights model is probably Cohere Command-R+. The memory requirements on it are quite high, though.


I really want to see some benchmarks with performance weighted by energy use. I think Mistral 7B performance to watt would be the leader by a huge margin. On many tasks I get equal performance on zero shot classification tasks on Mistral than in bigger models.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:
  翻译: