AFAIK, 2-bit quant leads to too much loss of performance, such that you're bette...

aydyn 5 months ago | parent | context | favorite | on: Mistral AI Launches New 8x22B MOE Model

AFAIK, 2-bit quant leads to too much loss of performance, such that you're better off using a different smaller model altogether. See here:

翻译：