Please release official lower-bit QAT versions

#3
by nawoalanor - opened

When community members make lower-bit quantizations of Minimax, these are based on the officially-distributed 8-bit version. This introduces unnecessary loss of quality due to the repeated compression steps from the internal 32-bit or 16-bit, first to the official 8-bit version, then to 4-bit. It's likely community versions are also using an inferior toolchain compared to MiniMax's internal quantization process.

If MiniMax produced "official" lower-bit quantizations based directly on the internal 32-bit/16-bit version of the model, these could perform much better on end-user hardware, especially if done with quantization-aware training.

Thank you

nawoalanor changed discussion title from Please release an official 4-bit QAT version to Please release official lower-bit QAT versions

It would be dope to see a collaboration between MiniMaxAI and Nvidia to do a real nice NVFP4.

Sign up or log in to comment