Hugging Face: Quanto: PyTorch quantization backend for Optimum enables int2/int4/int8 and float8 quantization in Transformers workflows | SignalBreak | SignalBreak