Hugging Face: Q8-Chat: 8-bit LLM inference on Intel Xeon with SmoothQuant and Optimum Intel | SignalBreak | SignalBreak