Hugging Face: Optimum-NVIDIA enables one-line LLM inference on NVIDIA GPUs with FP8 and up to 28x throughput | SignalBreak | SignalBreak