Hugging Face: Assisted Generation: a low-latency path using Flash Attention, INT8, and tensor parallelism | SignalBreak | SignalBreak