Hugging Face: OpenAI releases agent skill for custom CUDA kernels for LLM training | SignalBreak | SignalBreak