InfoCapability

OpenAI releases gpt-oss-safeguard for safety classification

AI Impact Summary

OpenAI is releasing gpt-oss-safeguard, a new set of open-weight reasoning models focused on safety classification. These models offer policy inference at runtime and are designed for flexible, explainable safety and rapid policy iteration. This capability allows developers to integrate safety checks directly into their applications, improving transparency and control over model behavior.

Affected Systems

Hugging Face

Business Impact

Developers can now integrate safety classification directly into their applications, improving model transparency and control.

Models affected

Date: Date not specified
Change type: capability
Severity: info

OpenAI releases gpt-oss-safeguard for safety classification

More from OpenAI

Get alerts for OpenAI