OpenAI releases gpt-oss-safeguard for safety classification
AI Impact Summary
OpenAI is releasing gpt-oss-safeguard, a new set of open-weight reasoning models focused on safety classification. These models offer policy inference at runtime and are designed for flexible, explainable safety and rapid policy iteration. This capability allows developers to integrate safety checks directly into their applications, improving transparency and control over model behavior.
Affected Systems
Business Impact
Developers can now integrate safety classification directly into their applications, improving model transparency and control.
Models affected
- Date
- Date not specified
- Change type
- capability
- Severity
- info