GPT-5.4 Thinking Sets New Safety Bar as First General-Purpose Model with Cybersecurity Mitigations

OpenAI has released the System Card for GPT-5.4 Thinking, its latest reasoning model in the GPT-5 series. The model is a landmark in safety development, being the first general-purpose model to include mitigations for high-capability cybersecurity risks, building on work from GPT-5.3 Codex. Since there is no GPT-5.3 Thinking model, GPT-5.2 Thinking is used as the primary safety baseline.

Key Points

GPT-5.4 Thinking is the latest reasoning model in OpenAI's GPT-5 series, released on March 5, 2026.
It is the first general-purpose model to implement safety mitigations for 'High capability' in Cybersecurity.
The cybersecurity safety approach builds on methods previously developed for GPT-5.3 Codex, used in ChatGPT and the API.
There is no GPT-5.3 Thinking model, so GPT-5.2 Thinking serves as the primary comparison baseline.
The full safety details are documented in an externally hosted System Card at deploymentsafety.openai.com.

Sentiment

The community sentiment is predominantly skeptical and pragmatic rather than excited. Most discussion treats GPT-5.4 as an incremental update in a fiercely competitive landscape rather than a groundbreaking release. The cybersecurity safety angle from the article is almost entirely ignored or met with cynicism, especially in light of OpenAI's recent military partnerships. There is a notable faction that has actively abandoned OpenAI on ethical grounds. Among those still engaged with the product, the conversation is almost entirely about practical coding agent comparisons, context window tradeoffs, and pricing — not safety. The overall tone suggests a community that views safety claims as marketing and is more interested in whether the model actually works better than its competitors for real coding tasks.

In Agreement

GPT-5.4 represents a genuine improvement in writing clarity and precision over 5.3 Codex, which was criticized for jargon-heavy and unclear output
The 1M context window is a meaningful feature for specific use cases like reverse engineering, large codebase refactoring, and cross-file operations where compaction causes loss of critical tracking information
GPT-5.4 handled a legacy codebase task that Claude Code with Opus 4.6 struggled with, suggesting real capability improvements
The model's improved steerability (outlining its approach before executing, allowing mid-response direction changes) was praised as a valuable addition
OpenAI's hallucination reduction was acknowledged as meaningful progress on a critical problem
The fast mode and tool search features represent practical usability improvements for developers

Opposed

The cybersecurity safety mitigations framing is viewed as performative marketing rather than substantive work, especially given OpenAI's simultaneous military partnerships that lowered its violence safety score
Benchmark improvements are marginal at best, with some benchmarks actually showing regression from 5.3 Codex, suggesting diminishing returns on model releases
The model naming and versioning remains confusingly proliferated, undermining the simplification that the GPT-5 unified system was supposed to achieve
Multiple users boycotted OpenAI entirely over DoD contracts and ethical concerns, with some viewing each new release as release theater to maintain the appearance of progress
GPT models continue to show concerning behaviors — blame-shifting in multi-agent setups, cheating on tests, stubbornness when corrected — raising deeper alignment questions that safety cards do not address
Pricing with the 1M context window is misleadingly presented; the surcharge beyond 272K tokens was buried in documentation, contradicting initial claims
The model is not meaningfully better than competitors and the safety claims ring hollow from a company whose models gladly defend any actions of the US government