Azure OpenAIOpenSearch (AWS)Google Gemini / Vertex AIGroqPerplexityAWS BedrockTogether AI

OpenAI Drops GPT-5-Codex Bombshell: Week of 1 September 2025

1 Sept 2025 – 8 Sept 20256 min read

OpenAI Drops GPT-5-Codex Bombshell: Week of 1 September 2025

OpenAI just dropped the biggest coding model release since GitHub Copilot went mainstream. GPT-5-Codex launched this week with deep CLI and VS Code integration, forcing every development team to reconsider their AI toolchain strategy.

The Big Moves

OpenAI's GPT-5-Codex Changes the Developer Game

OpenAI's GPT-5-Codex release on 1 September represents the most significant advancement in AI-assisted development since the original Codex. This isn't just another model update – it's a complete reimagining of how developers interact with AI.

The integration with Codex CLI and Visual Studio Code extension creates a seamless workflow that existing tools simply can't match. Unlike previous iterations that required constant context switching, GPT-5-Codex embeds directly into developer environments. Early access requires registration through the Foundry portal, with existing GPT-5 users automatically gaining access.

What makes this release critical isn't just the technical capabilities – it's the migration pressure. Development teams using competing solutions now face a stark choice: stick with increasingly outdated tools or invest in significant code updates to integrate GPT-5-Codex. The model's enhanced code generation capabilities and reasoning improvements suggest this isn't a decision teams can postpone indefinitely.

The timing couldn't be more strategic. With GitHub Copilot facing increased competition and developer productivity becoming a board-level concern, OpenAI has positioned itself as the definitive solution for AI-assisted development. Teams that delay adoption risk falling behind competitors who embrace the new capabilities immediately.

Google Introduces Hidden Costs in Vertex AI RAG Engine

Google's decision to introduce Spanner database billing charges for Vertex AI RAG Engine's serverless option, effective 3 September, represents a significant shift in cloud AI economics. This change transforms what appeared to be a simple, predictable pricing model into something far more complex.

The serverless RAG Engine previously abstracted away database costs, making it attractive for teams wanting to avoid infrastructure management. Now, customers face additional Spanner charges that could substantially impact their AI budgets. This is particularly problematic for organisations that built their cost models around the previous pricing structure.

What's concerning isn't just the additional cost – it's the precedent. Google has effectively moved from transparent, usage-based pricing to a model where core infrastructure costs are separated and charged independently. This mirrors traditional cloud pricing complexity that many teams hoped to avoid with serverless AI solutions.

Organisations using Vertex AI RAG Engine need to immediately reassess their cost projections and consider whether the convenience of managed RAG justifies the new pricing structure. For some, this change may trigger a migration to alternative solutions or a move to self-managed RAG implementations.

AWS Bedrock Achieves FIPS 140-2 Compliance

Amazon OpenSearch Serverless gaining FIPS 140-2 support on 5 September opens significant new market opportunities, particularly in government and regulated industries. This capability addition removes a major barrier that prevented many organisations from adopting OpenSearch Serverless for sensitive workloads.

FIPS 140-2 compliance isn't just a checkbox – it's a gateway to entire market segments that were previously inaccessible. Government contractors, financial services firms, and healthcare organisations can now consider OpenSearch Serverless for applications that handle classified or regulated data. This represents a substantial competitive advantage over providers that haven't achieved similar compliance levels.

The implementation appears seamless, with no reported migration requirements for existing users. This suggests AWS has built FIPS support into the underlying infrastructure rather than requiring customer-side changes. For organisations with compliance requirements, this eliminates the need to manage separate FIPS-compliant infrastructure while maintaining the benefits of serverless architecture.

Worth Watching

Groq Launches Compound Model with 25% Accuracy Boost

Groq's Compound model release on 4 September deserves attention for its integrated agentic tools and significant accuracy improvements. Built on GPT-OSS-120B and Llama foundations, Compound offers web search, code execution, and browser automation capabilities that competing platforms provide as separate services. The 25% accuracy increase over previous models, combined with enhanced rate limits, positions Groq as a serious alternative for teams seeking advanced reasoning capabilities without the complexity of multiple tool integrations.

Groq Adds Kimi K2-0905 with 256K Context Window

The addition of Moonshot AI's Kimi K2-0905 model to GroqCloud on 5 September brings impressive specifications: a 256K context window and prompt caching that can reduce costs by up to 50%. The model's enhanced agentic coding capabilities and frontend development performance improvements make it particularly relevant for development teams. The prompt caching feature addresses one of the biggest cost concerns with large context models, potentially making complex applications more economically viable.

Perplexity Expands with File Attachments and Search API

Perplexity's dual capability release on 1 September – file attachment support and a standalone Search API – signals their evolution from simple search to comprehensive information processing platform. File attachment support enables document analysis and multi-language processing, while the Search API provides developers with direct access to ranked search results. These additions position Perplexity as a more versatile alternative to traditional search and document processing solutions.

Groq SDK Updates Improve OpenAI Compatibility

Groq's SDK updates (v0.31.1 and v0.32.0) released on 4 September enhance OpenAI model compatibility and introduce new Compound tools including Wolfram Alpha, Browser Automation, and Visit Website functionality. While not requiring immediate action, these updates expand platform capabilities and could simplify migration paths for teams currently using OpenAI models. The improved compatibility reduces integration friction for organisations considering Groq as an alternative or supplementary provider.

Quick Hits

Together AI released Kimi-K2-Instruct-0905, a 1 trillion parameter Mixture of Experts model, expanding high-performance model options
AWS Bedrock introduced a new bearer token condition key for more granular API authentication control
Amazon published a CloudFormation tutorial for Bedrock resource creation, improving onboarding for new users
Perplexity launched automated API key rotation with zero downtime, enhancing security and operational efficiency

The Week Ahead

Expect immediate market reaction to OpenAI's GPT-5-Codex release as competing providers scramble to match its developer-focused capabilities. Google's RAG Engine pricing change will likely prompt cost reassessments across organisations using Vertex AI, potentially driving migration discussions.

Watch for announcements from GitHub regarding Copilot enhancements, as Microsoft will need to respond to OpenAI's developer tool advancement. AWS may accelerate compliance-related announcements to capitalise on the FIPS 140-2 momentum.

The convergence of coding assistance, document processing, and search capabilities across multiple providers suggests we're entering a phase where platform differentiation will depend less on individual model performance and more on integrated workflow solutions. Teams should prepare for more complex migration decisions as providers bundle previously separate capabilities into comprehensive platforms.