Google Gemini / Vertex AI
model_owner
306 signals tracked
March 16, 2026
Vertex AI v1
16 Mar 2026
InfoOtherMarch 24, 2026
Generative AI on Vertex AI v1 Deprecated Imagen generation GA endpoints deprecation The following table describes image generation endpoints that are deprecated and their replacements. We recommend updating your model endpoints before June 30, 2026, to avoid service disruption. Discontinued endpoints Recommended endpoint migration imagegeneration@002 gemini-2.5-flash-image imagegeneration@003 gemini-2.5-flash-image imagegeneration@004 gemini-2.5-flash-image imagegeneration@005 gemini-2.5-flash-image imagegeneration@006 gemini-2.5-flash-image imagetext@001 gemini-2.5-flash-image imagen-3.0-capability-001 gemini-2.5-flash-image imagen-3.0-capability-002 gemini-2.5-flash-image imagen-3.0-fast-generate-001 gemini-2.5-flash-image imagen-3.0-generate-001 gemini-2.5-flash-image imagen-3.0-generate-002 gemini-2.5-flash-image imagen-4.0-fast-generate-001 gemini-2.5-flash-image imagen-4.0-generate-001 gemini-2.5-flash-image imagen-4.0-ultra-generate-001 gemini-2.5-flash-image Deprecated Video generation GA endpoints deprecation The following table describes video generation endpoints that are deprecated and their replacements. We recommend updating your model endpoints before June 30, 2026, to avoid service disruption. Discontinued endpoints Recommended endpoint migration veo-3.0-generate-001 veo-3.1-generate-001 veo-3.0-fast-generate-001 veo-3.1-fast-generate-001 veo-2.0-generate-001 veo-3.1-generate-001
24 Mar 2026
InfoDeprecationColab Enterprise: Generally Available SQL Cells Now Available
Write, edit, and run SQL queries directly from Colab Enterprise notebooks.
Date not specified
HighCapabilityColab Enterprise: Visualization Cells Generally Available
Generate interactive and editable visualizations directly from notebooks with configurable chart types, aggregation, colors, and labels.
Date not specified
HighCapabilityAnthropic Claude Opus 4.7 Available in Vertex AI Model Garden
Anthropic's Claude Opus 4.7 model is now available through Vertex AI Model Garden.
Date not specified
HighCapabilityVertex AI Gemini API global endpoint error rates resolved — review error handling
Vertex AI Gemini API customers experienced increased error rates when accessing the global Gemini endpoint. The incident has been resolved. Reference: https://status.cloud.google.com/incidents/41E5S3mkTGDfkZuJZH5k
Date not specified
CriticalIncidentGoogle releases Vertex AI v1 with GLM 4.7 and Veo 3.1 Lite
Generative AI on Vertex AI v1 Feature GLM 4.7 is available as an experimental launch in Model Garden. This model is designed for core or vibe coding, tool use, and complex reasoning. GLM 4.7 is available as a managed API in Model Garden. To learn more, see GLM 4.7 .
Date not specified
HighCapabilityGoogle releases RAG Cross Corpus Retrieval in Vertex AI
Generative AI on Vertex AI v1 Feature RAG Cross Corpus Retrieval RAG Cross Corpus Retrieval is available in public preview . This feature allows you to retrieve relevant contexts or generate answers from multiple RAG corpora simultaneously using the AsyncRetrieveContexts and AskContexts APIs. For more information, see RAG Cross Corpus Retrieval .
Date not specified
HighCapabilityVertex AI: Imagen 3, Gemma 2 & Model Updates - Deprecation Alert
Generative AI on Vertex AI v1 Feature New Imagen on Vertex AI image generation model and features The Imagen 3 image generation models ( imagen-3.0-generate-001 and the low-latency version imagen-3.0-fast-generate-001 ) are Generally Available to approved users . These models offer the following additional features: Additional aspect ratios (1:1, 3:4, 4:3, 9:16, 16:9) Digital watermark ( SynthID ) enabled by default Watermark verification User-configurable safety features (safety setting, person/face setting) For more information, see Model versions and Generate images using text prompts . Feature The following models have been added to Model Garden : Gemma 2 2B : A foundation LLM by Google DeepMind. Qwen2 : An LLM series by Alibaba Cloud. Phi-3 : An LLM series by Microsoft. Change Resource and deployment settings were made to the following models: Added GPU inferences for gemma2-27b and gemma2-27b-it with verified performances. Added verified deployment settings for Mistral AI models that are deployed from Huggingface, including mistralai/mistral-nemo-instruct-2407 , mistralai/mistral-nemo-base-2407 , mistralai/mistral-large-instruct-2407 , and mistralai/codestral-22b-v0.1 . Added multiple deployment settings with A100 (40G), A100 (80G) and H100 (80G) for select models, such as llama3.1 , llama3 , gemma2 , gemma , and mistral-7b .
Date not specified
HighCapabilityVertex AI Releases New Capabilities & Deprecates Image/Video Generation Endpoints
Generative AI on Vertex AI v1 Feature Vertex AI SDK for Python supports token listing and counting for prompts without the need to make API calls. This feature is available in ( Preview ). For details, see List and count tokens . Vertex AI v1 Feature The Vertex AI Model Registry now offers Preview support for model copy across different projects. For information about how to copy your model projects and regions, see Copy models in Model Registry .
Date not specified
HighCapabilityGoogle Colab Enterprise Updates: New Languages & Deprecations
Colab Enterprise Fixed Fixed an issue in which users weren't able to access the Colab Enterprise UI when Colab Service Status was OFF for everyone in Google Workspace. Generative AI on Vertex AI v1 Change The translation LLM now supports Arabic, Hindi, and Russian. For the full list of supported languages, see the Translate text page.
Date not specified
HighCapabilityVertex AI Workbench v2 M124 Release: Kernel Fixes, New UI, Endpoint Deprecations
Vertex AI Workbench v2 Feature M124 release The M124 release of Vertex AI Workbench instances includes the following: Fixed a bug that prevented kernels from appearing when the Cloud Resource Manager API is turned off and Dataproc is enabled. Spark notebooks on Dataproc: The Serverless Spark runtime template creation screen now has an easy-to-use UI for configuring resource allocation, autoscaling, and GPU settings. v1 Feature M124 release The M124 release of Vertex AI Workbench managed notebooks includes the following: Fixed a bug that prevented kernels from appearing when the Cloud Resource Manager API is turned off and Dataproc is enabled.
Date not specified
HighCapabilityVertex AI adds CMEK, VPC Service Controls, and Data Residency support
Vertex AI v1 Change Generative AI on Vertex AI supports CMEK, VPC Service Controls, and Data Residency. For more information, see Security controls .
Date not specified
HighCapabilityVertex AI Workbench v1 M124 Release: Pytorch 2.3, Python 3.10, and New Capabilities
Vertex AI Workbench v1 Feature M124 release The M124 release of Vertex AI Workbench user-managed notebooks includes the following: Pytorch 2.3.0 with CUDA 12.1 and Python 3.10 user-managed notebooks instances are now available. Fixed a bug that prevented kernels from appearing when the Cloud Resource Manager API is turned off and Dataproc is enabled.
Date not specified
CriticalCapabilityGoogle Adds AI21 Labs Models to Vertex AI
Generative AI on Vertex AI v1 Change AI21 Labs Managed models from AI21 Labs are available on Vertex AI. To use a AI21 Labs model on Vertex AI, send a request directly to the Vertex AI API endpoint. For more information, see AI21 models .
Date not specified
HighCapabilityVertex AI: New ResponseMimeType for Generative Models (text/x.enum)
Generative AI on Vertex AI v1 Change For controlled generation, you can have the model respond with an enum value in plain text, as defined in your response schema. Set the responseMimeType to text/x.enum . For more information, see Control generated output .
Date not specified
HighCapabilityVertex AI Announces New Feature Schedule & Deprecations
Vertex AI v1 Feature Schedule Vertex AI custom training jobs based on resource availability. For details, see the Vertex AI documentation .
Date not specified
HighCapabilityVertex AI: Image Generation Endpoints Deprecated - Migration Required
Vertex AI v1 Feature To ensure that VM resources are available when your custom training and prediction jobs need them, you can now use Compute Engine reservations. Reservations provide a high level of assurance in obtaining capacity for Compute Engine resources. This feature is available in Preview for A2 and A3 machine series reservations. For more information, see Use reservations with training and Use reservations with prediction . Feature To reduce the cost of running your training and prediction jobs, you can now use Spot VMs. Spot VMs are virtual machine (VM) instances that are excess Compute Engine capacity. Spot VMs have significant discounts, but Compute Engine might preemptively stop or delete Spot VMs to reclaim the capacity at any time. This feature is available in Preview . For more information, see Use Spot VMs with training and Use Spot VMs with prediction .
Date not specified
HighDeprecationVertex AI Adds Metadata Labels to Generative AI APIs
Generative AI on Vertex AI v1 Feature Add label metadata to generateContent and streamGenerateContent API calls. For details, see Add labels to API calls .
Date not specified
HighCapabilityGoogle Gemini Model Updates on Vertex AI - New Features & Deprecations
Generative AI on Vertex AI v1 Announcement New stable versions of Gemini 1.5 Pro ( gemini-1.5-pro-002 ) and Gemini 1.5 Flash ( gemini-1.5-flash-002 ) are Generally Available . These models introduce broad quality improvements over the previous 001 versions, with significant gains in the following categories: Factuality and reduce model hallucinations Openbook Q&A for RAG use cases Instruction following Multilingual understanding in 102 languages, especially in Korean, French, German, Spanish, Japanese, Russian, and Chinese. SQL generation Audio understanding Document understanding Long context Math and reasoning For more information about differences with the previous model versions, see Model versions and lifecycle . Feature Use Gemini to directly analyze YouTube videos and publicly available media (such as images, audio, and video) by using a link. This feature is in Public Preview . Announcement The 2M context window with Gemini 1.5 Pro is now in Generally Available , which opens up long-form multimodal use cases that only Gemini can support. Feature The Vertex AI prompt optimizer adapts your prompts using the optimal instructions and examples to elicit the best performance from your chosen model. This feature is available in Preview . To learn more, see Optimize prompts . Change Gemini 1.5 Pro and Gemini 1.5 Flash Tuning is now available in GA . Tune Gemini with text, image, audio, and document data types using the latest models: gemini-1.5-pro-002 gemini-1.5-flash-002 Gemini 1.0 tuning remains in preview. For more information on tuning Gemini, see Tune Gemini models by using supervised fine-tuning . Feature Gemini 1.5 Pro and Gemini 1.5 Flash now support multimodal input with function calling . This feature is in Preview . Change The latest versions of Gemini 1.5 Flash ( gemini-1.5-flash-002 ) and Gemini 1.5 Pro ( gemini-1.5-pro-002 ) use dynamic shared quota , which distributes on-demand capacity among all queries being processed. Dynamic shared quota is Gene
Date not specified
HighCapability
Get alerts for Google Gemini / Vertex AI
Never miss a breaking change. SignalBreak monitors Google Gemini / Vertex AI and dozens of other AI providers in real time.
Sign up free — no credit card required