Replicate
inference_host
62 signals tracked
Replicate adds model metadata environment variables
Replicate now exposes metadata like username, model name, Docker image URI, version ID, and deployment name as environment variables in running containers. These variables can be used for debugging and instrumentation purposes. No action required, this is an additive feature for enhanced observability.
Date not specified
LowCapabilityReplicate reduces model metadata size by 5KB per object
Replicate updated their public API to return smaller response objects by removing unnecessary OpenAPI schema metadata from model objects. This performance improvement reduces response sizes by over 1MB for operations returning multiple models, dramatically improving response times for LLMs consuming this data. No action required as this is a backward-compatible optimization.
Date not specified
LowCapabilityReplicate launches FLUX.1 Kontext and Veo 3 models with platform updates
Replicate added new AI models including FLUX.1 Kontext from Black Forest Labs and Veo 3 from Google. The platform also received navigation updates, improved pricing displays for official models, and enhanced model card information with last pushed dates. These are incremental improvements that enhance user experience without requiring any action.
Date not specified
InfoCapabilityReplicate adds audio support, streaming text, and playground enhancements
Replicate has enhanced their platform with new audio and streaming text capabilities in the playground, along with improved model navigation features. These additions include better model discovery through 'More by this user' dropdowns and direct content copying to LLMs. No action required - these are additive features that improve the user experience.
Date not specified
LowCapabilityReplicate launches fast FLUX model fine-tuning trainer
Replicate has released a new FLUX fine-tuning trainer that enables model customization in under 2 minutes for less than $2. The service provides both runnable models and downloadable LoRA weights, with plans to open-source the underlying work. This is a new feature addition that expands Replicate's fine-tuning capabilities.
Date not specified
MediumCapabilityReplicate Playground adds streaming and text output models
Replicate's playground now supports models that output and stream text, enabling users to compare different language models. New supported models include salesforce/blip and yorickvp/llava-13b. This is a feature enhancement that expands playground functionality without requiring any user action.
Date not specified
InfoCapabilityReplicate: Added web URL property for browser viewing of predictions
Replicate has added a new 'webproperty' field to the URLs object in prediction responses, allowing users to view predictions directly in their browser. This is an additive enhancement that provides additional functionality without breaking existing integrations. No action is required from users, but they can now leverage the web URL for easier prediction viewing.
Date not specified
LowCapabilityReplicate adds playground integration and audio preview support
Replicate introduced a new "Iterate in the playground" button on model pages for easier testing workflows. The playground now also supports audio preview functionality, allowing users to generate and listen to audio content within the same interface. This is a quality-of-life improvement that enhances the user experience without requiring any action from existing users.
Date not specified
LowCapabilityReplicate adds NVIDIA H100 GPUs and multi-GPU configurations
Replicate has added support for NVIDIA H100 GPUs and expanded multi-GPU configurations (2x, 4x, 8x) for A100 and L40S GPUs to custom models and training runs. This enhances compute options for users requiring high-performance GPU resources. No action required, this is an additive capability expansion.
Date not specified
MediumCapabilityReplicate: 'More by this user' button added to model pages
Replicate has added a new UI feature allowing users to easily discover other models created by the same author through a 'More by this user' button on model pages. This is a minor usability enhancement that improves model discovery on the platform. No action required from users.
Date not specified
LowCapabilityReplicate Platform UI Improvements: Pricing & Playground Enhancements
Replicate updated their pricing page design and improved navigation to make the playground easier to find. Additional improvements include fixing Node API code examples display issues and adding support for more input types in the playground including secrets and multi-file uploads. These are user experience improvements that don't require any action from users.
Date not specified
LowCapabilityReplicate adds LLM-friendly code snippets to documentation
Replicate has enhanced their documentation with copy-paste friendly code snippets that include direct links for use in ChatGPT and Claude. This improvement makes it easier for developers to reference Replicate docs while working with LLMs. No action required - this is a documentation enhancement feature.
Date not specified
InfoCapabilityCog Platform Improvements: Optional Inputs, Caching, Navigation Updates
Replicate added support for optional inputs in cog predict, implemented performance and reliability improvements through caching, and rolled out new navigation. Additional platform fixes include better error display for organization creation and improved Explore page performance.
Date not specified
LowCapabilityReplicate releases Cog 0.14 with async/await concurrent prediction support
Replicate has released Cog version 0.14 (currently at 0.14.3) featuring models that can run predictions concurrently using async/await functionality. This enhancement improves performance for concurrent model inference workloads. No immediate action required for existing users.
Date not specified
MediumCapabilityReplicate simplifies organization creation — no GitHub organization required
Replicate has simplified the organization creation process by removing the GitHub organization requirement. Organizations allow teams to share access to models, API tokens, billing, and dashboards with centralized billing. This is a new feature that provides more flexibility for team collaboration without impacting existing functionality.
Date not specified
MediumCapabilityReplicate Platform Improvements: Org Creation, Feature Flags, UI Enhancements
Replicate has made several platform improvements focusing on user experience enhancements. The updates include streamlined organization creation, better feature flag support, clearer error messaging for training failures, documentation fixes, and improved visual indicators for predictions with multiple outputs. No action required from users as these are backend improvements and UI enhancements.
Date not specified
InfoCapabilityPlatform enhancements: filtering, GitHub-independent organizations, UI fixes
Replicate launched better predictions list filtering and the ability to create organizations without GitHub integration. Additional updates include various UI bug fixes and display improvements across the deployment overview and predictions interface.
Date not specified
MediumCapabilityReplicate adds enhanced prediction filtering and browsing
Replicate has added new filtering options for predictions in the last 24 hours, allowing users to filter by model, deployment, version, and status. Users can also browse predictions by date for improved monitoring and analysis. This is a new feature enhancement that improves user experience without requiring any action.
Date not specified
LowCapabilityReplicate UI Enhancements and Terms of Service Updates
Replicate made several UI enhancements including deployment predictions overview, browser navigation fixes, and playground improvements. Minor updates were also made to the terms of service, though specific changes were not detailed.
Date not specified
LowPolicyReplicate launches official models and platform improvements
Replicate has launched official models and added several platform improvements including latest predictions on deployment pages, better logged-out user experience, and expanded training input support beyond zip files. These are enhancement updates that improve functionality without requiring user action.
Date not specified
LowCapability
Get alerts for Replicate
Never miss a breaking change. SignalBreak monitors Replicate and dozens of other AI providers in real time.
Sign up free — no credit card required