Hugging Face
inference_host
338 signals tracked
Nemotron ColEmbed V2: Advanced Multimodal Retrieval with ViDoRe V3
New model release improving multimodal retrieval performance with ViDoRe V3 technology
Date not specified
MediumCapabilityAnthropic Claude Assists in CUDA Kernel Development
AI model demonstrated ability to build CUDA kernels and help train open models
Date not specified
MediumCapabilityIntroducing Daggr: Chain AI Apps Programmatically
New tool for programmatically chaining applications with visual inspection capabilities
Date not specified
InfoCapabilityHugging Face expands AI infrastructure deployment options
Hugging Face has enhanced deployment options including Inference Endpoints, support for AWS, Azure, and Google Cloud, and new inference optimization toolkits for text generation and embeddings
Date not specified
InfoCapabilityHugging Face expands model deployment with AWS, Azure, Google Cloud integrations
Hugging Face expanded model deployment capabilities including dedicated Inference Endpoints, AWS, Azure, and Google Cloud integrations for deploying and training AI models
Date not specified
InfoCapabilityGoogle launches Text Generation Inference (TGI) for optimized language model serving
New optimized toolkit for serving language models with improved performance and scalability
Date not specified
InfoCapabilityHugging Face launches Text Embeddings Inference (TEI) service
Hugging Face introduced optimized Text Embeddings Inference toolkit for serving embedding models efficiently
Date not specified
InfoCapabilityOpenAI Enhances Model Training Toolkits with PEFT, TRL, Lighteval, and bitsandbytes
New tools like PEFT, TRL, Lighteval, and bitsandbytes added to support parameter-efficient fine-tuning, reinforcement learning, and model optimization.
Date not specified
MediumCapabilityNew Open Source AI Tools Released: Transformers.js, LeRobot, Smolagents
Multiple new libraries introduced including Transformers.js for browser-based ML, LeRobot for robotics AI, and smolagents for building AI agents.
Date not specified
InfoCapabilityHugging Face expands AI deployment options to Azure and Google Cloud
Hugging Face now offers model deployment capabilities on Microsoft Azure and Google Cloud, expanding infrastructure options for AI model hosting and training.
Date not specified
InfoCapabilityOverworld releases Waypoint-1: Real-time Interactive Video Diffusion Model
New interactive video diffusion model from Overworld with real-time capabilities
Date not specified
MediumCapabilityClaude Assists in CUDA Kernel Development for Open Models
AI model Claude demonstrated capability to help build CUDA kernels and support open model development
Date not specified
MediumCapabilityDaggr: Programmatic App Chaining and Visual Inspection Tool Released
New development tool for programmatically connecting and visually inspecting application chains
Date not specified
MediumCapabilityNVIDIA Introduces New Policy for Advanced Robot Control
NVIDIA introduces advanced robotic control policy leveraging AI technologies
Date not specified
HighPolicyAzure and Google Cloud now support AI model deployment
Enhanced model deployment options now available on Microsoft Azure and Google Cloud, broadening infrastructure choices for AI model hosting.
Date not specified
InfoCapabilityIntroducing Text Embeddings Inference (TEI) Service
New optimized toolkit for serving embedding models, providing dedicated infrastructure for efficient embeddings model deployment.
Date not specified
InfoCapabilityHugging Face releases new Text Generation Inference (TGI) Toolkit
Hugging Face launched an optimized toolkit for serving language models with Text Generation Inference (TGI), enabling more efficient model deployment and inference.
Date not specified
InfoCapabilityOpenAI releasing Text Generation & Embeddings Inference Toolkits (TGI & TEI)
New optimized toolkits (TGI and TEI) for serving language models and embedding models with improved performance
Date not specified
InfoCapabilityHugging Face Launches Managed Inference Endpoints on AWS, Azure, and Google Cloud
Hugging Face launched dedicated infrastructure for deploying models on managed endpoints with support for AWS, Microsoft Azure, and Google Cloud
Date not specified
InfoCapabilityHugging Face updates open responses policy
Hugging Face introduces new guidelines for open responses, potential impact on model interactions
Date not specified
HighPolicy
Get alerts for Hugging Face
Never miss a breaking change. SignalBreak monitors Hugging Face and dozens of other AI providers in real time.
Sign up free — no credit card required