Tag: Inference

Revolutionizing AI-as-a-Service: Rafay’s Serverless Inference for GPU Cloud Providers

Summary: Rafay has launched a Serverless Inference offering to assist NVIDIA Cloud Partners and GPU Cloud Providers in

Red Hat’s Latest Product Expansion: Introducing the AI Inference Server

Summary: 1. Red Hat introduces the Red Hat AI Inference Server, enhancing generative AI applications' speed and efficiency.

Stealthy Seattle Startup Secures $16M Funding for Cutting-Edge AI Inference Technology

Summary: 1. ElastixAI, a new Seattle startup, is developing technology to optimize large language model deployment. 2. Led

VSORA Secures $46 Million to Launch AI Inference Chip

French deep-tech company VSORA has secured $46 million in fresh funding to expedite the development of its cutting-edge

NTT debuts breakthrough AI chip for real-time 4K inference at the edge

NTT has introduced the world’s first AI inference large-scale integration (LSI) chip that enables real-time 4K video processing

Google Launches Ironwood TPU For Next-Gen AI Inference

Google has introduced Ironwood, its seventh-generation AI chip, aimed at handling demanding AI inference workloads at a large