Tag: Inference

Nvidia’s Open-Source Inference Models: Unlocking 10x Cost Savings

Summary: Nvidia has improved the cost per token from 20 cents to 5 cents by upgrading to the

Nokia and Blaize Partner to Advance Edge AI Inference for APAC Networks

Nokia and Blaize, a leading AI-enabled edge computing chip company, have entered into a strategic Memorandum of Understanding

Introducing Maia 200: Microsoft’s Next-Gen AI Inference Chip

Summary: Microsoft introduces Maia 200, a powerful chip capable of running large AI models with ease. Maia 200

Microsoft’s Cutting-Edge Maia 200 Chip Revolutionizes In-House Inference

Summary: 1. Microsoft has launched the Maia 200, an inference accelerator designed to enhance AI performance for large-scale

OpenAI Partners with Cerebras to Revolutionize AI Inference Infrastructure

The Future of AI Workloads in Data Centers Summary: 1. Analysts predict that AI workloads will become more

Groq Steps in to Bridge the GPU Inference Gap for NVIDIA

Summary: NVIDIA has engaged in a licensing agreement with Groq, where NVIDIA will compensate Groq for the utilization

Adapting to the Rising Inference Costs: The Evolution of AI Infrastructure in Enterprises

Summary: 1. AI spending in Asia Pacific is increasing, but many companies struggle to derive value from their

Edge-Optimized AI: Akamai’s Enhanced Inference with NVIDIA Technology

Akamai has recently unveiled the Akamai Inference Cloud, a groundbreaking platform that brings AI inference to the edge,

Zenlayer Revolutionizes Global AI Scaling with Enhanced Edge Infrastructure for Distributed Inference

Discovering a new breakthrough in global AI scaling, Zenlayer, a leading hyperconnected cloud company, unveiled its latest innovation,

Maximizing Efficiency: How ATLAS Adaptive Speculator Achieved a 400% Inference Speedup Through Real-Time Workload Learning

Summary: 1. Enterprises expanding AI deployments face a performance wall due to static speculators unable to keep up

The AI Inference Revolution: Unleashing the Power of Specialized Processors in the Silicon Arms Race

The landscape of AI processors is rapidly evolving as tech giants and chip designers compete to create the

Alibaba’s Advancements in AI Technology: The Development of an Inference Chip Despite US Export Restrictions

Summary: Alibaba's new AI chip aims to address the limitations of Huawei's AI chips by being compatible with