Distributed Cache System Design

VAST Data Redesigns AI Inference Architecture for the Agentic Era with NVIDIA

VAST Data , the AI Operating System company, today announced a new inference architecture that enables the NVIDIA Inference Context Memory Storage Platform – deployments for the era of long-lived, ...

Nvidia debuts Rubin chip with 336B transistors and 50 petaflops of AI performance

The GPU made its debut at CES alongside five other data center chips. Customers can deploy them together in a rack called the Vera Rubin NVL72 that Nvidia says ships with 220 trillion transistors, ...

InfoQ

Engineering Speed at Scale — Architectural Lessons from Sub-100-ms APIs

Sub‑100-ms APIs emerge from disciplined architecture using latency budgets, minimized hops, async fan‑out, layered caching, ...

InfoQ

Google’s Eight Essential Multi-Agent Design Patterns

Google recently published a guide outlining eight essential design patterns for multi-agent systems, ranging from sequential ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results