#gpu-lpu-hybrid-architecture

[ follow ]
Tech industry
fromTheregister
3 days ago

Nvidia slaps Groq into new LPX racks for faster AI response

Nvidia integrates Groq's language processing units into Vera Rubin systems to dramatically accelerate LLM inference, enabling hundreds to thousands of tokens per second per user.
[ Load more ]