#ai-inference tag

Baseten raises $150 million to power the future of AI inference

Baseten just pulled in a massive $150 million Series D, vaulting the AI infrastructure startup to a $2.15 billion valuation and cementing its place as one of the most important players in the race to scale inference - the behind-the-scenes compute that makes AI apps actually run. If the last generation of great tech companies was built on the cloud, the next wave is being built on inference. Every time you ask a chatbot a question, generate an image, or tap into an AI-powered workflow, inference is happening under the hood.

Venture

Artificial intelligence

fromFortune

1 week ago

Exclusive: Baseten, AI inference unicorn, raises $150 million at $2.15 billion valuation

Baseten provides inference infrastructure that enables companies to deploy, manage, and scale AI models while rapidly increasing revenue and valuation.

Artificial intelligence

fromInfoWorld

2 weeks ago

Evolving Kubernetes for generative AI inference

Kubernetes now includes native AI inference features including vLLM support, inference benchmarking, LLM-aware routing, inference gateway extensions, and accelerator scheduling.

#amd

fromTechzine Global

3 months ago

Artificial intelligence

AMD makes third acquisition in eight days as feeding frenzy continues

fromBusiness Insider

4 months ago

Artificial intelligence

AMD's CTO says AI inference will move out of data centers and increasingly to phones and laptops

fromTechzine Global

3 months ago

Artificial intelligence

AMD makes third acquisition in eight days as feeding frenzy continues

fromBusiness Insider

4 months ago

Artificial intelligence

AMD's CTO says AI inference will move out of data centers and increasingly to phones and laptops

more#amd

Artificial intelligence

fromInfoQ

3 months ago

Google Enhances LiteRT for Faster On-Device Inference

LiteRT simplifies on-device ML inference with enhanced GPU and NPU support for faster performance and lower power consumption.

fromTechzine Global

3 months ago

Red Hat lays foundation for AI inferencing: Server and llm-d project

AI inferencing is crucial for unlocking the full potential of artificial intelligence, as it enables models to apply learned knowledge to real-world situations.

Artificial intelligence

fromIT Pro

4 months ago

'TPUs just work': Why Google Cloud is betting big on its custom chips

Google's seventh generation TPU, 'Ironwood', aims to lead in AI workload efficiency and cost-effectiveness.

TPUs were developed with a cohesive hardware-software synergy, enhancing their utility for AI applications.

#ai-inference#ai-inference

Baseten raises $150 million to power the future of AI inference

Exclusive: Baseten, AI inference unicorn, raises $150 million at $2.15 billion valuation

Evolving Kubernetes for generative AI inference

AMD makes third acquisition in eight days as feeding frenzy continues

AMD's CTO says AI inference will move out of data centers and increasingly to phones and laptops

AMD makes third acquisition in eight days as feeding frenzy continues

AMD's CTO says AI inference will move out of data centers and increasingly to phones and laptops

Google Enhances LiteRT for Faster On-Device Inference

Red Hat lays foundation for AI inferencing: Server and llm-d project

'TPUs just work': Why Google Cloud is betting big on its custom chips

#ai-inference
#ai-inference