Vista Equity Partners and Cambium Launch Vector Core Compute — the World’s First Inference Cloud Powered by CPUs, GPUs and RDUs
Vista Equity Partners and Cambium Capital launched Vector Core Compute (VC2), the world’s first commercially-available enterprise inference cloud for disaggregated inference leveraging technologies across SambaNova, Intel, and NVIDIA.
Also Read: AiThority Interview with Matej Bukovinski, Chief Technology Officer at Nutrient
Backed by a $3.5 billion compute commitment to SambaNova and support from Intel, VC2 launched live from the Los Angeles facility. Additional sites are in development in Chicago, Seattle, and Phoenix, and planned across 50+ U.S. metros. VC2’s distributed deployment model is designed to scale into strategic markets globally, with additional regions informed by strategic partners and customer demand.
Enterprise agentic AI is a sequence of distinct computational demands, each with different performance requirements. VC2 is the first real-world demonstration of a disaggregated inference architecture, bringing together Intel® Xeon® CPUs for orchestration and execution, SambaNova SN40 RDUs for decode, and NVIDIA Blackwell GPUs for prefill:
- Intel Xeon CPUs (orchestration and execution) — designed for the needs of inference and agentic AI, Xeon orchestrates the workflow end-to-end, routing work across silicon in real time
- SambaNova RDUs (decode) — built for fast token generation at scale
- NVIDIA Blackwell GPUs (prefill) — handle the high-compute burst of processing incoming requests.
VC2 is the first cloud infrastructure to bring these three processors together in production, and with a rollout to achieve commercial scale. Rather than concentrating capacity in a handful of remote mega-sites, VC2 aims to distribute compute across the top 50 U.S. metropolitan markets — putting inference endpoints close to the enterprises and customers that use them.
Together AI, the AI Native Cloud serving 400 trillion tokens a month, is the first commercial customer for VC2’s novel agentic cloud, enabling it to bring significantly more inference capacity to its customers. Vista Equity Partners has secured early access to the company’s research-powered inference platform for its 90+ portfolio companies which serve more than 2.5 million enterprise customers and 750 million users worldwide.
“Agentic AI is producing real work at enterprise scale: decisions made, code written, claims processed, customers served. The constraint is no longer the model; it is access to the infrastructure that makes it economically viable to run at scale. Vista believes purpose-built inference infrastructure is a key competitive enabler for enterprise software — distributing workloads like always-on monitoring, high-volume data processing, and complex multi-step orchestration across specialized hardware to reduce cost. Securing early access to this type of specialized inference cloud puts that infrastructure directly in the hands of our portfolio companies doing this work today. As our portfolio companies scale enterprise agentic solutions and expand value to their customers, innovative inference infrastructure ensures they capture more of that value with improved inference economics.” — Robert F. Smith, Founder, Chairman and CEO, Vista Equity Partners
“The rapid growth of AI training over the past decade has resulted in an exponential increase in inference and agentic AI workloads—driving the need for a new model to meet customer demand for high-performance and low-cost inference at scale,” said Lip-Bu Tan, CEO, Intel. “Today’s demonstration of fully disaggregated inference represents a breakthrough moment for customers seeking a cost-efficient and high-performance compute model to accelerate the deployment of AI workloads into production. Together, Intel and its partners are redefining and advancing the economics of running inference at scale.”
“SambaNova was founded in 2017 before the mechanics of generative inference were understood. We built a chip that was purpose-built for AI; and today, the RDU is perfectly-suited for the agentic workloads of the enterprise,” said Rodrigo Liang, Co-Founder and CEO of SambaNova. “VC2 is the largest commercial deployment of SambaNova technology in our history and we’re proud to partner with the industry’s strongest leaders.”
“Cambium has been investing in advanced compute longer than most firms in venture today. We know one thing for certain: the agentic era will not be served by a single chip optimized for a single task,” said Landon Downs, Co-Founder and Managing Partner of Cambium Capital. “The disaggregated architecture is the only way to give each stage of an agentic workflow the silicon it needs.”
“We continue to see exponential demand for inference tokens, now serving over 400T tokens a month of open models for agentic use cases.” said Vipul Ved Prakash, Co-Founder and CEO of Together.ai. “We are excited to collaborate with Vector Core Compute to bring significantly more inference capacity to companies building the next generation of agentic applications, including early access for 90+ portfolio companies of Vista Equity Partners.”
Also Read: AI systems – Interoperable AI systems: Connecting models across platforms
[To share your insights with us, please write to psen@itechseries.com]
Comments are closed.