[bsfp-cryptocurrency style=”widget-18″ align=”marquee” columns=”6″ coins=”selected” coins-count=”6″ coins-selected=”BTC,ETH,XRP,LTC,EOS,ADA,XLM,NEO,LTC,EOS,XEM,DASH,USDT,BNB,QTUM,XVG,ONT,ZEC,STEEM” currency=”USD” title=”Cryptocurrency Widget” show_title=”0″ icon=”” scheme=”light” bs-show-desktop=”1″ bs-show-tablet=”1″ bs-show-phone=”1″ custom-css-class=”” custom-id=”” css=”.vc_custom_1523079266073{margin-bottom: 0px !important;padding-top: 0px !important;padding-bottom: 0px !important;}”]

CoreWeave Announces Agreement to Power Perplexity’s AI Inference Workloads

CoreWeave, Inc. Logo

CoreWeave, Inc. , The Essential Cloud for AI™, announced that it has entered into a multi-year strategic partnership with Perplexity to support its inference workloads on CoreWeave Cloud and pilot new services across both organizations.

Also Read: AiThority Interview With Arun Subramaniyan, Founder & CEO, Articul8 AI

Perplexity builds AI-native products and services that operate continuously in real-world environments, where inference performance and reliability directly impact user experience. The CoreWeaveCloud platform is purpose-built to meet these requirements, delivering consistent performance with low latency, predictable cost characteristics, and the ability to scale rapidly as usage grows. CoreWeave enables customers to move from development to sustained production without re-architecting systems or tooling.

Under the agreement, Perplexity will power its next-generation inference workloads on CoreWeave’s platform. By utilizing dedicated NVIDIA GB200 NVL72-powered clusters, CoreWeave ensures that its infrastructure keeps pace with Perplexity’s rapid growth and the sophisticated requirements of the Sonar and Search API ecosystem. CoreWeave will also roll out Perplexity Enterprise Max across its organization, enabling employees to search the web and internal knowledge, run deep multi-step research, visualize and analyze data, and work with the most advanced AI models available — all within one platform.

Related Posts
1 of 42,210

“We’re proud to partner with Perplexity as they scale their inference workloads on CoreWeave’s AI cloud,” said Max Hjelm, senior vice president of revenue at CoreWeave. “AI applications running in production require more than just access to raw infrastructure – they require best-in-class performance and reliability as well as a cloud platform designed end-to-end for AI that simplifies compute operations.”

“We were impressed by the combination of CoreWeave’s technical aptitude and partner-first mindset that help AI-native companies accelerate their growth and scaling goals,” said Dmitry Shevelenko, chief business officer at Perplexity. “CoreWeave is an essential partner in our efforts to optimize our infrastructure and the models we use to provide Perplexity users across industries with the strongest AI tools and agents on the market.”

Perplexity has begun running inference workloads with CoreWeave Kubernetes Service as part of the initial phase of the deployment and is leveraging W&B Models to help train, fine-tune, and manage models from experimentation to production. The collaboration reflects Perplexity’s multi-cloud strategy and underscores CoreWeave’s role as a specialized AI cloud provider for companies operating advanced AI systems in high-demand production environments.

CoreWeave consistently sets new standards for performance, demonstrated by industry-leading MLPerf benchmark results and its position as the only AI cloud to earn top Platinum ranking in both SemiAnalysis ClusterMAX™ 1.0 and 2.0, which evaluate AI cloud performance, efficiency, and reliability.

Also Read: Cheap and Fast: The Strategy of LLM Cascading (Frugal GPT)

[To share your insights with us, please write to psen@itechseries.com]

Comments are closed.