Deci Collaborates with Intel to Achieve 11.8x Accelerated Inference Speed at MLPerf

By AIT News Desk On Dec 1, 2020

NewThe Deci-Intel collaboration marks a significant step towards enabling deep learning inference at scale on CPUs

Deci, the deep learning company building the next generation of AI, announced its inference results that were submitted to the open division of the MLPerf v0.7 inference benchmark. On several popular Intel CPUs, Deci’s AutoNAC (Automated Neural Architecture Construction) technology accelerated the inference speed of the well-known ResNet-50 neural network. It reduced the submitted models’ latency by a factor of up to 11.8x and increased throughput by up to 11x– all while preserving the model’s accuracy within 1%.

“Billions of dollars have been spent on building dedicated AI chips, some of which are focused on computer vision inference,” says Yonatan Geifman, CEO and co-founder of Deci. “At MLPerf we demonstrated that Deci’s AutoNAC algorithmic acceleration, together with Intel’s OpenVino toolkit, enables the use of standard CPUs for deep learning inference at scale.”

According to MLPerf rules, Deci’s goal was to reduce the latency, or increase throughput, while staying within 1% accuracy of ResNet-50 trained on the Imagenet dataset. Deci’s optimized model improved latency between 5.16x and 11.8x when compared to vanilla ResNet-50. When compared to competing submissions, Deci achieved throughput per core that was three times higher than models of other submitters.

Elite Site Optimizer Launches AI Tools Suite and AI Readiness Audit for the Next Generation of Search

Jul 21, 2026

Dell PowerEdge XE7740 supported up to 74 concurrent AI agents in Principled Technologies testing

Jul 21, 2026

Google Maps Scraper by Outscraper Helps Businesses Scale Lead Generation

Jul 21, 2026

Prev Next 1 of 42,602

“Intel’s collaboration with Deci takes a significant step towards enabling deep learning inference on CPU, a longstanding challenge for AI practitioners across the globe,” said Guy Boudoukh from Intel AI. “Accelerating the latency of inference by a factor of 11x enables new applications and deep learning inference tasks in a real-time environment on CPU edge devices and dramatically cuts cloud costs for large scale inference scenarios.”

MLPerf gathers expert deep learning leaders to build fair and useful benchmarks for measuring training and inference performance of ML hardware, software, and services. The models submitted were optimized using Deci’s AutoNAC technology and quantized with Intel’s OpenVINO to 8-bit precision.

Deci’s patent-pending AutoNAC technology uses machine learning to redesign any model and maximize its inference performance on any hardware – all while preserving its accuracy.