Llemma Outperforms Google’s Minerva Model

By AIT Staff Writer On Oct 19, 2023

An Open Mathematical Language Model

The team at EleutherAI has released Llemma, an open mathematical language model, coupled with the Proof-Pile-2 dataset. The academic and scientific community has taken a keen interest in this endeavor since it was constructed using CodeLlama’s ongoing pretraining.

While similar to Minerva, a closed model developed by Google Research specifically for mathematics, this new invention from EleutherAI actually outperforms Minerva when compared on an equal-parameter basis. Llemma is unique among mathematical language models since it can perform a wider variety of tasks, such as those involving the use of tools and formal mathematics.

The paper’s first author, Zhangir Azerbayev, explains how the development of Llemma began with the compilation of a massive dataset of mathematical tokens. This dataset included the ArXiv subset of RedPajama, the recently released OpenWebMath dataset, and the debut of the AlgebraicStack, a code dataset designed specifically for mathematics. By covering all bases, we were able to train on an unprecedented 55 billion tokens.

Read: AI and Machine Learning Are Changing Business Forever

Llemma Outperforms Minerva

Suzuki Teams Up with Cerence AI to Bring In-Car Assistant to the Road

Sep 30, 2025

Verisk Introduces New AI Tools to Streamline the Property Claims Experience

Sep 30, 2025

webFEAT Complete Upgrades Hosting Infrastructure with DartPoints State-of-the-Art Data Center

Sep 30, 2025

Prev Next 1 of 41,651

Lemma stands out because it can handle larger model sizes than any other open base model, including Google’s Minerva, at both the 7 billion and 34 billion parameter levels. The feat is all the more impressive given that the Llemma model, with just half as many parameters (34 billion), is getting close to the performance of Google’s Minerva (with 62 billion).

Models in Llemma were seeded with Code Llama weights before being trained on StabilityAI’s Ezra cluster’s network of 256 A100 GPUs. Training for the 7-billion model took place for over 200 billion tokens and 23,000 A100 hours, whereas training for the 34-billion model lasted 50 billion tokens and 47,000 A100 hours.

Llemma outperforms Minerva on chain-of-thought tasks when the two systems are compared using the same set of parameters, and this advantage is compounded by the fact that Llemma uses majority voting to make decisions.

Read the latest blogs: Navigating The Reality Spectrum: Understanding VR, AR, and MR

The development of Llemma is the result of teamwork amongst researchers at several universities and institutes, including Princeton, EleutherAI, the University of Toronto, Vector Institute, the University of Cambridge, Carnegie Mellon, and the University of Washington.

[To share your insights with us, please write to sghosh@martechseries.com]

Llemma Outperforms Google’s Minerva Model

An Open Mathematical Language Model

Llemma Outperforms Minerva

Quick Links

Visit Our Other Sites

Follow Us

Interested in our Customized Editorial Services?

Please fill your details and we’ll get in touch with you!

NEWS

INTERVIEWS

INSIGHTS

AI RADAR

SERVICES

SUBSCRIBE

CONTACT US

Brought to you by

To repurpose or use any of the content or material on this and our sister sites, explicit written permission needs to be sought.

Copyright © 2025 AiThority. All Rights Reserved. Privacy Policy

Llemma Outperforms Google’s Minerva Model

An Open Mathematical Language Model

Llemma Outperforms Minerva

Quick Links

Visit Our Other Sites

Follow Us

Interested in our Customized Editorial Services?

﻿Please fill your details and we’ll get in touch with you!

NEWS

INTERVIEWS

INSIGHTS

AI RADAR

SERVICES

SUBSCRIBE

CONTACT US

Brought to you by

To repurpose or use any of the content or material on this and our sister sites, explicit written permission needs to be sought. Copyright © 2025 AiThority. All Rights Reserved. Privacy Policy

Please fill your details and we’ll get in touch with you!

To repurpose or use any of the content or material on this and our sister sites, explicit written permission needs to be sought.

Copyright © 2025 AiThority. All Rights Reserved. Privacy Policy