Sep 13

Nvidia's Llama-3.1 Nemotron Ultra Beats DeepSeek R1 with Fewer Parameters

Nvidia's Llama-3.1 Nemotron Ultra outperforms DeepSeek R1 in benchmarks, showcasing advanced reasoning in a lightweight 253B model.

Editorial StaffEditor

Published September 13, 20251 min read709 views1 tags

Originally reported byventurebeat

Nvidia has introduced its latest large language model, the Llama-3.1 Nemotron Ultra, boasting 253 billion parameters and designed for advanced reasoning and AI assistant tasks. Announced on April 7, 2025, this fully open-source model outperforms Meta's DeepSeek R1, despite having less than half its total parameters. The model is now publicly available on Hugging Face, featuring open weights and post-training data. At the core of the Llama-3.1 Nemotron Ultra model is an architecture optimized for efficient inference, fine-tuned through Neural Architecture Search (NAS). This new design incorporates features like skipped attention layers and compressed feedforward networks, allowing deployment on a single 8x H100 GPU node, while reducing memory usage and computational requirements. The model is compatible with Nvidia's B100 and Hopper microarchitectures and can operate in two modes to handle varying complexity in tasks. Performance evaluations indicate significant improvements, particularly in reasoning-enabled mode. For example, the model scored 97% on the MATH500 benchmark, up from 80.4% when not enabled for reasoning. Such gains highlight its effectiveness in instruction following and general reasoning tasks, surpassing DeepSeek R1 in numerous areas. Developers can integrate the model with the Hugging Face Transformers library and customize performance based on specific task needs. With multilingual capabilities, Llama-3.1 Nemotron Ultra supports various applications, including chatbots, code generation, and retrieval-augmented generation. Released under the Nvidia Open Model License, the model is prepared for commercial use, with guidance on assessing its alignment and safety. Oleksii Kuchaiev from Nvidia expressed excitement about the model's launch, highlighting its innovative design and potential applications in AI development.

#news

Editorial StaffEditor

The Editorial Staff at AIChief is a team of professional content writers with extensive experience in AI and marketing. Founded in 2025, AIChief has quickly grown into the largest free AI resource hub in the industry.

View all posts

Reader feedback

What did you think of this story?

User Comments

Filter:

No comments yet. Be the first to comment!

View all news

YouTuber Hank Green Calls His AI Use 'Unhealthy

#ainews#hankgreen#chatgpt#unhealthyuse#authenticity

Hank Green, the accomplished novelist, comedian, and YouTuber boasting 3.2 million subscribers, recently issued an apology to his extensive audience regarding his increasing reliance on AI chatbots. T...

3 min readAugust 2, 2026

20m ago

Hot 100 Hit: Is It Just AI Slop?

#ainews#aimusic#fenixflexin#rubberz#aidetection

While absolute certainty remains elusive, evidence strongly suggests the involvement of artificial intelligence. Fenix Flexin, primarily recognized as half of the Los Angeles rap duo Shoreline Mafia,...

6 min readAugust 1, 2026

1h ago

Altman's persistent push: ChatGPT for parents.

#ainews#openai#chatgpt#parenting#aiethics

OpenAI CEO Sam Altman recently shared what he enthusiastically described as a "cool use case" for the company's new product, ChatGPT Work. On Friday, he posted that parents could "connect your family...

2 min readAugust 1, 2026

3h ago