At CES 2025, NVIDIA CEO Jensen Huang launched new Nemotron models, including the Llama Nemotron large language models (LLMs) and Cosmos Nemotron vision language models (VLMs), to improve agentic AI and boost enterprise productivity.
The Llama Nemotron models, built on Llama foundation models, allow developers to create AI agents for applications like customer support, fraud detection, and supply chain optimisation.
“Llama 3.1 is a complete phenomenon, with the downloads reaching 650,000 times. It has been derived and turned into other models, about 60,000 different models. It is singularly the reason why every single enterprise and every single industry has been activated to start working on AI,” said Huang.
“We realized that the Llama models could really be better fine-tuned for enterprise use, so we fine-tuned them using our expertise and capabilities and turned them into the Llama Nemotron suite of open models,” he added.
The Nemotron families will be offered in Nano, Super, and Ultra sizes to suit deployment needs, from low-latency real-time applications to high-accuracy data center use cases. Optimised for computing efficiency and accuracy, these models support agentic AI tasks like instructions for following, coding, and math.
“Agentic AI is the next frontier of AI development, and delivering on this opportunity requires full-stack optimization across a system of LLMs to deliver efficient, accurate AI agents,” said Ahmad Al-Dahle, vice president and head of GenAI at Meta.
NVIDIA announced that the models will be available as downloadable resources or as microservices for deployment across various computing platforms, including data centers and edge devices. Llama Nemotron and Cosmos Nemotron models will be available soon on build.nvidia.com, Hugging Face, and through the NVIDIA Developer Program.
Enterprise-grade deployments will be supported via the NVIDIA AI Enterprise platform on accelerated cloud and data center infrastructure.
NVIDIA’s Cosmos Nemotron models extend AI capabilities to vision and video tasks, allowing agents to analyse and respond to images and videos. These tools aim to support industries like autonomous systems, healthcare, retail, and media.
NVIDIA also unveiled Cosmos world foundation models for physics-aware video generation in robotics and autonomous vehicle applications.
NVIDIA NeMo microservices allow enterprises to customise these models for specific domains and workflows.
Leading AI platform providers, such as SAP and ServiceNow, have backed the Nemotron models. SAP plans to incorporate them into its Joule platform to improve enterprise user productivity, while ServiceNow seeks to utilise the models for AI agent services across various industries.
The models are built using NVIDIA’s NeMo platform for distillation, pruning, and alignment, ensuring high accuracy and throughput across various hardware configurations. NVIDIA NeMo Retriever allows integration with enterprise data, boosting model functionality through retrieval-augmented generation capabilities.
The post NVIDIA Unveils New Llama Nemotron Models to Build AI Agents appeared first on Analytics India Magazine.