Nvidia revealed the Nemotron Model Families during CES 2025. Showcasing Agentic AI which starts a new era of artificial intelligence where specialized agents can solve complex problems and automate repetitive tasks. Enterprises can achieve unprecedented productivity with custom AI agents, which require multiple generative AI models optimized for agentic AI functions.
NVIDIA has announced the Llama Nemotron family of open large language models (LLMs) to provide a foundation for enterprise agentic AI, enabling developers to create and deploy AI agents across various applications.
Moreover, the company has introduced new Cosmos Nemotron vision language models (VLMs) and NVIDIA NIM microservices for video search and summarization, allowing developers to create agents capable of analyzing and responding to images and videos from various sources, including autonomous machines, hospitals, stores, sports events, movies, and news.
NVIDIA: Open Llama Nemotron Models Optimize Compute Efficiency and Accuracy for AI Agents
The company has released the Llama Nemotron models, which are optimized for AI agent development. These models are trained using the company’s latest techniques and high-quality datasets, enhancing agentic capabilities. They excel in instruction following, chat, function calls, coding, and math, and are size-optimized for running on various computing platforms.
In addition, the Llama Nemotron model family is available as downloadable models and NVIDIA NIM microservices, offering industry-leading performance and seamless integration into agentic AI application workflows.
Customize and Connect to Business Knowledge With NeMo
The Llama Nemotron and Cosmos Nemotron model families come in Nano, Super, and Ultra sizes to provide options for deploying AI agents at every scale.
● Nano: The most cost-effective model optimized for real-time applications with low latency, ideal for deployment on PCs and edge devices.
● Super: A high-accuracy model offering exceptional throughput on a single GPU.
● Ultra: The highest-accuracy model, designed for data-center-scale applications demanding the highest performance.
NeMo microservices enable enterprises to customize models for specific use cases, accelerate model customization, and apply guardrails. Developers can integrate RAG capabilities and use NVIDIA Blueprints for agentic AI to create applications quickly. NVIDIA Cosmos Nemotron, Llama Nemotron, and NeMo Retriever are also available.
NeMo, NeMo Retriever, and NVIDIA Blueprints are all available with the NVIDIA AI Enterprise software platform.
Llama Nemotron and Cosmos Nemotron models will be available as hosted APIs, free for Developer Program members, and can be run on accelerated data center and cloud infrastructure.
Started his freelancing adventure in 2018 and began doing freelance Audio Engineering work and then started freelance writing a few years later.
Currently he writes for Gadget Pilipinas and Grit.PH.
He is also a musician, foody, gamer, and PC enthusiast.