NVLM 1.0 - a nvidia Collection

nvidia 's Collections

NVIDIA Nemotron v3

Nemotron-Cascade 2

BioNeMo - Design

MedTech Open Models

Nemotron-Terminal

Nemotron Speech

Inference Optimized Checkpoints (with Model Optimizer)

Nemotron OCR and Object Detection

Nemotron ColEmbed V2

Steering Reasoning VLAs

NVIDIA Cosmos 2

Nemotron-Cascade

Nemotron-Post-Training-v3

Nemotron-Pre-Training-Datasets

NVIDIA Nemotron V2

Speculative Decoding Modules

Cosmos-Drive-Dreams

Reward Models 10-2025

BioNeMo - Understand

BioNeMo - Optimize

Cosmos-Predict2.5

Nemotron-Personas

Llama-Embed-Nemotron-8B

Reasoning Efficiency Research

OpenReasoning-Nemotron

Cosmos-Predict2

Reward Models 06-2025

Cosmos-Transfer2.5

Describe Anything

OpenMathReasoning

OpenCodeReasoning

OpenCodeReasoning-II

Llama Nemotron Feedback-Edit Inference-Time Scaling

Scoring Verifiers

Nemotron-UltraLong

Cosmos-Transfer1

Cosmos-Tokenize1

Cosmos-Predict1

Cosmos-Tokenizer

Llama-3.1-Nemotron-70B

NVILA-Speech-Audio-Setups

NeMo Audio Codecs

Optimized ONNX models for NVIDIA RTX GPUs

Nemotron 4 340B

Llama3-ChatQA-1.5

PS3: Scaling Vision Pre-Training to 4K Resolution

Llama3-ChatQA-2

NeMo Curator - Classifier Models

Nemotron v3 Pre-Training

NVLM 1.0

updated 8 days ago

A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks.