nvidia

GitHub
https://github.com/nvidia

nvidia / nemotron-3-nano-30b-a3b

Nemotron-3-Nano-30B-A3B is a large language model (LLM) trained from scratch by NVIDIA

252 runs
Public

nvidia / chronoedit

ChronoEdit-14B enables physics-aware image editing and action-conditioned world simulation through temporal reasoning.

433 runs
Public

nvidia / nemotron-nano-v2-12b-vl

A multi-modal AI model for visual Q&A, summarization, and data extraction, supporting text, images, and video.

822 runs
Public

nvidia / canary-qwen-2.5b

🎤The best open-source speech-to-text model as of Jul 2025, transcribing audio with record 5.63% WER and enabling AI tasks like summarization directly from speech✨

8.9K runs
Public

nvidia / pdf-to-podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

812 runs
Public

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

1M runs
Public

nvidia / sana

A fast image model with wide artistic range and resolutions up to 4096x4096

226.1K runs
Public

nvidia / parakeet-rnnt-1.1b

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

20.4K runs
Public

nvidia / prismer

A Vision-Language Model with An Ensemble of Experts

1.7K runs
Public