nvidia

GitHub
https://github.com/nvidia

nvidia / chronoedit

ChronoEdit-14B enables physics-aware image editing and action-conditioned world simulation through temporal reasoning.

167 runs
Public

nvidia / nemotron-nano-v2-12b-vl

A multi-modal AI model for visual Q&A, summarization, and data extraction, supporting text, images, and video.

674 runs
Public

nvidia / canary-qwen-2.5b

🎤The best open-source speech-to-text model as of Jul 2025, transcribing audio with record 5.63% WER and enabling AI tasks like summarization directly from speech✨

6.6K runs
Public

nvidia / sana-sprint-1.6b

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

836K runs
Public

nvidia / pdf-to-podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

732 runs
Public

nvidia / sana

A fast image model with wide artistic range and resolutions up to 4096x4096

209.4K runs
Public

nvidia / parakeet-rnnt-1.1b

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

18.9K runs
Public

nvidia / prismer

A Vision-Language Model with An Ensemble of Experts

1.7K runs
Public