⌘KCtrl+K

Your Privacy Choices

Copyright © 2026 NVIDIA Corporation

Models

Deploy and scale models on your GPU infrastructure of choice with NVIDIA NIM inference microservices

Optimized by NVIDIA Launch from Hugging FaceBeta

Filters

Free Endpoint

52

Partner Endpoint

56

Download Available

107

Use Case

Retrieval Augmented Generation

14

Drug Discovery

13

Image-to-Text

11

Code Generation

10

Speech-to-Text

9

Inference Providers

Deep Infra

42

Together AI

32

GMI Cloud

18

Bitdeer AI

17

CoreWeave

10

Publisher

NVIDIA

82

Meta

11

Mistral AI

10

Google

6

Qwen

6

156 models

Sort By

Free Endpoint

deepseek-v4-flash

DeepSeek V4 Flash is a 284B MoE model with 1M-token context optimized for fast coding and agents.

3d

Items per page

of 7 pages

213K

Free Endpoint

deepseek-v4-pro

DeepSeek V4 scales to 1M-token context windows with efficient MoE architecture for coding tasks.

265K

3d

Downloadable

glm-5.1

GLM-5.1 is a flagship LLM for agentic workflows, coding, and long-horizon reasoning tasks.

1.27M

1w

Free Endpoint

glm-4.7

GLM-4.7 is a multilingual agentic coding partner with stronger reasoning, tool use, and UI skills.

3.55M

1w

Downloadable

NVIDIA AI for Media Relighting

Re-illuminate people in video to match target lighting from a 360 HDRI environment map.

224

1w

Free Endpoint

nemotron-3-content-safety

Multilingual, multimodal model for detecting unsafe and toxic content.

19.1K

1w

DownloadableFree Endpoint

synthetic-video-detector

NVIDIA Synthetic Video Detector is an AI-powered micro-service for detecting AI‑generated (synthetic) videos.

6.18K

1w

DownloadableFree Endpoint

Active Speaker Detection

Detect and track speaker identities across video frames.

216

1w

Downloadable

LipSync

Generative lip dubbing that syncs lips in a video to input audio.

1w

Downloadable

ising-calibration-1-35b-a3b

Open VLM for quantum computer calibration chart understanding across a range of qubit modalities.

74.27K

1w

Free Endpoint

minimax-m2.7

MiniMax M2.7 is a 230B-parameter text-to-text AI model excelling in coding, reasoning, and office tasks.

3.6M

2w

Downloadable

gemma-4-31b-it

Dense 31B model delivering frontier reasoning for coding, agentic workflows, and fine-tuning.

3.22M

3w

Downloadable

llama-nemotron-rerank-vl-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

7K

3w

Downloadable

mistral-small-4-119b-2603

Hybrid MoE model unifying instruct, reasoning, and coding with multimodal input and 256k context

code generation

7.94M

1mo

Free Endpoint

nemotron-voicechat

Nemotron 3 Voicechat

3.46K

1mo

Downloadable

nemotron-asr-streaming

Real-time speech recognition for English

Automatic Speech Recognition

23.17K

1mo

Black-forest-labs

Downloadable

flux.2-klein-4b

FLUX.2-klein-4B is a distilled image generation and editing model, producing outputs at lighting speed

104K

1mo

Downloadable

nemotron-ocr-v1

Powerful OCR model for fast, accurate real-world image text extraction, layout, and structure analysis.

Table Extraction

1.85M

1mo

Downloadable

nemotron-3-super-120b-a12b

Open, efficient hybrid Mamba-Transformer MoE with 1M context, excelling in agentic reasoning, coding, planning, tool calling, and more

42.61M

1mo

Downloadable

llama-nemotron-rerank-1b-v2

GPU-accelerated model optimized for providing a probability score that a given passage contains the information to answer a question.

176K

1mo

Downloadable

qwen3.5-122b-a10b

122B MoE LLM (10B active) for coding, reasoning, multimodal chat. Agent-ready.

7.52M

1mo

Downloadable

nemotron-table-structure-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

15.6K

1mo

Downloadable

nemotron-page-elements-v3

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

29.09K

1mo

Downloadable

nemotron-graphic-elements-v1

Model for object detection, fine-tuned to detect charts, tables, and titles in documents.

Object Detection

9.49K

1mo