Skip to content

Generic NIM GPU recommendations

This page lists suggested minimum and scaled GPU configurations per NIM image for quick planning alongside the Generic configuration guide.

Name GPU Requirements
nvcr.io/nim/baidu/paddleocr:1.3.0 1xA100, 1xA10G, 1xT4
nvcr.io/nim/bigcode/starcoder2-7b:1.8.1 1xH100, 2xH100
nvcr.io/nim/black-forest-labs/flux.1-dev:1.0.1 1xA100, 1xA10G, 1xB200, 1xH100, 1xL4, 1xL40S
nvcr.io/nim/colabfold/msa-search:1.0.0 1xA100, 1xH100, 1xL40S
nvcr.io/nim/deepseek-ai/deepseek-r1-distill-llama-8b:1.5.2 1xA100, 2xA10G, 1xH100, 4xL40S
nvcr.io/nim/deepseek-ai/deepseek-r1-distill-qwen-14b:1.1.0 1xH20, 1xH200, 1xL20, 1xL40S
nvcr.io/nim/deepseek-ai/deepseek-r1-distill-qwen-32b:1.1.0 1xH100, 1xH20, 1xH200, 2xH200, 1xL20, 2xL20
2xL40S
nvcr.io/nim/deepseek-ai/deepseek-r1-distill-qwen-7b:1.1.0 1xA10G, 1xL4OS
nvcr.io/nim/defog/llama-3-sqlcoder-8b:1.2.3 1xA10G, 2xA10G, 4xA10G, 1xH100, 2xH100, 1xL40S
2xL40S
nvcr.io/nim/google/gemma-2-2b-instruct:1.4.0 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/google/gemma-2-9b-it:1.4.0 1xA100, 1xA10G, 1xH100, 1xL40S, 2xT4
nvcr.io/nim/ipd/proteinmpnn:1 1xA10G, 1xL40S
nvcr.io/nim/ipd/rfdiffusion:2 1xA100, 1xH100, 1xL40S
nvcr.io/nim/meta/codellama-13b-instruct:1.2.2 2xA100, 4xA100, 4xA10G, 8xA10G, 2xH100, 4xH100
nvcr.io/nim/meta/codellama-34b-instruct:1.2.2 2xA100, 4xA100, 4xA10G, 8xA10G, 2xH100, 4xH100
4xL40S
nvcr.io/nim/meta/codellama-70b-instruct:1.2.2 4xA100, 8xA100, 8xA10G, 4xH100, 8xH100
nvcr.io/nim/meta/llama-2-13b-chat:1.0.3 1xA100, 2xA100, 1xH100, 2xH100, 1xL40S, 2xL40S
nvcr.io/nim/meta/llama-2-70b-chat:1.0.3 4xA100, 8xA100, 2xH100, 4xH100, 8xH100, 4xL40S
nvcr.io/nim/meta/llama-2-7b-chat:1.0.3 1xA100, 2xA100, 1xH100, 2xH100, 1xL40S, 2xL40S
nvcr.io/nim/meta/llama-3.1-70b-instruct-pb24h2:1.3.6 4xA100, 8xA100, 2xH100, 4xH100, 8xH100, 1xH200
2xH200, 4xH200, 4xL40S
nvcr.io/nim/meta/llama-3.1-70b-instruct:1.8.4 4xA100, 8xA100, 2xH100, 4xH100, 8xH100, 1xH200
2xH200, 4xH200, 4xL40S
nvcr.io/nim/meta/llama-3.1-8b-base:1.1.2 1xA100, 2xA100, 2xA10G, 4xA10G, 1xH100, 2xH100
2xL40S
nvcr.io/nim/meta/llama-3.1-8b-instruct-pb24h2:1.3.6 2xA10G, 1xH100, 1xL40S
nvcr.io/nim/meta/llama-3.1-8b-instruct:1.8.4 2xA10G, 1xH100, 1xL40S
nvcr.io/nim/meta/llama-3.2-11b-vision-instruct:1.1.1 1xA100, 2xA100, 4xA10G, 8xA10G, 1xH100, 2xH100
1xH200, 2xH200, 2xL40S, 4xL40S
nvcr.io/nim/meta/llama-3.2-1b-instruct:1.8.5 1xA10G, 1xL4OS
nvcr.io/nim/meta/llama-3.2-3b-instruct:1.8.4 1xA100, 2xA10G, 1xH100, 2xL40S
nvcr.io/nim/meta/llama-3.2-90b-vision-instruct:1.1.1 4xA100, 8xA100, 2xH100, 4xH100, 8xH100, 1xH200
2xH200, 4xH200, 8xL40S
nvcr.io/nim/meta/llama-3.3-70b-instruct:1.8.5 8xA100, 8xH100, 4xH200, 8xL40S
nvcr.io/nim/meta/llama3-70b-instruct:1.0.3 4xA100, 8xA10G, 4xH100, 8xH100, 8xL40S
nvcr.io/nim/meta/llama3-8b-instruct:1.0.3 1xA100, 2xA100, 1xA10G, 2xA10G, 1xH100, 2xH100
1xL40S, 2xL40S
nvcr.io/nim/microsoft/phi-3-mini-4k-instruct:1.2.3 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/mistralai/mistral-7b-instruct-v0.3:1.3.0 1xA100, 2xA100, 2xA10G, 4xA10G, 1xH100, 2xH100
1xL40S, 2xL40S
nvcr.io/nim/mistralai/mixtral-8x22b-instruct-v01:1.2.2 8xA100, 8xH100
nvcr.io/nim/mistralai/mixtral-8x7b-instruct-v01:1.3 2xA100, 4xA100, 8xA10G, 2xH100, 4xH100, 4xL40S
nvcr.io/nim/nv-mistralai/mistral-nemo-12b-instruct:1.2.2 1xA100, 2xA100, 8xA10G, 1xH100, 2xH100
nvcr.io/nim/nv-mistralai/mistral-nemo-minitron-8b-8k-instruct:1.2.3 1xA100, 2xA100, 2xA10G, 4xA10G, 1xH100, 2xH100
1xL40S, 2xL40S
nvcr.io/nim/nvidia/cosmos-predict1-7b-text2world:1.0.0 1xH100
nvcr.io/nim/nvidia/cosmos-predict1-7b-video2world:1.0.0 1xH100
nvcr.io/nim/nvidia/domino-automotive-aero:1.0.0 1xA100, 1xH100, 1xL40S
nvcr.io/nim/nvidia/genmol:1.0.0 1xA100, 1xA10G, 1xA6000, 1xH100, 1xL40S
nvcr.io/nim/nvidia/llama-3.1-nemoguard-8b-content-safety:1.0.0 1xL40S, 4xL40S, 8xL40S
nvcr.io/nim/nvidia/llama-3.1-nemoguard-8b-topic-control:1.0.0 1xL40S, 4xL40S, 8xL40S
nvcr.io/nim/nvidia/llama-3.1-nemotron-70b-instruct:1.2 4xA100, 8xA100, 2xH100, 4xH100, 8xH100, 8xL40S
nvcr.io/nim/nvidia/llama-3.1-nemotron-nano-8b-v1:1.8.3 1xA100, 2xA100, 2xA10G, 1xH100, 2xH100, 1xH200
2xH200, 1xL40S, 2xL40S, 4xL40S
nvcr.io/nim/nvidia/llama-3.1-nemotron-ultra-253b-v1:1.8.4 4xB100, 4xH100, 8xH100
nvcr.io/nim/nvidia/llama-3.2-nv-embedqa-1b-v2:1.6.0 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/nvidia/llama-3.2-nv-rerankqa-1b-v2:1.5.0 1xA100, 1xA10G, 1xH100, 1xL4, 1xL40S
nvcr.io/nim/nvidia/llama-3.3-nemotron-super-49b-v1:1.8.4 4xA100, 8xA100, 8xA10G, 1xH100, 2xH100, 4xH100
8xH100, 1xH200, 2xH200, 4xH200, 4xL40S, 8xL40S
nvcr.io/nim/nvidia/molmim:1.0.0 1xA10G, 1xL40S
nvcr.io/nim/nvidia/nemoguard-jailbreak-detect:1.0.0 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/nvidia/nemoretriever-graphic-elements-v1:1.3 1xA100, 1xA10G, 1xB200, 1xH100, 1xL4, 1xL40S
nvcr.io/nim/nvidia/nemoretriever-page-elements-v2:1.3 1xA100, 1xA10G, 1xB200, 1xH100, 1xL4, 1xL40S
nvcr.io/nim/nvidia/nemoretriever-parse:1.2 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/nvidia/nemoretriever-table-structure-v1:1.3.0 1xA100, 1xA10G, 1xB200, 1xH100, 1xL4, 1xL40S
nvcr.io/nim/nvidia/nv-embedqa-e5-v5-pb24h2:1.2.3 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/nvidia/nv-embedqa-e5-v5:1.6.0 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/nvidia/nv-embedqa-mistral-7b-v2:1.0.1 1xA100, 1xH100, 1xL40S
nvcr.io/nim/nvidia/nv-rerankqa-mistral-4b-v3:1.0.2 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/nvidia/nv-yolox-page-elements-v1:1.1.0-rtx 1xA100, 1xA10G, 1xL40S
nvcr.io/nim/nvidia/nvclip:2.0.0 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/openfold/openfold2:1.0 1xA100, 1xH100
nvcr.io/nim/qwen/qwen-2.5-7b-instruct:1.0.0 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/snowflake/arctic-embed-l:1.0.1 1xA100, 1xA10G, 1xH100, 1xL40S
nvcr.io/nim/tokyotech-llm/llama-3-swallow-70b-instruct-v0.1:1.1.2 2xA100, 4xA10G, 2xH100, 4xH100, 2xL40S
nvcr.io/nim/tokyotech-llm/llama-3.1-swallow-70b-instruct-v0.1:1.3.2 4xA100, 4xH100, 2xH200, 8xL40S
nvcr.io/nim/yentinglin/llama-3-taiwan-70b-instruct:1.1.2 2xA100, 4xA10G, 2xH100, 4xH100, 2xL40S