ModelList.md

Model List

LLM Models

Model Name	Supported Runtimes
Deepseek R1 Distill Llama 8B	Intel CPU, Intel GPU, Intel NPU
Deepseek R1 Distill Qwen 1.5B	Qualcomm NPU, Qualcomm GPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Deepseek R1 Distill Qwen 14B	NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Deepseek R1 Distill Qwen 7B	AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Llama 3.1 8B Instruct	Qualcomm NPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Llama 3.2 1B Instruct	Qualcomm NPU, Qualcomm GPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Mistral 7B Instruct V0.2	AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Mistral 7B Instruct V0.3	Intel CPU, Intel GPU
Phi 3 Mini 128K Instruct	Qualcomm NPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Phi 3 Mini 4K Instruct	Qualcomm NPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Phi 3.5 Mini Instruct	Qualcomm NPU, Qualcomm GPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Phi 4	NVIDIA TensorRT for RTX, Intel CPU, Intel GPU
Phi 4 Mini Instruct	Qualcomm NPU, AMD NPU, Intel CPU, Intel GPU, Intel NPU
Phi 4 Mini Reasoning	AMD NPU, Intel CPU, Intel GPU, Intel NPU
Phi 4 Reasoning	Intel NPU
Phi 4 Reasoning Plus	Intel NPU
Qwen2.5 0.5B Instruct	AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Qwen2.5 1.5B Instruct	Qualcomm NPU, Qualcomm GPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Qwen2.5 14B Instruct	NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Qwen2.5 3B Instruct	Intel CPU, Intel GPU, Intel NPU
Qwen2.5 7B Instruct	Qualcomm NPU, AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Qwen2.5 Coder 0.5B Instruct	AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Qwen2.5 Coder 1.5B Instruct	AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Qwen2.5 Coder 14B Instruct	NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU
Qwen2.5 Coder 3B Instruct	Intel CPU, Intel GPU, Intel NPU
Qwen2.5 Coder 7B Instruct	AMD NPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU

Non-LLM Models

Model Name	Supported Runtimes
Bert Base Multilingual Cased	Qualcomm NPU, Qualcomm GPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Bert Base Uncased Mrpc	Qualcomm NPU, Qualcomm GPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Chinese Clip Vit Base Patch16	Intel CPU, Intel GPU, Intel NPU
Clip Vit B 32 Laion2B S34B B79K	Qualcomm NPU, Qualcomm GPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Clip Vit Base Patch16	Qualcomm NPU, Qualcomm GPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Clip Vit Base Patch32	Qualcomm NPU, Qualcomm GPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Clip Vit Large Patch14	Qualcomm NPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Resnet 50	Qualcomm NPU, Qualcomm GPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Stable Diffusion V1 5	Qualcomm NPU, Intel CPU, Intel GPU
Vit Base Patch16 224	Qualcomm NPU, Qualcomm GPU, AMD NPU, AMD GPU, NVIDIA TensorRT for RTX, Intel CPU, Intel GPU, Intel NPU, DirectML
Whisper Large V3 Turbo	Qualcomm NPU

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model List

LLM Models

Non-LLM Models

FilesExpand file tree

ModelList.md

Latest commit

History

ModelList.md

File metadata and controls

Model List

LLM Models

Non-LLM Models