ONNX 模型库
返回模型

暂无说明文档

onnxruntime/DeepSeek-R1-Distill-ONNX

作者 onnxruntime

text-generation transformers
↓ 8 ♥ 8

创建时间: 2025-01-29 22:44:14+00:00

更新时间: 2026-02-11 00:13:23+00:00

在 Hugging Face 上查看

文件 (52)

.gitattributes
LICENSE
README.md
config.json
deepseek-r1-distill-llama-8B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/genai_config.json
deepseek-r1-distill-llama-8B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx ONNX
deepseek-r1-distill-llama-8B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx.data
deepseek-r1-distill-llama-8B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/special_tokens_map.json
deepseek-r1-distill-llama-8B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.json
deepseek-r1-distill-llama-8B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer_config.json
deepseek-r1-distill-llama-8B/gpu/gpu-int4-rtn-block-32/genai_config.json
deepseek-r1-distill-llama-8B/gpu/gpu-int4-rtn-block-32/model.onnx ONNX
deepseek-r1-distill-llama-8B/gpu/gpu-int4-rtn-block-32/model.onnx.data
deepseek-r1-distill-llama-8B/gpu/gpu-int4-rtn-block-32/special_tokens_map.json
deepseek-r1-distill-llama-8B/gpu/gpu-int4-rtn-block-32/tokenizer.json
deepseek-r1-distill-llama-8B/gpu/gpu-int4-rtn-block-32/tokenizer_config.json
deepseek-r1-distill-qwen-1.5B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/genai_config.json
deepseek-r1-distill-qwen-1.5B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx ONNX
deepseek-r1-distill-qwen-1.5B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx.data
deepseek-r1-distill-qwen-1.5B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/special_tokens_map.json
deepseek-r1-distill-qwen-1.5B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.json
deepseek-r1-distill-qwen-1.5B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer_config.json
deepseek-r1-distill-qwen-1.5B/gpu/gpu-int4-rtn-block-32/genai_config.json
deepseek-r1-distill-qwen-1.5B/gpu/gpu-int4-rtn-block-32/model.onnx ONNX
deepseek-r1-distill-qwen-1.5B/gpu/gpu-int4-rtn-block-32/model.onnx.data
deepseek-r1-distill-qwen-1.5B/gpu/gpu-int4-rtn-block-32/special_tokens_map.json
deepseek-r1-distill-qwen-1.5B/gpu/gpu-int4-rtn-block-32/tokenizer.json
deepseek-r1-distill-qwen-1.5B/gpu/gpu-int4-rtn-block-32/tokenizer_config.json
deepseek-r1-distill-qwen-14B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/genai_config.json
deepseek-r1-distill-qwen-14B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx ONNX
deepseek-r1-distill-qwen-14B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx.data
deepseek-r1-distill-qwen-14B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/special_tokens_map.json
deepseek-r1-distill-qwen-14B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.json
deepseek-r1-distill-qwen-14B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer_config.json
deepseek-r1-distill-qwen-14B/gpu/gpu-int4-rtn-block-32/genai_config.json
deepseek-r1-distill-qwen-14B/gpu/gpu-int4-rtn-block-32/model.onnx ONNX
deepseek-r1-distill-qwen-14B/gpu/gpu-int4-rtn-block-32/model.onnx.data
deepseek-r1-distill-qwen-14B/gpu/gpu-int4-rtn-block-32/special_tokens_map.json
deepseek-r1-distill-qwen-14B/gpu/gpu-int4-rtn-block-32/tokenizer.json
deepseek-r1-distill-qwen-14B/gpu/gpu-int4-rtn-block-32/tokenizer_config.json
deepseek-r1-distill-qwen-7B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/genai_config.json
deepseek-r1-distill-qwen-7B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx ONNX
deepseek-r1-distill-qwen-7B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/model.onnx.data
deepseek-r1-distill-qwen-7B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/special_tokens_map.json
deepseek-r1-distill-qwen-7B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.json
deepseek-r1-distill-qwen-7B/cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer_config.json
deepseek-r1-distill-qwen-7B/gpu/gpu-int4-rtn-block-32/genai_config.json
deepseek-r1-distill-qwen-7B/gpu/gpu-int4-rtn-block-32/model.onnx ONNX
deepseek-r1-distill-qwen-7B/gpu/gpu-int4-rtn-block-32/model.onnx.data
deepseek-r1-distill-qwen-7B/gpu/gpu-int4-rtn-block-32/special_tokens_map.json
deepseek-r1-distill-qwen-7B/gpu/gpu-int4-rtn-block-32/tokenizer.json
deepseek-r1-distill-qwen-7B/gpu/gpu-int4-rtn-block-32/tokenizer_config.json