ONNX 模型库
返回模型

暂无说明文档

microsoft/Phi-3-mini-4k-instruct-onnx

作者 microsoft

text-generation transformers
↓ 495 ♥ 144

创建时间: 2024-04-23 02:19:22+00:00

更新时间: 2025-12-10 21:04:48+00:00

在 Hugging Face 上查看

文件 (56)

.gitattributes
LICENSE
README.md
config.json
configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/added_tokens.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/config.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/genai_config.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/phi3-mini-4k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx ONNX
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/phi3-mini-4k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx.data
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/special_tokens_map.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.model
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer_config.json
cpu_and_mobile/cpu-int4-rtn-block-32/added_tokens.json
cpu_and_mobile/cpu-int4-rtn-block-32/config.json
cpu_and_mobile/cpu-int4-rtn-block-32/configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32/genai_config.json
cpu_and_mobile/cpu-int4-rtn-block-32/phi3-mini-4k-instruct-cpu-int4-rtn-block-32.onnx ONNX
cpu_and_mobile/cpu-int4-rtn-block-32/phi3-mini-4k-instruct-cpu-int4-rtn-block-32.onnx.data
cpu_and_mobile/cpu-int4-rtn-block-32/special_tokens_map.json
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer.json
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer.model
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer_config.json
cuda/cuda-fp16/added_tokens.json
cuda/cuda-fp16/config.json
cuda/cuda-fp16/configuration_phi3.py
cuda/cuda-fp16/genai_config.json
cuda/cuda-fp16/phi3-mini-4k-instruct-cuda-fp16.onnx ONNX
cuda/cuda-fp16/phi3-mini-4k-instruct-cuda-fp16.onnx.data
cuda/cuda-fp16/special_tokens_map.json
cuda/cuda-fp16/tokenizer.json
cuda/cuda-fp16/tokenizer.model
cuda/cuda-fp16/tokenizer_config.json
cuda/cuda-int4-rtn-block-32/added_tokens.json
cuda/cuda-int4-rtn-block-32/config.json
cuda/cuda-int4-rtn-block-32/configuration_phi3.py
cuda/cuda-int4-rtn-block-32/genai_config.json
cuda/cuda-int4-rtn-block-32/phi3-mini-4k-instruct-cuda-int4-rtn-block-32.onnx ONNX
cuda/cuda-int4-rtn-block-32/phi3-mini-4k-instruct-cuda-int4-rtn-block-32.onnx.data
cuda/cuda-int4-rtn-block-32/special_tokens_map.json
cuda/cuda-int4-rtn-block-32/tokenizer.json
cuda/cuda-int4-rtn-block-32/tokenizer.model
cuda/cuda-int4-rtn-block-32/tokenizer_config.json
data_summary_card.md
directml/directml-int4-awq-block-128/added_tokens.json
directml/directml-int4-awq-block-128/config.json
directml/directml-int4-awq-block-128/configuration_phi3.py
directml/directml-int4-awq-block-128/genai_config.json
directml/directml-int4-awq-block-128/model.onnx ONNX
directml/directml-int4-awq-block-128/model.onnx.data
directml/directml-int4-awq-block-128/special_tokens_map.json
directml/directml-int4-awq-block-128/tokenizer.json
directml/directml-int4-awq-block-128/tokenizer.model
directml/directml-int4-awq-block-128/tokenizer_config.json