返回模型
暂无说明文档
microsoft/Phi-3-mini-128k-instruct-onnx
作者 microsoft
text-generation
transformers
↓ 105
♥ 192
创建时间: 2024-04-23 02:20:03+00:00
更新时间: 2025-05-30 21:00:32+00:00
在 Hugging Face 上查看文件 (55)
.gitattributes
LICENSE
README.md
config.json
configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/added_tokens.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/config.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/genai_config.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/phi3-mini-128k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx
ONNX
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/phi3-mini-128k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx.data
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/special_tokens_map.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.model
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer_config.json
cpu_and_mobile/cpu-int4-rtn-block-32/added_tokens.json
cpu_and_mobile/cpu-int4-rtn-block-32/config.json
cpu_and_mobile/cpu-int4-rtn-block-32/configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32/genai_config.json
cpu_and_mobile/cpu-int4-rtn-block-32/phi3-mini-128k-instruct-cpu-int4-rtn-block-32.onnx
ONNX
cpu_and_mobile/cpu-int4-rtn-block-32/phi3-mini-128k-instruct-cpu-int4-rtn-block-32.onnx.data
cpu_and_mobile/cpu-int4-rtn-block-32/special_tokens_map.json
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer.json
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer.model
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer_config.json
cuda/cuda-fp16/added_tokens.json
cuda/cuda-fp16/config.json
cuda/cuda-fp16/configuration_phi3.py
cuda/cuda-fp16/genai_config.json
cuda/cuda-fp16/phi3-mini-128k-instruct-cuda-fp16.onnx
ONNX
cuda/cuda-fp16/phi3-mini-128k-instruct-cuda-fp16.onnx.data
cuda/cuda-fp16/special_tokens_map.json
cuda/cuda-fp16/tokenizer.json
cuda/cuda-fp16/tokenizer.model
cuda/cuda-fp16/tokenizer_config.json
cuda/cuda-int4-rtn-block-32/added_tokens.json
cuda/cuda-int4-rtn-block-32/config.json
cuda/cuda-int4-rtn-block-32/configuration_phi3.py
cuda/cuda-int4-rtn-block-32/genai_config.json
cuda/cuda-int4-rtn-block-32/phi3-mini-128k-instruct-cuda-int4-rtn-block-32.onnx
ONNX
cuda/cuda-int4-rtn-block-32/phi3-mini-128k-instruct-cuda-int4-rtn-block-32.onnx.data
cuda/cuda-int4-rtn-block-32/special_tokens_map.json
cuda/cuda-int4-rtn-block-32/tokenizer.json
cuda/cuda-int4-rtn-block-32/tokenizer.model
cuda/cuda-int4-rtn-block-32/tokenizer_config.json
directml/directml-int4-awq-block-128/added_tokens.json
directml/directml-int4-awq-block-128/config.json
directml/directml-int4-awq-block-128/configuration_phi3.py
directml/directml-int4-awq-block-128/genai_config.json
directml/directml-int4-awq-block-128/model.onnx
ONNX
directml/directml-int4-awq-block-128/model.onnx.data
directml/directml-int4-awq-block-128/special_tokens_map.json
directml/directml-int4-awq-block-128/tokenizer.json
directml/directml-int4-awq-block-128/tokenizer.model
directml/directml-int4-awq-block-128/tokenizer_config.json