ONNX 模型库
返回模型

说明文档


license: mit pipeline_tag: text-generation tags:

  • ONNX
  • DML
  • ONNXRuntime
  • phi3
  • nlp
  • conversational
  • custom_code inference: false

FusionQuill/Phi-3-mini-128k-instruct-onnx

作者 FusionQuill

text-generation transformers
↓ 4 ♥ 0

创建时间: 2024-05-17 16:15:13+00:00

更新时间: 2024-05-17 16:20:07+00:00

在 Hugging Face 上查看

文件 (57)

.gitattributes
LICENSE
README.md
config.json
configuration_phi3.py
cpu.zip
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/added_tokens.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/config.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/genai_config.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/phi3-mini-128k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx ONNX
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/phi3-mini-128k-instruct-cpu-int4-rtn-block-32-acc-level-4.onnx.data
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/special_tokens_map.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.json
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer.model
cpu_and_mobile/cpu-int4-rtn-block-32-acc-level-4/tokenizer_config.json
cpu_and_mobile/cpu-int4-rtn-block-32/added_tokens.json
cpu_and_mobile/cpu-int4-rtn-block-32/config.json
cpu_and_mobile/cpu-int4-rtn-block-32/configuration_phi3.py
cpu_and_mobile/cpu-int4-rtn-block-32/genai_config.json
cpu_and_mobile/cpu-int4-rtn-block-32/phi3-mini-128k-instruct-cpu-int4-rtn-block-32.onnx ONNX
cpu_and_mobile/cpu-int4-rtn-block-32/phi3-mini-128k-instruct-cpu-int4-rtn-block-32.onnx.data
cpu_and_mobile/cpu-int4-rtn-block-32/special_tokens_map.json
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer.json
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer.model
cpu_and_mobile/cpu-int4-rtn-block-32/tokenizer_config.json
cuda/cuda-fp16/added_tokens.json
cuda/cuda-fp16/config.json
cuda/cuda-fp16/configuration_phi3.py
cuda/cuda-fp16/genai_config.json
cuda/cuda-fp16/phi3-mini-128k-instruct-cuda-fp16.onnx ONNX
cuda/cuda-fp16/phi3-mini-128k-instruct-cuda-fp16.onnx.data
cuda/cuda-fp16/special_tokens_map.json
cuda/cuda-fp16/tokenizer.json
cuda/cuda-fp16/tokenizer.model
cuda/cuda-fp16/tokenizer_config.json
cuda/cuda-int4-rtn-block-32/added_tokens.json
cuda/cuda-int4-rtn-block-32/config.json
cuda/cuda-int4-rtn-block-32/configuration_phi3.py
cuda/cuda-int4-rtn-block-32/genai_config.json
cuda/cuda-int4-rtn-block-32/phi3-mini-128k-instruct-cuda-int4-rtn-block-32.onnx ONNX
cuda/cuda-int4-rtn-block-32/phi3-mini-128k-instruct-cuda-int4-rtn-block-32.onnx.data
cuda/cuda-int4-rtn-block-32/special_tokens_map.json
cuda/cuda-int4-rtn-block-32/tokenizer.json
cuda/cuda-int4-rtn-block-32/tokenizer.model
cuda/cuda-int4-rtn-block-32/tokenizer_config.json
directml/directml-int4-awq-block-128/added_tokens.json
directml/directml-int4-awq-block-128/config.json
directml/directml-int4-awq-block-128/configuration_phi3.py
directml/directml-int4-awq-block-128/genai_config.json
directml/directml-int4-awq-block-128/model.onnx ONNX
directml/directml-int4-awq-block-128/model.onnx.data
directml/directml-int4-awq-block-128/special_tokens_map.json
directml/directml-int4-awq-block-128/tokenizer.json
directml/directml-int4-awq-block-128/tokenizer.model
directml/directml-int4-awq-block-128/tokenizer_config.json
dml.zip