ONNX 模型库
返回模型

说明文档

任务: image-classification
后端: sagemaker-training
后端参数: {'instance_type': 'ml.m5.2xlarge', 'supported_instructions': 'avx512'}
评估样本数量: 全部数据集

固定参数:

  • 数据集: [{'path': 'beans', 'eval_split': 'validation', 'data_keys': {'primary': 'image'}, 'ref_keys': ['labels'], 'name': None, 'calibration_split': 'train'}]
  • name_or_path: nateraw/vit-base-beans
  • from_transformers: True
  • node_exclusion: []
  • 校准:
    • 方法: percentile
    • num_calibration_samples: 128
    • calibration_histogram_percentile: 99.999

基准测试参数:

  • 框架: onnxruntime, pytorch
  • 量化方法: dynamic, static
  • 待量化算子: ['Add', 'MatMul'], ['Add']
  • 逐通道: False, True
  • 框架参数: {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4}, {}
  • 缩减范围: True, False
  • 应用量化: True, False

评估

非时间指标

框架 量化方法 待量化算子 逐通道 框架参数 缩减范围 应用量化 准确率
onnxruntime None None None {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} None False | 0.977
onnxruntime dynamic ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.977
onnxruntime dynamic ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.977
onnxruntime dynamic ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.977
onnxruntime dynamic ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.977
onnxruntime dynamic ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.977
onnxruntime dynamic ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.977
onnxruntime dynamic ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.977
onnxruntime dynamic ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.977
onnxruntime static ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.421
onnxruntime static ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.421
onnxruntime static ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.316
onnxruntime static ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.451
onnxruntime static ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.361
onnxruntime static ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.361
onnxruntime static ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 0.361
onnxruntime static ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 0.361
pytorch None None None {} None None | 0.977

时间指标

时间基准测试每个配置运行 15 秒。

以下是批大小 = 1,输入长度 = 224 的时间指标。

框架 量化方法 待量化算子 逐通道 框架参数 缩减范围 应用量化 平均延迟 (ms) 吞吐量 (/s)
onnxruntime None None None {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} None False | 130.41 | 7.73
onnxruntime dynamic ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 102.44 | 9.80
onnxruntime dynamic ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 101.57 | 9.87
onnxruntime dynamic ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 102.37 | 9.80
onnxruntime dynamic ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 102.36 | 9.80
onnxruntime dynamic ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 130.67 | 7.67
onnxruntime dynamic ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 131.29 | 7.67
onnxruntime dynamic ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 132.65 | 7.60
onnxruntime dynamic ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 131.03 | 7.67
onnxruntime static ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 127.99 | 7.87
onnxruntime static ['Add', 'MatMul'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 128.27 | 7.80
onnxruntime static ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 131.10 | 7.67
onnxruntime static ['Add', 'MatMul'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 130.29 | 7.73
onnxruntime static ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 164.55 | 6.13
onnxruntime static ['Add'] False {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 168.61 | 5.93
onnxruntime static ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} False True | 164.52 | 6.13
onnxruntime static ['Add'] True {'opset': 13, 'optimization_level': 1, 'intra_op_num_threads': 4} True True | 165.31 | 6.07
pytorch None None None {} None None | 149.23 | 6.73

fxmarty/20220911-h13m59s08_vit_beans_quantization

作者 fxmarty

image-classification
↓ 0 ♥ 0

创建时间: 2022-09-11 14:42:53+00:00

更新时间: 2022-09-11 14:44:14+00:00

在 Hugging Face 上查看

文件 (105)

.gitattributes
20220911-h14m03s04_0/augmented_model.onnx ONNX
20220911-h14m03s04_0/calibration_histograms.npy
20220911-h14m03s04_0/model.onnx ONNX
20220911-h14m03s04_0/ort_config.json
20220911-h14m03s04_0/quantized_model.onnx ONNX
20220911-h14m03s04_0/results.json
20220911-h14m04s10_1/model.onnx ONNX
20220911-h14m04s10_1/ort_config.json
20220911-h14m04s10_1/quantized_model.onnx ONNX
20220911-h14m04s10_1/results.json
20220911-h14m05s03_2/model.onnx ONNX
20220911-h14m05s03_2/ort_config.json
20220911-h14m05s03_2/quantized_model.onnx ONNX
20220911-h14m05s03_2/results.json
20220911-h14m09s58_3/augmented_model.onnx ONNX
20220911-h14m09s58_3/calibration_histograms.npy
20220911-h14m09s58_3/model.onnx ONNX
20220911-h14m09s58_3/ort_config.json
20220911-h14m09s58_3/quantized_model.onnx ONNX
20220911-h14m09s58_3/results.json
20220911-h14m14s51_4/augmented_model.onnx ONNX
20220911-h14m14s51_4/calibration_histograms.npy
20220911-h14m14s51_4/model.onnx ONNX
20220911-h14m14s51_4/ort_config.json
20220911-h14m14s51_4/quantized_model.onnx ONNX
20220911-h14m14s51_4/results.json
20220911-h14m18s36_5/augmented_model.onnx ONNX
20220911-h14m18s36_5/calibration_histograms.npy
20220911-h14m18s36_5/model.onnx ONNX
20220911-h14m18s36_5/ort_config.json
20220911-h14m18s36_5/quantized_model.onnx ONNX
20220911-h14m18s36_5/results.json
20220911-h14m19s29_6/model.onnx ONNX
20220911-h14m19s29_6/ort_config.json
20220911-h14m19s29_6/quantized_model.onnx ONNX
20220911-h14m19s29_6/results.json
20220911-h14m20s17_7/model.onnx ONNX
20220911-h14m20s17_7/ort_config.json
20220911-h14m20s17_7/quantized_model.onnx ONNX
20220911-h14m20s17_7/results.json
20220911-h14m25s21_8/augmented_model.onnx ONNX
20220911-h14m25s21_8/calibration_histograms.npy
20220911-h14m25s21_8/model.onnx ONNX
20220911-h14m25s21_8/ort_config.json
20220911-h14m25s21_8/quantized_model.onnx ONNX
20220911-h14m25s21_8/results.json
20220911-h14m29s08_9/augmented_model.onnx ONNX
20220911-h14m29s08_9/calibration_histograms.npy
20220911-h14m29s08_9/model.onnx ONNX
20220911-h14m29s08_9/ort_config.json
20220911-h14m29s08_9/quantized_model.onnx ONNX
20220911-h14m29s08_9/results.json
20220911-h14m29s56_10/model.onnx ONNX
20220911-h14m29s56_10/ort_config.json
20220911-h14m29s56_10/quantized_model.onnx ONNX
20220911-h14m29s56_10/results.json
20220911-h14m31s00_11/model.onnx ONNX
20220911-h14m31s00_11/ort_config.json
20220911-h14m31s00_11/quantized_model.onnx ONNX
20220911-h14m31s00_11/results.json
20220911-h14m34s46_12/augmented_model.onnx ONNX
20220911-h14m34s46_12/calibration_histograms.npy
20220911-h14m34s46_12/model.onnx ONNX
20220911-h14m34s46_12/ort_config.json
20220911-h14m34s46_12/quantized_model.onnx ONNX
20220911-h14m34s46_12/results.json
20220911-h14m35s34_13/model.onnx ONNX
20220911-h14m35s34_13/ort_config.json
20220911-h14m35s34_13/quantized_model.onnx ONNX
20220911-h14m35s34_13/results.json
20220911-h14m36s21_14/model.onnx ONNX
20220911-h14m36s21_14/ort_config.json
20220911-h14m36s21_14/quantized_model.onnx ONNX
20220911-h14m36s21_14/results.json
20220911-h14m41s28_15/augmented_model.onnx ONNX
20220911-h14m41s28_15/calibration_histograms.npy
20220911-h14m41s28_15/model.onnx ONNX
20220911-h14m41s28_15/ort_config.json
20220911-h14m41s28_15/quantized_model.onnx ONNX
20220911-h14m41s28_15/results.json
20220911-h14m42s13_16/model.onnx ONNX
20220911-h14m42s13_16/results.json
20220911-h14m42s53_17/results.json
README.md
runs.json
tensorboard/1662907378.8460534/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.1
tensorboard/1662907378.8475587/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.2
tensorboard/1662907378.848857/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.3
tensorboard/1662907378.8500683/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.4
tensorboard/1662907378.8512115/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.5
tensorboard/1662907378.8523538/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.6
tensorboard/1662907378.8535051/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.7
tensorboard/1662907378.8546307/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.8
tensorboard/1662907378.855868/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.9
tensorboard/1662907378.857226/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.10
tensorboard/1662907378.8584979/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.11
tensorboard/1662907378.8597944/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.12
tensorboard/1662907378.8610377/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.13
tensorboard/1662907378.8623042/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.14
tensorboard/1662907378.8636096/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.15
tensorboard/1662907378.8649743/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.16
tensorboard/1662907378.8668153/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.17
tensorboard/1662907378.868103/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.18
tensorboard/events.out.tfevents.1662907378.ip-10-0-138-55.ec2.internal.1.0