说明文档

Whisper Base ONNX

这是 OpenAI whisper-base 模型的 ONNX 转换版本，针对 Transformers.js 进行了优化。

模型详情

模型类型: Whisper（编码器-解码器）
任务: 自动语音识别
格式: ONNX（INT8 量化）
大小: ~75MB（从 ~300MB 量化而来）

使用方法

import { pipeline } from '@huggingface/transformers';

const transcriber = await pipeline('automatic-speech-recognition', 'markusingvarsson/whisper-test');
const result = await transcriber('audio.wav');
console.log(result.text);

转换详情

该模型通过自定义转换流程进行转换，该流程：

下载原始 HuggingFace 模型
导出为支持 KV 缓存的 ONNX 格式
应用 INT8 量化以减小体积
添加 Whisper 专用的对齐头以支持时间戳功能

量化后的模型体积约为原始模型的四分之一，同时保持了准确性。

markusingvarsson/whisper-test

作者 markusingvarsson

automatic-speech-recognition transformers.js

↓ 1 ♥ 0

创建时间: 2025-10-31 18:58:24+00:00

更新时间: 2025-11-01 11:51:34+00:00

在 Hugging Face 上查看

文件 (20)

.gitattributes

README.md

added_tokens.json

config.json

generation_config.json

merges.txt

normalizer.json

onnx/decoder_model.onnx ONNX

onnx/decoder_model_merged.onnx ONNX

onnx/decoder_model_merged_quantized.onnx ONNX

onnx/decoder_model_quantized.onnx ONNX

onnx/decoder_with_past_model.onnx ONNX

onnx/decoder_with_past_model_quantized.onnx ONNX

onnx/encoder_model.onnx ONNX

onnx/encoder_model_quantized.onnx ONNX

preprocessor_config.json

special_tokens_map.json

tokenizer.json

tokenizer_config.json

vocab.json