返回模型

说明文档

<div align="center"> <h1>FireRedChat-pVAD</h1> </div> <div align="center"> <a href="https://fireredteam.github.io/demos/firered_chat/">演示</a> • <a href="https://arxiv.org/pdf/2509.06502">论文</a> • <a href="https://huggingface.co/FireRedTeam">Huggingface</a> </div>

简介

FireRedChat 的个性化语音活动检测（pVAD）模型，一个支持说话人嵌入更新的开放权重语音活动检测模型。LiveKit 插件可在此获取

支持说话人嵌入更新，以提升语音活动检测效果。
该插件需要兼容的 LiveKit Agents 分支或修改版本，以便为用户的首次语音输入包含 update_speaker 调用。

路线图

[x] 2025/09
- [x] 发布 pVAD 模型权重和 LiveKit 插件。

使用方法

进行推理时，请使用 LiveKit 插件。安装和配置如下：

from livekit.plugins import fireredchat_pvad as pvad

def prewarm(proc: JobProcess):
    proc.userdata["vad"] = pvad.VAD.load(activation_threshold=0.5)

# 在首次语音输入后（或当基于 RMS 切换主要说话人时），调用 VADStream 的 update_speaker() 来更新说话人嵌入。

许可证

模型权重和插件代码采用 Apache-2.0 许可证授权。

致谢

说话人嵌入模型：speechbrain/spkrec-ecapa-voxceleb

FireRedTeam/FireRedChat-pvad

作者 FireRedTeam

voice-activity-detection

↓ 0 ♥ 12

创建时间: 2025-09-15 07:32:14+00:00

更新时间: 2025-09-22 05:10:53+00:00

在 Hugging Face 上查看

文件 (12)

.gitattributes

NOTICE

README.md

pvad.onnx ONNX

spkrec-ecapa-voxceleb/README.md

spkrec-ecapa-voxceleb/classifier.ckpt

spkrec-ecapa-voxceleb/config.json

spkrec-ecapa-voxceleb/embedding_model.ckpt

spkrec-ecapa-voxceleb/hyperparams.yaml

spkrec-ecapa-voxceleb/label_encoder.ckpt

spkrec-ecapa-voxceleb/label_encoder.txt

spkrec-ecapa-voxceleb/mean_var_norm_emb.ckpt