ONNX 模型库
返回模型

说明文档

警告: 请不要将此仓库中的任何内容视为可用于生产环境。

检查点

  • siglip_swinv2_base_2025_02_22_18h56m54s
    在冻结的 SmilingWolf/wd-swinv2-tagger-v3 基础上训练的文本编码器,基本上是 SigLIT 风格。与现有的 DeepGHS 索引/嵌入兼容。
  • siglip_swinv2_base_2025_05_02_22h02m36s
    基于 siglip_swinv2_base_2025_02_22_18h56m54s,图像编码器已解冻。所以这应该是热启动的 SigLIP。
  • siglip_eva02_base_2025_05_02_21h53m54s
    使用不同架构的测试,从头开始使用 SigLIP 训练。

使用示例

请参阅 deepghs/search_image_by_image_or_text 查看示例用法。

兼容性

此仓库中的检查点已结构化以与 dghs-imgutils 包兼容。

您可以通过以下两种方式在本地运行

通过代码推理

pip install dghs-imgutils>=0.17.0
from imgutils.generic import siglip_predict

pred = siglip_predict(
    images=[
        'https://huggingface.co/datasets/narugo1992/nzb_files/resolve/main/eshuushuu_51.webp',
    ],
    texts=[
        # 短标签
        '1girl',
        '1boy',
        'orange_hair',
        'blue_hair',

        # 长文本
        '1girl, solo, thighhighs, orange_shirt, twintails, brown_hair, hair_bun, long_hair, double_bun, '
        'zettai_ryouiki, jar, sitting, bow, school_uniform, long_sleeves, smile, pink_bow, skirt, orange_skirt, '
        'very_long_hair, black_thighhighs, orange_dress, miniskirt',
        'food, halo, red_eyes, side_ponytail, skirt, macaron, pink_hair, sailor_collar, holding, '
        'black_sailor_collar, cake, long_hair, drumsticks, black_skirt, pleated_skirt, pink_halo, 1girl, '
        'ahoge, red_neckerchief, chibi, neckerchief, long_sleeves, holding_food, sash, blush, holding_drumsticks, '
        'multiple_views, white_cardigan, looking_at_viewer,'
    ],
    repo_id='deepghs/siglip_beta',
    model_name='smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s'
)
print(pred)
# [[2.5059912e-02 1.7571157e-04 2.1646977e-03 1.8494057e-04 1.0000000e+00
#   3.8877626e-15]]

启动 Gradio 演示

pip install dghs-imgutils[demo]>=0.17.0
from imgutils.generic import SigLIPModel

SigLIPModel(
    repo_id='deepghs/siglip_beta',
).launch_demo(
    default_model_name='smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s'
)

deepghs/siglip_beta

作者 deepghs

zero-shot-image-classification dghs-imgutils
↓ 0 ♥ 9

创建时间: 2025-05-04 12:11:11+00:00

更新时间: 2025-06-02 04:27:26+00:00

在 Hugging Face 上查看

文件 (17)

.gitattributes
README.md
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/image_encode.onnx ONNX
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/meta.json
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/preprocessor.json
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/text_encode.onnx ONNX
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/tokenizer.json
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/image_encode.onnx ONNX
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/meta.json
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/preprocessor.json
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/text_encode.onnx ONNX
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/tokenizer.json
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/image_encode.onnx ONNX
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/meta.json
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/preprocessor.json
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/text_encode.onnx ONNX
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/tokenizer.json