返回模型
说明文档
警告: 请不要将此仓库中的任何内容视为可用于生产环境。
检查点
- siglip_swinv2_base_2025_02_22_18h56m54s
在冻结的 SmilingWolf/wd-swinv2-tagger-v3 基础上训练的文本编码器,基本上是 SigLIT 风格。与现有的 DeepGHS 索引/嵌入兼容。 - siglip_swinv2_base_2025_05_02_22h02m36s
基于siglip_swinv2_base_2025_02_22_18h56m54s,图像编码器已解冻。所以这应该是热启动的 SigLIP。 - siglip_eva02_base_2025_05_02_21h53m54s
使用不同架构的测试,从头开始使用 SigLIP 训练。
使用示例
请参阅 deepghs/search_image_by_image_or_text 查看示例用法。
兼容性
此仓库中的检查点已结构化以与 dghs-imgutils 包兼容。
您可以通过以下两种方式在本地运行
通过代码推理
pip install dghs-imgutils>=0.17.0
from imgutils.generic import siglip_predict
pred = siglip_predict(
images=[
'https://huggingface.co/datasets/narugo1992/nzb_files/resolve/main/eshuushuu_51.webp',
],
texts=[
# 短标签
'1girl',
'1boy',
'orange_hair',
'blue_hair',
# 长文本
'1girl, solo, thighhighs, orange_shirt, twintails, brown_hair, hair_bun, long_hair, double_bun, '
'zettai_ryouiki, jar, sitting, bow, school_uniform, long_sleeves, smile, pink_bow, skirt, orange_skirt, '
'very_long_hair, black_thighhighs, orange_dress, miniskirt',
'food, halo, red_eyes, side_ponytail, skirt, macaron, pink_hair, sailor_collar, holding, '
'black_sailor_collar, cake, long_hair, drumsticks, black_skirt, pleated_skirt, pink_halo, 1girl, '
'ahoge, red_neckerchief, chibi, neckerchief, long_sleeves, holding_food, sash, blush, holding_drumsticks, '
'multiple_views, white_cardigan, looking_at_viewer,'
],
repo_id='deepghs/siglip_beta',
model_name='smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s'
)
print(pred)
# [[2.5059912e-02 1.7571157e-04 2.1646977e-03 1.8494057e-04 1.0000000e+00
# 3.8877626e-15]]
启动 Gradio 演示
pip install dghs-imgutils[demo]>=0.17.0
from imgutils.generic import SigLIPModel
SigLIPModel(
repo_id='deepghs/siglip_beta',
).launch_demo(
default_model_name='smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s'
)
deepghs/siglip_beta
作者 deepghs
zero-shot-image-classification
dghs-imgutils
↓ 0
♥ 9
创建时间: 2025-05-04 12:11:11+00:00
更新时间: 2025-06-02 04:27:26+00:00
在 Hugging Face 上查看文件 (17)
.gitattributes
README.md
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/image_encode.onnx
ONNX
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/meta.json
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/preprocessor.json
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/text_encode.onnx
ONNX
smilingwolf/siglip_eva02_base_2025_05_02_21h53m54s/tokenizer.json
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/image_encode.onnx
ONNX
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/meta.json
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/preprocessor.json
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/text_encode.onnx
ONNX
smilingwolf/siglip_swinv2_base_2025_02_22_18h56m54s/tokenizer.json
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/image_encode.onnx
ONNX
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/meta.json
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/preprocessor.json
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/text_encode.onnx
ONNX
smilingwolf/siglip_swinv2_base_2025_05_02_22h02m36s/tokenizer.json