返回模型

说明文档

MobileNetV3 — ONNX，量化版

🔥 轻量级移动端模型，用于图像分类，分为两类：

document（文档）（扫描件、收据、纸张、发票）
photo（照片）（常规手机照片：场景、人物、自然等）

🟢 概述

专为移动设备设计（手机和平板，Android/iOS），非常适合实时端侧推理！
架构：MobileNetV2
格式：ONNX（包含 float32 和量化 int8 版本）
在平衡的真实世界开源数据集上训练，涵盖文档和照片。
适用于以下任务：
- 相册/相机胶卷中的文档检测
- 截图、收据、照片和 PDF 预览分类
- 隐私优先的离线 AI 助手的图像排序

🏷️ 模型类别

0 — document（文档）
1 — photo（照片）

⚡️ 版本

mobilenet_v3_small.onnx — 标准 float32 版本，精度最高（最适合 ARM/CPU）
mobilenet_v3_small_quant.onnx — 量化 int8 版本，推理更快，文件更小（最适合低功耗或边缘设备）

🚀 为什么选择这个模型？

超小体积（~10-15MB），在大多数手机上实现实时推理（<100ms）
100% 离线运行（隐私保护，无需云端）
易于集成，支持任何框架，包括 React Native（onnxruntime-react-native）、Android（ONNX Runtime）和 iOS。

🗃️ 数据集

照片： alfredplpl/Japanese-photos
文档： 3sara/colpali_italian_documents

🤖 作者

@vlad-m-dev 为边缘 AI/手机/平板离线图像分类构建：文档 vs 照片 Telegram: https://t.me/dwight_schrute_engineer

🛠️ 使用示例

import onnxruntime as ort
import numpy as np

session = ort.InferenceSession(MODEL_PATH)
img = np.random.randn(1, 3, 224, 224).astype(np.float32)  # 替换为你的图像预处理！
output = session.run(None, {"input": img})
pred_class = np.argmax(output[0])
print(pred_class)  # 0 = document（文档）, 1 = photo（照片）

vlad-m-dev/mobilenet_v3_small_onnx_photo_doc

作者 vlad-m-dev

image-classification

↓ 0 ♥ 2

创建时间: 2025-06-15 11:21:06+00:00

更新时间: 2025-06-15 12:18:47+00:00

在 Hugging Face 上查看

文件 (4)

.gitattributes

README.md

mobilenet_v3_small.onnx ONNX

mobilenet_v3_small_quant.onnx ONNX