返回模型
说明文档
PaddleOCR ONNX 模型
PaddleOCR PP-OCRv5 模型已转换为 ONNX 格式,用于高效的 OCR 推理。
中文说明
这是 PaddleOCR PP-OCRv5 模型的 ONNX 格式版本,用于高效率的文字识别推理。
模型文件说明
| 文件名称 | 说明 |
|---|---|
PP-OCRv5_server_det_infer.onnx |
文字检测模型 - 检测图片中文字的位置 |
PP-OCRv5_server_rec_infer.onnx |
文字识别模型 - 识别文字内容 |
PP-LCNet_x1_0_textline_ori_infer.onnx |
文字方向分类模型 |
PP-LCNet_x1_0_doc_ori_infer.onnx |
文档方向校正模型 |
UVDoc_infer.onnx |
文档展平模型 |
PP-OCRv5_server_rec_infer.yml |
字符字典配置文件 |
快速使用
from huggingface_hub import hf_hub_download
# 下载文字检测模型
det_model_path = hf_hub_download(
repo_id="marsena/paddleocr-onnx-models",
filename="PP-OCRv5_server_det_infer.onnx"
)
# 下载文字识别模型
rec_model_path = hf_hub_download(
repo_id="marsena/paddleocr-onnx-models",
filename="PP-OCRv5_server_rec_infer.onnx"
)
一次下载所有模型
from huggingface_hub import snapshot_download
# 下载所有模型文件到本地文件夹
snapshot_download(
repo_id="marsena/paddleocr-onnx-models",
local_dir="./paddleocr_onnx"
)
支持的语言和文字类型
- 语言支持: 简体中文、繁体中文、英文、日文
- 文字类型: 印刷体、手写体、竖排文字、旋转文字、弯曲文字
- 输入格式: 图片文件 (JPEG, PNG)
- 运行环境: 支持 CPU 和 GPU 推理
English
Model Files
| File | Description |
|---|---|
PP-OCRv5_server_det_infer.onnx |
Text detection model |
PP-OCRv5_server_rec_infer.onnx |
Text recognition model |
PP-LCNet_x1_0_textline_ori_infer.onnx |
Text orientation classification |
PP-LCNet_x1_0_doc_ori_infer.onnx |
Document orientation correction |
UVDoc_infer.onnx |
Document unwarping |
PP-OCRv5_server_rec_infer.yml |
Character dictionary config |
Source Models
These ONNX models are converted from official PaddlePaddle PP-OCRv5 models:
- Detection Model: PaddlePaddle/PP-OCRv5_server_det
- Recognition Model: PaddlePaddle/PP-OCRv5_server_rec
- Official Documentation: PP-OCRv5 Introduction
Usage
Download Specific Model
from huggingface_hub import hf_hub_download
# Download detection model
det_model_path = hf_hub_download(
repo_id="marsena/paddleocr-onnx-models",
filename="PP-OCRv5_server_det_infer.onnx"
)
# Download recognition model
rec_model_path = hf_hub_download(
repo_id="marsena/paddleocr-onnx-models",
filename="PP-OCRv5_server_rec_infer.onnx"
)
Download All Models
from huggingface_hub import snapshot_download
# Download all model files to local directory
snapshot_download(
repo_id="marsena/paddleocr-onnx-models",
local_dir="./paddleocr_onnx"
)
ONNX Runtime Inference
import onnxruntime as ort
import numpy as np
# Load model
session = ort.InferenceSession("PP-OCRv5_server_det_infer.onnx")
# Run inference
input_name = session.get_inputs()[0].name
output = session.run(None, {input_name: input_data})
Model Specifications
- Languages: Simplified Chinese, Traditional Chinese, English, Japanese
- Text Types: Printed text, handwriting, vertical text, rotated text
- Input Format: Images (JPEG, PNG)
- Output Format: Bounding boxes + recognized text
- Runtime: ONNX Runtime 1.16+
- Hardware: CPU and GPU inference supported
License
These models follow the Apache License 2.0, consistent with the original PaddleOCR project.
- PaddleOCR Repository: https://github.com/PaddlePaddle/PaddleOCR
- License Details: Apache License 2.0
Conversion Information
- Conversion Tool: Paddle2ONNX
- ONNX Version: 1.12+
- Source Framework: PaddlePaddle 2.5+
- Conversion Date: January 2025
Citation
If you use these models in your research, please cite the original PaddleOCR paper:
@misc{paddleocr2020,
title={PaddleOCR: Awesome multilingual OCR toolkits},
author={PaddlePaddle Authors},
howpublished = {\url{https://github.com/PaddlePaddle/PaddleOCR}},
year={2020}
}
Issues
For model usage issues, please report to the original PaddleOCR repository:
- PaddleOCR Issues: https://github.com/PaddlePaddle/PaddleOCR/issues
marsena/paddleocr-onnx-models
作者 marsena
image-to-text
onnxruntime
↓ 0
♥ 9
创建时间: 2025-09-01 06:02:04+00:00
更新时间: 2025-09-01 06:15:25+00:00
在 Hugging Face 上查看文件 (8)
.gitattributes
PP-LCNet_x1_0_doc_ori_infer.onnx
ONNX
PP-LCNet_x1_0_textline_ori_infer.onnx
ONNX
PP-OCRv5_server_det_infer.onnx
ONNX
PP-OCRv5_server_rec_infer.onnx
ONNX
PP-OCRv5_server_rec_infer.yml
README.md
UVDoc_infer.onnx
ONNX