PaddleOCR tool library v2.8.1

Python

2.8.1

Download

PaddleOCR aims to create a rich, leading and practical OCR tool library to help users train better models and implement them.

PP-OCR is a practical ultra-lightweight OCR system. It mainly consists of three parts: DB text detection, detection frame correction and CRNN text recognition. The system adopts 19 effective strategies from eight aspects: backbone network selection and adjustment, prediction head design, data enhancement, learning rate transformation strategy, regularization parameter selection, use of pre-training models, and automatic model cropping and quantification. The model was optimized and slimmed down, and finally an ultra-lightweight Chinese and English OCR with an overall size of 3.5M and an English digital OCR of 2.8M were obtained.

characteristic

1. PPOCR series high-quality pre-training model, accurate recognition effect

Ultra-light ppocr_mobile mobile series: detection (2.6M) + direction classifier (0.9M) + recognition (4.6M) = 8.1M

General ppocr_server series: detection (47.2M) + direction classifier (0.9M) + recognition (107M) = 155.1M

Ultra-lightweight compression ppocr_mobile_slim series: detection (1.4M) + direction classifier (0.5M) + recognition (1.6M) = 3.5M

2. Support Chinese and English number combination recognition, vertical text recognition, and long text recognition

3. Support multi-language recognition: Korean, Japanese, German, French

4. Support user-defined training and provide rich predictive reasoning deployment solutions

5. Support PIP quick installation and use

6. Can run on Linux, Windows, MacOS and other systems

Expand

Additional Information

Version 2.8.1
Type Python
Update Time 2024-10-19
size 109.42MB

Related Applications

feilong development tool library v4.1.2

2024-11-14
DbUtils database query tool kit v1.8.1

2024-11-13
PaddleOCR tool library v2.9.1

2024-11-13
PaddleNLP v2.8.1

2024-11-13
PaddleOCR

2024-11-09
ofdrw reading and writing library v2.3.3

2024-10-19

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
Google Blog Converters (blog data converter)

Python

1.0 R54
Nuitka

Python

1.0.0
smartchart data visualization platform v6.9

Python

6.9
waymo open dataset

Other source code

December 2023 Update
termwind

Other categories

v2.3.0
wp functions

Other categories

1.0.0

Related Information All

PaddleOCR tool library v2.8.1

characteristic